New features we'd like to support: - [ ] Tool Calling - [ ] Structured Outputs - [ ] Reasoning Outputs - [ ] Automatic Prefix Caching - [ ] Speculative Decoding - [ ] Quantization For any other features not mentioned here, please add them to this thread