What's Changed
- qwen2.5 long context by @ysjprojects in #1933
- debug installing
torch
for Thunder by @Borda in #2072 - Fix in
convert_hf_checkpoint
related to Gemma 3 by @mseeger in #2062 - debug some failing standalone tests with compiler by @Borda in #2075
- Qwen3 MoE by @ysjprojects in #2060
- Deferring import of torch in config to allow faster import by @JackUrb in #2079
- update bug-report/issue with reproducing in Studio by @Borda in #2081
- ci: show the longest tests for improvement by @Borda in #2083
- docs: Add documentation for OpenAI-compatible API in LitGPT deployment by @bhimrazy in #2082
- doc: add
n_query_groups
to attention notation table by @raishish in #2092 - doc: add comments for clarifying query / KV groups by @raishish in #2093
- Complete pending todos in testing by @raishish in #2088
- pin: restrict datasets version to <4.0.0 for compatibility by @Borda in #2095
- add/debug Lit CI by @Borda in #2094
- add
api_path
parameter to cli options to allow custom API endpoint configuration by @botirk38 in #2080 - Fix commands in pretrain.md by @H4kor in #2097
New Contributors
- @JackUrb made their first contribution in #2079
- @raishish made their first contribution in #2092
- @botirk38 made their first contribution in #2080
- @H4kor made their first contribution in #2097
Full Changelog: v0.5.9...v0.5.10