Commit 1cb813b
server : add speculative decoding support (ggml-org#10455)
* server : add speculative decoding support
ggml-ci
* server : add helper function slot.can_speculate()
ggml-ci1 parent fa4365c commit 1cb813b
1 file changed
+300
-141
lines changed
0 commit comments