-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Open
Labels
Description
- Use
llama_decodeinstead of deprecatedllama_evalinLlamaclass - Implement batched inference support for
generateandcreate_completionmethods inLlamaclass - Add support for streaming / infinite completion
giangluu352001, harry-pham-wise, JackKCWong, bb-worm, ChristianWeyer and 45 moresengiv, ArtyomZemlyak, hamishc, bioshazard, gerred and 16 moreesmeetu, robertritz, zhengzhanpeng, hamishc, ngupta10 and 12 more