-
Notifications
You must be signed in to change notification settings - Fork 28
Open
Description
@mukel thank you for creating this project! I would like to discuss the following topics:
-
Please enable the Discussions tab for posts like this, which are not real "issues"
-
Do you plan on releasing Llama3 code?
-
Do you plan on quantized llama models with Java vector api?
-
Can you run a benchmark against llamafile, the vector version of which (AVX, neon) claims to be the performance king for inference.
(I am deciding between using your project or wrapping around the llamafile c code with Java 22 foreign function apis) -
Do you plan to implant model training as well? If so, take a look at Andrey's LLM.c repo
Metadata
Metadata
Assignees
Labels
No labels