GSoC 2025: "Implement Open LLM Models with JAX and Flax" Project #4674
Unanswered
akshayatam
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi everyone,
I'm currently preparing a proposal for the “Implement Open LLM Models with JAX and Flax” project for Google Summer of Code 2025, and I’m very excited about the opportunity to contribute to the JAX and Flax ecosystems. I noticed the email list on the page is unresponsive at the time of posting.
I have a background in machine learning, and recently completed a project where I built a DC-GAN from scratch using JAX to deepen my understanding of the framework. I’ve also worked on hate speech detection using DistilBERT, as well as a machine translation system using RetNet (a novel alternative to multi-head attention).
This project aligns closely with my goal of developing expertise in model implementation and optimization using JAX and Flax, and contributing reference models and educational material that helps others build with these tools.
As I finalize my proposal, I would really appreciate feedback or clarification on a few points:
flax.nnx
was used for the miniGPT tutorial on the project description page. Should the project useflax.linen
,flax.nnx
, or both depending on the use case?I’m committed to making meaningful contributions during and beyond GSoC and would love to align my proposal with what would be most useful to the community.
Beta Was this translation helpful? Give feedback.
All reactions