Skip to content

Conversation

juk1329
Copy link
Contributor

@juk1329 juk1329 commented Aug 28, 2025

In the general_vlm_pytorch_huggingface branch, I added a fine-tuning and evaluation pipeline for VLM models. Previously, there was only a pipeline for language models that handled text data. With this VLM pipeline, it is now possible to work with vision data as well.

Similar to the language model pipeline, the VLM pipeline runs within the FastTrack environment, and all paths are properly redirected to FastTrack’s vFolder.

Since VLM models require more user configuration, the setup now involves four YAML files. Details can be found in the README.md.

If you have time, I would greatly appreciate any feedback, questions, or suggestions regarding the code or commit history.

Thank you very much for your time and review.

juk1329 and others added 30 commits July 30, 2025 14:06
@rapsealk rapsealk self-requested a review August 28, 2025 07:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant