Base Fine Tune model

by bdytx5 - opened 22 days ago

Discussion

bdytx5

22 days ago

Could you release the base fine tune model without the CoT training? I am writing an article on this. Thanks

Xkev

Owner 22 days ago

•

edited 22 days ago

Hi, the base model is Llama-3.2-11B-Vision-Instruct: https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct

Xkev changed discussion status to closed 22 days ago

bdytx5

22 days ago

Sorry, I mean the "Direct Training" model on the LLaVA 100k dataset (eg just trained on the answers rather than the entire CoT data). from the paper "Here, LLaVA-o1 (with Direct Training) refers to the model trained directly on the original VQA dataset’s Q&A pairs"

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment