We made a LLM model with tabtoyou/KoLLaVA-v1.5-Synatra-7b (mistral)
We use LoRA(r=128, alpha=256), lr=2e-5, mm_projector_lr = 2e-5
CCTV image data(w/ BBox) used, and 3 epoch train
We are making Multi-modal LLM model for Kolon !