File size: 775 Bytes
0b2005c 8910f1d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
---
license: other
---
# Model Card for llava-polyglot-ko-1.3b-hf
## Model Description
` llava-polyglot-ko-1.3b-hf ` is a model based on polyglot-ko-13b.
We use llava for the vision question answering.
You can see [‘demo.py’]( https://github.com/LearnItAnyway/llava_gpt_neox/blob/main/demo.py) and [‘llava_gpt_neox.py’]( https://github.com/LearnItAnyway/llava_gpt_neox/blob/main/llava_gpt_neox.py).
Currently, the model has been trained on small vision question answer dataset (approx, 10k) with 1.3b (small) model.
## TODO
- Multi-turn chat based on the image
- Larger LLM
- More pretraining on for the vision-text adapter
## References
- [`LLaVA`](https://github.com/haotian-liu/LLaVA)
- [`polyglot korean`](https://huggingface.co/EleutherAI/polyglot-ko-1.3b)
|