--- license: apache-2.0 tags: - generated_from_trainer - instruction fine-tuning model-index: - name: flan-t5-small-distil-v2 results: [] language: - en pipeline_tag: text2text-generation widget: - text: >- how can I become more healthy? example_title: example --- # LaMini-FLAN-T5-77M [![Model License](https://img.shields.io/badge/Model%20License-CC%20By%20NC%204.0-red.svg)]() This model is one of our LaMini model series in paper "[LaMini: A Diverse Herd of Distilled Models from Large-Scale Instructions](https://github.com/mbzuai-nlp/lamini)". This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on [LaMini dataset](https://huggingface.co/datasets/MBZUAI/LaMini-instruction) that contains 2.58M samples for instruction fine-tuning. For more information about our dataset, please refer to our [project repository](https://github.com/mbzuai-nlp/lamini/). You can view other LaMini model series as follow. Note that not all models are performing as well. More details can be seen in our paper.
Base model | LaMini series (#parameters) | |||
---|---|---|---|---|
T5 | LaMini-T5-61M | LaMini-T5-223M | LaMini-T5-738M | |
Flan-T5 | LaMini-Flan-T5-77M | LaMini-Flan-T5-248M | LaMini-Flan-T5-783M | |
Cerebras-GPT | LaMini-Cerebras-111M | LaMini-Cerebras-256M | LaMini-Cerebras-590M | LaMini-Cerebras-1.3B |
GPT-2 | LaMini-GPT-124M | LaMini-GPT-774M | LaMini-GPT-1.5B | |
GPT-Neo | LaMini-Neo-125M | LaMini-Neo-1.3B | ||
GPT-J | coming soon | |||
LLaMA | coming soon |