YuLan-Mini - a yulan-team Collection

yulan-team 's Collections

YuLan-Mini

updated 8 days ago

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.

yulan-team/YuLan-Mini

Text Generation • Updated 8 days ago • 789 • 35

Note A highly capable 2.4B lightweight LLM using only 1T pre-training data.
yulan-team/YuLan-Mini-Instruct-V1

Text Generation • Updated 3 days ago • 62 • 2
yulan-team/YuLan-Mini-Datasets

Updated Dec 29, 2024 • 973 • 9
YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 64
yulan-team/YuLan-Mini-Before-Annealing

Updated Dec 30, 2024 • 20 • 7

Note The model & optimizer states of the last curriculum phase before learning rate annealing.
yulan-team/YuLan-Mini-Phase20

Updated Dec 29, 2024 • 18 • 2

Note The model & optimizer states of the 20th curriculum phase.