Automatic data mixture method for large language model pre-training
Sea AI Lab
company
Verified
AI & ML interests
None defined yet.
models
38
sail/Sailor-14B-Chat
Text Generation
•
Updated
•
24
•
11
sail/scaling-with-vocab-trained-tokenizers
Updated
•
2
sail/scaling-vocab-3b-32k-overtrain
Text Generation
•
Updated
•
39
sail/scaling-vocab-3b-43k-overtrain
Text Generation
•
Updated
•
11
sail/Zephyr-7B-DICE-Iter2
Text Generation
•
Updated
•
7
sail/Zephyr-7B-DICE-Iter1
Text Generation
•
Updated
•
7
sail/Llama-3-Base-8B-DICE-Iter2
Text Generation
•
Updated
•
6
•
2
sail/Llama-3-Base-8B-DICE-Iter1
Text Generation
•
Updated
•
6
•
1
sail/Sailor-0.5B-Chat-gguf
Updated
•
366
•
4
sail/Sailor-1.8B-Chat-gguf
Updated
•
267
•
3