A collection of baselines trained by 🔥 flame
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
12
models
38

fla-hub/transformer-340M-4K-0.5B-20480-lr3e-4-decay0.1-sqrt
Updated

fla-hub/transformer-340M-4K-0.5B-20480-lr3e-4-cosine
Updated
•
1

fla-hub/rwkv7-0.1B-g1
Question Answering
•
Updated
•
120
•
2

fla-hub/rwkv7-191M-world
Text Generation
•
Updated
•
364
•
1

fla-hub/rwkv7-2.9B-world
Text Generation
•
Updated
•
1.05k
•
3

fla-hub/rwkv7-0.4B-world
Text Generation
•
Updated
•
7
•
1

fla-hub/rwkv7-1.5B-world
Text Generation
•
Updated
•
735
•
7

fla-hub/rwkv7-168M-pile
Text Generation
•
Updated
•
275
•
5

fla-hub/transformer-3B-qwen2.5
Updated
•
20

fla-hub/transformer-3B-qwen2.5-instruct
Updated
•
378