adol
adol01
AI & ML interests
None yet
Recent Activity
New activity
about 2 months ago
Qwen/Qwen2-1.5B:Hello, is this 1.5B model trained from scratch, or is it distilled like LLaMA 3.2?
Organizations
None yet
adol01's activity
Hello, is this 1.5B model trained from scratch, or is it distilled like LLaMA 3.2?
#7 opened about 2 months ago
by
adol01
MMLU Performance After Token Training
#3 opened 2 months ago
by
adol01
Do you plan to open-source the training code?
2
#1 opened 4 months ago
by
adol01