adol's picture

3

adol

adol01

AI & ML interests

None yet

Recent Activity

New activity about 2 months ago

Qwen/Qwen2-1.5B:Hello, is this 1.5B model trained from scratch, or is it distilled like LLaMA 3.2?

View all activity

Organizations

None yet

adol01's activity

New activity in Qwen/Qwen2-1.5B about 2 months ago

Hello, is this 1.5B model trained from scratch, or is it distilled like LLaMA 3.2?

#7 opened about 2 months ago by

New activity in TRI-ML/DCLM-1B 2 months ago

MMLU Performance After Token Training

#3 opened 2 months ago by

New activity in Alibaba-NLP/gte-multilingual-base 4 months ago

Do you plan to open-source the training code?

#1 opened 4 months ago by