Michael N's picture

3

Michael N

mnoukhov

·

http://mnoukhov.github.io

AI & ML interests

Representation learning for functional language

Recent Activity

updated a model 12 days ago

mnoukhov/SmolLM2-135M-tldr-sft

updated a model 12 days ago

mnoukhov/SmolLM2-360M-tldr-sft

New activity 12 days ago

trl-lib/tldr:Extra space at start of completion

View all activity

Organizations

mnoukhov's activity

New activity in trl-lib/tldr 12 days ago

Extra space at start of completion

#2 opened 12 days ago by

commented a paper about 1 month ago

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

Paper • 2410.18252 • Published Oct 23 • 5 •

New activity in arianhosseini/pythia410m-tldr-dpo-1b-relbl-10k about 1 year ago

change to correct base model

#1 opened about 1 year ago by