Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mohamed Anwar's picture
4

Mohamed Anwar

mootje
Β·

AI & ML interests

None yet

Recent Activity

liked a Space 10 days ago
ysharma/drag-and-drop-kanban-board
liked a Space 17 days ago
maldons77/ai-storyboard-creator
reacted to BramVanroy's post with πŸ‘€ over 1 year ago
The InstructGPT paper mentions that they insert 10% pretraining data during SFT, which they find improves the effect of PPO (IIUC). Has anyone else done later ablations on this? I've only seen the inverse suggested, mixing in SFT data during pretraining.
View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs