Collection related to the paper, "Training a Generally Curious Agent" (Project page: https://paprika-llm.github.io/)
Fahim Tajwar
ftajwar
AI & ML interests
LLMs, RLHF
Recent Activity
updated
a model about 1 month ago
guanning-ai/SmolLM-Checkpoints-Final-0124 published
a model about 1 month ago
guanning-ai/SmolLM-Checkpoints-Final-0124 updated
a model about 1 month ago
guanning-ai/SmolLM-Checkpoints-Final-0123