Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
5
1
8
Alex Havrilla
Dahoas
Follow
JavierNYC's profile picture
cokeroluwafemi's profile picture
ShylockH's profile picture
65 followers
·
0 following
https://dahoas.github.io/
dahoas
AI & ML interests
NLP, RL
Recent Activity
updated
a dataset
about 5 hours ago
Dahoas/MATH
published
a dataset
about 5 hours ago
Dahoas/MATH
updated
a dataset
about 1 month ago
Dahoas/numina-synthetic
View all activity
Articles
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Dec 9, 2022
•
134
Organizations
Papers
3
arxiv:
2412.02980
arxiv:
2403.04642
arxiv:
2402.10963
models
33
Sort: Recently updated
Dahoas/gptj-rm-IHP
Updated
Mar 8, 2023
•
2
Dahoas/gptneox-response-full-static-sft
Text Generation
•
Updated
Mar 6, 2023
•
19
•
1
Dahoas/pythia-1B-response-full-static-sft
Text Generation
•
Updated
Mar 6, 2023
•
16
•
1
Dahoas/pythia-125M-response-full-static-sft
Text Generation
•
Updated
Mar 6, 2023
•
22
•
1
Dahoas/synthetic-pythia-6B-rm-sft-response
Text Generation
•
Updated
Mar 2, 2023
•
25
Dahoas/pythia-6B-sft-response-full-static
Text Generation
•
Updated
Feb 27, 2023
•
18
•
1
Dahoas/gptj-6B-response-full-static-sft
Text Generation
•
Updated
Feb 15, 2023
•
11
•
1
Dahoas/pythia-6B-rm-response-full-hh
Updated
Feb 15, 2023
Dahoas/gptj-response-full-sft
Text Generation
•
Updated
Feb 15, 2023
•
20
•
1
Dahoas/pythia-6b-rm-response-only-full-hh
Text Generation
•
Updated
Feb 14, 2023
•
18
Expand 33 models
datasets
148
Sort: Recently updated
Dahoas/MATH
Viewer
•
Updated
about 5 hours ago
•
12.5k
Dahoas/numina-synthetic
Viewer
•
Updated
Dec 23, 2024
•
361k
•
51
Dahoas/aimo-validation-aime
Viewer
•
Updated
Dec 11, 2024
•
90
•
81
Dahoas/qwen-1.5-4B-default-positives-epoch-1-100
Viewer
•
Updated
Dec 6, 2024
•
290k
•
57
Dahoas/qwen-1.5-4B-tree-positives-epoch-2-100
Viewer
•
Updated
Dec 6, 2024
•
491k
•
52
Dahoas/qwen-1.5-4B-tree-positives-epoch-1-100
Viewer
•
Updated
Dec 5, 2024
•
477k
•
48
Dahoas/qwen-1.5-4B-epoch-1-test-100
Viewer
•
Updated
Nov 28, 2024
•
498k
•
42
Dahoas/qwen-1.5-4B-K-100-test
Viewer
•
Updated
Nov 5, 2024
•
500k
•
49
Dahoas/MATH_train_K_100_qwen_1.5_4B_outputs
Viewer
•
Updated
Oct 22, 2024
•
750k
•
39
Dahoas/MATH-K-100-train
Viewer
•
Updated
Sep 12, 2024
•
750k
•
1.87k
•
2
Expand 148 datasets