Temperature's effect on the performance of long chain reasoning models. Why was 0.7 used for the evals?
1
#6 opened 2 days ago
by
j456
Great work!
#5 opened 8 days ago
by
Daemontatox
License of your model
1
#4 opened 9 days ago
by
chewkokwah
Evaluation
1
#3 opened 10 days ago
by
PSM24
Merge with 32b coder?
12
#2 opened 12 days ago
by
RDson