AI & ML interests
None defined yet.
PolarisEvals/leaderboard-data
Viewer
•
Updated
•
1.14M
•
14
PolarisEvals/llm_dataset_completness_2stage_score_mini
Viewer
•
Updated
•
10
•
9
PolarisEvals/llm_dataset_completness_2stage_score
Viewer
•
Updated
•
54.3k
•
18
PolarisEvals/llm_dataset_completness_2stage_justification_score
Viewer
•
Updated
•
54.3k
•
13
PolarisEvals/llm_dataset_completness_2stage
Viewer
•
Updated
•
54.3k
•
9
PolarisEvals/shikib_dataset_completeness_2stage_unittest
Viewer
•
Updated
•
5.47k
•
63
PolarisEvals/shikib_dataset_completeness_2stage_unittest_debug
Viewer
•
Updated
•
100
•
16
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest_response
Viewer
•
Updated
•
5.47k
•
24
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest
Viewer
•
Updated
•
912
•
16
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_filtering_debug
Viewer
•
Updated
•
100
•
7
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts
Viewer
•
Updated
•
912
•
10
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions_filtering_debug
Viewer
•
Updated
•
100
•
9
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions
Viewer
•
Updated
•
982
•
10
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
•
6
PolarisEvals/training_criteria_dpo_distill_relevance_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
•
9
PolarisEvals/training_criteria_dpo_distill
Viewer
•
Updated
•
912
•
9
PolarisEvals/synqa_hudson_300_samples_relevance_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
•
9
PolarisEvals/synqa_hudson_300_samples_completeness_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
•
7
PolarisEvals/synqa_hudson_300_samples
Viewer
•
Updated
•
1.5k
•
9
PolarisEvals/synqa_hudson_300_samples_clarity_gpt-4-0613_outputs_json_True_debug
Viewer
•
Updated
•
100
•
7
PolarisEvals/synqa_hudson_300_queries_rubrics_score_completeness_gpt-4-0613_outputs_json_True
Viewer
•
Updated
•
10
•
11
PolarisEvals/synqa_hudson_300_queries_rubrics_score_completeness_gpt-4-0613_outputs_json_False
Viewer
•
Updated
•
10
•
6
PolarisEvals/synqa_hudson_300_queries_rubrics_score
Viewer
•
Updated
•
7.5k
•
6
PolarisEvals/synqa_hudson_300_samples_gpt-4-0613_outputs
Viewer
•
Updated
•
81
•
11