leaderboard / .gitignore
Miaoran000's picture
minor updates
5c4aa1e
raw
history blame
316 Bytes
auto_evals/
venv/
__pycache__/
.env
.ipynb_checkpoints
*ipynb
.vscode/
gpt_4_evals/
human_evals/
eval-queue/
eval-results/
auto_evals/
eval-queue-bk/
eval-results-bk/
src/assets/model_counts.html
generation_results/
Hallucination Leaderboard Results
dataset_stats.py
get_comparison.py
GPT-4-Turbo_v.s._GPT-4o.csv