5 7 18

saitejautpala

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

upvoted an article 10 days ago

BigCodeArena: Judging code generations end to end with code executions

upvoted a paper 28 days ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

View all activity

Organizations

None yet

upvoted a paper 6 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published 10 days ago • 28

upvoted an article 10 days ago

Article

BigCodeArena: Judging code generations end to end with code executions

•

13 days ago

• 16

upvoted a paper 28 days ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

Paper • 2509.09677 • Published Sep 11 • 33

liked a Space about 2 months ago

BigCodeArena

🚀

Compare two AI models by sending them code and seeing their responses

liked 2 datasets 3 months ago

agent-evals/hal_traces

Updated 12 days ago • 2.13k • 2

cais/hle

Viewer • Updated Sep 10 • 2.5k • 12.2k • 493

upvoted a collection 3 months ago

NextCoder

Collection

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated Jul 9 • 71

upvoted a paper 5 months ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122

upvoted a paper 6 months ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

authored a paper 7 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 152

upvoted a paper 7 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 152

updated a Space 8 months ago

Arena Annotation Progress

😻

Display battle counts per annotator

updated a model over 1 year ago

saitejautpala/bert-base-yelp-reviews

Text Classification • 0.1B • Updated May 24, 2024 • 16

liked 2 models about 2 years ago

stabilityai/stablelm-3b-4e1t

Text Generation • 3B • Updated Mar 7, 2024 • 19.5k • 310

microsoft/phi-1

Text Generation • 1B • Updated Apr 29, 2024 • 4.42k • 215

liked a Space about 2 years ago

13.6k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

liked 3 models about 2 years ago

liked a dataset about 2 years ago

bigcode/bigcode-pii-dataset

Viewer • Updated May 15, 2023 • 12.1k • 38 • 51

saitejautpala

AI & ML interests

Recent Activity

Organizations

saitejautpala's activity

BigCodeArena: Judging code generations end to end with code executions

BigCodeArena

Arena Annotation Progress

Open LLM Leaderboard