4 9 12

Tianyang Liu

tianyang

https://leolty.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

upvoted an article about 1 month ago

BigCodeArena: Judging code generations end to end with code executions

authored a paper 5 months ago

Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models

View all activity

Organizations

upvoted a paper 29 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9 • 35

upvoted an article about 1 month ago

Article

BigCodeArena: Judging code generations end to end with code executions

•

Oct 7

• 17

authored 2 papers 5 months ago

Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models

Paper • 2411.08733 • Published Nov 13, 2024 • 1

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

upvoted a paper 5 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

liked a dataset 5 months ago

LLM360/guru-RL-92k

Viewer • Updated Aug 20 • 91.9k • 2.73k • 36

authored a paper 9 months ago

Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs

Paper • 2502.19411 • Published Feb 26 • 2

upvoted a paper 9 months ago

Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs

Paper • 2502.19411 • Published Feb 26 • 2

liked a Space 9 months ago

SWE Arena

🏢

SWE-Arena: Compare & Test Best AI Chatbots for Code

liked a Space about 1 year ago

Decentralized Arena Leaderboard

🥇

View and compare LLM evaluations across various domains

authored a paper about 1 year ago

LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models

Paper • 2404.05221 • Published Apr 8, 2024 • 1

upvoted a paper about 1 year ago

LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models

Paper • 2404.05221 • Published Apr 8, 2024 • 1

liked a Space over 1 year ago

10.8k

AI Comic Factory

👩

Create your own AI comic with a single prompt

New activity in tianyang/repobench_java_v1.1 over 1 year ago

Error when loading the dataset

#2 opened over 1 year ago by

Bilibili

upvoted a collection over 1 year ago

💫 StarCoder2

Collection

StarCoder2 models and datasets! • 8 items • Updated Mar 1, 2024 • 89

New activity in bigcode/starcoder2-evaluation over 1 year ago

RepoBench

#1 opened over 1 year ago by

tianyang

authored a paper over 1 year ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 149

upvoted a paper over 1 year ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 149

liked a model over 1 year ago

bigcode/starcoder2-15b

Text Generation • 16B • Updated Jun 5, 2024 • 5.11k • 641

updated a dataset over 1 year ago

tianyang/repobench_java_v1.1

Viewer • Updated Feb 27, 2024 • 26.1k • 39

Tianyang Liu

AI & ML interests

Recent Activity

Organizations

tianyang's activity

BigCodeArena: Judging code generations end to end with code executions

SWE Arena

Decentralized Arena Leaderboard

AI Comic Factory

Error when loading the dataset

RepoBench