yugongwzx

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

OJBench: A Competition Level Code Benchmark For Large Language Models

authored a paper about 2 months ago

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

authored a paper about 2 months ago

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

OJBench: A Competition Level Code Benchmark For Large Language Models

Paper • 2506.16395 • Published Jun 19 • 4

authored 4 papers about 2 months ago

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

Paper • 2406.08587 • Published Jun 12, 2024 • 16

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published Sep 5, 2024 • 36

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published Feb 23 • 27

OJBench: A Competition Level Code Benchmark For Large Language Models

Paper • 2506.16395 • Published Jun 19 • 4

upvoted 2 papers 8 months ago

ProgCo: Program Helps Self-Correction of Large Language Models

Paper • 2501.01264 • Published Jan 2 • 27

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 48

upvoted a paper 11 months ago

Toward General Instruction-Following Alignment for Retrieval-Augmented Generation

Paper • 2410.09584 • Published Oct 12, 2024 • 49

updated 3 datasets 12 months ago

updated 3 models 12 months ago

yugongwzx/demo2

Text Generation • Updated Sep 12, 2024 • 6

yugongwzx/demo1

Updated Sep 12, 2024

yugongwzx/demo

Updated Sep 12, 2024

updated a dataset 12 months ago

yugongwzx/test2

Updated Sep 12, 2024 • 1

updated 2 models 12 months ago

yugongwzx/test1

Text Generation • Updated Sep 12, 2024 • 5

yugongwzx/test

Text Generation • Updated Sep 12, 2024 • 9

upvoted a paper 12 months ago

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published Sep 5, 2024 • 36

upvoted a paper about 1 year ago

We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?

Paper • 2407.01284 • Published Jul 1, 2024 • 82

yugongwzx

AI & ML interests

Recent Activity

Organizations

yugongwzx's activity