WildEval

non-profit

wild_eval

WildEval

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ChengsongHuang authored a paper 6 days ago

Training Data Efficiency in Multimodal Process Reward Models

ChengsongHuang authored a paper 9 days ago

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

ChengsongHuang authored a paper 9 days ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

View all activity

ChengsongHuang

authored a paper 6 days ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published 9 days ago • 75

ChengsongHuang

authored 2 papers 9 days ago

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Paper • 2601.22628 • Published 14 days ago • 34

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published 9 days ago • 25

ChengsongHuang

submitted a paper to Daily Papers 11 days ago

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Paper • 2601.22628 • Published 14 days ago • 34

ChengsongHuang

authored a paper about 1 month ago

RelayLLM: Efficient Reasoning via Collaborative Decoding

Paper • 2601.05167 • Published Jan 8 • 31

ChengsongHuang

submitted a paper to Daily Papers about 1 month ago

RelayLLM: Efficient Reasoning via Collaborative Decoding

Paper • 2601.05167 • Published Jan 8 • 31

ChengsongHuang

authored a paper about 1 month ago

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Paper • 2601.03986 • Published Jan 7 • 34

ChengsongHuang

submitted a paper to Daily Papers about 1 month ago

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Paper • 2601.03986 • Published Jan 7 • 34

ChengsongHuang

authored a paper about 2 months ago

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 54

faezeb

authored a paper 3 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 61

ChengsongHuang

authored a paper 3 months ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 43

yuntian-deng

authored a paper 4 months ago

TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar

Paper • 2510.14972 • Published Oct 16, 2025 • 35

DongfuJiang

authored 2 papers 4 months ago

Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning

Paper • 2509.22824 • Published Sep 26, 2025 • 21

VideoScore2: Think before You Score in Generative Video Evaluation

Paper • 2509.22799 • Published Sep 26, 2025 • 26

yuntian-deng

authored 2 papers 4 months ago

Interactive Training: Feedback-Driven Neural Network Optimization

Paper • 2510.02297 • Published Oct 2, 2025 • 43

Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls

Paper • 2510.00184 • Published Sep 30, 2025 • 17

ChengsongHuang

authored a paper 5 months ago

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 103

DongfuJiang

authored a paper 5 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 78

ChengsongHuang

authored 2 papers 6 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 84

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 130

AI & ML interests

Recent Activity

Team members 9

WildEval's activity