YangWang92's picture

YangWang92

yangwang92

·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

deepseek-ai/DeepSeek-V2-Lite

liked a model 6 days ago

openai/gpt-oss-safeguard-20b

new activity 10 days ago

allenai/dolma3_longmino_mix-100B-1125:cranemath/shard_00000138-withid.jsonl.zst file corrupted

View all activity

Organizations

authored 2 papers 9 months ago

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Paper • 2506.08989 • Published Jun 10, 2025 • 14

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17, 2025 • 45

authored a paper over 1 year ago

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

Paper • 2409.17066 • Published Sep 25, 2024 • 28