taicheng guo's picture

29 6

taicheng guo

taicheng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

upvoted a paper 6 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

upvoted a paper 6 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

View all activity

Organizations

taicheng's activity

upvoted 3 papers 6 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 9 days ago • 75

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 10 days ago • 230

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 10 days ago • 83

upvoted a paper 12 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 16 days ago • 47

upvoted a paper 26 days ago

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published 30 days ago • 73

upvoted a paper 29 days ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 91

upvoted a paper about 1 month ago

What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks

Paper • 2305.18365 • Published May 27, 2023 • 4

upvoted a paper about 2 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 42

liked a model 3 months ago

Qwen/Qwen2-0.5B

Text Generation • Updated Oct 22, 2024 • 136k • 123

upvoted a collection 3 months ago

Power-LM

Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17, 2024 • 15

upvoted 2 papers 3 months ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 63

MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions

Paper • 2409.12958 • Published Sep 19, 2024 • 8

updated 8 models 4 months ago

taicheng/zephyr-7b-align-scan-0.0-0.0-linear-1

Text Generation • Updated Sep 28, 2024 • 7

taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-1

Text Generation • Updated Sep 28, 2024 • 12

taicheng/zephyr-7b-align-scan-0.0-0.0-cosine-2

Text Generation • Updated Sep 28, 2024 • 11

taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-2

Text Generation • Updated Sep 28, 2024 • 11

taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-3

Text Generation • Updated Sep 28, 2024 • 7

taicheng/zephyr-7b-align-scan-0.0-0.0-linear-3

Text Generation • Updated Sep 28, 2024 • 8

taicheng/zephyr-7b-align-scan

Text Generation • Updated Sep 28, 2024 • 9

taicheng/zephyr-7b-align-scan-1e-07-0.27-polynomial-1.0

Updated Sep 28, 2024