Ji-Xiang's picture

Ji-Xiang

Ji-Xiang

·

AI & ML interests

None yet

Recent Activity

updated a collection about 9 hours ago

updated a collection about 9 hours ago

updated a collection about 9 hours ago

View all activity

Organizations

Ji-Xiang's activity

upvoted a collection about 9 hours ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 7 items • Updated 4 days ago • 36

upvoted a collection 2 days ago

Breeze 2 Family

Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 5 items • Updated 3 days ago • 11

upvoted an article 3 days ago

Article

Open-source DeepResearch – Freeing our search agents

13 days ago

• 984

upvoted a collection 4 days ago

CritiqueFineTuning

The dataset and models for CritiqueFineTuning • 4 items • Updated 14 days ago • 2

upvoted 2 papers 4 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 18 days ago • 54

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 31

upvoted a collection 4 days ago

Zonos-v0.1

3 items • Updated 4 days ago • 6

upvoted a collection 8 days ago

DeepSeek-R1-abliterated

7 items • Updated 16 days ago • 85

upvoted a collection 9 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 21 days ago • 99

upvoted a collection 13 days ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated 6 days ago • 74

upvoted 2 collections 14 days ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 6 days ago • 69

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 4 days ago • 90

upvoted an article 15 days ago

Article

Gradio spaces are the perfect agent tools\!

By

•

about 1 month ago

• 14

upvoted a collection 15 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 11 items • Updated 5 days ago • 65

upvoted an article 15 days ago

Article

Open-R1: Update #1

By

and 7 others •

15 days ago

• 279

upvoted an article 16 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

20 days ago

• 747

upvoted a collection 17 days ago

DeepSeek-V3

3 items • Updated Jan 6 • 183

upvoted a collection 18 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 20 days ago • 343

upvoted an article 19 days ago

Article

LLM数据工程3——数据收集魔法：获取顶级训练数据的方法

By

•

Jun 4, 2024

• 16

upvoted a collection 20 days ago

Thinking/Reasoning Datasets

16 items • Updated 17 days ago • 2