Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji PRO
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked a model 4 days ago
google/gemma-4-12B-it liked a model 4 days ago
google/diffusiongemma-26B-A4B-it liked a model about 1 month ago
openai/privacy-filter