1 8 26

Oleh Shliazhko

olmer

ollmer

AI & ML interests

Large Language Models, code generation, retrieval

Recent Activity

upvoted a paper about 2 months ago

How to Train Your LLM Web Agent: A Statistical Diagnosis

liked a Space 3 months ago

Agents-MCP-Hackathon/VulnBuster

upvoted a paper 3 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

View all activity

Organizations

upvoted a paper about 2 months ago

How to Train Your LLM Web Agent: A Statistical Diagnosis

Paper • 2507.04103 • Published Jul 5 • 48

liked a Space 3 months ago

VulnBuster

🛡

AI Security Agent: Multi-MCP Code Vulnerability Scanner

upvoted a paper 3 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20 • 78

upvoted an article 4 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

and 5 others •

Feb 4

• 107

upvoted a paper 7 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 193

liked a Space 7 months ago

3.15k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 7 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

and 3 others •

Feb 4

• 172

liked a dataset 7 months ago

OpenLeecher/lmsys_chat_1m_clean

Viewer • Updated Dec 31, 2024 • 273k • 275 • 77

liked a model 8 months ago

Qwen/QwQ-32B-Preview

Text Generation • 33B • Updated Jan 12 • 107k • • 1.74k

updated a collection over 1 year ago

Self-improvement

Collection

2 items • Updated Apr 21, 2024

upvoted 2 papers over 1 year ago

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18, 2024 • 56

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22, 2024 • 84

liked a Space over 1 year ago

1.42k

Big Code Models Leaderboard

📈

Search and submit code models for evaluation

liked a model over 1 year ago

AetherResearch/Cerebrum-1.0-7b

Text Generation • 7B • Updated Mar 13, 2024 • 5 • • 51

updated a collection over 1 year ago

Self-improvement

Collection

2 items • Updated Apr 21, 2024

upvoted a paper over 1 year ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 152

liked a model over 1 year ago

microsoft/phi-2

Text Generation • 3B • Updated Apr 29, 2024 • 684k • 3.39k

liked 2 models almost 2 years ago

mistralai/Mistral-7B-Instruct-v0.1

Text Generation • 7B • Updated Jul 24 • 163k • 1.79k

mistralai/Mistral-7B-v0.1

Text Generation • 7B • Updated Jul 24 • 381k • 3.95k