3 72 207

YYY

zzfive

ZZfive

AI & ML interests

None yet

Recent Activity

liked a Space 22 days ago

hf-audio/open_asr_leaderboard

liked a Space 22 days ago

linoyts/Flux2-Klein-Face-Swap

updated a collection 22 days ago

video

View all activity

Organizations

None yet

upvoted 2 papers 8 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 146

Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation

Paper • 2508.07901 • Published Aug 11, 2025 • 40

upvoted a paper 10 months ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 33

upvoted 4 papers 11 months ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12, 2025 • 135

upvoted 6 papers about 1 year ago

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17, 2025 • 51

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 308

MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

Paper • 2502.18924 • Published Feb 26, 2025 • 16

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published Apr 1, 2025 • 37

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26, 2025 • 61

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154

upvoted an article about 1 year ago

Article

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

Jan 23, 2025

•

192

upvoted 3 papers about 1 year ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13, 2025 • 93

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26, 2025 • 38

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published Feb 23, 2025 • 36

upvoted an article about 1 year ago

Article

The Large Language Model Course

Jan 16, 2025

•

226

upvoted a paper about 1 year ago

HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution

Paper • 2501.10045 • Published Jan 17, 2025 • 10

upvoted a paper over 1 year ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 302

YYY

AI & ML interests

Recent Activity

Organizations

zzfive's activity

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

The Large Language Model Course