Xin Li

lixin67

WilliamLeeBravo

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

upvoted a paper 3 days ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

upvoted a paper 3 days ago

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

View all activity

Organizations

None yet

lixin67's activity

upvoted an article 3 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

3 days ago

• 244

upvoted 3 papers 3 days ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 3 days ago • 55

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Paper • 2503.05978 • Published 7 days ago • 31

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published 4 days ago • 73

upvoted 2 articles 23 days ago

Article

Open R1: Update #2

and 6 others •

Feb 10

• 204

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 808

upvoted an article 24 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 154

upvoted 2 papers 2 months ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 37

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 65

upvoted a collection 3 months ago

Open Image Preferences

Collection

Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated Dec 19, 2024 • 9

upvoted 6 papers 3 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 57

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 140

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published Dec 12, 2024 • 35

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 94

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 111

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published Dec 11, 2024 • 53

upvoted 2 articles 6 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 60

Article

Key Insights into the Law of Vision Representations in MLLMs

•

Sep 2, 2024

• 18

upvoted an article 7 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19, 2024

• 77

upvoted an article 9 months ago

Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

•

Jun 11, 2024

• 56