Dl's picture

54 590

Dl

Dlbk

·

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

sesame/csm-1b

liked a Space about 22 hours ago

sesame/csm-1b

liked a model 1 day ago

CohereForAI/c4ai-command-a-03-2025

View all activity

Organizations

Dlbk's activity

upvoted a collection about 1 month ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted 2 articles about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 295

upvoted a collection about 2 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 8 items • Updated 19 days ago • 397

upvoted a paper about 2 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 63

upvoted 2 collections about 2 months ago

OuteTTS

8 items • Updated 7 days ago • 14

OuteTTS 0.3

4 items • Updated Jan 15 • 18

upvoted a paper about 2 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 276

upvoted 3 collections 3 months ago

QwQ

Qwen with Questions • 6 items • Updated 8 days ago • 82

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated Jan 1 • 45

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 30 days ago • 83

upvoted a paper 3 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 140

upvoted 2 collections 3 months ago

GLM-4

GLM-4 Open Models • 14 items • Updated 21 days ago • 117

DeepSeek-V2.5

2 items • Updated Dec 10, 2024 • 40

upvoted a paper 3 months ago

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 44

upvoted 2 collections 6 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 292

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 17 days ago • 563

upvoted an article 6 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19, 2024

• 137

upvoted 2 collections 7 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 208

Gemma 2 Release

15 items • Updated 3 days ago • 216