Cosmo's picture

Cosmo

cosmojg

·

https://cosmo.red

AI & ML interests

Machine learning and computational neuroscience

Recent Activity

liked a model about 8 hours ago

tomg-group-umd/huginn-0125

liked a model about 8 hours ago

Zyphra/Zonos-v0.1-transformer

liked a model about 8 hours ago

Zyphra/Zonos-v0.1-hybrid

View all activity

Organizations

None yet

cosmojg's activity

upvoted a paper 6 days ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 16

upvoted a collection 6 days ago

DeepSeek-VL2

5 items • Updated 3 days ago • 63

upvoted an article 7 days ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 919

upvoted a paper 12 days ago

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 16 days ago • 30

upvoted a collection 12 days ago

Selene-1-Mini

13 items • Updated 4 days ago • 9

upvoted an article 12 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

15 days ago

• 712

upvoted an article 15 days ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

20 days ago

• 62

upvoted a collection 15 days ago

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 20 days ago • 68

upvoted an article 15 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

20 days ago

• 124

upvoted a collection 15 days ago

DeepSeek-R1

8 items • Updated 22 days ago • 475

upvoted 2 collections 19 days ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated Jan 10 • 83

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 131

upvoted a collection 22 days ago

FuseO1-Preview

System-II Reasoning Fusion of LLMs • 10 items • Updated 12 days ago • 17

upvoted 2 articles 22 days ago

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Feb 27, 2024

• 50

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

By

•

22 days ago

• 60

upvoted a paper 25 days ago

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published 26 days ago • 47

upvoted an article 28 days ago

Article

Diving into MiniMax01 405B MoE

By

•

28 days ago

• 17

upvoted a paper 28 days ago

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Paper • 2410.10629 • Published Oct 14, 2024 • 11

upvoted 2 collections about 1 month ago

Deepseek V3 (All Versions)

Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. • 3 items • Updated 8 days ago • 32

Cosmos

The collection of Cosmos models • 31 items • Updated 26 days ago • 259