Sugato Ray's picture

Sugato Ray PRO

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

upvoted an article about 8 hours ago

Open-R1: a fully open reproduction of DeepSeek-R1

upvoted an article 1 day ago

Welcome to Inference Providers on the Hub 🔥

reacted to m-ric's post with 🔥 1 day ago

𝗧𝗵𝗲 𝗛𝘂𝗯 𝘄𝗲𝗹𝗰𝗼𝗺𝗲𝘀 𝗲𝘅𝘁𝗲𝗿𝗻𝗮𝗹 𝗶𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝗽𝗿𝗼𝘃𝗶𝗱𝗲𝗿𝘀! ✅ Hosting our own inference was not enough: now the Hub 4 new inference providers: fal, Replicate, SambaNova Systems, & Together AI. Check model cards on the Hub: you can now, in 1 click, use inference from various providers (cf video demo) Their inference can also be used through our Inference API client. There, you can use either your custom provider key, or your HF token, then billing will be handled directly on your HF account, as a way to centralize all expenses. 💸 Also, PRO users get 2$ inference credits per month! Read more in the announcement 👉 https://huggingface.co/blog/inference-providers

View all activity

Organizations

sugatoray's activity

upvoted an article about 8 hours ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

2 days ago

• 383

upvoted an article 1 day ago

Article

Welcome to Inference Providers on the Hub 🔥

2 days ago

• 147

upvoted a paper 1 day ago

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Paper • 2501.13928 • Published 6 days ago • 12

upvoted a paper 2 days ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published 7 days ago • 38

upvoted a collection 3 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 3 days ago • 83

upvoted an article 5 days ago

Article

We now support VLMs in smolagents!

6 days ago

• 62

upvoted a paper 6 days ago

MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation

Paper • 2501.06713 • Published 18 days ago • 1

upvoted an article 6 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

7 days ago

• 89

upvoted a collection 6 days ago

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 6 days ago • 60

upvoted 5 collections 7 days ago

SmolLM2 - Smashed

Many variations of SmolLM2 with many variation techniques • 15 items • Updated 29 days ago • 1

Text-to-video Generation (Zeroscope, ...)

6 items • Updated Mar 27, 2024 • 4

Image Classification (ResNet, ViT, MobileNet, ...)

524 items • Updated Mar 27, 2024 • 4

Text-to-text Generation Models (LLMs, Llama, GPT, ...)

5165 items • Updated 14 minutes ago • 13

Text-to-image Generation Models (Diffusion, LCM...)

57 items • Updated May 8, 2024 • 8

upvoted a paper 7 days ago

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published 7 days ago • 62

upvoted 2 collections 7 days ago

GTE models

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated 9 days ago • 20

GTE ModernBERT

GTE Models Based on ModernBERT • 2 items • Updated 9 days ago • 12

upvoted a paper 8 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 15 days ago • 270

upvoted a collection 8 days ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 27 items • Updated 3 days ago • 101

upvoted an article 8 days ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By

•

14 days ago

• 40