Simon Pagezy's picture

Simon Pagezy

pagezyhf

AI & ML interests

Healthcare ML

Recent Activity

Organizations

Hugging Face's profile picture AWS Inferentia and Trainium's profile picture Hugging Face Optimum's profile picture Hugging Test Lab's profile picture Hugging Face OSS Metrics's profile picture Core ML Projects's profile picture Blog-explorers's profile picture Amazon SageMaker's profile picture Enterprise Explorers's profile picture Paris AI Running Club's profile picture Google Cloud ๐Ÿค๐Ÿป Hugging Face's profile picture PagezyTest's profile picture

pagezyhf's activity

upvoted 2 articles 6 days ago
view article
Article

Hugging Face on AMD Instinct MI300 GPU

โ€ข 12
view article
Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

โ€ข 47
upvoted an article 14 days ago
view article
Article

Open-source DeepResearch โ€“ Freeing our search agents

โ€ข 1.02k
posted an update 19 days ago
view post
Post
1661
We published https://huggingface.co/blog/deepseek-r1-aws!

If you are using AWS, give a read. It is a running document to showcase how to deploy and fine-tune DeepSeek R1 models with Hugging Face on AWS.

We're working hard to enable all the scenarios, whether you want to deploy to Inference Endpoints, Sagemaker or EC2; with GPUs or with Trainium & Inferentia.

We have full support for the distilled models, DeepSeek-R1 support is coming soon!! I'll keep you posted.

Cheers
  • 1 reply
ยท
published an article 20 days ago
view article
Article

How to deploy and fine-tune DeepSeek models on AWS

โ€ข 45
reacted to m-ric's post with ๐Ÿš€ 20 days ago
view post
Post
4003
๐—ง๐—ต๐—ฒ ๐—›๐˜‚๐—ฏ ๐˜„๐—ฒ๐—น๐—ฐ๐—ผ๐—บ๐—ฒ๐˜€ ๐—ฒ๐˜…๐˜๐—ฒ๐—ฟ๐—ป๐—ฎ๐—น ๐—ถ๐—ป๐—ณ๐—ฒ๐—ฟ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—ฝ๐—ฟ๐—ผ๐˜ƒ๐—ถ๐—ฑ๐—ฒ๐—ฟ๐˜€!

โœ… Hosting our own inference was not enough: now the Hub 4 new inference providers: fal, Replicate, SambaNova Systems, & Together AI.

Check model cards on the Hub: you can now, in 1 click, use inference from various providers (cf video demo)

Their inference can also be used through our Inference API client. There, you can use either your custom provider key, or your HF token, then billing will be handled directly on your HF account, as a way to centralize all expenses.

๐Ÿ’ธ Also, PRO users get 2$ inference credits per month!

Read more in the announcement ๐Ÿ‘‰ https://huggingface.co/blog/inference-providers
  • 1 reply
ยท
New activity in deepseek-ai/DeepSeek-R1 21 days ago
New activity in amazon-sagemaker/repository-metadata 21 days ago

Update modal.json

#29 opened 21 days ago by
pagezyhf
upvoted an article 21 days ago
view article
Article

Welcome to Inference Providers on the Hub ๐Ÿ”ฅ

โ€ข 377
reacted to merve's post with ๐Ÿ‘ 23 days ago
view post
Post
5157
Oof, what a week! ๐Ÿฅต So many things have happened, let's recap! merve/jan-24-releases-6793d610774073328eac67a9

Multimodal ๐Ÿ’ฌ
- We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG ๐Ÿ’—
- UI-TARS are new models by ByteDance to unlock agentic GUI control ๐Ÿคฏ in 2B, 7B and 72B
- Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B
- MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context
- Dataset: Yale released a new benchmark called MMVU
- Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark

LLMs ๐Ÿ“–
- DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! ๐Ÿคฏ
- Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B
- NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!)

Audio ๐Ÿ—ฃ๏ธ
- Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B
- TangoFlux is a new audio generation model trained from scratch and aligned with CRPO

Image/Video/3D Generation โฏ๏ธ
- Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux
- tencent released Hunyuan3D-2, new 3D asset generation from images
ยท
upvoted an article 27 days ago
view article
Article

Mastering Long Contexts in LLMs with KVPress

By nvidia and 1 other โ€ข
โ€ข 62