7 43 20

Simon Pagezy

pagezyhf

pagezyhf

AI & ML interests

Healthcare ML

Recent Activity

updated a dataset 5 days ago

amazon-sagemaker/repository-metadata

upvoted an article 6 days ago

Hugging Face on AMD Instinct MI300 GPU

upvoted an article 6 days ago

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

View all activity

Organizations

pagezyhf's activity

updated a dataset 5 days ago

amazon-sagemaker/repository-metadata

Preview • Updated 5 days ago • 426 • 1

upvoted 2 articles 6 days ago

Article

Hugging Face on AMD Instinct MI300 GPU

May 21, 2024

• 12

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

7 days ago

• 47

updated a dataset 8 days ago

huggingface/documentation-images

Viewer • Updated 32 minutes ago • 50 • 3.33M • 49

liked a Space 12 days ago

329

NeuralJam

🚂

EscapeExpress : LLM AI detective puzzle game.

upvoted an article 14 days ago

Article

Open-source DeepResearch – Freeing our search agents

15 days ago

• 1.02k

posted an update 19 days ago

Post

1661

We published https://huggingface.co/blog/deepseek-r1-aws!

If you are using AWS, give a read. It is a running document to showcase how to deploy and fine-tune DeepSeek R1 models with Hugging Face on AWS.

We're working hard to enable all the scenarios, whether you want to deploy to Inference Endpoints, Sagemaker or EC2; with GPUs or with Trainium & Inferentia.

We have full support for the distilled models, DeepSeek-R1 support is coming soon!! I'll keep you posted.

Cheers

1 reply

published an article 20 days ago

Article

How to deploy and fine-tune DeepSeek models on AWS

20 days ago

• 45

reacted to m-ric's post with 🚀 20 days ago

Post

4003

𝗧𝗵𝗲 𝗛𝘂𝗯 𝘄𝗲𝗹𝗰𝗼𝗺𝗲𝘀 𝗲𝘅𝘁𝗲𝗿𝗻𝗮𝗹 𝗶𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝗽𝗿𝗼𝘃𝗶𝗱𝗲𝗿𝘀!

✅ Hosting our own inference was not enough: now the Hub 4 new inference providers: fal, Replicate, SambaNova Systems, & Together AI.

Check model cards on the Hub: you can now, in 1 click, use inference from various providers (cf video demo)

Their inference can also be used through our Inference API client. There, you can use either your custom provider key, or your HF token, then billing will be handled directly on your HF account, as a way to centralize all expenses.

💸 Also, PRO users get 2$ inference credits per month!

Read more in the announcement 👉 https://huggingface.co/blog/inference-providers

1 reply

New activity in deepseek-ai/DeepSeek-R1 21 days ago

problem with using serverless inference

#78 opened 21 days ago by

manju2345

New activity in amazon-sagemaker/repository-metadata 21 days ago

Update modal.json

#29 opened 21 days ago by

pagezyhf

upvoted an article 21 days ago

Article

Welcome to Inference Providers on the Hub 🔥

22 days ago

• 377

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-70B 21 days ago

Amazon Sagemaker deployment failing with CUDA OutOfMemory error

#10 opened 22 days ago by

neelkapadia

New activity in Qwen/Qwen2-VL-7B-Instruct 23 days ago

Anyone able to deploy an inference endpoint on sagemaker?

#71 opened about 1 month ago by

TeoGX

reacted to merve's post with 👍 23 days ago

Post

5157

Oof, what a week! 🥵 So many things have happened, let's recap! merve/jan-24-releases-6793d610774073328eac67a9

Multimodal 💬
- We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG 💗
- UI-TARS are new models by ByteDance to unlock agentic GUI control 🤯 in 2B, 7B and 72B
- Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B
- MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context
- Dataset: Yale released a new benchmark called MMVU
- Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark

LLMs 📖
- DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! 🤯
- Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B
- NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!)

Audio 🗣️
- Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B
- TangoFlux is a new audio generation model trained from scratch and aligned with CRPO

Image/Video/3D Generation ⏯️
- Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux
- tencent released Hunyuan3D-2, new 3D asset generation from images

7 replies

upvoted an article 27 days ago

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

27 days ago

• 62

liked 3 models 27 days ago

upvoted a collection 27 days ago

DeepSeek-R1

Collection

8 items • Updated 29 days ago • 511