gg-hf-gm

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

RyanMullins updated a model 2 days ago

google/embeddinggemma-300m-qat-q8_0-unquantized

RyanMullins updated a model 2 days ago

google/embeddinggemma-300m-qat-q4_0-unquantized

RyanMullins updated a model 2 days ago

google/embeddinggemma-300m

View all activity

RyanMullins

updated 3 models 2 days ago

google/embeddinggemma-300m-qat-q8_0-unquantized

google/embeddinggemma-300m-qat-q4_0-unquantized

google/embeddinggemma-300m

RyanMullins

updated a model 3 days ago

onnx-community/embeddinggemma-300m-ONNX

Xenova

updated a model 4 days ago

onnx-community/embeddinggemma-300m-ONNX

osanseviero

updated 2 models 4 days ago

google/embeddinggemma-300m-qat-q4_0-unquantized

google/embeddinggemma-300m

ariG23498

posted an update about 2 months ago

Post

777

I have always advocated for writing techinical stories without using LLMs.

The following one page editorial really drives the point home.
https://www.nature.com/articles/s44222-025-00323-4

reach-vb

posted an update 3 months ago

Post

4469

Excited to onboard FeatherlessAI on Hugging Face as an Inference Provider - they bring a fleet of 6,700+ LLMs on-demand on the Hugging Face Hub 🤯

Starting today, you'd be able to access all those LLMs (OpenAI compatible) on HF model pages and via OpenAI client libraries too! 💥

Go, play with it today: https://huggingface.co/blog/inference-providers-featherless

P.S. They're also bringing on more GPUs to support all your concurrent requests!

1 reply

ariG23498

posted an update 3 months ago

Post

1712

🚨 Implement KV Cache from scratch in pure PyTorch. 🚨

We have documented all of our learning while implementing KV Cache to nanoVLM. Joint work with @kashif @lusxvr @andito @pcuenq

Blog: hf.co/blog/kv-cache

1 reply

PhilCulliton

authored 2 papers 3 months ago

Predicting Severe Sepsis Using Text from the Electronic Health Record

Paper • 1711.11536 • Published Nov 30, 2017

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31, 2024 • 80

reach-vb

posted an update 4 months ago

Post

4334

hey hey @mradermacher - VB from Hugging Face here, we'd love to onboard you over to our optimised xet backend! 💥

as you know we're in the process of upgrading our storage backend to xet (which helps us scale and offer blazingly fast upload/ download speeds too): https://huggingface.co/blog/xet-on-the-hub and now that we are certain that the backend can scale with even big models like Llama 4/ Qwen 3 - we;re moving to the next phase of inviting impactful orgs and users on the hub over as you are a big part of the open source ML community - we would love to onboard you next and create some excitement about it in the community too!

in terms of actual steps - it should be as simple as one of the org admins to join hf.co/join/xet - we'll take care of the rest.

p.s. you'd need to have a the latest hf_xet version of huggingface_hub lib but everything else should be the same: https://huggingface.co/docs/hub/storage-backends#using-xet-storage

p.p.s. this is fully backwards compatible so everything will work as it should! 🤗

16 replies

PhilCulliton

authored a paper 4 months ago

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation

Paper • 2505.00612 • Published May 1 • 9

sanmikoyejo

authored a paper 4 months ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

philschmid

posted an update 5 months ago

Post

4158

Gemini 2.5 Flash is here! We excited launch our first hybrid reasoning Gemini model. In Flash 2.5 developer can turn thinking off.

**TL;DR:**
- 🧠 Controllable "Thinking" with thinking budget with up to 24k token
- 🌌 1 Million multimodal input context for text, image, video, audio, and pdf
- 🛠️ Function calling, structured output, google search & code execution.
- 🏦 $0.15 1M input tokens; $0.6 or $3.5 (thinking on) per million output tokens (thinking tokens are billed as output tokens)
- 💡 Knowledge cut of January 2025
- 🚀 Rate limits - Free 10 RPM 500 req/day
- 🏅Outperforms 2.0 Flash on every benchmark

Try it ⬇️
https://aistudio.google.com/prompts/new_chat?model=gemini-2.5-flash-preview-04-17