Pedro Cuenca's picture

Pedro Cuenca

pcuenq

·

AI & ML interests

None yet

Recent Activity

liked a model about 12 hours ago

meta-llama/Meta-Llama-3-8B

reacted to onekq's post with 👍 about 12 hours ago

So 🐋DeepSeek🐋 hits the mainstream media. But it has been a star in our little cult for at least 6 months. Its meteoric success is not overnight, but two years in the making. To learn their history, just look at their 🤗 repo https://huggingface.co/deepseek-ai * End of 2023, they launched the first model (pretrained by themselves) following Llama 2 architecture * June 2024, v2 (MoE architecture) surpassed Gemini 1.5, but behind Mistral * September, v2.5 surpassed GPT 4o mini * December, v3 surpassed GPT 4o * Now R1 surpassed o1 Most importantly, if you think DeepSeek success is singular and unrivaled, that's WRONG. The following models are also near or equal the o1 bar. * Minimax-01 * Kimi k1.5 * Doubao 1.5 pro

updated a dataset 1 day ago

huggingface-projects/drlc-leaderboard-data

View all activity

Organizations

upvoted an article 1 day ago

Article

NVIDIA RTX 6000 Blackwell Server Edition: Tests, Benchmarks & Comparison with Workstation and RTX 5090, Cooling Features

By

•

23 days ago

• 1

upvoted an article 3 days ago

Article

Scaleway on Hugging Face Inference Providers 🔥

By

and 7 others •

3 days ago

• 8

upvoted a collection 4 days ago

Granite Docling

4 items • Updated 4 days ago • 42

upvoted 2 articles 5 days ago

Article

"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack

By

•

5 days ago

• 10

Article

`LeRobotDataset`: Bringing large-scale datasets to lerobot

By

and 10 others •

6 days ago

• 24

upvoted an article 6 days ago

Article

AI Watermarking 101: Tools and Techniques

By

and 8 others •

Feb 26, 2024

• 25

upvoted an article 10 days ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

By

and 6 others •

11 days ago

• 144

upvoted 2 collections 10 days ago

Qwen3 for ANE

Initial Support for QWEN3 • 2 items • Updated Jun 17 • 5

ANEMLL-0.3.4

Models build with 0.3.4, improved quality and bug fixes • 3 items • Updated Jul 7 • 1

upvoted 6 articles 11 days ago

Article

🕳️ Attention Sinks in LLMs for endless fluency

By

•

Oct 9, 2023

• 22

Article

cocogold: training Marigold for text-grounded segmentation

By

•

Jul 8

• 31

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

By

•

Mar 17

• 337

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

By

and 5 others •

Feb 4

• 111

Article

mmBERT: ModernBERT goes Multilingual

By

and 5 others •

13 days ago

• 92

Article

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

By

and 5 others •

11 days ago

• 95

upvoted a paper 12 days ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published 17 days ago • 173

upvoted a collection 16 days ago

EmbeddingGemma

3 items • Updated 11 days ago • 83

upvoted an article 17 days ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

By

and 5 others •

18 days ago

• 219

upvoted 2 articles 19 days ago

Article

Five Big Improvements to Gradio MCP Servers

By

•

Jul 17

• 24

Article

Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation

By

and 3 others •

20 days ago

• 58