Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
12
2
Ksenia Se
Kseniase
Follow
gschex's profile picture
ggalpi's profile picture
sudanenator's profile picture
146 followers
·
42 following
https://www.turingpost.com/
Kseniase_
ksenia-se
kseniase.bsky.social
AI & ML interests
None yet
Recent Activity
posted
an
update
about 14 hours ago
Today, we spoke with Snowflake’s AI Research Team Leads, Yuxiong He and Samyam Rajbhandari (@samyam) (he is also one the researchers behind https://huggingface.co/papers/2401.08671 and other DeepSpeed papers) Collaborating with their co-authors to reduce inference costs for enterprise-specific tasks, they observed that inputs are often significantly larger than outputs. This is because it’s in the nature of enterprises to analyze enormous amounts of information trying to extract valuable insights, which are much shorter. To address this, they developed SwiftKV https://huggingface.co/papers/2410.03960, an optimization that reduces LLM inference costs by up to 75% for Meta Llama LLMs, enhancing efficiency and performance in enterprise AI tasks. Today they are open-sourcing SwiftKV (https://huggingface.co/Snowflake/Llama-3.1-SwiftKV-8B-Instruct) and ArcticTrainging Platform. In our new episode "15 minutes with a Researcher" they explain how SwiftKV works, its applicability to other architectures, its limitations, and additional methods to further reduce computation costs in inference. Watch the full 15 min interview here (https://youtu.be/9x1k7eXe-6Q?si=4_HQOyi1CPHgvlrx)
published
an
article
about 15 hours ago
Topic 23: What is LLM Inference, it's challenges and solutions for it
upvoted
an
article
4 days ago
🌁#83: GAN is back
View all activity
Articles
Topic 23: What is LLM Inference, it's challenges and solutions for it
about 15 hours ago
•
2
🌁#83: GAN is back
4 days ago
•
5
🦸🏻#7: From Agentic AI to Physical AI
6 days ago
•
4
🅰️ℹ️ 1️⃣0️⃣1️⃣ **What is HtmlRAG, Multimodal RAG and Agentic RAG?**
8 days ago
•
6
🌁#82: AI and ML in Real Life
11 days ago
•
15
AI in 2025: A Combinatorial Explosion of Possibilities, but NOT AGI
14 days ago
•
3
🦸🏻#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows
21 days ago
•
9
🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI?
23 days ago
•
7
🌁#81: Key AI Concepts to Follow in 2025
25 days ago
•
24
Organizations
Kseniase
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a Space
10 days ago
Running
472
📈
Scaling test-time compute
liked
a Space
about 1 month ago
Running
on
CPU Upgrade
9.13k
👩🎨
AI Comic Factory
Create your own AI comic with a single prompt