AI & ML interests

LLMs for language and code + Time series and geospatial foundation models

Recent Activity

mrutkows  updated a collection about 7 hours ago
Granite Quantized Models
mrutkows  updated a collection about 7 hours ago
Granite Quantized Models
mrutkows  updated a collection about 7 hours ago
Granite Quantized Models
View all activity

ariG23498 
posted an update about 1 month ago
view post
Post
759
New post is live!

This time we cover some major updates to transformers.

🤗
  • 1 reply
·
danielhanchen 
posted an update about 2 months ago
view post
Post
5620
Run DeepSeek-V3.1 locally on 170GB RAM with Dynamic 1-bit GGUFs!🐋
GGUFs: unsloth/DeepSeek-V3.1-GGUF

The 715GB model gets reduced to 170GB (-80% size) by smartly quantizing layers.

The 1-bit GGUF passes all our code tests & we fixed the chat template for llama.cpp supported backends.

Guide: https://docs.unsloth.ai/basics/deepseek-v3.1
Xenova 
posted an update about 2 months ago
view post
Post
7142
Okay this is insane... WebGPU-accelerated semantic video tracking, powered by DINOv3 and Transformers.js! 🤯
Demo (+ source code): webml-community/DINOv3-video-tracking

This will revolutionize AI-powered video editors... which can now run 100% locally in your browser, no server inference required (costs $0)! 😍

How does it work? 🤔
1️⃣ Generate and cache image features for each frame
2️⃣ Create a list of embeddings for selected patch(es)
3️⃣ Compute cosine similarity between each patch and the selected patch(es)
4️⃣ Highlight those whose score is above some threshold

... et voilà! 🥳

You can also make selections across frames to improve temporal consistency! This is super useful if the object changes its appearance slightly throughout the video.

Excited to see what the community builds with it!
  • 1 reply
·
Xenova 
posted an update 2 months ago
view post
Post
4267
The next generation of AI-powered websites is going to be WILD! 🤯

In-browser tool calling & MCP is finally here, allowing LLMs to interact with websites programmatically.

To show what's possible, I built a demo using Liquid AI's new LFM2 model, powered by 🤗 Transformers.js: LiquidAI/LFM2-WebGPU

As always, the demo is open source (which you can find under the "Files" tab), so I'm excited to see how the community builds upon this! 🚀
  • 2 replies
·
danielhanchen 
posted an update 2 months ago
Xenova 
posted an update 3 months ago
view post
Post
3220
Introducing Voxtral WebGPU: State-of-the-art audio transcription directly in your browser! 🤯
🗣️ Transcribe videos, meeting notes, songs and more
🔐 Runs on-device, meaning no data is sent to a server
🌎 Multilingual (8 languages)
🤗 Completely free (forever) & open source

That's right, we're running Mistral's new Voxtral-Mini-3B model 100% locally in-browser on WebGPU, powered by Transformers.js and ONNX Runtime Web! 🔥

Try it out yourself! 👇
webml-community/Voxtral-WebGPU
danielhanchen 
posted an update 3 months ago
ariG23498 
posted an update 3 months ago
danielhanchen 
posted an update 3 months ago
danielhanchen 
posted an update 3 months ago