AI & ML interests

vision , multimedia , gradio, accessibility & cool demos

TeamTonic's activity

AtAndDevΒ 
posted an update 1 day ago
view post
Post
881
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
prithivMLmodsΒ 
posted an update 1 day ago
view post
Post
1483
Gemma-3-4B : Image and Video Inference πŸ–ΌοΈπŸŽ₯

🧀Space: prithivMLmods/Imagineo-Chat

@gemma3-4b : {Tag + Space_+ 'prompt'}
@gemma3-4b-video : {Tag + Space_+ 'prompt'}
By default, it runs: prithivMLmods/Qwen2-VL-OCR-2B-Instruct

Additionally, I have also tested Aya-Vision 8B vs Custom Qwen2-VL-OCR for OCR with test case samples on messy handwriting for experimental purposes to optimize edge device VLMs for Optical Character Recognition.

πŸ“œRead the blog here: https://huggingface.co/blog/prithivMLmods/aya-vision-vs-qwen2vl-ocr-2b
  • 1 reply
Β·
not-lainΒ 
posted an update 1 day ago
prithivMLmodsΒ 
posted an update 2 days ago
TonicΒ 
posted an update 7 days ago
view post
Post
1004
πŸ™‹πŸ»β€β™‚οΈHey there folks,

Did you know that you can use ModernBERT to detect model hallucinations ?

Check out the Demo : Tonic/hallucination-test

See here for Medical Context Demo : MultiTransformer/tonic-discharge-guard

check out the model from KRLabs : KRLabsOrg/lettucedect-large-modernbert-en-v1

and the library they kindly open sourced for it : https://github.com/KRLabsOrg/LettuceDetect

πŸ‘†πŸ»if you like this topic please contribute code upstream πŸš€

  • 2 replies
Β·
ZennyKennyΒ 
posted an update 8 days ago
view post
Post
482
It took me a while, but I've finally got it working: ZennyKenny/note-to-text

Using a Meta LLaMa checkpoint from Unsloth and some help from the HF community, you can capture handwritten notes and convert them into digital format in just a few second.

Really exciting times for AI builders on Hugging Face.
  • 2 replies
Β·
TonicΒ 
posted an update 8 days ago
view post
Post
644
Powered by KRLabsOrg/lettucedect-large-modernbert-en-v1 from KRLabsOrg.

Detect hallucinations in answers based on context and questions using ModernBERT with 8192-token context support!

### Model Details
- **Model Name**: [lettucedect-large-modernbert-en-v1]( KRLabsOrg/lettucedect-large-modernbert-en-v1)
- **Organization**: [KRLabsOrg](https://huggingface.co/KRLabsOrg)
- **Github**: [https://github.com/KRLabsOrg/LettuceDetect](https://github.com/KRLabsOrg/LettuceDetect)
- **Architecture**: ModernBERT (Large) with extended context support up to 8192 tokens
- **Task**: Token Classification / Hallucination Detection
- **Training Dataset**: [RagTruth]( wandb/RAGTruth-processed)
- **Language**: English
- **Capabilities**: Detects hallucinated spans in answers, provides confidence scores, and calculates average confidence across detected spans.

LettuceDetect excels at processing long documents to determine if an answer aligns with the provided context, making it a powerful tool for ensuring factual accuracy.
prithivMLmodsΒ 
posted an update 8 days ago
ZennyKennyΒ 
posted an update 13 days ago
view post
Post
1874
I've spent most of time working with AI on user-facing apps like Chatbots and TextGen, but today I decided to work on something that I think has a lot of applications for Data Science teams: ZennyKenny/comment_classification

This Space supports uploading a user CSV and categorizing the fields based on user-defined categories. The applications of AI in production are truly endless. πŸš€
prithivMLmodsΒ 
posted an update 16 days ago
view post
Post
5847
Dropping some of the custom fine-tunes based on SigLIP2,
with a single/multi label classification problem type! πŸŒ€πŸ§€

- AI vs Deepfake vs Real : prithivMLmods/AI-vs-Deepfake-vs-Real-Siglip2
- Deepfake Detect : prithivMLmods/Deepfake-Detect-Siglip2
- Fire Detection : prithivMLmods/Fire-Detection-Siglip2
- Deepfake Quality Assess : prithivMLmods/Deepfake-Quality-Assess-Siglip2
- Guard Against Unsafe Content : prithivMLmods/Guard-Against-Unsafe-Content-Siglip2

🌠Collection : prithivMLmods/siglip2-custom-67bcdb2de8fe96b99fb4e19e
prithivMLmodsΒ 
posted an update 19 days ago
view post
Post
5821
It's really interesting about the deployment of a new state of matter in Majorana 1: the world’s first quantum processor powered by topological qubits. If you missed this news this week, here are some links for you:

πŸ…±οΈTopological qubit arrays: https://arxiv.org/pdf/2502.12252

βš›οΈ Quantum Blog: https://azure.microsoft.com/en-us/blog/quantum/2025/02/19/microsoft-unveils-majorana-1-the-worlds-first-quantum-processor-powered-by-topological-qubits/

πŸ“– Read the story: https://news.microsoft.com/source/features/innovation/microsofts-majorana-1-chip-carves-new-path-for-quantum-computing/

πŸ“ Majorana 1 Intro: https://youtu.be/Q4xCR20Dh1E?si=Z51DbEYnZFp_88Xp

πŸŒ€The Path to a Million Qubits: https://youtu.be/wSHmygPQukQ?si=TS80EhI62oWiMSHK
Β·
prithivMLmodsΒ 
posted an update 23 days ago
view post
Post
3917
Dino: The Minimalist Multipurpose Chat System 🌠
Agent-Dino : prithivMLmods/Agent-Dino
Github: https://github.com/PRITHIVSAKTHIUR/Agent-Dino

By default, it performs the following tasks:
{Text-to-Text Generation}, {Image-Text-Text Generation}
@image: Generates an image using Stable Diffusion xL.
@3d: Generates a 3D mesh.
@web: Web search agents.
@rAgent: Initiates a reasoning chain using Llama mode for coding explanations.
@tts1-♀, @tts2-β™‚: Voice generation (Female and Male voices).
@yolo : Object Detection
ZennyKennyΒ 
posted an update 23 days ago
view post
Post
2196
Really excited to start contributing to the SWE Arena project: https://swe-arena.com/

Led by IBM PhD fellow @terryyz , our goal is to advance research in code generation and app development by frontier LLMs.

ZennyKennyΒ 
posted an update 25 days ago
view post
Post
1985
Okay this is pretty crazy. Snowflake has CortexAI and Uber is already teasing QueryGPT, both of which prominently feature plain text to SQL features to query your database.

I decided to see how hard it would be to put together something similar using πŸ€— smolagents. Turns out, it was pretty straightforward. I managed to get it done in London Luton airport this afternoon.

ZennyKenny/sqlAgent
  • 2 replies
Β·
prithivMLmodsΒ 
posted an update 25 days ago
view post
Post
4489
The last week of Impression Craft Arts and sketches from strangerzonehfπŸŽ¨πŸ§‘πŸ»β€πŸŽ¨

- Collection : strangerzonehf/Flux-Ultimate-LoRA-Collection

Adapters:
+ Ld-Art : strangerzonehf/Ld-Art
+ Animeopix-Flux : strangerzonehf/Animeopix-Flux
+ Flux-Super-Paint-LoRA : strangerzonehf/Flux-Super-Paint-LoRA
+ CinematicShot-Pics-Flux : strangerzonehf/cinematicShot-Pics-Flux
+ Oil-Wall-Art-Flux : strangerzonehf/Oil-Wall-Art-Flux
+ Pixelo-Flux : strangerzonehf/Pixelo-Flux
+ Abstract-Shattered : strangerzonehf/Abstract-Shattered
+ Neon-Impressionism-Flux : strangerzonehf/Neon-Impressionism-Flux
+ NewG-Art : strangerzonehf/NewG-Art

πŸͺ§Demo : prithivMLmods/FLUX-LoRA-DLC
πŸ€—Page : https://huggingface.co/strangerzonehf
AtAndDevΒ 
posted an update 26 days ago
view post
Post
2422
@nroggendorff is that you sama?
  • 2 replies
Β·
ZennyKennyΒ 
posted an update about 1 month ago
view post
Post
3432
I've completed the first unit of the just-launched Hugging Face Agents Course. I would highly recommend it, even for experienced builders, because it is a great walkthrough of the smolagents library and toolkit.
prithivMLmodsΒ 
posted an update about 1 month ago
view post
Post
4279
QwQ Edge Gets a Small Update..! πŸ’¬
try now: prithivMLmods/QwQ-Edge

πŸš€Now, you can use the following commands for different tasks:

πŸ–ΌοΈ @image 'prompt...' β†’ Generates an image
πŸ”‰@tts1 'prompt...' β†’ Generates speech in a female voice
πŸ”‰ @tts2 'prompt...' β†’ Generates speech in a male voice
πŸ…°οΈ@text 'prompt...' β†’ Enables textual conversation (If not specified, text-to-text generation is the default mode)

πŸ’¬Multimodality Support : prithivMLmods/Qwen2-VL-OCR-2B-Instruct
πŸ’¬For text generation, the FastThink-0.5B model ensures quick and efficient responses, prithivMLmods/FastThink-0.5B-Tiny
πŸ’¬Image Generation: sdxl lightning model, SG161222/RealVisXL_V4.0_Lightning

Github: https://github.com/PRITHIVSAKTHIUR/QwQ-Edge

graph TD
    A[User Interface] --> B[Chat Logic]
    B --> C{Command Type}
    C -->|Text| D[FastThink-0.5B]
    C -->|Image| E[Qwen2-VL-OCR-2B]
    C -->|@image| F[Stable Diffusion XL]
    C -->|@tts| G[Edge TTS]
    D --> H[Response]
    E --> H
    F --> H
    G --> H
TonicΒ 
posted an update about 1 month ago
view post
Post
2354
πŸ™‹πŸ»β€β™‚οΈhey there folks ,

Goedel's Theorem Prover is now being demo'ed on huggingface : Tonic/Math

give it a try !
prithivMLmodsΒ 
posted an update about 1 month ago
view post
Post
4842
o3-Mini and Deepseek R1
Worked out with some famous and weird examples.

πŸ”₯Blog: https://huggingface.co/blog/prithivMLmods/o3-mini-vs-deepseek-r1

Prompt : Using HTML, CSS, and JavaScript in a single HTML file to create a simulation of the solar system. Pay extreme attention to the UI to make it as intuitive as possible. Ensure that every planet appears as a sphere and is labeled with its corresponding name.

example 1: o3 Mini , example 2: Deepseek R1

Q2 : https://huggingface.co/blog/prithivMLmods/o3-mini-vs-deepseek-r1#q2--web-solar-system-explorer
  • 1 reply
Β·