Tahir's picture

Tahir

TahirC

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

openbmb/VoxCPM1.5

liked a model 2 days ago

zai-org/GLM-4.6V-Flash

reacted to prithivMLmods's post with 🤗 4 days ago

One speech model with seven voices, streamlined with multimodal capabilities for vision tasks. Performs vision(image-text) to audio inference with Qwen2.5-VL + VibeVoice-Realtime-0.5B. Vision to VibeVoice (EN) - The demo is live. 🗣️🔥 🤗 Vision-to-VibeVoice-en [Demo]: https://huggingface.co/spaces/prithivMLmods/Vision-to-VibeVoice-en ✨ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations ✨ Speech [VibeVoice-Realtime-0.5B]: https://huggingface.co/microsoft/VibeVoice-Realtime-0.5B ✨ Vision [Qwen2.5-VL]: https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct To know more about it, visit the app page or the respective model page!

View all activity

Organizations

liked a model 1 day ago

openbmb/VoxCPM1.5

Text-to-Speech • Updated 5 days ago • 711 • 90

liked a model 2 days ago

zai-org/GLM-4.6V-Flash

Image-Text-to-Text • 10B • Updated about 18 hours ago • 10k • • 275

reacted to prithivMLmods's post with 🤗 4 days ago

Post

3499

One speech model with seven voices, streamlined with multimodal capabilities for vision tasks. Performs vision(image-text) to audio inference with Qwen2.5-VL + VibeVoice-Realtime-0.5B. Vision to VibeVoice (EN) - The demo is live. 🗣️🔥

🤗 Vision-to-VibeVoice-en [Demo]: prithivMLmods/Vision-to-VibeVoice-en
✨ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
✨ Speech [VibeVoice-Realtime-0.5B]: microsoft/VibeVoice-Realtime-0.5B
✨ Vision [Qwen2.5-VL]: Qwen/Qwen2.5-VL-7B-Instruct

To know more about it, visit the app page or the respective model page!

6 replies

·

liked 2 models 5 days ago

Quark-Vision/Live-Avatar

Image-to-Video • Updated 1 day ago • 93

microsoft/VibeVoice-Realtime-0.5B

Text-to-Speech • 1B • Updated 1 day ago • 67.7k • 626

New activity in nvidia/gliner-PII 9 days ago

55+ labels ? where i can get the list

#6 opened 12 days ago by

liked 3 models 12 days ago

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 9 days ago • 34.6k • 432

fal/FLUX.2-Tiny-AutoEncoder

Updated 13 days ago • 455 • 50

Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated 2 days ago • 233k • • 2.46k

liked a model 13 days ago

nvidia/gliner-PII

Token Classification • Updated 3 days ago • 1.83k • 40

liked 4 models 14 days ago

Qwen/Qwen3Guard-Gen-0.6B

Text Generation • 0.8B • Updated Nov 7 • 103k • 47

Qwen/Qwen3Guard-Gen-4B

Text Generation • 4B • Updated Nov 7 • 11.6k • 31

meta-llama/Llama-Guard-3-8B

Text Generation • 8B • Updated Oct 11, 2024 • 56.2k • • 249

black-forest-labs/FLUX.2-dev

Image-to-Image • Updated 13 days ago • 215k • • 956

liked 2 models 19 days ago

Photoroom/prx-1024-t2i-beta

Text-to-Image • Updated 8 days ago • 1.28k • 77

tencent/HunyuanVideo-1.5

Text-to-Video • Updated about 23 hours ago • 14.1k • • 829

liked a model 21 days ago

Intel/Magistral-Small-2509-int4-AutoRound

2B • Updated Nov 4 • 131 • 3

upvoted an article 21 days ago

Article

We’re open-sourcing our text-to-image model and the process behind it

28 days ago

•

74

liked 2 models 23 days ago

cpatonn/Qwen3-VL-32B-Instruct-AWQ-4bit

Image-Text-to-Text • 7B • Updated Oct 21 • 1.94k • 4

Qwen/Qwen3-VL-32B-Instruct-GGUF

Image-Text-to-Text • 33B • Updated Nov 1 • 5.51k • 10