1 22 49

Ishan Gajbhiye

n3rdium

https://n3rdium.dev

N3RDIUM

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model

liked a model 13 days ago

NousResearch/DeepHermes-3-Llama-3-8B-Preview

liked a Space 4 months ago

mrfakename/E2-F5-TTS

View all activity

Organizations

None yet

n3rdium's activity

upvoted a paper 12 days ago

video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model

Paper • 2502.11775 • Published 13 days ago • 8

liked a model 13 days ago

NousResearch/DeepHermes-3-Llama-3-8B-Preview

Text Generation • Updated 11 days ago • 9.93k • 266

liked a Space 4 months ago

1.96k

F5-TTS

🗣

F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)

liked a model 4 months ago

OuteAI/OuteTTS-0.1-350M

Text-to-Speech • Updated Nov 27, 2024 • 4.98k • 300

upvoted a paper 4 months ago

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Paper • 2410.13863 • Published Oct 17, 2024 • 37

liked 2 models 5 months ago

Zyphra/Zamba2-7B-Instruct

Text Generation • Updated 17 days ago • 2.41k • 88

rhymes-ai/Aria

Image-Text-to-Text • Updated Jan 27 • 26.1k • 617

upvoted a collection 5 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 573

upvoted 2 papers 6 months ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23, 2024 • 26

Scalable Autoregressive Image Generation with Mamba

Paper • 2408.12245 • Published Aug 22, 2024 • 26

liked a model 6 months ago

Salesforce/xLAM-7b-fc-r-gguf

Updated Jan 24 • 388 • 23

upvoted a paper 6 months ago

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

Paper • 2408.09702 • Published Aug 19, 2024 • 11

liked a model 7 months ago

Groq/Llama-3-Groq-8B-Tool-Use

Text Generation • Updated Aug 27, 2024 • 1.03k • 273

liked a model 8 months ago

Trelis/Meta-Llama-3-8B-Instruct-function-calling

Text Generation • Updated Jul 23, 2024 • 213 • 43

liked a Space 9 months ago

ChatTTS Free

🔥

Generate audio from text input

liked a model 9 months ago

2Noise/ChatTTS

Text-to-Audio • Updated Oct 22, 2024 • 2.92k • 1.5k

liked a Space 9 months ago

1.89k

Stable Diffusion XL on TPUv5e

🏋

Generate images from text prompts with various styles

upvoted a paper 9 months ago

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24, 2024 • 55

liked 2 models 9 months ago

mistralai/Mixtral-8x22B-Instruct-v0.1

Text Generation • Updated Oct 3, 2024 • 154k • • 717

mistral-community/Mixtral-8x22B-v0.1

Text Generation • Updated Jul 1, 2024 • 3.93k • 674