NG's picture

120 205

NG

SirRa1zel

·

AI & ML interests

Text-to-Speech, Translation, Object Detection

Recent Activity

upvoted a collection about 9 hours ago

upvoted a paper about 9 hours ago

MangaNinja: Line Art Colorization with Precise Reference Following

upvoted a collection about 9 hours ago

Visual Document Retrieval

View all activity

Organizations

None yet

SirRa1zel's activity

upvoted a collection about 9 hours ago

OuteTTS 0.3

4 items • Updated 1 day ago • 13

upvoted a paper about 9 hours ago

MangaNinja: Line Art Colorization with Precise Reference Following

Paper • 2501.08332 • Published 2 days ago • 45

upvoted a collection about 9 hours ago

Visual Document Retrieval

A collection of models, datasets, and spaces in the VDR series • 5 items • Updated 6 days ago • 8

upvoted a paper about 9 hours ago

UnCommon Objects in 3D

Paper • 2501.07574 • Published 3 days ago • 10

upvoted a paper 3 days ago

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published 6 days ago • 54

upvoted a paper 4 days ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published 9 days ago • 40

liked a dataset 5 days ago

DAMO-NLP-SG/multimodal_textbook

Updated 5 days ago • 7.91k • 110

liked a model 6 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated about 17 hours ago • 18.1k • 1.68k

liked a Space 6 days ago

Running on Zero

Kokoro TTS

Now in 5 languages!

upvoted a collection 9 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated 6 days ago • 230

liked a Space 27 days ago

Jupyter Agent

upvoted a paper about 1 month ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 45

liked a model about 1 month ago

jianzongwu/DiffSensei

Updated Dec 11, 2024 • 32

liked a dataset about 1 month ago

jianzongwu/MangaZero

Viewer • Updated Dec 11, 2024 • 32.7k • 138 • 21

liked a Space about 1 month ago

Background Removal Arena

upvoted a collection about 1 month ago

[MASK] is All You Need

Code, dataset, and pretrained model • 5 items • Updated Nov 29, 2024 • 9

liked 3 Spaces about 1 month ago

Translation-Agent-WebUI

Translation-Agent-WebUI

MindSearch

Running on Zero

Indic Parler-TTS

A demo of Indic Parler-TTS

liked a model about 1 month ago

ai4bharat/indic-parler-tts

Text-to-Speech • Updated Dec 9, 2024 • 16.8k • 89