5 3 4

Ghosh

Sreyan88

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

authored a paper about 1 month ago

Do Audio-Language Models Understand Linguistic Variations?

authored a paper about 1 month ago

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

View all activity

Organizations

None yet

Sreyan88's activity

authored 4 papers about 1 month ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24 • 19

Do Audio-Language Models Understand Linguistic Variations?

Paper • 2410.16505 • Published Oct 21 • 1

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

Paper • 2410.13198 • Published Oct 17 • 9

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13 • 12

commented a paper about 1 month ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24 • 19 •

liked a Space about 1 month ago

Running on Zero

📚

Synthio Stable Audio Open

Stable audio open model from Synthio paper.

liked a model about 1 month ago

sonalkum/synthio-stable-audio-open

Updated Oct 19 • 2

commented a paper about 1 month ago

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

Paper • 2410.13198 • Published Oct 17 • 9 •

upvoted a paper about 2 months ago

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Paper • 2410.02056 • Published Oct 2 • 6

commented a paper about 2 months ago

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Paper • 2410.02056 • Published Oct 2 • 6 •

commented a paper 2 months ago

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13 • 12 •

liked 2 Spaces 5 months ago

Running on Zero

🏆

GAMA-IT

Running on Zero

🌍

GAMA

upvoted a paper 5 months ago

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Paper • 2406.11768 • Published Jun 17 • 20

authored 3 papers 5 months ago

data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

Paper • 2211.01246 • Published Nov 2, 2022

CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP

Paper • 2404.00415 • Published Mar 30

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Paper • 2406.11768 • Published Jun 17 • 20

commented a paper 5 months ago

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Paper • 2406.11768 • Published Jun 17 • 20 •

upvoted a collection 6 months ago

Math

Collection

46 items • Updated May 31 • 9

authored a paper 6 months ago

VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap

Paper • 2405.15683 • Published May 24