Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2308.11466

Papers - Training - Speed - Reduced Training Time

SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1

Papers - Fine-tuning - Decoder Only - Frozen Encoder Weights

SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1

Papers - Text - Embedding - Sentence

2D Matryoshka Sentence Embeddings

Paper • 2402.14776 • Published Feb 22 • 6
AnglE-optimized Text Embeddings

Paper • 2309.12871 • Published Sep 22, 2023 • 2
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1

Papers - Text - Embedding - Sentence - SONAR

In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation

Paper • 2408.00397 • Published Aug 1 • 10
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1

Papers - Audio - STT - ASR

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Paper • 2303.00747 • Published Mar 1, 2023 • 4
Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion

Paper • 2311.14836 • Published Nov 24, 2023 • 2
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training

Paper • 2108.06209 • Published Aug 7, 2021 • 1

Papers - Audio - TTS

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

Paper • 1712.05884 • Published Dec 16, 2017 • 2
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

Paper • 2403.16973 • Published Mar 25 • 2
High Fidelity Neural Audio Compression

Paper • 2210.13438 • Published Oct 24, 2022 • 4
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

Paper • 2404.03204 • Published Apr 4 • 7

Papers - Fine-tuning - Multilingual

RakutenAI-7B: Extending Large Language Models for Japanese

Paper • 2403.15484 • Published Mar 21 • 12
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1

Papers - Multimodal - Audio

A Multimodal Approach to Device-Directed Speech Detection with Large Language Models

Paper • 2403.14438 • Published Mar 21 • 2
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1

Papers - Audio - Training

A Multimodal Approach to Device-Directed Speech Detection with Large Language Models

Paper • 2403.14438 • Published Mar 21 • 2
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Paper • 2403.17694 • Published Mar 26 • 10
FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23 • 29
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1

Papers - Automatic Speech Recognition

Streaming Transformer ASR with Blockwise Synchronous Beam Search

Paper • 2006.14941 • Published Jun 25, 2020 • 2
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models

Paper • 2403.14438 • Published Mar 21 • 2
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs