coml
/

Safetensors
English
File size: 597 Bytes
5d4286b
 
9251d0f
 
 
 
 
 
5d4286b
9251d0f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
---
license: mit
language:
- en
datasets:
- openslr/librispeech_asr
base_model:
- facebook/hubert-base-ls960
---

# Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach

**Paper:** https://arxiv.org/abs/2410.00025
Presented at EMNLP 2024.

This branch contains the HuBERT model fine-tuned with phoneme classification on train-clean-100.
See the companion repository: https://github.com/bootphon/spokenlm-phoneme.

Use it like this:
```python
from phonslm import HuBERTPhoneme

model = HuBERTPhoneme.from_pretrained("coml/hubert-phoneme-classification")
```