denocris's picture
Update README.md
f01fdf0
|
raw
history blame
840 Bytes
---
language: it
license: mit
---
# ChefBERTo 👨‍🍳
**chefberto-italian-cased** is a BERT model obtained by MLM adaptive-tuning [**bert-base-italian-xxl-cased**](https://huggingface.co/dbmdz/bert-base-italian-xxl-cased) model on Italian cooking recipes, approximately 50k sentences (2.6M words).
The model was trained by Cristiano De Nobili ([@denocris](https://twitter.com/denocris)) for [VINHOOD](https://www.vinhood.com/en/).
# Perplexity
Test set of 9k sentences about food.
| Model | Perplexity |
| ------ | ------ |
| chefberto-italian-cased | **1.84** |
| bert-base-italian-xxl-cased | 2.85 |
# Usage
```python
from transformers import AutoModel, AutoTokenizer
model_name = "vinhood/chefberto-italian-cased"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)
```