metadata
language:
- lb
license: mit
tags:
- luxembourgish
- lëtzebuergesch
- text generation
- transfer learning
model-index:
- name: LuxGPT2-basedEN
results:
- task:
type: text-generation
name: Text Generation
dataset:
type: LuxembourgishTestDataset
name: Luxembourgish Test Dataset
metrics:
- type: accuracy
value: '0.35'
- name: LuxGPT2-basedEN
results:
- task:
type: text-generation
name: Text Generation
dataset:
type: LuxembourgishTestDataset
name: Luxembourgish Test Dataset
metrics:
- type: perplexity
value: '45.08'
LuxGPT-2 based GER
GPT-2 model for Text Generation in luxembourgish language, trained on 711 MB of text data, consisting of RTL.lu news articles, comments, parlament speeches, the luxembourgish Wikipedia, Newscrawl, Webcrawl and subtitles. Created via transfer learning with an English base model, feature space mapping from LB on Base feature space and gradual layer freezing. The training took place on a 32 GB Nvidia Tesla V100
- with One Cycle policy for the learning rate
- with the help of fastai's LR finder
- for 49.2 hours
- for 18 epochs and 8 cycles
- using the fastai library
Usage
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("laurabernardy/LuxGPT2-basedEN")
model = AutoModelForCausalLM.from_pretrained("laurabernardy/LuxGPT2-basedEN")
Limitations and Biases
See the GPT2 model card for considerations on limitations and bias. See the GPT2 documentation for details on GPT2.