README.md · laurabernardy/LuxGPT-basedEN at main

metadata

language:
  - lb
license: mit
tags:
  - luxembourgish
  - lëtzebuergesch
  - text generation
  - transfer learning
model-index:
  - name: LuxGPT2-basedEN
    results:
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          type: LuxembourgishTestDataset
          name: Luxembourgish Test Dataset
        metrics:
          - type: accuracy
            value: '0.35'
  - name: LuxGPT2-basedEN
    results:
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          type: LuxembourgishTestDataset
          name: Luxembourgish Test Dataset
        metrics:
          - type: perplexity
            value: '45.08'

LuxGPT-2 based GER

GPT-2 model for Text Generation in luxembourgish language, trained on 711 MB of text data, consisting of RTL.lu news articles, comments, parlament speeches, the luxembourgish Wikipedia, Newscrawl, Webcrawl and subtitles. Created via transfer learning with an English base model, feature space mapping from LB on Base feature space and gradual layer freezing. The training took place on a 32 GB Nvidia Tesla V100

with One Cycle policy for the learning rate
with the help of fastai's LR finder
for 49.2 hours
for 18 epochs and 8 cycles
using the fastai library

Usage

from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("laurabernardy/LuxGPT2-basedEN")
model = AutoModelForCausalLM.from_pretrained("laurabernardy/LuxGPT2-basedEN")

Limitations and Biases

See the GPT2 model card for considerations on limitations and bias. See the GPT2 documentation for details on GPT2.