Stefan Schweter PRO
stefan-it
AI & ML interests
Flair Library, NER & PoS Tagging, LM Pretraining (mostly encoder-only), Historical Language Models
Recent Activity
updated
a model
about 10 hours ago
stefan-it/bert5urk
reacted
to
davanstrien's
post
with š„
2 days ago
š Big step for multilingual AI data!
The Hugging Face community has rated educational content in languages spoken by 1.6 billion people! New additions:
ā¢ Japanese
ā¢ Italian
ā¢ Old High German
Learn more and contribute: https://huggingface.co/blog/davanstrien/fineweb2-community
These ratings can help enhance training data for major world languages.
Articles
Organizations
Posts
1
Post
1401
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.
š Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
š Model Hub Link: https://huggingface.co/model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with ā¤ļø and š„Ø.
š Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
š Model Hub Link: https://huggingface.co/model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with ā¤ļø and š„Ø.
Collections
14
My pretrained LMs on FineWeb datasets - part of my TensorFlow Model Garden LMs project
A Collection of Historical Multilingual Language Models
-
dbmdz/bert-base-historic-multilingual-cased
Fill-Mask ā¢ Updated ā¢ 72 ā¢ 7 -
dbmdz/bert-base-historic-multilingual-64k-td-cased
Fill-Mask ā¢ Updated ā¢ 109 ā¢ 1 -
hmbyt5-preliminary/byt5-small-historic-multilingual-span20-flax
Text2Text Generation ā¢ Updated ā¢ 70 -
hmteams/teams-base-historic-multilingual-discriminator
Updated ā¢ 7
models
1334
stefan-it/bert5urk
Updated
ā¢
76
ā¢
3
stefan-it/bort-full
Fill-Mask
ā¢
Updated
ā¢
44
stefan-it/span-marker-gelectra-large-germeval14
Token Classification
ā¢
Updated
ā¢
25
ā¢
2
stefan-it/zeitungs-lm-v1
Updated
ā¢
6
ā¢
4
stefan-it/wav2vec2-large-xlsr-53-basque
Automatic Speech Recognition
ā¢
Updated
ā¢
3.52k
stefan-it/german-gpt2-larger
Text Generation
ā¢
Updated
ā¢
564
ā¢
8
stefan-it/xlstm-german-wikipedia
Text Generation
ā¢
Updated
ā¢
23
ā¢
7
stefan-it/flair-barner-wiki-coarse-gbert-large
Token Classification
ā¢
Updated
ā¢
21
ā¢
1
stefan-it/flair-clean-conll-5
Token Classification
ā¢
Updated
ā¢
6
stefan-it/flair-clean-conll-4
Token Classification
ā¢
Updated
ā¢
10
datasets
12
stefan-it/senti-anno
Viewer
ā¢
Updated
ā¢
929
ā¢
117
stefan-it/offenseval2020_tr
Viewer
ā¢
Updated
ā¢
35.3k
ā¢
86
stefan-it/dewiki-20230701-nltk-corpus
Viewer
ā¢
Updated
ā¢
39.4M
ā¢
71
ā¢
2
stefan-it/germeval14_no_wikipedia
Preview
ā¢
Updated
ā¢
97
stefan-it/histnero
Viewer
ā¢
Updated
ā¢
217k
ā¢
73
stefan-it/HisGermaNER
Preview
ā¢
Updated
ā¢
348
ā¢
2
stefan-it/co-funer
Preview
ā¢
Updated
ā¢
58
stefan-it/german-dbmdz-bert-corpus
Viewer
ā¢
Updated
ā¢
52.8M
ā¢
182
ā¢
2
stefan-it/span-marker-base-model-detection
Viewer
ā¢
Updated
ā¢
28
ā¢
63
stefan-it/flair-base-model-detection
Viewer
ā¢
Updated
ā¢
52
ā¢
52
ā¢
1