--- datasets: - togethercomputer/RedPajama-Data-V2 language: - de pipeline_tag: text-generation library_name: transformers license: other --- # LLäMmlein 1B This is a German Tinyllama 1B language model trained from scratch using the [Tinyllama](https://github.com/jzhang38/TinyLlama) codebase on the German portion of [RedPajama V2](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-V2). Find more details on our [page](https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/)! ### Usage ```python from transformers import AutoModelForCausalLM, AutoTokenizer model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_1B") tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_1B") ``` ### Evaluation We evaluated our results on the [Superkleber](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark.