LSX-UniWue
/

LLaMmlein_1B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

LLaMmlein_1B / README.md

JanPf's picture

Update README.md

72c4a8c verified 3 days ago

|

history blame contribute delete

913 Bytes

	---
	datasets:
	- togethercomputer/RedPajama-Data-V2
	language:
	- de
	pipeline_tag: text-generation
	library_name: transformers
	license: other
	---

	# LLäMmlein 1B

	This is a German Tinyllama 1B language model trained from scratch using the [Tinyllama](https://github.com/jzhang38/TinyLlama) codebase on the German portion of [RedPajama V2](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-V2).
	Find more details on our [page](https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/) and our [preprint](arxiv.org/abs/2411.11171)!


	### Usage

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer

	model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_1B")

	tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_1B")
	```


	### Evaluation
	We evaluated our results on the [Superkleber](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark.