LumiOpen
/

Viking-13B

@@ -16,12 +16,11 @@ language:
 # Viking 13B
-**NOTE:** We are aware of an incompatibility with HF transformers that impacts finetuning and are working to correct it.
 _**NOTE:** This is a **research checkpoint** of a model for which **training has not been completed.** It is being provided in its current state for research and testing purposes. **Care should be taken when using the outputs of the model.** Once pretraining has completed we intend to release additional instruction-tuned and chat-tuned varieties._
-Viking 13B is a 13B parameter decoder-only transformer pretrained on Finnish, English, Swedish, Danish, Norwegian, Icelandic and code.  It is being trained on 2 trillion tokens (1 trillion as of this release). Viking 13B is a fully open source model and is made available under the Apache 2.0 License.
 Viking was created in a collaboration between the [TurkuNLP group](https://turkunlp.org/) of the University of Turku, [SiloGen](https://www.silo.ai/silogen) from [Silo AI](https://www.silo.ai/),and [High Performance Language Technologies](https://hplt-project.org/) (HPLT). Training was conducted on the [LUMI supercomputer](https://www.lumi-supercomputer.eu/), using compute resources generously provided by [CSC](https://csc.fi/) - IT Center for Science, Finland.
@@ -96,6 +95,13 @@ Training checkpoints are available as branches in the repository.  Checkpoints w
 * [800B](https://huggingface.co/LumiOpen/Viking-13B/tree/800B)
 * [900B](https://huggingface.co/LumiOpen/Viking-13B/tree/900B)
 * [1000B](https://huggingface.co/LumiOpen/Viking-13B/tree/1000B)
 The transformers library allows you to load a checkpoint from a branch as follows:

 # Viking 13B
 _**NOTE:** This is a **research checkpoint** of a model for which **training has not been completed.** It is being provided in its current state for research and testing purposes. **Care should be taken when using the outputs of the model.** Once pretraining has completed we intend to release additional instruction-tuned and chat-tuned varieties._
+Viking 13B is a 13B parameter decoder-only transformer pretrained on Finnish,
+English, Swedish, Danish, Norwegian, Icelandic and code.  It is being trained
+on 2 trillion tokens (1.3 trillion as of this release). Viking 13B is a fully open source model and is made available under the Apache 2.0 License.
 Viking was created in a collaboration between the [TurkuNLP group](https://turkunlp.org/) of the University of Turku, [SiloGen](https://www.silo.ai/silogen) from [Silo AI](https://www.silo.ai/),and [High Performance Language Technologies](https://hplt-project.org/) (HPLT). Training was conducted on the [LUMI supercomputer](https://www.lumi-supercomputer.eu/), using compute resources generously provided by [CSC](https://csc.fi/) - IT Center for Science, Finland.
 * [800B](https://huggingface.co/LumiOpen/Viking-13B/tree/800B)
 * [900B](https://huggingface.co/LumiOpen/Viking-13B/tree/900B)
 * [1000B](https://huggingface.co/LumiOpen/Viking-13B/tree/1000B)
+* [1100B](https://huggingface.co/LumiOpen/Viking-13B/tree/1100B)
+* [1200B](https://huggingface.co/LumiOpen/Viking-13B/tree/1200B)
+* [1300B](https://huggingface.co/LumiOpen/Viking-13B/tree/1300B)
+* [1400B](https://huggingface.co/LumiOpen/Viking-13B/tree/1400B)
+* [1500B](https://huggingface.co/LumiOpen/Viking-13B/tree/1500B)
+* [1600B](https://huggingface.co/LumiOpen/Viking-13B/tree/1600B)
+* [1700B](https://huggingface.co/LumiOpen/Viking-13B/tree/1700B)
 The transformers library allows you to load a checkpoint from a branch as follows: