LumiOpen
/

Viking-13B

Text Generation

text-generation-inference

Model card Files Files and versions

jonabur commited on Jun 25, 2024

Commit

b9b3e0f

·

1 Parent(s): 8f2c8dd

update readme

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -16,8 +16,6 @@ language:
 # Viking 13B
-_**NOTE:** This is a **research checkpoint** of a model for which **training has not been completed.** It is being provided in its current state for research and testing purposes. **Care should be taken when using the outputs of the model.** Once pretraining has completed we intend to release additional instruction-tuned and chat-tuned varieties._
 Viking 13B is a 13B parameter decoder-only transformer pretrained on Finnish,
 English, Swedish, Danish, Norwegian, Icelandic and code.  It is being trained
 on 2 trillion tokens (1.3 trillion as of this release). Viking 13B is a fully open source model and is made available under the Apache 2.0 License.
@@ -38,7 +36,7 @@ Viking is the second set of models released by LumiOpen and is available at
 [Viking 33B](https://huggingface.co/LumiOpen/Viking-33B)
 ## Model Overview
-_**NOTE:** In addition to being an early research release, Viking is a base model which needs further fine tuning for most use cases._
 Viking is a generative pretrained transformer using a LLaMA-like GPT architecture, and makes use of rotary positional embeddings and flash attention.
@@ -102,6 +100,9 @@ Training checkpoints are available as branches in the repository.  Checkpoints w
 * [1500B](https://huggingface.co/LumiOpen/Viking-13B/tree/1500B)
 * [1600B](https://huggingface.co/LumiOpen/Viking-13B/tree/1600B)
 * [1700B](https://huggingface.co/LumiOpen/Viking-13B/tree/1700B)
 The transformers library allows you to load a checkpoint from a branch as follows:

 # Viking 13B
 Viking 13B is a 13B parameter decoder-only transformer pretrained on Finnish,
 English, Swedish, Danish, Norwegian, Icelandic and code.  It is being trained
 on 2 trillion tokens (1.3 trillion as of this release). Viking 13B is a fully open source model and is made available under the Apache 2.0 License.
 [Viking 33B](https://huggingface.co/LumiOpen/Viking-33B)
 ## Model Overview
+_**NOTE:** This is a base model which needs further fine tuning for most use cases._
 Viking is a generative pretrained transformer using a LLaMA-like GPT architecture, and makes use of rotary positional embeddings and flash attention.
 * [1500B](https://huggingface.co/LumiOpen/Viking-13B/tree/1500B)
 * [1600B](https://huggingface.co/LumiOpen/Viking-13B/tree/1600B)
 * [1700B](https://huggingface.co/LumiOpen/Viking-13B/tree/1700B)
+* [1800B](https://huggingface.co/LumiOpen/Viking-13B/tree/1800B)
+* [1900B](https://huggingface.co/LumiOpen/Viking-13B/tree/1900B)
+* [2000B](https://huggingface.co/LumiOpen/Viking-13B/tree/2000B)
 The transformers library allows you to load a checkpoint from a branch as follows: