TinyLlama
/

TinyLlama-1.1B-intermediate-step-240k-503b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

PY007 commited on Sep 16, 2023

Commit

47ae335

•

1 Parent(s): bf1a0ed

Update README.md

Files changed (1) hide show

README.md +0 -15

README.md CHANGED Viewed

@@ -24,21 +24,6 @@ We adopted exactly the same architecture and tokenizer as Llama 2. This means Ti
 #### This Model
 This is an intermediate checkpoint with 240K steps and 503B tokens.
-#### Releases Schedule
-We will be rolling out intermediate checkpoints following the below schedule. We also include some baseline models for comparison.
-| Date       | HF Checkpoint                                   | Tokens | Step | HellaSwag Acc_norm |
-|------------|-------------------------------------------------|--------|------|---------------------|
-| Baseline   | [StableLM-Alpha-3B](https://huggingface.co/stabilityai/stablelm-base-alpha-3b)| 800B   | --   |  38.31            |
-| Baseline   | [Pythia-1B-intermediate-step-50k-105b](https://huggingface.co/EleutherAI/pythia-1b/tree/step50000)             | 105B   | 50k   |  42.04            |
-| Baseline   | [Pythia-1B](https://huggingface.co/EleutherAI/pythia-1b)             | 300B   | 143k   |  47.16            |
-| 2023-09-04 | [TinyLlama-1.1B-intermediate-step-50k-105b](https://huggingface.co/PY007/TinyLlama-1.1B-step-50K-105b) | 105B   | 50k   |  43.50               |
-| 2023-09-16 | --                                             | 500B   | --   |  --               |
-| 2023-10-01 | --                                             | 1T     | --   |  --               |
-| 2023-10-16 | --                                             | 1.5T   | --   |  --               |
-| 2023-10-31 | --                                             | 2T     | --   |  --               |
-| 2023-11-15 | --                                             | 2.5T   | --   |  --               |
-| 2023-12-01 | --                                             | 3T     | --   |  --               |
 #### How to use
 You will need the transformers>=4.31

 #### This Model
 This is an intermediate checkpoint with 240K steps and 503B tokens.
 #### How to use
 You will need the transformers>=4.31