t-tech
/

T-pro-it-1.0

Safetensors

Russian

qwen2

Model card Files Files and versions Community

germanjke commited on Dec 13, 2024

Commit

77156fd

verified ·

1 Parent(s): 8340bb9

Update README.md

Browse files

Files changed (1) hide show

README.md +83 -4

README.md CHANGED Viewed

@@ -10,18 +10,72 @@ language:
 T-pro-it-1.0 is a model built upon the Qwen 2.5 model family and incorporates both continual pre-training and alignment techniques.
-Detailed model card’s coming soon…
 ### 📚 Dataset
-Detailed model card’s coming soon…
 ## 📊 Benchmarks
-Detailed model card’s coming soon…
 ## 👨‍💻 Examples of usage
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
@@ -77,4 +131,29 @@ Output:
 Поиск закономерностей — его цель, открыть тайны бытия.
 От распознавания лиц до понимания речи,
 Машинное обучение — это ключ, что открывает двери.
 ```

 T-pro-it-1.0 is a model built upon the Qwen 2.5 model family and incorporates both continual pre-training and alignment techniques.
 ### 📚 Dataset
+Pre-training Stage 1:
+100B tokens, consisting of diverse Russian data from Common Crawl, books, code, and proprietary datasets, mixed with re-played English data (English added as it is the primary language of the base model).
+Pre-training Stage 2:
+40B tokens, a mix of instruction and pre-training data.
+Supervised Fine-Tuning (SFT):
+1B tokens, a mix of diverse instruction data.
+Preference Tuning:
+1B tokens, training the model to be helpful.
 ## 📊 Benchmarks
+Proprietary models:
+| Benchmark                                      | T-pro-it-1.0          | GPT-4o                       | GPT-4o-mini           |   GigaChat Max 1.0.26.20 |
+|------------------------------------------------|-----------------------|------------------------------|-----------------------|---------------------|
+| [MERA](https://mera.a-ai.ru)                   | <u>0.629</u>          | **0.642**                    | 0.57                     | 0.588              |
+| [MaMuRaMu](https://mera.a-ai.ru/ru/tasks/22)   | <u>0.841</u>          | **0.874**                    | 0.779                             | 0.824              |
+| ruMMLU-PRO                                     | <u>0.665</u>          | **0.713**                    | 0.573                             | 0.535              |
+| ruGSM8K                                        | **0.941**             | <u>0.931</u>                 | 0.888                           | 0.892              |
+| ruMATH                                         | **0.776**             | <u>0.771</u>                 | 0.724                             | 0.589              |
+ | ruMBPP                                         | **0.805**                 | <u>0.802</u>                 | 0.79                               | 0.626              |
+ | [ruCodeEval](https://mera.a-ai.ru/ru/tasks/23) | 0.432 / 0.626 / 0.677 | <u>0.529 / 0.649 / 0.683</u> | **0.704 / 0.753 / 0.768**   | 0.077 / 0.093 / 0.098 |
+| Arena-Hard-Ru                                  | **90.17**             | <u>84.87</u>                 | 81                            | -                  |
+| MT Bench Ru                                    | <u>8.7</u>            | **8.706**                    | 8.45                             | 8.53               |
+ | Alpaca Eval Ru                                 | <u>47.61</u>          | **50**                       | 45.51                            | 38.13              |
+Open-source models:
+| Benchmark                                      | T-pro-it-1.0              | Qwen-2.5-32B-Instruct         | RuAdapt-Qwen-32B-Instruct-v1 | gemma-2-27b-it               | Llama-3.3-70B-Instruct |
+|------------------------------------------------|---------------------------|-------------------------------|------------------------------|------------------------------|------------------------|
+| [MERA](https://mera.a-ai.ru)                   | **0.629**                 | 0.578                         | <u>0.615</u>                 | 0.574                        | 0.567                  |
+| [MaMuRaMu](https://mera.a-ai.ru/ru/tasks/22)   | **0.841**                 | <u>0.824</u>                  | 0.812                        | 0.768                        | 0.818                  |
+| ruMMLU-PRO                                     | **0.665**                 | 0.637                         | 0.631                        | 0.470                        | <u>0.653</u>           |
+| ruGSM8K                                        | **0.941**                 | 0.926                         | 0.923                        | 0.894                        | <u>0.934</u>           |
+| ruMATH                                         | **0.776**                 | 0.727                         | <u>0.742</u>                 | 0.538                        | 0.636                  |
+ | ruMBPP                                         | 0.805                     | **0.825**                     | <u>0.813</u>                 | 0.708                        | 0.77                   |
+ | [ruCodeEval](https://mera.a-ai.ru/ru/tasks/23) | **0.432 / 0.626 / 0.677** | 0.06 / 0.098 / 0.116          | 0.426 / 0.561 / 0.598        | <u>0.259 / 0.586 / 0.689</u> | 0.112 / 0.166 / 0.189  |
+| Arena-Hard-Ru                                  | **90.17**                 | 74.54                         | <u>80.23</u>                 | 66.4                         | 76.51                  |
+| MT Bench Ru                                    | **8.7**                   | 8.15                          | 8.39                         | 7.96                         | <u>8.26</u>            |
+ | Alpaca Eval Ru                                 | **47.61**                 | 35.01                         | <u>43.15</u>                 | 38.82                        | -                      |
+| Benchmark                                      | T-pro-it-1.0          | GPT-4o                       | GPT-4o-mini           | Qwen-2.5-32B-Instruct | GigaChat Max 1.0.26.20 | RuAdapt-Qwen-32B-Instruct-v1 | gemma-2-27b-it        | Llama-3.3-70B-Instruct |
+|------------------------------------------------|-----------------------|------------------------------|-----------------------|-----------------------|--------------------|------------------------------|-----------------------|------------------------|
+| [MERA](https://mera.a-ai.ru)                   | <u>0.629</u>          | **0.642**                    | 0.57                  | 0.578                 | 0.588              | 0.615                        | 0.574                 | 0.567                  |
+| [MaMuRaMu](https://mera.a-ai.ru/ru/tasks/22)   | <u>0.841</u>          | **0.874**                    | 0.779                 | 0.824                 | 0.824              | 0.812                        | 0.768                 | 0.818                  |
+| ruMMLU-PRO                                     | <u>0.665</u>          | **0.713**                    | 0.573                 | 0.637                 | 0.535              | 0.631                        | 0.470                 | 0.653                  |
+| ruGSM8K                                        | **0.941**             | 0.931                        | 0.888                 | 0.926                 | 0.892              | 0.923                        | 0.894                 | <u>0.934</u>           |
+| ruMATH                                         | **0.776**             | <u>0.771</u>                 | 0.724                 | 0.727                 | 0.589              | 0.742                        | 0.538                 | 0.636                  |
+ | ruMBPP                                         | 0.805                 | 0.802                        | 0.79                  | **0.825**                 | 0.626              | <u>0.813</u>                 | 0.708                 | 0.77                   |
+ | [ruCodeEval](https://mera.a-ai.ru/ru/tasks/23) | 0.432 / 0.626 / 0.677 | <u>0.529 / 0.649 / 0.683</u> | **0.704 / 0.753 / 0.768** | 0.06 / 0.098 / 0.116  | 0.077 / 0.093 / 0.098 | 0.426 / 0.561 / 0.598        | 0.259 / 0.586 / 0.689 | 0.112 / 0.166 / 0.189  |
+| Arena-Hard-Ru                                  | **90.17**             | <u>84.87</u>                 | 81                    | 74.54                 | -                  | 80.23                        | 66.4                  | 76.51                  |
+| MT Bench Ru                                    | <u>8.7</u>            | **8.706**                        | 8.45                  | 8.15                  | 8.53               | 8.39                         | 7.96                  | 8.26                   |
+ | Alpaca Eval Ru                                 | <u>47.61</u>          | **50**                           | 45.51                 | 35.01                 | 38.13              | 43.15                        | 38.82                 | -                      |
+Detailed evaluation results can be found in our [habr post](https://habr.com/ru/companies/tbank/articles/865582/)
 ## 👨‍💻 Examples of usage
+### HF Usage
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
 Поиск закономерностей — его цель, открыть тайны бытия.
 От распознавания лиц до понимания речи,
 Машинное обучение — это ключ, что открывает двери.
+```
+### VLLM Usage
+```python
+from transformers import AutoTokenizer
+from vllm import LLM, SamplingParams
+model_name = "t-tech/T-pro-it-1.0"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+llm = LLM(model=model_name)
+sampling_params = SamplingParams(temperature=0.3, max_tokens=8192)
+prompt = "Напиши стих про машинное обучение"
+messages = [
+    {"role": "system", "content": "Ты T-pro, виртуальный ассистент в Т-Технологии. Твоя задача - быть полезным диалоговым ассистентом."},
+    {"role": "user", "content": prompt}
+]
+prompt_token_ids = tokenizer.apply_chat_template(messages, add_generation_prompt=True)
+outputs = llm.generate(prompt_token_ids=prompt_token_ids, sampling_params=sampling_params)
+generated_text = [output.outputs[0].text for output in outputs]
+print(generated_text)
 ```