ai-forever commited on
Commit
b53a90f
1 Parent(s): e59c5eb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -94,14 +94,14 @@ model-index:
94
 
95
  ### Summary
96
 
97
- The model corrects spelling errors and typos by bringing all the words in the text to the norm of the Russian language.
98
- Corrector was trained based on the model [M2M100-1.2B](https://huggingface.co/facebook/m2m100_1.2B).
99
  An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the library [SAGE](https://github.com/ai-forever/sage).
100
 
101
  ### Public references
102
  - [SAGE library announcement](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
103
  - [Paper about synthetic error generation methods](https://www.dialog-21.ru/media/5914/martynovnplusetal056.pdf), Dialogue 2023
104
- - [Paper about SAGE and our best solution](https://arxiv.org/abs/2308.09435), Review EACL 2024
105
 
106
 
107
  ### Examples
 
94
 
95
  ### Summary
96
 
97
+ The model corrects spelling and punctuation errors and typos by bringing all the words in the text to the norm of the Russian language.
98
+ Corrector had been trained based on the model [FRED-T5-large](https://huggingface.co/ai-forever/FRED-T5-large).
99
  An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the library [SAGE](https://github.com/ai-forever/sage).
100
 
101
  ### Public references
102
  - [SAGE library announcement](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
103
  - [Paper about synthetic error generation methods](https://www.dialog-21.ru/media/5914/martynovnplusetal056.pdf), Dialogue 2023
104
+ - [SAGE EACL 2024 paper](https://aclanthology.org/2024.findings-eacl.10/)
105
 
106
 
107
  ### Examples