ai-forever
/

T5-large-spell

@@ -15,12 +15,13 @@ tags:
 ### Summary
 The model corrects spelling errors and typos by bringing all words in the text to the standard English language.
 The proofreader was trained based on the [T5-large](https://huggingface.co/t5-large) model.
-An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the English-language Wikipedia and News blogs, then typos and spelling errors were automatically introduced into it using the functionality of the [SAGE] library (https://github.com /orgs/ai-forever/sage).
-### Articles and speeches
-- [Speech about the SAGE library](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
-- [Article about synthetic error generation methods](https://www.dialog-21.ru/media/5914/martynovnplusetal056.pdf), Dialogue 2023
-- [Article about SAGE and our best solution](https://arxiv.org/abs/2308.09435), Review EACL 2024
 ### Examples
 | Input | Output |
@@ -78,7 +79,7 @@ print(answer)
 ```
 ## Resources
-- [SAGE library code with augmentation methods, access to datasets and open models](https://github.com/orgs/ai-forever/sage), GitHub
 - [ruM2M100-1.2B](https://huggingface.co/ai-forever/RuM2M100-1.2B), HuggingFace
 - [ruM2M100-418M](https://huggingface.co/ai-forever/RuM2M100-420M), HuggingFace
 - [FredT5-large-spell](https://huggingface.co/ai-forever/FRED-T5-large-spell), HuggingFace
@@ -86,7 +87,7 @@ print(answer)
 ## License
 The [T5-large](https://huggingface.co/t5-large) model, on which our solution is based, and its source code are supplied under the APACHE-2.0 license.
-Our solution is supplied under the MIT license.
 ## Specifications
 - File size: 3 Gb;
@@ -96,4 +97,4 @@ Our solution is supplied under the MIT license.
 - Developer: SberDevices, AGI NLP
 ## Contacts
-For questions related to the operation and application of the model, please contact the product manager: Pavel Lebedev PIgLebedev@sberbank.ru.

 ### Summary
 The model corrects spelling errors and typos by bringing all words in the text to the standard English language.
 The proofreader was trained based on the [T5-large](https://huggingface.co/t5-large) model.
+An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the English-language Wikipedia and News blogs, then typos and spelling errors were automatically introduced into it using the functionality of the [SAGE library](https://github.com/orgs/ai-forever/sage).
+### Public references
+- [SAGE library announcement](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
+- [Paper about synthetic error generation methods](https://www.dialog-21.ru/media/5914/martynovnplusetal056.pdf), Dialogue 2023
+- [Paper about SAGE and our best solution](https://arxiv.org/abs/2308.09435), Review EACL 2024
+- [Path_to_model](https://huggingface.co/ai-forever/T5-large-spell)
 ### Examples
 | Input | Output |
 ```
 ## Resources
+- [SAGE library](https://github.com/orgs/ai-forever/sage), GitHub
 - [ruM2M100-1.2B](https://huggingface.co/ai-forever/RuM2M100-1.2B), HuggingFace
 - [ruM2M100-418M](https://huggingface.co/ai-forever/RuM2M100-420M), HuggingFace
 - [FredT5-large-spell](https://huggingface.co/ai-forever/FRED-T5-large-spell), HuggingFace
 ## License
 The [T5-large](https://huggingface.co/t5-large) model, on which our solution is based, and its source code are supplied under the APACHE-2.0 license.
+Our solution is supplied under MIT license.
 ## Specifications
 - File size: 3 Gb;
 - Developer: SberDevices, AGI NLP
 ## Contacts
+nikita.martynov.98@list.ru