dieineb commited on
Commit
f45d27d
1 Parent(s): 6fd3520

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -20
README.md CHANGED
@@ -4,27 +4,32 @@ library_name: transformers
4
  language:
5
  - en
6
  - pt
 
7
  ---
8
- # Model Description
9
 
10
- The transformer-eng-por model is used for text classification.
11
 
12
- The model was trained with a ...More information is needed.
13
 
14
- ## Details
15
- - Size: 23,805,216 parameters
16
- - Dataset:
17
- - Languages: English, Portuguese
18
- - Number of Training
19
- - Batch size: 32
20
- - Optimizer: rmsprop
21
- - Learning Rate: 0.001
22
- - Hardware: TeslaT4
23
- - This repository has the source [code used](https://github.com/Nkluge-correa/teeny-tiny_castle/blob/master/ML%20Intro%20Course/16_sequence_to_sequence.ipynb) to train this model.
24
 
25
- ## Usage
 
 
 
 
 
 
 
 
 
 
 
26
 
27
- ```
 
 
28
  import tensorflow as tf
29
  import numpy as np
30
  import string
@@ -138,16 +143,41 @@ Portuguese translation:
138
  [start] não faça isso [end]
139
  --------------------------------------------------
140
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
141
  # Cite as 🤗
142
- ```
143
  @misc{teenytinycastle,
144
  doi = {10.5281/zenodo.7112065},
145
- url = {https://huggingface.co/AiresPucrs/transformer-eng-por},
146
  author = {Nicholas Kluge Corr{\^e}a},
147
  title = {Teeny-Tiny Castle},
148
- year = {2023},
149
- publisher = {HuggingFace},
150
- journal = {HuggingFace repository},
 
151
  }
152
  ```
153
  ## License
 
4
  language:
5
  - en
6
  - pt
7
+ pipeline_tag: translation
8
  ---
9
+ # Transformer-eng-por
10
 
11
+ ## Model Overview
12
 
13
+ The transformer-eng-por model is transformer trained to for text classification
14
 
15
+ ### Details
 
 
 
 
 
 
 
 
 
16
 
17
+ - **Size:** 23,805,216 parameters
18
+ - **Model type:** Transformer
19
+ - **Optimizer**: `rmsprop`
20
+ - **Number of Epochs:** 30
21
+ - **Embbedding dimensionality:** 256
22
+ - **Dense dimensionality:** 2048
23
+ - **Attention heads:** 8
24
+ - **Vocabulary size:** 20000
25
+ - **Sequence lenght:** 20
26
+ - **Hardware:** Tesla V4
27
+ - **Emissions:** Not measured
28
+ - **Total Energy Consumption:** Not measured
29
 
30
+ ### How to Use
31
+
32
+ ```python
33
  import tensorflow as tf
34
  import numpy as np
35
  import string
 
143
  [start] não faça isso [end]
144
  --------------------------------------------------
145
  ```
146
+ ## Intended Use
147
+
148
+ This model was created for research purposes only. Specifically, it was designed to translate sentences from English to Portuguese. We do not recommend any application of this model outside this scope.
149
+
150
+
151
+ ## Performance Metrics
152
+
153
+ Accuracy: 76,46%
154
+
155
+
156
+ ## Training Data
157
+
158
+ [English-portuguese translation](https://www.kaggle.com/datasets/nageshsingh/englishportuguese-translation).
159
+
160
+ The dataset consists of a set of English and Portuguese sentences.
161
+
162
+ ## Limitations
163
+
164
+ In `What is its name?` to `[start] o que é o seu nome [end]`, the `transformer` model makes a gender assumption,
165
+ even though the source sentence wasn't gendered (`[start] qual é o nome dele [end]`).
166
+ Errors like these are not uncommon in NLP, algorithmic bias being one of the great problems associated with using language models
167
+ in real applications. In conclusion, we do not recommend using this model in real-world applications.
168
+ It was solely developed for academic and educational purposes.
169
+
170
  # Cite as 🤗
171
+ ```latex
172
  @misc{teenytinycastle,
173
  doi = {10.5281/zenodo.7112065},
174
+ url = {https://github.com/Nkluge-correa/teeny-tiny_castle},
175
  author = {Nicholas Kluge Corr{\^e}a},
176
  title = {Teeny-Tiny Castle},
177
+ year = {2024},
178
+ publisher = {GitHub},
179
+ journal = {GitHub repository},
180
+ note = {Last updated 12 January 2024},
181
  }
182
  ```
183
  ## License