rmayormartins commited on
Commit
de7d2e6
·
verified ·
1 Parent(s): e455b80

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +101 -3
README.md CHANGED
@@ -1,3 +1,101 @@
1
- ---
2
- license: ecl-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: ecl-2.0
3
+ datasets:
4
+ - mozilla-foundation/common_voice_11_0
5
+ language:
6
+ - en
7
+ - pt
8
+ metrics:
9
+ - accuracy
10
+ library_name: transformers
11
+ tags:
12
+ - code
13
+ ---
14
+
15
+ # Speech Portuguese (Brazilian) Accent Classifier
16
+
17
+ 🎙️🤖🇧🇷
18
+
19
+ This project is a speech accent classifier that distinguishes between Portuguese (Brazilian) and other accents.
20
+
21
+ ## Project Overview
22
+
23
+ This application uses a trained model to classify speech accents into two categories:
24
+ 1. Portuguese (Brazilian)
25
+ 2. Other
26
+
27
+ The model is based on the author's work [results] and utilizes the Portuguese portion of the Common Voice dataset (version 11.0) from Mozilla Foundation.
28
+
29
+ ## Dataset
30
+
31
+ The project uses the Portuguese subset of the Common Voice dataset:
32
+ - Dataset: "mozilla-foundation/common_voice_11_0", "pt"
33
+
34
+ Brazilian accents included in the dataset:
35
+ - Português do Brasil, Região Sul do Brasil
36
+ - Paulistano
37
+ - Paulista, Brasileiro
38
+ - Carioca
39
+ - Mato Grosso
40
+ - Mineiro
41
+ - Interior Paulista
42
+ - Gaúcho
43
+ - Nordestino
44
+ - And various regional mixes
45
+
46
+ ## Model and Processor
47
+
48
+ The project utilizes the following model and processor:
49
+ - Base Model: "facebook/wav2vec2-base-960h"
50
+ - Processor: Wav2Vec2Processor.from_pretrained
51
+
52
+ ## Model Versions
53
+
54
+ Was trained three versions of the model with different configurations:
55
+
56
+ 1. **(OLD) v 1.1**:
57
+ - Epochs: 3
58
+ - Training samples: 1000
59
+ - Validation samples: 200
60
+
61
+ 2. **(OLD) v 1.2**:
62
+ - Epochs: 10
63
+ - Training samples: 1000
64
+ - Validation samples: 500
65
+
66
+ 3. **(NEW) v 1.3**:
67
+ - Epochs: 20
68
+ - Training samples: 5000
69
+ - Validation samples: 1000
70
+
71
+ All models were trained using high RAM GPU on Google Colab Pro.
72
+
73
+ ## Model Structure (files)
74
+
75
+ Each version of the model includes the following files:
76
+ results config.json | preprocessor_config.json | model.safetensors | special_tokens_map.json | tokenizer_config.json | vocab.json
77
+
78
+
79
+ ## How to Use
80
+
81
+ Test with recording or uploading an audio file. To test, I recommend short sentences.
82
+
83
+ ## License
84
+
85
+ This project is licensed under the Eclipse Public License 2.0 (ECL-2.0).
86
+
87
+ ## Developer Information
88
+
89
+ Developed by Ramon Mayor Martins (2024)
90
+ - Email: rmayormartins@gmail.com
91
+ - Homepage: https://rmayormartins.github.io/
92
+ - Twitter: @rmayormartins
93
+ - GitHub: https://github.com/rmayormartins
94
+
95
+ ## Acknowledgements
96
+
97
+ Special thanks to Instituto Federal de Santa Catarina (Federal Institute of Santa Catarina) IFSC-São José-Brazil.
98
+
99
+ ## Contact
100
+
101
+ For any queries or suggestions, please contact the developer using the information provided above.