MrBananaHuman commited on
Commit
97311a6
·
verified ·
1 Parent(s): bc65833

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -1
README.md CHANGED
@@ -2,4 +2,27 @@
2
  license: apache-2.0
3
  ---
4
 
5
- training named entity recognizer example: https://colab.research.google.com/drive/1WirfVhJIbKH70stuLRPhiPr2CexZiGuP?usp=sharing
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: apache-2.0
3
  ---
4
 
5
+ # Korean Character BERT Model
6
+
7
+ Welcome to the repository of the Korean Character (syllable-level) BERT Model, a compact and efficient transformer-based model designed specifically for Korean language processing tasks. This model takes a unique approach by tokenizing text at the syllable level, catering to the linguistic characteristics of the Korean language.
8
+
9
+ ## Features
10
+
11
+ - Vocabulary Size: The model utilizes a vocabulary of 7,477 tokens, focusing on Korean syllables. This streamlined vocabulary size allows for efficient processing while maintaining the ability to capture the nuances of the Korean language.
12
+ - Transformer Encoder Layers: It employs a simplified architecture with only 3 transformer encoder layers. This design choice strikes a balance between model complexity and computational efficiency, making it suitable for a wide range of applications, from mobile devices to server environments.
13
+ - License: This model is open-sourced under the Apache License 2.0, allowing for both academic and commercial use while ensuring that contributions and improvements are shared within the community.
14
+
15
+ ## Getting Started
16
+
17
+ ```
18
+ # Load model directly
19
+ from transformers import AutoTokenizer, AutoModelForMaskedLM
20
+
21
+ tokenizer = AutoTokenizer.from_pretrained("MrBananaHuman/char_ko_bert_small")
22
+ model = AutoModelForMaskedLM.from_pretrained("MrBananaHuman/char_ko_bert_small")
23
+ ```
24
+ training named entity recognizer example: https://colab.research.google.com/drive/1WirfVhJIbKH70stuLRPhiPr2CexZiGuP?usp=sharing
25
+
26
+ ## Contact
27
+
28
+ For any questions or inquiries, please reach out to me at mrbananahuman.kim@gmail.com. I'm always happy to discuss the model, potential collaborations, or any other inquiries related to this project.