Traditional Chinese corpus collection for LLM training (pre-training, instruction-tuning, and RLHF/alignment).
Oscar, Li
liswei
AI & ML interests
Multimodal Deep Learning, Natural Language Processing, Efficient Fine-Tuning
Organizations
None yet
Collections
2
models
6
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642321e36e61cda1b39f9939/sGJN0NWX77IirPv2BuYHg.jpeg)
liswei/Taiwan-ELM
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642321e36e61cda1b39f9939/sGJN0NWX77IirPv2BuYHg.jpeg)
liswei/Taiwan-ELM-1_1B-Instruct
Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642321e36e61cda1b39f9939/sGJN0NWX77IirPv2BuYHg.jpeg)
liswei/Taiwan-ELM-270M-Instruct
Text Generation
•
Updated
•
51
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642321e36e61cda1b39f9939/sGJN0NWX77IirPv2BuYHg.jpeg)
liswei/Taiwan-ELM-1_1B
Text Generation
•
Updated
•
43
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642321e36e61cda1b39f9939/sGJN0NWX77IirPv2BuYHg.jpeg)
liswei/Taiwan-ELM-270M
Text Generation
•
Updated
•
20
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642321e36e61cda1b39f9939/sGJN0NWX77IirPv2BuYHg.jpeg)
liswei/EmojiLMSeq2SeqLoRA
Text2Text Generation
•
Updated
•
1.28k
datasets
10
liswei/Taiwan-Text-Excellence-2B
Viewer
•
Updated
•
1.78M
•
1
•
3
liswei/PromptPair-TW
Viewer
•
Updated
•
119k
•
2
•
1
liswei/news-collection-zhtw
Viewer
•
Updated
•
592k
•
5
liswei/wikinews-zhtw-dedup
Viewer
•
Updated
•
8.37k
•
4
liswei/wikipedia-zhtw-dedup
Viewer
•
Updated
•
1.18M
•
15
liswei/common-crawl-zhtw
Viewer
•
Updated
•
2.71M
•
3
•
1
liswei/coct-en-zhtw-dedup
Viewer
•
Updated
•
217k
•
3
liswei/c4-zhtw
Viewer
•
Updated
•
4.86M
•
4
liswei/rm-static-zhTW
Viewer
•
Updated
•
81.4k
•
30
liswei/NTU-Tree
Viewer
•
Updated
•
478
•
5
•
2