kimhyeongjun commited on
Commit
f4fec9c
ยท
verified ยท
1 Parent(s): 834c95f

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: other
4
+ base_model: NousResearch/Hermes-3-Llama-3.1-8B
5
+ tags:
6
+ - llama-factory
7
+ - full
8
+ - unsloth
9
+ - generated_from_trainer
10
+ model-index:
11
+ - name: kimhyeongjun/Hermes-3-Llama-3.1-8B-Ko-Finance-Advisors
12
+ results: []
13
+ ---
14
+
15
+
16
+ # kimhyeongjun/Hermes-3-Llama-3.1-8B-Ko-Finance-Advisors
17
+
18
+ This model is a fine-tuned version of [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B) on the Korean_synthetic_financial_dataset_21K.
19
+
20
+ ์ด ๋ชจ๋ธ์€ ํ•œ๊ตญ_ํ•ฉ์„ฑ_๊ธˆ์œต_๋ฐ์ดํ„ฐ์…‹_21K์˜ [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B)๋ฅผ ๋ฏธ์„ธ ์กฐ์ •ํ•œ ๋ฒ„์ „์ž…๋‹ˆ๋‹ค.
21
+
22
+ ## Model description
23
+ Based on finance PDF data collected directly from the web, we refined the raw data using the 'meta-llama/Meta-Llama-3.1-70B-Instruct' model.
24
+ After generating synthetic data based on the cleaned data, we further evaluated the quality of the generated data using the 'meta-llama/Llama-Guard-3-8B' and 'RLHFlow/ArmoRM-Llama3-8B-v0.1' models.
25
+ We then used 'Alibaba-NLP/gte-large-en-v1.5' to extract embeddings and applied Faiss to perform Jaccard distance-based nearest neighbor analysis to construct the final dataset of 21k, which is multidimensional and sophisticated.
26
+
27
+ ์›น์—์„œ ์ง์ ‘ ์ˆ˜์ง‘ํ•œ ๊ธˆ์œต ๊ด€๋ จ PDF ๋ฐ์ดํ„ฐ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ, 'meta-llama/Meta-Llama-3.1-70B-Instruct' ๋ชจ๋ธ์„ ํ™œ์šฉํ•˜์—ฌ ์›์‹œ ๋ฐ์ดํ„ฐ๋ฅผ ์ •์ œํ•˜์˜€์Šต๋‹ˆ๋‹ค.
28
+ ์ •์ œ๋œ ๋ฐ์ดํ„ฐ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•œ ํ›„, 'meta-llama/Llama-Guard-3-8B' ๋ฐ 'RLHFlow/ArmoRM-Llama3-8B-v0.1' ๋ชจ๋ธ์„ ํ†ตํ•ด ์ƒ์„ฑ๋œ ๋ฐ์ดํ„ฐ์˜ ํ’ˆ์งˆ์„ ์‹ฌ์ธต์ ์œผ๋กœ ํ‰๊ฐ€ํ•˜์˜€์Šต๋‹ˆ๋‹ค.
29
+ ์ด์–ด์„œ 'Alibaba-NLP/gte-large-en-v1.5'๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ž„๋ฒ ๋”ฉ์„ ์ถ”์ถœํ•˜๊ณ , Faiss๋ฅผ ์ ์šฉํ•˜์—ฌ ์ž์นด๋“œ ๊ฑฐ๋ฆฌ ๊ธฐ๋ฐ˜์˜ ๊ทผ์ ‘ ์ด์›ƒ ๋ถ„์„์„ ์ˆ˜ํ–‰ํ•จ์œผ๋กœ์จ ๋‹ค์ฐจ์›์ ์ด๊ณ  ์ •๊ตํ•œ ์ตœ์ข… ๋ฐ์ดํ„ฐ์…‹ 21k์„ ๊ตฌ์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค.
30
+
31
+
32
+ ## sample
33
+
34
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/619d8e31c21bf5feb310bd82/gJ6hnvAV2Qx9774AFFwQe.png)
35
+
36
+ ### Framework versions
37
+
38
+ - Transformers 4.44.2
39
+ - Pytorch 2.4.0+cu121
40
+ - Datasets 2.21.0
41
+ - Tokenizers 0.19.1
42
+
43
+