Spaces:
Build error
Build error
fine-tuned gpt-4o
Browse files
logs/openai-gpt-4o-fine-tuned.txt
ADDED
@@ -0,0 +1,95 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
Evaluating model: ft:gpt-4o-2024-08-06:mastercard::A0krlVBL
|
2 |
+
loading train/test data files
|
3 |
+
DatasetDict({
|
4 |
+
train: Dataset({
|
5 |
+
features: ['chinese', 'english'],
|
6 |
+
num_rows: 4528
|
7 |
+
})
|
8 |
+
test: Dataset({
|
9 |
+
features: ['chinese', 'english'],
|
10 |
+
num_rows: 1133
|
11 |
+
})
|
12 |
+
})
|
13 |
+
--------------------------------------------------
|
14 |
+
chinese: 老耿端起枪,眯缝起一只三角眼,一搂扳机响了枪,冰雹般的金麻雀劈哩啪啦往下落,铁砂子在柳枝间飞迸着,嚓嚓有声。
|
15 |
+
--------------------------------------------------
|
16 |
+
english: Old Geng picked up his shotgun, squinted, and pulled the trigger. Two sparrows crashed to the ground like hailstones as shotgun pellets tore noisily through the branches.
|
17 |
+
*** Evaluating with num_shots: 0
|
18 |
+
100%|██████████| 1133/1133 [19:12<00:00, 1.02s/it]
|
19 |
+
gpt-4o/epochs-01 metrics: {'meteor': 0.37829977579665147, 'sacrebleu': {'score': 11.961183004083166, 'counts': [12783, 4604, 2130, 1067], 'totals': [29124, 27991, 26875, 25764], 'precisions': [43.89163576431809, 16.448144046300598, 7.925581395348837, 4.141437664958858], 'bp': 0.9640596438285618, 'sys_len': 29124, 'ref_len': 30190}, 'bleu_scores': {'bleu': 0.11961183004083172, 'precisions': [0.4389163576431809, 0.16448144046300597, 0.07925581395348837, 0.04141437664958857], 'brevity_penalty': 0.9640596438285618, 'length_ratio': 0.9646902947996026, 'translation_length': 29124, 'reference_length': 30190}, 'rouge_scores': {'rouge1': 0.42429274225313846, 'rouge2': 0.17523641258245923, 'rougeL': 0.369425658597467, 'rougeLsum': 0.369423462411946}, 'accuracy': 0.00088261253309797, 'correct_ids': [77]}
|
20 |
+
Evaluating model: ft:gpt-4o-2024-08-06:mastercard::A1E8HfT2
|
21 |
+
loading train/test data files
|
22 |
+
DatasetDict({
|
23 |
+
train: Dataset({
|
24 |
+
features: ['chinese', 'english'],
|
25 |
+
num_rows: 4528
|
26 |
+
})
|
27 |
+
test: Dataset({
|
28 |
+
features: ['chinese', 'english'],
|
29 |
+
num_rows: 1133
|
30 |
+
})
|
31 |
+
})
|
32 |
+
--------------------------------------------------
|
33 |
+
chinese: 老耿端起枪,眯缝起一只三角眼,一搂扳机响了枪,冰雹般的金麻雀劈哩啪啦往下落,铁砂子在柳枝间飞迸着,嚓嚓有声。
|
34 |
+
--------------------------------------------------
|
35 |
+
english: Old Geng picked up his shotgun, squinted, and pulled the trigger. Two sparrows crashed to the ground like hailstones as shotgun pellets tore noisily through the branches.
|
36 |
+
*** Evaluating with num_shots: 0
|
37 |
+
100%|██████████| 1133/1133 [17:26<00:00, 1.08it/s]
|
38 |
+
gpt-4o/epochs-02 metrics: {'meteor': 0.378362442260616, 'sacrebleu': {'score': 11.94987840396735, 'counts': [12801, 4612, 2127, 1061], 'totals': [29101, 27968, 26852, 25740], 'precisions': [43.98817910037456, 16.490274599542335, 7.9211976761507525, 4.121989121989122], 'bp': 0.9632701331199107, 'sys_len': 29101, 'ref_len': 30190}, 'bleu_scores': {'bleu': 0.11949878403967351, 'precisions': [0.4398817910037456, 0.16490274599542334, 0.07921197676150753, 0.04121989121989122], 'brevity_penalty': 0.9632701331199107, 'length_ratio': 0.9639284531301755, 'translation_length': 29101, 'reference_length': 30190}, 'rouge_scores': {'rouge1': 0.4244959919113983, 'rouge2': 0.17487441168118406, 'rougeL': 0.36961992190392845, 'rougeLsum': 0.3696134312327403}, 'accuracy': 0.00088261253309797, 'correct_ids': [77]}
|
39 |
+
Evaluating model: ft:gpt-4o-2024-08-06:mastercard::A1EEMdkp
|
40 |
+
loading train/test data files
|
41 |
+
DatasetDict({
|
42 |
+
train: Dataset({
|
43 |
+
features: ['chinese', 'english'],
|
44 |
+
num_rows: 4528
|
45 |
+
})
|
46 |
+
test: Dataset({
|
47 |
+
features: ['chinese', 'english'],
|
48 |
+
num_rows: 1133
|
49 |
+
})
|
50 |
+
})
|
51 |
+
--------------------------------------------------
|
52 |
+
chinese: 老耿端起枪,眯缝起一只三角眼,一搂扳机响了枪,冰雹般的金麻雀劈哩啪啦往下落,铁砂子在柳枝间飞迸着,嚓嚓有声。
|
53 |
+
--------------------------------------------------
|
54 |
+
english: Old Geng picked up his shotgun, squinted, and pulled the trigger. Two sparrows crashed to the ground like hailstones as shotgun pellets tore noisily through the branches.
|
55 |
+
*** Evaluating with num_shots: 0
|
56 |
+
100%|██████████| 1133/1133 [18:51<00:00, 1.00it/s]
|
57 |
+
gpt-4o/epochs-03 metrics: {'meteor': 0.37969689267707335, 'sacrebleu': {'score': 12.039126826084276, 'counts': [12800, 4627, 2152, 1077], 'totals': [29117, 27984, 26868, 25756], 'precisions': [43.9605728612151, 16.53444825614637, 8.009528063123419, 4.181549930113372], 'bp': 0.9638194215435517, 'sys_len': 29117, 'ref_len': 30190}, 'bleu_scores': {'bleu': 0.1203912682608427, 'precisions': [0.43960572861215097, 0.1653444825614637, 0.08009528063123418, 0.041815499301133714], 'brevity_penalty': 0.9638194215435517, 'length_ratio': 0.9644584299436899, 'translation_length': 29117, 'reference_length': 30190}, 'rouge_scores': {'rouge1': 0.4253565511807691, 'rouge2': 0.1762509031111424, 'rougeL': 0.3709362250350436, 'rougeLsum': 0.37090080700005995}, 'accuracy': 0.00088261253309797, 'correct_ids': [77]}
|
58 |
+
Evaluating model: ft:gpt-4o-2024-08-06:mastercard::A1LIBJeg
|
59 |
+
loading train/test data files
|
60 |
+
DatasetDict({
|
61 |
+
train: Dataset({
|
62 |
+
features: ['chinese', 'english'],
|
63 |
+
num_rows: 4528
|
64 |
+
})
|
65 |
+
test: Dataset({
|
66 |
+
features: ['chinese', 'english'],
|
67 |
+
num_rows: 1133
|
68 |
+
})
|
69 |
+
})
|
70 |
+
--------------------------------------------------
|
71 |
+
chinese: 老耿端起枪,眯缝起一只三角眼,一搂扳机响了枪,冰雹般的金麻雀劈哩啪啦往下落,铁砂子在柳枝间飞迸着,嚓嚓有声。
|
72 |
+
--------------------------------------------------
|
73 |
+
english: Old Geng picked up his shotgun, squinted, and pulled the trigger. Two sparrows crashed to the ground like hailstones as shotgun pellets tore noisily through the branches.
|
74 |
+
*** Evaluating with num_shots: 0
|
75 |
+
100%|██████████| 1133/1133 [18:37<00:00, 1.01it/s]
|
76 |
+
gpt-4o/epochs-04 metrics: {'meteor': 0.3784063502724225, 'sacrebleu': {'score': 11.971696060849887, 'counts': [12785, 4606, 2132, 1069], 'totals': [29104, 27971, 26855, 25743], 'precisions': [43.92866959868059, 16.467055164277287, 7.938931297709924, 4.152585168783747], 'bp': 0.9633731468442895, 'sys_len': 29104, 'ref_len': 30190}, 'bleu_scores': {'bleu': 0.11971696060849878, 'precisions': [0.4392866959868059, 0.16467055164277286, 0.07938931297709924, 0.04152585168783747], 'brevity_penalty': 0.9633731468442893, 'length_ratio': 0.9640278237827095, 'translation_length': 29104, 'reference_length': 30190}, 'rouge_scores': {'rouge1': 0.4238707558842832, 'rouge2': 0.17481350068875362, 'rougeL': 0.3695909612883631, 'rougeLsum': 0.36945188893013925}, 'accuracy': 0.00088261253309797, 'correct_ids': [77]}
|
77 |
+
Evaluating model: ft:gpt-4o-2024-08-06:mastercard::A1LKnnlI
|
78 |
+
loading train/test data files
|
79 |
+
DatasetDict({
|
80 |
+
train: Dataset({
|
81 |
+
features: ['chinese', 'english'],
|
82 |
+
num_rows: 4528
|
83 |
+
})
|
84 |
+
test: Dataset({
|
85 |
+
features: ['chinese', 'english'],
|
86 |
+
num_rows: 1133
|
87 |
+
})
|
88 |
+
})
|
89 |
+
--------------------------------------------------
|
90 |
+
chinese: 老耿端起枪,眯缝起一只三角眼,一搂扳机响了枪,冰雹般的金麻雀劈哩啪啦往下落,铁砂子在柳枝间飞迸着,嚓嚓有声。
|
91 |
+
--------------------------------------------------
|
92 |
+
english: Old Geng picked up his shotgun, squinted, and pulled the trigger. Two sparrows crashed to the ground like hailstones as shotgun pellets tore noisily through the branches.
|
93 |
+
*** Evaluating with num_shots: 0
|
94 |
+
100%|██████████| 1133/1133 [19:22<00:00, 1.03s/it]
|
95 |
+
gpt-4o/epochs-05 metrics: {'meteor': 0.3797983554067647, 'sacrebleu': {'score': 12.064338049153614, 'counts': [12791, 4641, 2157, 1081], 'totals': [29111, 27978, 26862, 25750], 'precisions': [43.93871732334856, 16.58803345485739, 8.029930757203484, 4.1980582524271846], 'bp': 0.9636134724514155, 'sys_len': 29111, 'ref_len': 30190}, 'bleu_scores': {'bleu': 0.12064338049153608, 'precisions': [0.43938717323348564, 0.16588033454857387, 0.08029930757203485, 0.041980582524271844], 'brevity_penalty': 0.9636134724514155, 'length_ratio': 0.9642596886386221, 'translation_length': 29111, 'reference_length': 30190}, 'rouge_scores': {'rouge1': 0.4248543304997294, 'rouge2': 0.1768290761907465, 'rougeL': 0.370949573624763, 'rougeLsum': 0.370875336303769}, 'accuracy': 0.00088261253309797, 'correct_ids': [77]}
|
notebooks/00c_Data Analysis_Fine_Tuned.ipynb
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
notebooks/02a_Fine_Tune_GPT-4o.ipynb
ADDED
The diff for this file is too large to render.
See raw diff
|
|
results/mac-results_few_shots_openai.csv
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:77b51fa5c206f2d23a7ccb5b33206dd97d5a05b7c11c781dad95b579c4245574
|
3 |
+
size 3491089
|
results/mac-results_fine_tuned_metrics.csv
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9debbd64265afdd7d85f030adb03733490e1c45e47501c28cd3e8d6fba3654cc
|
3 |
+
size 12038
|