license: apache-2.0
library_name: sklearn
- tabular-classification
- baseline-trainer
## Baseline Model trained on accentcombinedlenous8ktq9 to apply classification on accent
**Metrics of the best model:**
accuracy 0.947980
recall_macro 0.749094
precision_macro 0.622545
f1_macro 0.656714
Name: LogisticRegression(C=1, class_weight='balanced', max_iter=1000), dtype: float64
**See model plot below:**
word False False False ... False True False
kana False False False ... False True False
kind False False False ... False False False
morae False False False ... False False False
pos False False False ... False False False
etym False False False ... False False False
jilen False False False ... False False False
<b>In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook. <br />On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.</b>
word False False False ... False True False
kana False False False ... False True False
kind False False False ... False False False
morae False False False ... False False False
pos False False False ... False False False
etym False False False ... False False False
jilen False False False ... False False False
kanalen False False False ... False False False[8 rows x 7 columns])),('logisticregression',LogisticRegression(C=1, class_weight='balanced',max_iter=1000))])</pre></div></div></div><div class="sk-serial"><div class="sk-item"><div class="sk-estimator sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-14" type="checkbox" ><label for="sk-estimator-id-14" class="sk-toggleable__label sk-toggleable__label-arrow">EasyPreprocessor</label><div class="sk-toggleable__content"><pre>EasyPreprocessor(types= continuous dirty_float low_card_int ... date free_string useless
word False False False ... False True False
kana False False False ... False True False
kind False False False ... False False False
morae False False False ... False False False
pos False False False ... False False False
etym False False False ... False False False
jilen False False False ... False False False
kanalen False False False ... False False False[8 rows x 7 columns])</pre></div></div></div><div class="sk-item"><div class="sk-estimator sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-15" type="checkbox" ><label for="sk-estimator-id-15" class="sk-toggleable__label sk-toggleable__label-arrow">LogisticRegression</label><div class="sk-toggleable__content"><pre>LogisticRegression(C=1, class_weight='balanced', max_iter=1000)</pre></div></div></div></div></div></div></div>
**Disclaimer:** This model is trained with dabl library as a baseline, for better results, use [AutoTrain](https://huggingface.co/autotrain).
**Logs of training** including the models tried in the process can be found in logs.txt
version https://git-lfs.github.com/spec/v1
oid sha256:2a3c7dc88a769ace755f21d3d59a8b03406e5651ba6d5f11e75c9ffa103ccf6f
size 13656
Logging training
Running DummyClassifier()
accuracy: 0.495 recall_macro: 0.100 precision_macro: 0.050 f1_macro: 0.066
=== new best DummyClassifier() (using recall_macro):
accuracy: 0.495 recall_macro: 0.100 precision_macro: 0.050 f1_macro: 0.066
Running GaussianNB()
accuracy: 0.830 recall_macro: 0.537 precision_macro: 0.432 f1_macro: 0.400
=== new best GaussianNB() (using recall_macro):
accuracy: 0.830 recall_macro: 0.537 precision_macro: 0.432 f1_macro: 0.400
Running MultinomialNB()
accuracy: 0.900 recall_macro: 0.579 precision_macro: 0.510 f1_macro: 0.514
=== new best MultinomialNB() (using recall_macro):
accuracy: 0.900 recall_macro: 0.579 precision_macro: 0.510 f1_macro: 0.514
Running DecisionTreeClassifier(class_weight='balanced', max_depth=1)
accuracy: 0.400 recall_macro: 0.200 precision_macro: 0.113 f1_macro: 0.124
Running DecisionTreeClassifier(class_weight='balanced', max_depth=10)
accuracy: 0.909 recall_macro: 0.641 precision_macro: 0.531 f1_macro: 0.520
=== new best DecisionTreeClassifier(class_weight='balanced', max_depth=10) (using recall_macro):
accuracy: 0.909 recall_macro: 0.641 precision_macro: 0.531 f1_macro: 0.520
Running DecisionTreeClassifier(class_weight='balanced', min_impurity_decrease=0.01)
accuracy: 0.931 recall_macro: 0.723 precision_macro: 0.563 f1_macro: 0.595
=== new best DecisionTreeClassifier(class_weight='balanced', min_impurity_decrease=0.01) (using recall_macro):
accuracy: 0.931 recall_macro: 0.723 precision_macro: 0.563 f1_macro: 0.595
Running LogisticRegression(C=0.1, class_weight='balanced', max_iter=1000)
accuracy: 0.946 recall_macro: 0.739 precision_macro: 0.614 f1_macro: 0.647
=== new best LogisticRegression(C=0.1, class_weight='balanced', max_iter=1000) (using recall_macro):
accuracy: 0.946 recall_macro: 0.739 precision_macro: 0.614 f1_macro: 0.647
Running LogisticRegression(C=1, class_weight='balanced', max_iter=1000)
accuracy: 0.948 recall_macro: 0.749 precision_macro: 0.623 f1_macro: 0.657
=== new best LogisticRegression(C=1, class_weight='balanced', max_iter=1000) (using recall_macro):
accuracy: 0.948 recall_macro: 0.749 precision_macro: 0.623 f1_macro: 0.657
Best model:
LogisticRegression(C=1, class_weight='balanced', max_iter=1000)
Best Scores:
accuracy: 0.948 recall_macro: 0.749 precision_macro: 0.623 f1_macro: 0.657