teknofest_ner_mammo / README.md
Lagadro's picture
Update README.md
95afead verified
|
raw
history blame
2.21 kB
metadata
license: mit
base_model: dbmdz/bert-base-turkish-cased
tags:
  - generated_from_keras_callback
model-index:
  - name: Lagadro/teknofest_ner_mammo
    results: []

Lagadro/teknofest_ner_mammo

This model is a fine-tuned version of dbmdz/bert-base-turkish-cased on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 0.0812
  • Validation Loss: 0.0961
  • Epoch: 4
  • Overall Accuracy: 0.968

Model description

This model is trained to extract entity names from reports of mammographic images.

Training and evaluation data

The data has been provided by Teknofest.

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 260, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}

  • training_precision: float32

  • Name: AdamWeightDecay

  • Learning Rate:

    • Module: keras.optimizers.schedules
    • Class: PolynomialDecay
    • Config:
      • Initial Learning Rate: 2e-05
      • Decay Steps: 260
      • End Learning Rate: 0.0
      • Power: 1.0
      • Cycle: False
      • Name: None
    • Registered Name: None
  • Decay: 0.0

  • Beta 1: 0.9

  • Beta 2: 0.999

  • Epsilon: 1e-08

  • Amsgrad: False

  • Weight Decay Rate: 0.01

  • Training Precision: float32

Training results

Train Loss Validation Loss Epoch
0.5769 0.2226 0
0.1650 0.1344 1
0.1119 0.1110 2
0.0907 0.0994 3
0.0812 0.0961 4

Framework versions

  • Transformers 4.41.2
  • TensorFlow 2.15.0
  • Datasets 2.19.2
  • Tokenizers 0.19.1