metadata

license: mit
base_model: dbmdz/bert-base-turkish-cased
tags:
  - generated_from_keras_callback
model-index:
  - name: Lagadro/teknofest_ner_mammo
    results: []

Lagadro/teknofest_ner_mammo

This model is a fine-tuned version of dbmdz/bert-base-turkish-cased on an unknown dataset. It achieves the following results on the evaluation set:

Train Loss: 0.0812
Validation Loss: 0.0961
Epoch: 4
Overall Accuracy: 0.968

Model description

This model is trained to extract entity names from reports of mammographic images.

Training and evaluation data

The data has been provided by Teknofest.

Training hyperparameters

The following hyperparameters were used during training:

optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 260, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
training_precision: float32
Name: AdamWeightDecay
Learning Rate:
- Module: keras.optimizers.schedules
- Class: PolynomialDecay
- Config:
  - Initial Learning Rate: 2e-05
  - Decay Steps: 260
  - End Learning Rate: 0.0
  - Power: 1.0
  - Cycle: False
  - Name: None
- Registered Name: None
Decay: 0.0
Beta 1: 0.9
Beta 2: 0.999
Epsilon: 1e-08
Amsgrad: False
Weight Decay Rate: 0.01
Training Precision: float32

Training results

Train Loss	Validation Loss	Epoch
0.5769	0.2226	0
0.1650	0.1344	1
0.1119	0.1110	2
0.0907	0.0994	3
0.0812	0.0961	4

Framework versions

Transformers 4.41.2
TensorFlow 2.15.0
Datasets 2.19.2
Tokenizers 0.19.1