Upload README.md

0da5aab verified 4 months ago

5.71 kB


	---
	language:
	- eng
	license: wtfpl
	tags:
	- multilabel-image-classification
	- multilabel
	- generated_from_trainer
	base_model: microsoft/resnet-50
	model-index:
	- name: resnet-50-2024_09_13-batch-size32_epochs150_freeze
	results: []
	---

	DinoVd'eau is a fine-tuned version of [microsoft/resnet-50](https://huggingface.co/microsoft/resnet-50). It achieves the following results on the test set:

	- Loss: nan
	- F1 Micro: 0.0002
	- F1 Macro: 0.0002
	- Roc Auc: 0.4995
	- Accuracy: 0.0003

	---

	# Model description
	DinoVd'eau is a model built on top of dinov2 model for underwater multilabel image classification.The classification head is a combination of linear, ReLU, batch normalization, and dropout layers.

	The source code for training the model can be found in this [Git repository](https://github.com/SeatizenDOI/DinoVdeau).

	- Developed by: [lombardata](https://huggingface.co/lombardata), credits to [César Leblanc](https://huggingface.co/CesarLeblanc) and [Victor Illien](https://huggingface.co/groderg)

	---

	# Intended uses & limitations
	You can use the raw model for classify diverse marine species, encompassing coral morphotypes classes taken from the Global Coral Reef Monitoring Network (GCRMN), habitats classes and seagrass species.

	---

	# Training and evaluation data
	Details on the number of images for each class are given in the following table:
	\| Class \| train \| val \| test \| Total \|
	\|:-------------------------\|--------:\|------:\|-------:\|--------:\|
	\| Acropore_branched \| 1469 \| 464 \| 475 \| 2408 \|
	\| Acropore_digitised \| 568 \| 160 \| 160 \| 888 \|
	\| Acropore_sub_massive \| 150 \| 50 \| 43 \| 243 \|
	\| Acropore_tabular \| 999 \| 297 \| 293 \| 1589 \|
	\| Algae_assembly \| 2546 \| 847 \| 845 \| 4238 \|
	\| Algae_drawn_up \| 367 \| 126 \| 127 \| 620 \|
	\| Algae_limestone \| 1652 \| 557 \| 563 \| 2772 \|
	\| Algae_sodding \| 3148 \| 984 \| 985 \| 5117 \|
	\| Atra/Leucospilota \| 1084 \| 348 \| 360 \| 1792 \|
	\| Bleached_coral \| 219 \| 71 \| 70 \| 360 \|
	\| Blurred \| 191 \| 67 \| 62 \| 320 \|
	\| Dead_coral \| 1979 \| 642 \| 643 \| 3264 \|
	\| Fish \| 2018 \| 656 \| 647 \| 3321 \|
	\| Homo_sapiens \| 161 \| 62 \| 59 \| 282 \|
	\| Human_object \| 157 \| 58 \| 55 \| 270 \|
	\| Living_coral \| 406 \| 154 \| 141 \| 701 \|
	\| Millepore \| 385 \| 127 \| 125 \| 637 \|
	\| No_acropore_encrusting \| 441 \| 130 \| 154 \| 725 \|
	\| No_acropore_foliaceous \| 204 \| 36 \| 46 \| 286 \|
	\| No_acropore_massive \| 1031 \| 336 \| 338 \| 1705 \|
	\| No_acropore_solitary \| 202 \| 53 \| 48 \| 303 \|
	\| No_acropore_sub_massive \| 1401 \| 433 \| 422 \| 2256 \|
	\| Rock \| 4489 \| 1495 \| 1473 \| 7457 \|
	\| Rubble \| 3092 \| 1030 \| 1001 \| 5123 \|
	\| Sand \| 5842 \| 1939 \| 1938 \| 9719 \|
	\| Sea_cucumber \| 1408 \| 439 \| 447 \| 2294 \|
	\| Sea_urchins \| 327 \| 107 \| 111 \| 545 \|
	\| Sponge \| 269 \| 96 \| 105 \| 470 \|
	\| Syringodium_isoetifolium \| 1212 \| 392 \| 391 \| 1995 \|
	\| Thalassodendron_ciliatum \| 782 \| 261 \| 260 \| 1303 \|
	\| Useless \| 579 \| 193 \| 193 \| 965 \|

	---

	# Training procedure

	## Training hyperparameters

	The following hyperparameters were used during training:

	- Number of Epochs: 150
	- Learning Rate: 0.001
	- Train Batch Size: 32
	- Eval Batch Size: 32
	- Optimizer: Adam
	- LR Scheduler Type: ReduceLROnPlateau with a patience of 5 epochs and a factor of 0.1
	- Freeze Encoder: Yes
	- Data Augmentation: Yes


	## Data Augmentation
	Data were augmented using the following transformations :

	Train Transforms
	- PreProcess: No additional parameters
	- Resize: probability=1.00
	- RandomHorizontalFlip: probability=0.25
	- RandomVerticalFlip: probability=0.25
	- ColorJiggle: probability=0.25
	- RandomPerspective: probability=0.25
	- Normalize: probability=1.00

	Val Transforms
	- PreProcess: No additional parameters
	- Resize: probability=1.00
	- Normalize: probability=1.00



	## Training results
	Epoch \| Validation Loss \| Accuracy \| F1 Macro \| F1 Micro \| Learning Rate
	--- \| --- \| --- \| --- \| --- \| ---
	1 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.001
	2 \| nan \| 0.000693000693000693 \| 0.00031409501374165687 \| 0.00040576181781294376 \| 0.001
	3 \| nan \| 0.0017325017325017325 \| 0.0007850525985241011 \| 0.0010049241282283187 \| 0.001
	4 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.001
	5 \| nan \| 0.0010395010395010396 \| 0.00047177229124076113 \| 0.0006430178973314757 \| 0.001
	6 \| nan \| 0.0003465003465003465 \| 0.00015712153350616704 \| 0.000206782464846981 \| 0.001
	7 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.0001
	8 \| nan \| 0.0003465003465003465 \| 0.00015710919088766695 \| 0.0002061218179944347 \| 0.0001
	9 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.0001
	10 \| nan \| 0.000693000693000693 \| 0.00031441597233139445 \| 0.0004230565838180856 \| 0.0001
	11 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.0001


	---

	# CO2 Emissions

	The estimated CO2 emissions for training this model are documented below:

	- Emissions: 0.12280230273705112 grams of CO2
	- Source: Code Carbon
	- Training Type: fine-tuning
	- Geographical Location: Brest, France
	- Hardware Used: NVIDIA Tesla V100 PCIe 32 Go


	---

	# Framework Versions

	- Transformers: 4.41.1
	- Pytorch: 2.3.0+cu121
	- Datasets: 2.19.1
	- Tokenizers: 0.19.1


	---
	language:
	- eng
	license: wtfpl
	tags:
	- multilabel-image-classification
	- multilabel
	- generated_from_trainer
	base_model: microsoft/resnet-50
	model-index:
	- name: resnet-50-2024_09_13-batch-size32_epochs150_freeze
	results: []
	---

	DinoVd'eau is a fine-tuned version of [microsoft/resnet-50](https://huggingface.co/microsoft/resnet-50). It achieves the following results on the test set:

	- Loss: nan
	- F1 Micro: 0.0002
	- F1 Macro: 0.0002
	- Roc Auc: 0.4995
	- Accuracy: 0.0003

	---

	# Model description
	DinoVd'eau is a model built on top of dinov2 model for underwater multilabel image classification.The classification head is a combination of linear, ReLU, batch normalization, and dropout layers.

	The source code for training the model can be found in this [Git repository](https://github.com/SeatizenDOI/DinoVdeau).

	- Developed by: [lombardata](https://huggingface.co/lombardata), credits to [César Leblanc](https://huggingface.co/CesarLeblanc) and [Victor Illien](https://huggingface.co/groderg)

	---

	# Intended uses & limitations
	You can use the raw model for classify diverse marine species, encompassing coral morphotypes classes taken from the Global Coral Reef Monitoring Network (GCRMN), habitats classes and seagrass species.

	---

	# Training and evaluation data
	Details on the number of images for each class are given in the following table:
	\| Class \| train \| val \| test \| Total \|
	\|:-------------------------\|--------:\|------:\|-------:\|--------:\|
	\| Acropore_branched \| 1469 \| 464 \| 475 \| 2408 \|
	\| Acropore_digitised \| 568 \| 160 \| 160 \| 888 \|
	\| Acropore_sub_massive \| 150 \| 50 \| 43 \| 243 \|
	\| Acropore_tabular \| 999 \| 297 \| 293 \| 1589 \|
	\| Algae_assembly \| 2546 \| 847 \| 845 \| 4238 \|
	\| Algae_drawn_up \| 367 \| 126 \| 127 \| 620 \|
	\| Algae_limestone \| 1652 \| 557 \| 563 \| 2772 \|
	\| Algae_sodding \| 3148 \| 984 \| 985 \| 5117 \|
	\| Atra/Leucospilota \| 1084 \| 348 \| 360 \| 1792 \|
	\| Bleached_coral \| 219 \| 71 \| 70 \| 360 \|
	\| Blurred \| 191 \| 67 \| 62 \| 320 \|
	\| Dead_coral \| 1979 \| 642 \| 643 \| 3264 \|
	\| Fish \| 2018 \| 656 \| 647 \| 3321 \|
	\| Homo_sapiens \| 161 \| 62 \| 59 \| 282 \|
	\| Human_object \| 157 \| 58 \| 55 \| 270 \|
	\| Living_coral \| 406 \| 154 \| 141 \| 701 \|
	\| Millepore \| 385 \| 127 \| 125 \| 637 \|
	\| No_acropore_encrusting \| 441 \| 130 \| 154 \| 725 \|
	\| No_acropore_foliaceous \| 204 \| 36 \| 46 \| 286 \|
	\| No_acropore_massive \| 1031 \| 336 \| 338 \| 1705 \|
	\| No_acropore_solitary \| 202 \| 53 \| 48 \| 303 \|
	\| No_acropore_sub_massive \| 1401 \| 433 \| 422 \| 2256 \|
	\| Rock \| 4489 \| 1495 \| 1473 \| 7457 \|
	\| Rubble \| 3092 \| 1030 \| 1001 \| 5123 \|
	\| Sand \| 5842 \| 1939 \| 1938 \| 9719 \|
	\| Sea_cucumber \| 1408 \| 439 \| 447 \| 2294 \|
	\| Sea_urchins \| 327 \| 107 \| 111 \| 545 \|
	\| Sponge \| 269 \| 96 \| 105 \| 470 \|
	\| Syringodium_isoetifolium \| 1212 \| 392 \| 391 \| 1995 \|
	\| Thalassodendron_ciliatum \| 782 \| 261 \| 260 \| 1303 \|
	\| Useless \| 579 \| 193 \| 193 \| 965 \|

	---

	# Training procedure

	## Training hyperparameters

	The following hyperparameters were used during training:

	- Number of Epochs: 150
	- Learning Rate: 0.001
	- Train Batch Size: 32
	- Eval Batch Size: 32
	- Optimizer: Adam
	- LR Scheduler Type: ReduceLROnPlateau with a patience of 5 epochs and a factor of 0.1
	- Freeze Encoder: Yes
	- Data Augmentation: Yes


	## Data Augmentation
	Data were augmented using the following transformations :

	Train Transforms
	- PreProcess: No additional parameters
	- Resize: probability=1.00
	- RandomHorizontalFlip: probability=0.25
	- RandomVerticalFlip: probability=0.25
	- ColorJiggle: probability=0.25
	- RandomPerspective: probability=0.25
	- Normalize: probability=1.00

	Val Transforms
	- PreProcess: No additional parameters
	- Resize: probability=1.00
	- Normalize: probability=1.00



	## Training results
	Epoch \| Validation Loss \| Accuracy \| F1 Macro \| F1 Micro \| Learning Rate
	--- \| --- \| --- \| --- \| --- \| ---
	1 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.001
	2 \| nan \| 0.000693000693000693 \| 0.00031409501374165687 \| 0.00040576181781294376 \| 0.001
	3 \| nan \| 0.0017325017325017325 \| 0.0007850525985241011 \| 0.0010049241282283187 \| 0.001
	4 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.001
	5 \| nan \| 0.0010395010395010396 \| 0.00047177229124076113 \| 0.0006430178973314757 \| 0.001
	6 \| nan \| 0.0003465003465003465 \| 0.00015712153350616704 \| 0.000206782464846981 \| 0.001
	7 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.0001
	8 \| nan \| 0.0003465003465003465 \| 0.00015710919088766695 \| 0.0002061218179944347 \| 0.0001
	9 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.0001
	10 \| nan \| 0.000693000693000693 \| 0.00031441597233139445 \| 0.0004230565838180856 \| 0.0001
	11 \| nan \| 0.0 \| 0.0 \| 0.0 \| 0.0001


	---

	# CO2 Emissions

	The estimated CO2 emissions for training this model are documented below:

	- Emissions: 0.12280230273705112 grams of CO2
	- Source: Code Carbon
	- Training Type: fine-tuning
	- Geographical Location: Brest, France
	- Hardware Used: NVIDIA Tesla V100 PCIe 32 Go


	---

	# Framework Versions

	- Transformers: 4.41.1
	- Pytorch: 2.3.0+cu121
	- Datasets: 2.19.1
	- Tokenizers: 0.19.1