Ahmed235
/

roberta-base-topic_classification_simple

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

roberta-base-topic_classification_simple / README.md

Ahmed235's picture

End of training

f16e9c5 verified 8 months ago

|

history blame contribute delete

No virus

3.79 kB

	---
	license: mit
	base_model: roberta-base
	tags:
	- generated_from_trainer
	metrics:
	- accuracy
	- f1
	model-index:
	- name: roberta-base-topic_classification_simple
	results: []
	---

	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	# roberta-base-topic_classification_simple

	This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
	It achieves the following results on the evaluation set:
	- Loss: 1.3253
	- Accuracy: {'accuracy': 0.8445839874411303}
	- F1: {'f1': 0.8435559601445874}

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 2e-05
	- train_batch_size: 32
	- eval_batch_size: 32
	- seed: 42
	- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: linear
	- num_epochs: 20

	### Training results

	\| Training Loss \| Epoch \| Step \| Validation Loss \| Accuracy \| F1 \|
	\|:-------------:\|:-----:\|:----:\|:---------------:\|:--------------------------------:\|:--------------------------:\|
	\| No log \| 1.0 \| 353 \| 0.6772 \| {'accuracy': 0.7905359946176272} \| {'f1': 0.7881026657042776} \|
	\| 0.8304 \| 2.0 \| 706 \| 0.6028 \| {'accuracy': 0.8187934514465127} \| {'f1': 0.8207294945978928} \|
	\| 0.3839 \| 3.0 \| 1059 \| 0.5942 \| {'accuracy': 0.8344920385736713} \| {'f1': 0.8333019225828988} \|
	\| 0.3839 \| 4.0 \| 1412 \| 0.6904 \| {'accuracy': 0.8340435075128952} \| {'f1': 0.8330992428789376} \|
	\| 0.2015 \| 5.0 \| 1765 \| 0.8314 \| {'accuracy': 0.8264184794797039} \| {'f1': 0.82429813311833} \|
	\| 0.118 \| 6.0 \| 2118 \| 0.8572 \| {'accuracy': 0.8356133662256111} \| {'f1': 0.8349736274018552} \|
	\| 0.118 \| 7.0 \| 2471 \| 0.9742 \| {'accuracy': 0.8383045525902669} \| {'f1': 0.8376600364979794} \|
	\| 0.0804 \| 8.0 \| 2824 \| 1.0628 \| {'accuracy': 0.8333707109217313} \| {'f1': 0.8313400577604307} \|
	\| 0.0508 \| 9.0 \| 3177 \| 1.0866 \| {'accuracy': 0.8333707109217313} \| {'f1': 0.832415418717587} \|
	\| 0.0406 \| 10.0 \| 3530 \| 1.1633 \| {'accuracy': 0.8432383942588024} \| {'f1': 0.8425868379595812} \|
	\| 0.0406 \| 11.0 \| 3883 \| 1.2132 \| {'accuracy': 0.8400986768333707} \| {'f1': 0.8388873470699977} \|
	\| 0.0245 \| 12.0 \| 4236 \| 1.2799 \| {'accuracy': 0.836958959407939} \| {'f1': 0.8378019487138132} \|
	\| 0.0139 \| 13.0 \| 4589 \| 1.2379 \| {'accuracy': 0.8434626597891904} \| {'f1': 0.8429633731503271} \|
	\| 0.0139 \| 14.0 \| 4942 \| 1.2578 \| {'accuracy': 0.8445839874411303} \| {'f1': 0.8439974594663667} \|
	\| 0.014 \| 15.0 \| 5295 \| 1.3392 \| {'accuracy': 0.8407714734245346} \| {'f1': 0.8405188286141088} \|
	\| 0.0111 \| 16.0 \| 5648 \| 1.2977 \| {'accuracy': 0.8443597219107423} \| {'f1': 0.8438293082262649} \|
	\| 0.0099 \| 17.0 \| 6001 \| 1.3405 \| {'accuracy': 0.8412200044853106} \| {'f1': 0.8400992068548403} \|
	\| 0.0099 \| 18.0 \| 6354 \| 1.3433 \| {'accuracy': 0.8405472078941467} \| {'f1': 0.839917724407298} \|
	\| 0.0041 \| 19.0 \| 6707 \| 1.3269 \| {'accuracy': 0.8445839874411303} \| {'f1': 0.8434224071770644} \|
	\| 0.0041 \| 20.0 \| 7060 \| 1.3253 \| {'accuracy': 0.8445839874411303} \| {'f1': 0.8435559601445874} \|


	### Framework versions

	- Transformers 4.35.2
	- Pytorch 2.1.0+cu121
	- Datasets 2.16.1
	- Tokenizers 0.15.1