AmelieSchreiber
commited on
Commit
•
400bbb0
1
Parent(s):
49e961f
Update README.md
Browse files
README.md
CHANGED
@@ -26,7 +26,9 @@ and [here](https://huggingface.co/docs/transformers/model_doc/esm) for more deta
|
|
26 |
the binay token classification task of predicting binding sites (and active sites) of protein sequences based on sequence alone.
|
27 |
The model may be underfit and undertrained, however it still achieved better performance on the test set in terms of loss, accuracy,
|
28 |
precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
|
29 |
-
dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences.
|
|
|
|
|
30 |
|
31 |
## Training procedure
|
32 |
|
|
|
26 |
the binay token classification task of predicting binding sites (and active sites) of protein sequences based on sequence alone.
|
27 |
The model may be underfit and undertrained, however it still achieved better performance on the test set in terms of loss, accuracy,
|
28 |
precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
|
29 |
+
dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,
|
30 |
+
this model has a high recall, meaning it is likely to detect binding sites, but it has a low precision, meaning the model will likely return
|
31 |
+
false positives as well.
|
32 |
|
33 |
## Training procedure
|
34 |
|