AmelieSchreiber commited on
Commit
400bbb0
1 Parent(s): 49e961f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -26,7 +26,9 @@ and [here](https://huggingface.co/docs/transformers/model_doc/esm) for more deta
26
  the binay token classification task of predicting binding sites (and active sites) of protein sequences based on sequence alone.
27
  The model may be underfit and undertrained, however it still achieved better performance on the test set in terms of loss, accuracy,
28
  precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
29
- dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences.
 
 
30
 
31
  ## Training procedure
32
 
 
26
  the binay token classification task of predicting binding sites (and active sites) of protein sequences based on sequence alone.
27
  The model may be underfit and undertrained, however it still achieved better performance on the test set in terms of loss, accuracy,
28
  precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
29
+ dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,
30
+ this model has a high recall, meaning it is likely to detect binding sites, but it has a low precision, meaning the model will likely return
31
+ false positives as well.
32
 
33
  ## Training procedure
34