Commit
·
4b595e2
1
Parent(s):
575416b
Update README.md
Browse files
README.md
CHANGED
@@ -25,8 +25,9 @@ trained on `166` protein sequences in the [RNA binding sites dataset](https://hu
|
|
25 |
using a `85/15` train/test split. This model was trained with class weighting due to the imbalanced nature
|
26 |
of the RNA binding site dataset (fewer binding sites than non-binding sites). This model has slightly improved
|
27 |
precision, recall, and F1 score over [AmelieSchreiber/esm2_t12_35M_weighted_lora_rna_binding](https://huggingface.co/AmelieSchreiber/esm2_t12_35M_weighted_lora_rna_binding)
|
28 |
-
but may suffer from mild overfitting, as indicated by the training loss being slightly lower than the eval loss
|
29 |
-
binding sites and aren't worried about false positives, the higher recall may make this model
|
|
|
30 |
|
31 |
You can train your own version
|
32 |
using [this notebook](https://huggingface.co/AmelieSchreiber/esm2_t6_8M_weighted_lora_rna_binding/blob/main/LoRA_binding_sites_no_sweeps_v2.ipynb)!
|
|
|
25 |
using a `85/15` train/test split. This model was trained with class weighting due to the imbalanced nature
|
26 |
of the RNA binding site dataset (fewer binding sites than non-binding sites). This model has slightly improved
|
27 |
precision, recall, and F1 score over [AmelieSchreiber/esm2_t12_35M_weighted_lora_rna_binding](https://huggingface.co/AmelieSchreiber/esm2_t12_35M_weighted_lora_rna_binding)
|
28 |
+
but may suffer from mild overfitting, as indicated by the training loss being slightly lower than the eval loss (see metrics below).
|
29 |
+
If you are searching for binding sites and aren't worried about false positives, the higher recall may make this model
|
30 |
+
preferable to the other RNA binding site predictors.
|
31 |
|
32 |
You can train your own version
|
33 |
using [this notebook](https://huggingface.co/AmelieSchreiber/esm2_t6_8M_weighted_lora_rna_binding/blob/main/LoRA_binding_sites_no_sweeps_v2.ipynb)!
|