yarongef
/

DistilProtBert

protein language model

Inference Endpoints

Model card Files Files and versions Community

yarongef commited on Mar 30, 2022

Commit

020cba7

•

1 Parent(s): d88d839

Update README.md

Files changed (1) hide show

README.md +20 -1

README.md CHANGED Viewed

@@ -36,4 +36,23 @@ DistilProtBert model was pretrained on [Uniref50](https://www.uniprot.org/downlo
 # Pretraining procedure
 Preprocessing was done using ProtBert's tokenizer.
-The details of the masking procedure for each sequence followed the original Bert (as mentioned in [ProtBert](https://huggingface.co/Rostlab/prot_bert)).

 # Pretraining procedure
 Preprocessing was done using ProtBert's tokenizer.
+The details of the masking procedure for each sequence followed the original Bert (as mentioned in [ProtBert](https://huggingface.co/Rostlab/prot_bert)).
+The model was pretrained on a single DGX cluster 3 epochs in total. local batch size was 16, the optimizer used was AdamW with a learning rate of 5e-5 and mixed precision settings.
+## Evaluation results
+When fine-tuned on downstream tasks, this model achieves the following results:
+Test results :
+| Task/Dataset | secondary structure (3-states) | Membrane  |
+|:-----:|:-----:|:-----:|:-----:|
+|   CASP12  | 72 |    |
+|   TS115   | 81 |    |
+|   CB513   | 79 |    |
+|  DeepLoc  |    | 86 |
+Distinguish between:
+### BibTeX entry and citation info