oohtmeel's picture
Update README.md
9119cd4 verified
metadata
tags:
  - proteins
  - Biology
  - classifier

Model info

A finetuned Bert-Base-Uncased model for a multilabel classificaton task involving predicting protein functions based on their amino acid sequences. The model takes sequence data and protein class name as input and outputs probability scores. (How likely is it that this sequence belongs to this group)

Model Usage

The way you use this model for a demo is, you paste a protein sequence into the inference box and it outputs the relevant probabilities that certain GO terms are associated with that sequence. For example MMSTTHLLVFLLGVVTLTTPTFGTYESPNYGKPPTPVFKPPKVKPPPYEPKPPVYEPPKKEKPEPKPPVYAPPKKEKHGPKPTMYEPPKKEKPEPKPPVYTPPKKEVPKPKPPVYEPPKKEKPEPKPPIYTPPKKEKPEPKPPVYEPPKKEKPEPKPPVYTPPKKEKPEPKPPVYEPPKKPPMYEPKPPKPPVYTPPKKEKPEPKPPMYEPPKKPPMYEPKPPKPPVYTPPKKEKPEPKPPMYQPPNNPPIYEPKPPKPPVYAPPKEEKPKPKPPVYEPPAHEPPYGHYPGHPPLGKPQ

outputs the following score

[
  [
    {
      "label": "GO:0000122",
      "score": 0.29775485396385193
    },
    {
      "label": "GO:0000070",
      "score": 0.10477513074874878
    },
    {
      "label": "GO:0000075",
      "score": 0.08593793958425522
    },
    {
      "label": "GO:0000118",
      "score": 0.05860009789466858
    },
    {
      "label": "GO:0000082",
      "score": 0.05373986065387726
    },
    {
      "label": "GO:0000077",
      "score": 0.03928716108202934
    },
    {
      "label": "GO:0000096",
      "score": 0.03705739229917526
    },
    {
      "label": "GO:0000079",
      "score": 0.02797592058777809
    },
    {
      "label": "GO:0000045",
      "score": 0.026528609916567802
    },
    {
      "label": "GO:0000097",
      "score": 0.026119187474250793
    },
    {
      "label": "GO:0000086",
      "score": 0.019697198644280434
    },
    {
      "label": "GO:0000049",
      "score": 0.018551582470536232
    },
    {
      "label": "GO:0000041",
      "score": 0.016929756850004196
    },
    {
      "label": "GO:0000054",
      "score": 0.015105823054909706
    },
    {
      "label": "GO:0000083",
      "score": 0.01434631273150444
    },
    {
      "label": "GO:0000105",
      "score": 0.013960960321128368
    },
    {
      "label": "GO:0000076",
      "score": 0.013064960949122906
    },
    {
      "label": "GO:0000109",
      "score": 0.012523632496595383
    },
    {
      "label": "GO:0000113",
      "score": 0.012152223847806454
    },
    {
      "label": "GO:0000062",
      "score": 0.01127714291214943
    },
    {
      "label": "GO:0000101",
      "score": 0.011041304096579552
    },