regel-corpus commited on
Commit
19e407c
1 Parent(s): dc6e5f4

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +50 -0
  3. pytorch_model.bin +3 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - flair
4
+ - hunflair
5
+ - token-classification
6
+ - sequence-tagger-model
7
+ language: en
8
+ widget:
9
+ - text: It contains a functional GCGGCGGCG Egr-1-binding site
10
+ ---
11
+
12
+ ## HunFlair2 model for TFBS
13
+
14
+ [HunFlair](https://github.com/flairNLP/flair/blob/master/resources/docs/HUNFLAIR2.md) (biomedical flair) for enhancer entity:
15
+
16
+ - pre-trained language model: michiyasunaga/BioLinkBERT-base
17
+ - fine-tuned on RegEl corpus for `Tfbs` entity type
18
+
19
+ Predicts 1 tag:
20
+
21
+ | **tag** | **meaning** |
22
+ | ------- | ---------------------------------------- |
23
+ | Tfbs | DNA region bound by transcription factor |
24
+
25
+ ______________________________________________________________________
26
+
27
+ ## Info
28
+
29
+ ### Demo: How to use in Flair
30
+
31
+ Requires:
32
+
33
+ - **[Flair](https://github.com/flairNLP/flair/)>=0.14.0** (`pip install flair` or `pip install git+https://github.com/flairNLP/flair.git`)
34
+
35
+ ```python
36
+ from flair.data import Sentence
37
+ from flair.nn import Classifier
38
+ from flair.tokenization import SciSpacyTokenizer
39
+
40
+ text = "We found that Egr-1 specifically binds to the PTEN 5' untranslated region, which contains a functional GCGGCGGCG Egr-1-binding site."
41
+ sentence = Sentence(text, use_tokenizer=SciSpacyTokenizer())
42
+
43
+ tagger = Classifier.load("regel-corpus/hunflair2-regel-tfbs")
44
+ tagger.predict(sentence)
45
+
46
+ print('The following NER tags are found:')
47
+ # iterate over entities and print
48
+ for entity in sentence.get_spans('ner'):
49
+ print(entity)
50
+ ```
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:91bfda15d2c8b4c1a55b27b6bce4d0bb939abc108673103f3d252ea4837d9fad
3
+ size 433710127