IDEA-CCNL
/

Erlangshen-UniEX-RoBERTa-110M-Chinese

@@ -34,26 +34,6 @@ The core idea of UniEX is to transform information extraction into token-pair ta
 Because UniEX can unify all extraction tasks, and after pre-training, UniEX has strong Few-Shot and Zero-shot performance. We use the structured data of Baidu Encyclopedia to build a weakly supervised data set. After cleaning, we get about 600M data. In addition, we also collected 16 entity recognition, 7 relationship extraction, 6 event extraction, and 11 reading comprehension data sets. . We mix this data and feed it to the model for pre-training
-### 下游效果 Performance
-|         Task type         |    Datsset    | TANL(t5-base) | UniEX(roberta-base) | UIE(t5-large) | UniEX(roberta-large) |
-|:-------------------------:|:-------------:|:-------------:|:-------------------:|:-------------:|:--------------------:|
-|    Relation Extraction    |    CoNLL04    |      71.4     |        71.79        |     73.07     |         73.4         |
-|                           |     SciERC    |       -       |          -          |     33.36     |          38          |
-|                           |     ACE05     |      63.7     |        63.64        |     64.68     |         64.9         |
-|                           |      ADE      |      80.6     |        83.81        |       -       |           -          |
-| Nemed Entity  Recognition |    CoNNL03    |      91.7     |        92.13        |     92.17     |         92.65        |
-|                           |     ACE04     |       -       |          -          |     86.52     |         87.12        |
-|                           |     ACE05     |      84.9     |        85.96        |     85.52     |         87.02        |
-|                           |     GENIA     |      76.4     |        76.69        |       -       |           -          |
-|   Sentiment  Extraction   |     14lap     |       -       |          -          |     63.15     |         65.23        |
-|                           |     14res     |       -       |          -          |     73.78     |         74.77        |
-|                           |     15res     |       -       |          -          |      66.1     |         68.58        |
-|                           |     16res     |       -       |          -          |     73.87     |         76.02        |
-|     Event  Extraction     | ACE05-Trigger |      68.4     |        70.86        |     72.63     |         74.08        |
-|                           |   ACE05-Role  |      47.6     |        50.67        |     54.67     |         53.92        |
-|                           | CASIE-Trigger |       -       |          -          |     68.98     |         71.46        |
-|                           |   CASIE-Role  |       -       |          -          |     60.37     |         62.91        |
 ## 使用 Usage
 ```shell
 git clone https://github.com/IDEA-CCNL/Fengshenbang-LM.git

 Because UniEX can unify all extraction tasks, and after pre-training, UniEX has strong Few-Shot and Zero-shot performance. We use the structured data of Baidu Encyclopedia to build a weakly supervised data set. After cleaning, we get about 600M data. In addition, we also collected 16 entity recognition, 7 relationship extraction, 6 event extraction, and 11 reading comprehension data sets. . We mix this data and feed it to the model for pre-training
 ## 使用 Usage
 ```shell
 git clone https://github.com/IDEA-CCNL/Fengshenbang-LM.git