toastynews
/

xlnet-hongkongese-base

Text Generation

Inference Endpoints

Model card Files Files and versions Community

system HF staff commited on Jul 7, 2020

Commit

e4a8655

•

1 Parent(s): ea3bbd2

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -31,6 +31,7 @@ For text generation, like other XLNet models, a longer context will help generat
 ## Training data
 The following is the list of data sources. Total characters is about 507M.
 | Data                                              |   % |
 | ------------------------------------------------- | --: |
 | News Articles / Blogs                             | 58% |
@@ -40,6 +41,7 @@ The following is the list of data sources. Total characters is about 507M.
 | Online Fiction                                    |  1% |
 The following is the distribution of different languages within the corpus.
 | Language                                          |   % |
 | ------------------------------------------------- | --: |
 | Standard Chinese                                  | 62% |
@@ -49,6 +51,7 @@ The following is the distribution of different languages within the corpus.
 ## Training procedure
 Model was trained on a single TPUv3 from the official repo with the default parameters.
 | Parameter                                        | Value |
 | ------------------------------------------------ | ----: |
 | Batch Size                                       | 32    |
@@ -60,6 +63,7 @@ Model was trained on a single TPUv3 from the official repo with the default para
 ## Eval results
 Average evaluation task results over 10 runs. Comparison using the original repo model and code. Chinese models are available from [Joint Laboratory of HIT and iFLYTEK Research (HFL)](https://huggingface.co/hfl)
 | Model       | DRCD (EM/F1) | openrice-senti | lihkg-cat | wordshk-sem |
 |:-----------:|:------------:|:--------------:|:---------:|:-----------:|
 | Chinese     | 82.8 / 91.8  | 79.8           | 70.7      | 72.0 / 78.9*|

 ## Training data
 The following is the list of data sources. Total characters is about 507M.
 | Data                                              |   % |
 | ------------------------------------------------- | --: |
 | News Articles / Blogs                             | 58% |
 | Online Fiction                                    |  1% |
 The following is the distribution of different languages within the corpus.
 | Language                                          |   % |
 | ------------------------------------------------- | --: |
 | Standard Chinese                                  | 62% |
 ## Training procedure
 Model was trained on a single TPUv3 from the official repo with the default parameters.
 | Parameter                                        | Value |
 | ------------------------------------------------ | ----: |
 | Batch Size                                       | 32    |
 ## Eval results
 Average evaluation task results over 10 runs. Comparison using the original repo model and code. Chinese models are available from [Joint Laboratory of HIT and iFLYTEK Research (HFL)](https://huggingface.co/hfl)
 | Model       | DRCD (EM/F1) | openrice-senti | lihkg-cat | wordshk-sem |
 |:-----------:|:------------:|:--------------:|:---------:|:-----------:|
 | Chinese     | 82.8 / 91.8  | 79.8           | 70.7      | 72.0 / 78.9*|