Spaces:
Runtime error
Runtime error
# Checkpoints (OFA-CN) | |
We provide checkpoints of OFA-CN, which is the Chinese version of OFA. We provide Base-size and Large-size models, including pretrained and finetuned models on image captioning and referring expression comprehension. Note that we translated the texts in the RefCOCO(-/+/g) datasets and finetuned OFA-CN on them. We plan to release the related new datasets in the near future. | |
<br> | |
## Checkpoints | |
Below we provide the links for downloading the Chinese OFA checkpoints. | |
### Pretraining | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/ofa_cn_large.pt"> Pretrained checkpoint (OFA-CN-Large) </a> (~443M parameters) | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/ofa_cn_base.pt "> Pretrained checkpoint (OFA-CN-Base) </a> (~160M parameters) | |
### Finetuning (OFA-Large) | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/caption_cn_large.pt"> Finetuned checkpoint for MUGE Caption (Stage 1) </a> | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/refcoco_cn_large.pt"> Finetuned checkpoint for RefCOCO-CN </a> | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/refcocoplus_cn_large.pt"> Finetuned checkpoint for RefCOCO+-CN </a> | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/refcocog_cn_large.pt"> Finetuned checkpoint for RefCOCOg-CN </a> | |
### Finetuning (OFA-Base) | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/caption_cn_base.pt"> Finetuned checkpoint for MUGE Caption (Stage 1) </a> | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/refcoco_cn_base.pt"> Finetuned checkpoint for RefCOCO-CN </a> | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/refcocoplus_cn_base.pt"> Finetuned checkpoint for RefCOCO+-CN </a> | |
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/refcocog_cn_base.pt"> Finetuned checkpoint for RefCOCOg-CN </a> | |
<br> | |
## Model Card | |
Below we provide the basic information of the base-size and large-size OFA-CN. | |
<table border="1" width="100%"> | |
<tr align="center"> | |
<th>Model</th><th>#Params</th><th>Backbone</th><th>Hidden Size</th><th>Intermediate Size</th><th>#Heads</th><th>#Enc. Layers</th><th>#Dec. Layers</th> | |
</tr> | |
<tr align="center"> | |
<td>OFA<sub>Base</sub><td>160M</td><td>ResNet101</td><td>768</td></td><td>3072</td><td>12</td><td>6</td><td>6</td> | |
</tr> | |
<tr align="center"> | |
<td>OFA<sub>Large</sub></td><td>443M</td><td>ResNet152</td><td>1024</td></td><td>4096</td><td>16</td><td>12</td><td>12</td> | |
</tr> | |
</tr> | |
</table> | |
<br> | |
## Results | |
Below we provide the results of OFA-CN and the baselines for comparison. | |
### [MUGE Caption]("https://tianchi.aliyun.com/muge") | |
<table border="1" width="100%"> | |
<tr align="center"> | |
<td>Model</td><td>BLEU@4</td><td>ROUGE-L</td><td>CIDEr-D</td> | |
</tr> | |
<tr align="center"> | |
<td>Trm </td><td>7.33</td><td>51.51</td><td>11.00</td> | |
</tr> | |
<tr align="center"> | |
<td>M6</td><td>16.19</td><td>55.06</td><td>30.75</td> | |
</tr> | |
<tr align="center"> | |
<td>OFA<sub>Base</sub></td><td>26.23</td><td>58.95</td><td>50.70</td> | |
</tr> | |
<tr align="center"> | |
<td>OFA<sub>Large</sub></td><td><b>27.32</b></td><td><b>59.20</b></td><td><b>53.51</b></td> | |
</tr> | |
</table> | |
### RefCOCO-CN Series | |
<table border="1" width="100%"> | |
<tr align="center"> | |
<td>Model</td><td>RefCOCO(val/testA/testB)</td><td>RefCOCO+(val/testA/testB)</td><td>RefCOCOg(val/test-u)</td> | |
</tr> | |
<tr align="center"> | |
<td>OFA<sub>Base</sub>(random-init)</td><td>30.13/35.07/25.03</td><td>17.89/20.90/15.83</td><td>20.30/20.45</td> | |
</tr> | |
<tr align="center"> | |
<td>OFA<sub>Base</sub></td><td>82.18/86.07/<b>76.68</b></td><td>69.38/77.26/60.14</td><td><b>73.57/72.53</b></td> | |
</tr> | |
<tr align="center"> | |
<td>OFA<sub>Large</sub></td><td><b>82.84/86.54</b>/76.50</td><td><b>71.30/78.56/61.85</b></td><td>71.96/71.30</td> | |
</tr> | |
</table> | |
<br> | |