jordimas's picture
Update documentation
4b26218
metadata
language:
  - ca
tags:
  - punctuation prediction
  - punctuation
datasets: softcatala/Europarl-catalan
widget:
  - text: >-
      Els investigadors suggereixen que tot i que es tracta de la cua d'un
      dinosaure jove la mostra revela un plomatge adult i no pas plomissol
    example_title: Catalan
metrics:
  - f1

This model predicts the punctuation of Catalan language.

The model restores the following punctuation markers: "." "," "?" "-" ":"

Based on the work https://github.com/oliverguhr/fullstop-deep-punctuation-prediction

Results

The performance differs for the single punctuation markers as hyphens and colons, in many cases, are optional and can be substituted by either a comma or a full stop. The model achieves the following F1 scores for Catalan language:

Label CA
0 0.99
. 0.93
, 0.82
? 0.76
- 0.89
: 0.64
macro average 0.84

Contact

Jordi Mas jmas@softcatala.org