fr_solipcysme / tokenizer
thjbdvlt's picture
Update spaCy pipeline
ef5c8ef verified
raw
history blame contribute delete
184 Bytes
{"abbrev": ["(?:p\\.)?ex\\.", "env", "etc", "cf\\.", "pp", "chap", "c\\.?-?[\u00e0a]\\.?-?d", "r[e\u00e9]f", "[e\u00e9]ds?", "trads?", "m[a\u00e0]j", "mme?s?", "mrs?", "mlles?", "dr"]}