Spaces:

ml6team
/

post-processing-summarization

Running

File size: 696 Bytes

305fb83

As you can see we have 2 unmatched entities: "January 18" and "U.S". The first one is a hallucinated entity in the summary, that does not exist in the article.
Deep learning based generation is [prone to hallucinate](https://arxiv.org/pdf/2202.03629.pdf) unintended text. These hallucinations degrade
system performance and fail to meet user expectations in many real-world scenarios. By applying entity matching, we can improve this problem
for the downstream task of summary generation. U.S. **does** occur in the article, but as "US" instead of "U.S.". This could be solved
by comparing to a list of abbreviations or with a specific embedder for abbreviations but is currently not implemented.