Update README.md
Browse files
README.md
CHANGED
@@ -81,7 +81,7 @@ Obviously, this section is quite general as there are many things one can use "g
|
|
81 |
- To be investigated further, depending on what model/system is used it _might_ be worth it to apply this after OCR on typed characters.
|
82 |
2. Correcting/infilling text generated by text generation models to be cohesive/remove obvious errors that break the conversation immersion. I use this on the outputs of [this OPT 2.7B chatbot-esque model of myself](https://huggingface.co/pszemraj/opt-peter-2.7B).
|
83 |
> TODO add an example
|
84 |
-
3. Somewhat related to #2 above, fixing/correcting so-called [tortured-phrases](https://arxiv.org/abs/2107.06751) that are dead giveaways text was generated by a language model.
|
85 |
|
86 |
|
87 |
## Training and evaluation data
|
|
|
81 |
- To be investigated further, depending on what model/system is used it _might_ be worth it to apply this after OCR on typed characters.
|
82 |
2. Correcting/infilling text generated by text generation models to be cohesive/remove obvious errors that break the conversation immersion. I use this on the outputs of [this OPT 2.7B chatbot-esque model of myself](https://huggingface.co/pszemraj/opt-peter-2.7B).
|
83 |
> TODO add an example
|
84 |
+
3. Somewhat related to #2 above, fixing/correcting so-called [tortured-phrases](https://arxiv.org/abs/2107.06751) that are dead giveaways text was generated by a language model. _Note that _SOME_ of these are not fixed, especially as they venture into domain-specific terminology (i.e. irregular timberland instead of Random Forest)._
|
85 |
|
86 |
|
87 |
## Training and evaluation data
|