commit-rewriting-visualization / dataset_statistics.py

Commit History

Fix the synthetic data generation pipeline
347f566

Petr Tsvetkov commited on

Use FUS logs (not uploaded to repo) to compare length difference and edit distance distributions in FUS and in our dataset (resulting charts are not included).
5bd86a2

Petr Tsvetkov commited on

Generate charts for the presentation & diploma;some refactoring; add (commented) Student's t-test
7ab7be2

Petr Tsvetkov commited on