Length comparison template notebook; grazie token is needed to run
bb44b5c
Petr Tsvetkovcommited on
Fix the synthetic data generation pipeline
347f566
Petr Tsvetkovcommited on
Use FUS logs (not uploaded to repo) to compare length difference and edit distance distributions in FUS and in our dataset (resulting charts are not included).
5bd86a2
Petr Tsvetkovcommited on
Added some description to the README.md
3907263
Petr Tsvetkovcommited on
Latest version of the code; config updated to JetBrains-Research
a7bba68
Petr Tsvetkovcommited on
Generate charts for the presentation & diploma;some refactoring; add (commented) Student's t-test
7ab7be2
Petr Tsvetkovcommited on
Remove the histograms
827777f
Petr Tsvetkovcommited on
Add distribution charts; add more detailed statistics; compute multi-reference TER as mean of TERs for each reference to improve the performance
303303b
Petr Tsvetkovcommited on
Add the aggregated correlations table
ff76f88
Petr Tsvetkovcommited on
Visualizer bugs fixed; added normalized editdist
aef1dbe
Petr Tsvetkovcommited on
Fix the bug
9e1ff19
Petr Tsvetkovcommited on
Fix the visualization
39950c9
Petr Tsvetkovcommited on
Update requirements.txt
ca11b66
Petr Tsvetkovcommited on
Update requirements.txt
ac712df
Petr Tsvetkovcommited on
Revert "Update requirements.txt"
1a6c40b
Petr Tsvetkovcommited on
Update requirements.txt
9df2f4c
Petr Tsvetkovcommited on
Pretty-print all the correlations in the visualization app
a01d3ba
Petr Tsvetkovcommited on
Add noref gpt-eval to the pipeline
073db2c
Petr Tsvetkovcommited on
Add edit distance and edit time metrics; add GPT-based metric
f5faae7
Petr Tsvetkovcommited on
Generate a dataset for the labeling app
6676c5a
Petr Tsvetkovcommited on
Update the parameters
2d03034
Petr Tsvetkovcommited on
Add checkpoints
e027012
Petr Tsvetkovcommited on
Compute & compare metrics
f1b08a8
Petr Tsvetkovcommited on
Prompts & params adjusted
9d943c1
Petr Tsvetkovcommited on
Default value for "start_to_end"
fbb73cc
Petr Tsvetkovcommited on
Start-to-end generation
13e3243
Petr Tsvetkovcommited on
Fix the bug
d7e2287
Petr Tsvetkovcommited on
Default value for "end_to_start" for manual df
b7f7a57
Petr Tsvetkovcommited on
Display the end-to-start field
ea84073
Petr Tsvetkovcommited on
Keep the session column
02ebb6e
Petr Tsvetkovcommited on
Add returns
5ae823f
Petr Tsvetkovcommited on
Fix the dfs in visualization
b6ae739
Petr Tsvetkovcommited on
Fix the statistics in visualization
c151bb0
Petr Tsvetkovcommited on
Fix the visualization
0b259d2
Petr Tsvetkovcommited on
- New version of the end->start synthetics samples generation
a8a595d
Petr Tsvetkovcommited on
Fix
e2a35c0
Petr Tsvetkovcommited on
# of deletions rel to initial message length
4017643
Petr Tsvetkovcommited on
Add datasets comparison
f26a894
Petr Tsvetkovcommited on
Force the light theme
642fae1
Petr Tsvetkovcommited on
Full dataset generation
34d6af9
Petr Tsvetkovcommited on
Add some examples to the synthetic ds generation prompt
574fdf5
Petr Tsvetkovcommited on
Synthetic dataset generation for the first 5 samples; visualization fixed