Spaces:

wgcv
/

Tidy-Tabs-Titles

Sleeping

wgcv commited on Jul 10, 2024

Commit

f0c49d3

1 Parent(s): fa7f6ed

add some space and format

Files changed (1) hide show

app.py CHANGED Viewed

@@ -132,7 +132,7 @@ The dataset primarily comprises data gathered from nytimes.com and GitHub.com, s
 - P.S. I tested ChatGPT-4.0, and the results were highly discouraging for a chunk of data consisting of 100 text filed values.
--In the future, we should aim to increase the dataset size to at least 10,000-15,000 samples and improve the train/test/validation split methodology.
 """, unsafe_allow_html=False, help=None)
 st.info("I crafted this dataset using a more powerful LLM and scripts, no need for boring manual labeling. The idea is to eliminate human labeling.",icon="ℹ️")
@@ -183,7 +183,9 @@ Given the substantial volume of data, training a model from scratch was deemed i
 - Add more language in the dataset
 ### Access to the Models
 `https://huggingface.co/wgcv/tidy-tab-model-t5-small`
 `https://huggingface.co/wgcv/tidy-tab-model-pegasus-xsum`
 `https://huggingface.co/wgcv/tidy-tab-model-bart-large-cnn`
 ## co2_eq_emissions

 - P.S. I tested ChatGPT-4.0, and the results were highly discouraging for a chunk of data consisting of 100 text filed values.
+- In the future, we should aim to increase the dataset size to at least 10,000-15,000 samples and improve the train/test/validation split methodology.
 """, unsafe_allow_html=False, help=None)
 st.info("I crafted this dataset using a more powerful LLM and scripts, no need for boring manual labeling. The idea is to eliminate human labeling.",icon="ℹ️")
 - Add more language in the dataset
 ### Access to the Models
 `https://huggingface.co/wgcv/tidy-tab-model-t5-small`
 `https://huggingface.co/wgcv/tidy-tab-model-pegasus-xsum`
 `https://huggingface.co/wgcv/tidy-tab-model-bart-large-cnn`
 ## co2_eq_emissions