Update app.py
Browse files
app.py
CHANGED
@@ -10,9 +10,10 @@ st.title("FW clusters inspection (free topics)")
|
|
10 |
st.markdown("""
|
11 |
We clustered 100k FineWeb samples using [text-clustering](https://github.com/huggingface/text-clustering).
|
12 |
|
13 |
-
Our approach involved prompting Mixtral to evaluate whether the topics in each cluster are educational or could be considered college material using a score from 1 to 10.
|
|
|
14 |
|
15 |
-
Additionally, the model was tasked with finding the topic of each cluster.
|
16 |
""")
|
17 |
|
18 |
|
|
|
10 |
st.markdown("""
|
11 |
We clustered 100k FineWeb samples using [text-clustering](https://github.com/huggingface/text-clustering).
|
12 |
|
13 |
+
Our approach involved prompting Mixtral to evaluate whether the topics in each cluster are educational or could be considered college material using a score from 1 to 10. \
|
14 |
+
Technically, we provide it with 10 random examples from the cluster in the prompt and ask it to judge their topics.
|
15 |
|
16 |
+
Additionally, the model was tasked with finding the topic of each cluster (based on the 10 random examples).
|
17 |
""")
|
18 |
|
19 |
|