loubnabnl HF staff commited on
Commit
bc78067
1 Parent(s): 2858b59

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +3 -2
app.py CHANGED
@@ -10,9 +10,10 @@ st.title("FW clusters inspection (free topics)")
10
  st.markdown("""
11
  We clustered 100k FineWeb samples using [text-clustering](https://github.com/huggingface/text-clustering).
12
 
13
- Our approach involved prompting Mixtral to evaluate whether the topics in each cluster are educational or could be considered college material using a score from 1 to 10.
 
14
 
15
- Additionally, the model was tasked with finding the topic of each cluster.
16
  """)
17
 
18
 
 
10
  st.markdown("""
11
  We clustered 100k FineWeb samples using [text-clustering](https://github.com/huggingface/text-clustering).
12
 
13
+ Our approach involved prompting Mixtral to evaluate whether the topics in each cluster are educational or could be considered college material using a score from 1 to 10. \
14
+ Technically, we provide it with 10 random examples from the cluster in the prompt and ask it to judge their topics.
15
 
16
+ Additionally, the model was tasked with finding the topic of each cluster (based on the 10 random examples).
17
  """)
18
 
19