Spaces:
Running
Running
Commit History
App now retains original index following cleaning to allow for referring back to original data
90553eb
Only aggregate topics not 'other', allowed for minimum sentence length, default max_topics now will auto aggregate topics. Added Cognito Auth functionality (boto3 with AWS).
1e2bb3e
Can split passages into sentences. Improved embedding, LLM representation models, improved zero shot capabilities
55f0ce3
Updated packages. Improve hierarchy vis. Better models - mixedbread and phi3. Now option to split texts into sentences before modelling.
04a15c5
Should now parse custom regex correctly. Will now wipe previously created embeddings if 'low resource mode' option switched.
0a543a0
Sean-Case
commited on
Allowed for uploading custom regex for cleaning. Fixed calculate all probabilities, reduce outliers. Added text tree for hierarchical modelling.
381f959
Upgraded to Gradio 4.16.0. Guide for converting to exe added.
0a177ca
Fixed file load with files including capital letters
9c6425d
Added clean data options, improved re-representation options and visualisation. General format changes
4effac0
Model save now checks and makes a folder before writing the model
356791c
Lots of general fixes. New visualisations, fixed hierarchical vis for zero shot. Added calc all probabilities.
b4510a6
Split off LLM representation, visualisation, and reduce outliers from main function. Added hierarchical visualisation and logs
5d87c3c
App should now check if embeddings are loaded before topic modelling. And will save only once.
9eeba1e
Model export changed to safetensors. Improved representational model function. Got zero shot topic modelling working
4cfed8e
Sean-Case
commited on