Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
topic_modelling
like
10
App
Files
Files
Community
main
topic_modelling
/
funcs
3 contributors
History:
37 commits
seanpedrickcase
App now retains original index following cleaning to allow for referring back to original data
90553eb
about 1 month ago
__init__.py
0 Bytes
first commit
9 months ago
anonymiser.py
Safe
10.6 kB
App now retains original index following cleaning to allow for referring back to original data
about 1 month ago
auth.py
Safe
1.88 kB
Only aggregate topics not 'other', allowed for minimum sentence length, default max_topics now will auto aggregate topics. Added Cognito Auth functionality (boto3 with AWS).
3 months ago
bertopic_vis_documents.py
Safe
47.6 kB
Can split passages into sentences. Improved embedding, LLM representation models, improved zero shot capabilities
4 months ago
clean_funcs.py
Safe
4.86 kB
Only aggregate topics not 'other', allowed for minimum sentence length, default max_topics now will auto aggregate topics. Added Cognito Auth functionality (boto3 with AWS).
3 months ago
embeddings.py
Safe
3.37 kB
App now retains original index following cleaning to allow for referring back to original data
about 1 month ago
helper_functions.py
Safe
18.3 kB
App now retains original index following cleaning to allow for referring back to original data
about 1 month ago
presidio_analyzer_custom.py
Safe
4.18 kB
Added clean data options, improved re-representation options and visualisation. General format changes
9 months ago
prompts.py
Safe
6.24 kB
Updated packages. Improve hierarchy vis. Better models - mixedbread and phi3. Now option to split texts into sentences before modelling.
4 months ago
representation_model.py
Safe
7.83 kB
Removed some requirements from Dockerfile for AWS deployment to reduce container size
3 months ago
topic_core_funcs.py
Safe
38.4 kB
App now retains original index following cleaning to allow for referring back to original data
about 1 month ago