Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
topic_modelling
like
14
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
89c4d20
topic_modelling
/
funcs
145 kB
4 contributors
History:
38 commits
seanpedrickcase
Improved initial clean options. Now has option to return embeddings only.
89c4d20
about 1 year ago
__init__.py
Safe
0 Bytes
first commit
almost 2 years ago
anonymiser.py
Safe
10.6 kB
App now retains original index following cleaning to allow for referring back to original data
about 1 year ago
auth.py
Safe
1.88 kB
Only aggregate topics not 'other', allowed for minimum sentence length, default max_topics now will auto aggregate topics. Added Cognito Auth functionality (boto3 with AWS).
over 1 year ago
bertopic_vis_documents.py
Safe
47.6 kB
Can split passages into sentences. Improved embedding, LLM representation models, improved zero shot capabilities
over 1 year ago
clean_funcs.py
Safe
6.54 kB
Improved initial clean options. Now has option to return embeddings only.
about 1 year ago
embeddings.py
Safe
3.37 kB
App now retains original index following cleaning to allow for referring back to original data
about 1 year ago
helper_functions.py
Safe
18.3 kB
App now retains original index following cleaning to allow for referring back to original data
about 1 year ago
presidio_analyzer_custom.py
Safe
4.18 kB
Added clean data options, improved re-representation options and visualisation. General format changes
almost 2 years ago
prompts.py
Safe
6.24 kB
Updated packages. Improve hierarchy vis. Better models - mixedbread and phi3. Now option to split texts into sentences before modelling.
over 1 year ago
representation_model.py
Safe
7.83 kB
Removed some requirements from Dockerfile for AWS deployment to reduce container size
over 1 year ago
topic_core_funcs.py
Safe
38.9 kB
Improved initial clean options. Now has option to return embeddings only.
about 1 year ago