Edit model card

C2-Topic-Model-100

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AlexanderHolmes0/C2-Topic-Model-100")

topic_model.get_topic_info()

An example of the Chat GPT - 3.5 Turbo representations: "multiaspect.png"

Topic overview

  • Number of topics: 100
  • Number of training documents: 828299
Click here for an overview of all topics.
Topic Representation Count ChatGPT
-1 ['dumps', 'td', 'social', 'online', 'pin', 'like', 'new', 'make', 'card', 'time'] 34030 ['Home design tips and mobile betting information']
0 ['price', 'value', 'index', 'market', '31', 'total', '30', 'shares', 'years', 'assets'] 102457 ['NYSE end of day stock repo and market update for MDC']
1 ['card', 'credit', 'cards', 'account', 'rewards', 'travel', 'cash', 'points', 'purchases', 'earn'] 171217 ['Credit card application and travel tips for smoother experiences']
2 ['rating', 'trust', 'estate', 'quaer', 'shares', 'stock', 'real', 'realty', 'investment', '00'] 30538 ['Real Estate Investment Trust Shareholder Activity and Performance in the First and Fourth Quarters']
3 ['game', 'washington', 'wizards', 'season', 'bowl', 'games', 'capitals', 'team', 'spos', 'play'] 57205 ['Washington Wizards season and Capitals games']
4 ['app', 'easy', 'love', 'great', 'use', 'credit', 'payments', 'apps', 'good', 'make'] 27829 ['Easy-to-use app with great features and love for its simplicity']
5 ['2022', 'workforce', 'employees', 'announced', 'layoffs', 'million', 'business', 'cash', 'company', 'income'] 67826 ['Layoffs and restructuring announcements in 2022 by large companies listed on NASDAQ, NYSE, and other stock exchanges']
6 ['deals', 'save', 'black', 'deal', 'friday', 'walma', 'cyber', 'shopping', 'browser', 'best'] 23384 ['Early Black Friday Deals on Monitors, Soundbars, Apple Watch Series, and 60/58 Inch 4K TVs']
7 ['app', 'account', 'update', 'log', 'phone', 'login', 'password', 'use', 'spark', 'quicken'] 25012 ['Issues with App Account Transactions and Security']
8 ['easy', 'use', 'convenient', 'navigate', 'friendly', 'fast', 'simple', 'works', 'quick', 'great'] 20637 ['easy to use and very convenient']
9 ['ticketmaster', 'swift', 'presale', 'taylor', 'ticket', 'fans', 'tour', 'eras', 'verified', 'sale'] 14825 ["Ticketmaster's Chaos with Taylor Swift Ticket Sales"]
10 ['work', 'cons', 'pros', 'people', 'interview', 'great', 'good', 'like', 'tech', 'remote'] 16734 ['Software engineering degree apprenticeship and business planning']
11 ['rating', 'stock', '00', 'shares', 'evgo', 'transocean', 'research', 'petroleum', 'quaer', 'company'] 9483 ['Stock Rating of Occidental Petroleum and ConocoPhillips']
12 ['patent', 'virginia', 'alexandria', 'inventors', 'assigned', 'initially', 'developed', 'filed', 'application', 'b2'] 5562 ['Patents awarded to Virginia inventors in Alexandria']
13 ['en', 'la', 'el', 'que', 'los', 'del', 'para', 'por', 'una', 'las'] 7589 ['Mexican fintech startup reaches unicorn status amid global crisis']
14 ['wiki', 'pirates', 'like', 'piece', 'luffy', 'manga', 'anime', 'lol', 'chapter', 'vol'] 29399 ['Characters and events in various series and performances involving pirates and music']
15 ['biden', 'capitol', 'trump', 'president', 'election', 'said', 'democracy', 'ukraine', 'people', 'house'] 15650 ["President Biden's Speech Commemorating the Capitol Riot Anniversary and Accusations Against Trump"]
16 ['great', 'good', 'excellent', 'awesome', 'nice', 'ok', 'wonderful', 'thanks', 'service', 'amazing'] 9391 ['High Satisfaction Service']
17 ['hi', 'hey', 'hello', 'hu', 'bush', 'yuh', 'howdy', 'chiasson', 'kira', 'yoe'] 3496 ['Greetings and Names']
18 ['cou', 'county', 'case', 'plaintiff', 'filed', 'notice', 'attorney', 'judgment', 'civil', 'said'] 9937 ['Foreclosure Auction Notices in Suffolk County Courts']
19 ['arena', 'center', 'tour', 'music', 'dates', 'tx', 'ca', 'album', 'garden', 'band'] 16963 ["Madonna's Global Arena Tour Dates and Ticket Information"]
20 ['ihearadio', 'jingle', 'ball', 'photo', 'presented', 'tour', 'lizzo', 'lovato', 'demi', 'stage'] 7139 ['ihearadio jingle ball 2022 performances featuring dua lipa, lizzo, charlie puth, the kid laroi, ajr, demi lovato']
21 ['lounge', 'airpo', 'new', 'food', 'city', 'restaurant', 'lounges', 'hotel', 'like', 'park'] 13708 ['Pebblecreek Retirement Community and Amenities']
22 ['garcia', 'davis', 'fight', 'gervonta', 'hector', 'ennis', 'boxing', 'luis', 'tank', 'wwe'] 3245 ['Gervonta Davis defends title against Hector Luis Garcia in boxing match']
23 ['ihearadio', 'photo', 'festival', 'music', 'getty', 'images', 'ego', 'alter', 'fans', 'chili'] 4962 ['2023 iHeartRadio Alter Ego Music Festival Highlights']
24 ['farm', 'farmers', 'loans', 'mogage', 'loan', 'agricultural', 'usda', 'agriculture', 'program', 'land'] 2692 ['Farm Loans and Financial Assistance for Farmers']
25 ['banks', 'bank', 'zelle', 'overdraft', 'fees', 'customers', 'said', 'fraud', 'money', 'silicon'] 8970 ["Impact of Bank of America's Overdraft Fee Reduction on the Banking Industry"]
26 ['covid', 'nyc', 'bar', 'search', 'map', 'detroit', 'educational', 'recognition', 'missing', 'delta'] 1398 ['Search for COVID information in NYC and Detroit']
27 ['helix', 'solutions', 'aris', 'energy', 'water', 'rating', 'hlx', 'group', 'research', 'stock'] 1635 ['Helix Energy Solutions Group Stock and Ratings Analysis']
28 ['bs', 'blt', 'bue', 'crap', 'bib', 'ugly', 'null', 'honestly', 'bachelor', 'coupon'] 1286 ['bs, blt, bue, crap, bib, ugly, null, honestly, bachelor, coupon']
29 ['matador', 'ironwood', 'resources', 'mtdr', 'pharmaceuticals', 'company', 'rating', 'irwd', 'shares', 'stock'] 1336 ['matador resources stock acquisitions and analyst ratings']
30 ['doubleverify', 'dv', 'shopify', 'rating', 'stock', 'shares', '00', 'quaer', 'research', 'shop'] 1184 ['DoubleVerify Holdings Inc (NYSE: DV) Stock and Investor Activity']
31 ['__', 'add', 'date', 'correct', 'poor', 'location', 'pm', 'a1', 'aqesome', 'interested'] 1583 ['Data Quality Enhancement - Location Correction and Date Addition']
32 ['amphastar', 'crowdstrike', 'pharmaceuticals', 'amph', 'rating', 'stock', 'shares', 'crwd', 'sold', 'company'] 1087 ['Analysis of recent developments in the stock ratings and insider activities of Amphastar Pharmaceuticals']
33 ['boy', 'wentz', 'fob', 'band', 'ego', 'alter', 'cryptic', 'stump', 'album', 'guitar'] 1233 ['Fall Out Boy announces new era with cryptic ad and upcoming album, guitarist Joe Trohman discusses evolving sound']
34 ['nabors', 'industries', 'arvinas', 'drilling', 'rating', 'nbr', '00', 'research', 'target', 'stock'] 1215 ['Nabors Industries stock ratings and financial performance']
35 ['arena', 'blink', 'center', '182', 'tour', 'jul', 'delonge', 'oct', 'festival', 'band'] 1573 ['Blink 182 World Tour Announcement with Tom DeLonge Reunion and New Album']
36 ['biosciences', 'akoya', 'biopharmaceuticals', 'ideaya', 'stock', 'kodiak', 'shares', 'sciences', 'akya', 'price'] 1795 ['Nasdaq trading update for Akoya Biosciences Inc.']
37 ['therapeutics', 'rapt', 'chimerix', 'cmrx', 'rating', 'stock', 'repare', 'shares', '00', 'adc'] 1170 ['Rapt Therapeutics Stock Rating Analysis']
38 ['written', 'episode', 'news', 'video', 'season', 'new', 'wiki', 'january', 'december', 'live'] 5740 ["NYE events scaled back, Jo Koy's comedy journey, booster shots at Boston first night, Chicago 2021 review"]
39 ['like', 'commercial', 'know', 'wallet', 'got', 'hesitation', 'love', 'don', 've', 'dot'] 13044 ['Capital One Financial Services']
40 ['wizards', 'homebody', 'predictions', 'odds', 'picks', 'et', 'tip', 'spurs', 'nba', 'expe'] 2093 ['New York Knicks vs Washington Wizards NBA Predictions and Odds']
41 ['sweepstakes', 'sponsor', 'prize', 'entry', 'winner', 'station', 'text', 'edt', 'designated', 'entrant'] 852 ['Sweepstakes rules and eligibility for 2022 iheacountry festival flyaway sweepstakes']
42 ['cameron', 'getty', 'images', 'photo', 'song', 'singer', 'sultry', 'stage', 'dove', 'noh'] 785 ["Dove Cameron's Performance at iHeartRadio Jingle Ball 2022"]
43 ['arena', 'center', 'aug', 'drake', 'tour', 'jul', 'ca', 'sat', 'lamar', 'blur'] 1120 ['Drake "It All Blur" 2023 Tour Dates']
44 ['eur1', 'eur', 'price', 'director', 'change', 'assets', 'past', 'section', 'net', 'year'] 2216 ['Belgian diversified holding company stock price movements in EUR']
45 ['puth', 'like', 'love', 'song', 'music', 'jojo', 'charlie', 'loser', 'arpeggios', 'jungkook'] 1798 ["Charlie Puth's new album release and bromance with Jungkook from BTS"]
46 ['easy', 'peasy', 'fast', 'quick', 'simple', 'super', 'smooth', 'thanks', 'pretty', 'love'] 2191 ['Topic: Easy and Quick Solutions']
47 ['immersive', 'vr', 'tech', 'forward', 'looking', 'xr', 'cse', 'uncontained', 'information', 'synthesisvr'] 1611 ['XR Immersive Tech and SynthesisVR Announcement and Updates']
48 ['chargepoint', 'chpt', 'rating', 'stock', '00', 'shares', 'research', 'sold', 'quaer', 'target'] 592 ['ChargePoint Holdings Inc Stock Analysis and Investor Activity']
49 ['chvrches', 'hott', 'stranding', 'mayberry', 'ego', 'alter', 'death', 'lauren', 'band', 'followed'] 563 ['Chvrches performance at iHeartRadio Alter Ego and potential involvement in Death Stranding soundtrack']
50 ['tkk', 'tk', 'tzsarbreaux', '20se', 'ttds', 'lmk', 'tdp', 'demko', 'vols', 'tcl'] 630 ['tkk, tk, tzsarbreaux, 20se, ttds, lmk, tdp, demko, vols, tcl']
51 ['morello', 'white', 'belasco', 'untamable', 'unpredictable', 'wild', 'pa', 'theater', 'awesome', 'måneskin'] 561 ['Jack White and Tom Morello Rock Out at Belasco Theater']
52 ['peapack', 'gladstone', 'pgc', 'ffo', 'expenses', 'management', 'development', 'financial', 'product', 'candidates'] 677 ['NYSE stock performance of Plymouth Industrial REIT']
53 ['white', 'footage', 'ihearadio', 'beck', 'stripes', 'shaped', 'jack', 'solo', 'fans', 'army'] 668 ["Jack White's Asia Tour and Fan Engagement"]
54 ['libey', 'energy', 'lb', 'oilfield', 'stock', 'rating', 'shares', '00', 'services', 'company'] 853 ['Libey Energy Inc Stock Performance and Analyst Ratings']
55 ['trump', 'investigation', 'james', 'attorney', 'organization', 'donald', 'office', 'said', 'evidence', 'cou'] 1024 ["Investigation into Trump Organization's Alleged Misleading Asset Valuations led by Attorney General James"]
56 ['adr', 'adrs', 'mellon', 'depository', 'composite', 'york', 'llc', 'years', 'past', 'price'] 3156 ['topic stenka: adr adrs composite price york years']
57 ['energy', 'rating', 'chesapeake', 'oil', 'enerplus', 'gas', 'research', '00', 'estimates', 'stock'] 2762 ['Energy Stock Ratings and Earnings Forecasts']
58 ['center', 'arena', 'tso', '12', 'ghosts', 'tour', 'eve', 'oh', 'matinee', 'siberian'] 484 ['Trans Siberian Orchestra 2022 Winter Tour - The Ghosts of Christmas Eve Tour Dates']
59 ['socure', 'identity', 'verification', 'fraud', 'ventures', 'customers', 'digital', 'identities', 'mend', 'sass'] 567 ['Socure - Leader in Digital Identity Verification and Fraud Solutions']
60 ['dua', 'pop', 'lipa', 'ihearadio', 'nostalgia', 'album', 'dance', 'dancers', 'photo', 'jingle'] 747 ["Dua Lipa's Pop Success and Dance Performances at iHeartRadio Jingle Ball"]
61 ['laroi', 'kid', 'song', 'unreleased', 'acoustic', 'ihearadio', 'biggest', 'stage', 'jingle', 'glowed'] 413 ["The Kid Laroi's Unreleased Acoustic Songs at iHeartRadio Jingle Ball"]
62 ['fennec', 'pharmaceuticals', 'fenc', 'rating', '00', 'frx', 'stock', 'research', 'analysts', 'company'] 466 ['Fennec Pharmaceuticals Stock Analysis & Ratings']
63 ['lizzo', 'dance', 'kardashian', 'noh', 'ones', 'cw', 'fun', 'ryan', 'conce', 'ihearadio'] 454 ["Lizzo and Noh's Fun Dance Collaboration with Kardashian and Ryan at Concert"]
64 ['saul', 'centers', 'bfs', 'rating', 'zacks', 'research', 'stock', 'shares', 'estate', 'buy'] 316 ['Saul Centers Inc (NYSE: BFS) Earnings and Stock Analysis']
65 ['ryan', 'brothers', 'jingle', 'ajr', 'ihearadio', 'ball', 'jack', 'inspired', 'trio', 'weak'] 536 ['AJR brothers performance at iHeartRadio Jingle Ball with inspired song "Weak"']
66 ['food', 'wine', 'festival', 'chef', 'beach', 'miami', 'sobewff', 'chefs', 'beard', 'restaurant'] 1107 ['South Beach Wine and Food Festival in Miami Beach']
67 ['arena', 'spos', 'pumpkins', 'smashing', 'center', 'betting', 'addiction', 'corgan', '10', 'jane'] 636 ['Smashing Pumpkins' Epic 33-Track Album "Atum" Release and Tour Dates']
68 ['max', 'ava', 'jingle', 'ihearadio', 'ball', 'changed', 'perception', 'breakup', 'weapons', 'album'] 420 ["Ava Max's Performance at the 2022 iHeartRadio Jingle Ball and Breakup-Inspired Album"]
69 ['montrose', 'environmental', 'meg', 'group', 'rating', 'permitting', 'response', 'projects', 'shares', 'stock'] 347 ['Montrose Environmental Group Stock Analysis and Ratings']
70 ['lauv', 'song', 'jingle', 'photo', 'getty', 'images', 'loneliness', 'ball', 'ihearadio', 'tired'] 344 ["Lauv's Emotional Performance at iHeartRadio Jingle Ball"]
71 ['niall', 'harry', 'album', 'circus', 'narry', 'direction', 'horan', 'liam', 'thank', 'fan'] 396 ["Potential Harry Styles Collaboration on Niall Horan's Upcoming Circus-Themed Album"]
72 ['flute', 'lizzo', 'library', 'crystal', 'congress', 'madison', 'history', 'james', 'played', 'instrument'] 1246 ["Lizzo Playing James Madison's Crystal Flute at Library of Congress"]
73 ['alamos', 'agi', 'gold', 'newswire', 'globe', 'creighton', 'tsx', 'toronto', 'production', 'georgetown'] 516 ['Alamos Gold Inc Financial Results Q3 2022']
74 ['capitals', 'sharks', 'nhl', 'expe', 'odds', 'puck', 'lines', 'analyze', 'scheduled', 'tipico'] 571 ['NHL game analysis between San Jose Sharks and Washington Capitals']
75 ['07', '06', 'arena', 'center', 'paramore', 'album', '05', 'ca', 'kia', 'fieldhouse'] 698 ["Paramore's 2023 Tour Dates at Various Arenas"]
76 ['dcfc', 'tritium', 'chargers', 'rating', 'limited', 'research', 'stock', '00', 'repo', 'shares'] 288 ['Tritium DCFC Stock Ratings and Institutional Trading']
77 ['groove', 'engagement', 'sales', 'g2', 'salesforce', 'platform', 'enterprises', 'leading', 'trustradius', 'customer'] 354 ['Groove - Leading Salesforce Sales Engagement Platform for Enterprises on G2']
78 ['album', 'billboard', 'songs', 'cha', 'swift', 'trending', 'hot', 'bts', 'midnights', 'song'] 962 ['Billboard Hot Trending Songs - Swift's "Midnights" Album and BTS Songs']
79 ['google', 'viual', 'card', 'cards', 'pay', 'wallet', 'android', 'use', 'credit', 'payment'] 1194 ['Virtual Credit Cards for Secure Online Shopping']
80 ['train', 'album', 'gold', 'ihearadio', 'release', 'exclusive', 'love', 'songs', 'tour', 'jewel'] 540 ['Train's "Am Gold" Album Release Event with iHeartRadio']
81 ['ihearadio', 'chili', 'peppers', 'hair', 'hot', 'shared', 'red', 'double', 'toothed', 'bukowski'] 381 ['Red Hot Chili Peppers bassist Flea and iHeartRadio events']
82 ['dj', 'baseball', 'ice', 'hockey', 'soccer', 'football', 'basketball', 'videos', 'prnewswire', 'token'] 749 ['sports entertainment and news']
83 ['dhabi', 'abu', 'adcb', 'aed1', 'al', 'aed', 'arab', 'bank', 'commercial', 'mohamed'] 360 ['Abu Dhabi Commercial Bank (ADCB) Stock Analysis']
84 ['salle', 'saint', 'la', 'joseph', 'explorers', 'odds', 'hawks', 'tournament', 'st', 'atlantic'] 254 ['Saint Joseph vs La Salle Atlantic 10 Tournament Odds Prediction']
85 ['holiday', 'day', 'open', 'closed', 'holidays', 'bank', 'banks', 'christmas', 'hours', 'federal'] 976 ['Bank Holiday Schedule for 2022 - Are Banks Open or Closed on MLK Day and Other Holidays?']
86 ['iheacountry', 'festival', 'country', 'ihearadio', 'austin', 'pt', 'ram', 'stage', 'performances', 'moody'] 1490 ['iheacountry festival lineup in Austin, May 13th']
87 ['foods', 'simply', 'smpl', 'kilts', 'good', 'atkins', 'mr', 'growth', 'scalzo', 'nasdaq'] 387 ['Simply Good Foods Company (NASDAQ: SMPL) Stock Analysis - End of Day and End of Week Reports']
88 ['parking', 'howard', 'city', 'divisons', 'washingtondc', 'citycenterdc', 'mpd', 'reliably', 'nats', 'auxiliary'] 373 ['Parking Resources in Washington, DC']
89 ['bellamy', 'delonge', 'extraterrestrial', 'ego', 'alter', 'alien', 'disappointment', 'muse', 'hunting', 'tom'] 289 ["Tom DeLonge's Alien Hunting Invitation to Matt Bellamy at iHeartRadio Alter Ego"]
90 ['rhett', 'country', 'thomas', 'iheacountry', 'album', 'akins', 'underwood', 'ihearadio', 'festival', 'song'] 4136 ["Thomas Rhett's Exclusive iHeartRadio Album Release Party featuring Collaborations and New Songs"]
91 ['lgbt', 'prnewswire', 'national', 'nglcc', 'wbenc', 'nbic', 'council', 'saturday', 'business', 'corporations'] 912 ['National LGBT Chamber of Commerce Annual CoHo Announcement - Washington, Oct 27 2022']
92 ['tires', 'tire', 'goodyear', 'save', 'walma', 'snow', 'deals', 'firestone', 'michelin', 'terrain'] 453 ['Cyber Monday and Black Friday Tire Deals: Goodyear, Michelin, Firestone, Walmart (Walma)']
93 ['customer', 'marketing', 'customers', 'business', 'b2b', 'digital', 'brand', 'transformation', 'centric', 'brands'] 844 ['The Role of Email Marketing in Driving Customer-Centric Digital Transformation']
94 ['musk', 'elon', 'tiktok', 'social', 'media', 'users', 'said', 'tech', 'content', 'google'] 1539 ["Twitter chaos and Musk's influence on the app economy"]
95 ['cincinnati', 'fhlb', 'results', 'unaudited', 'prnewswire', 'released', 'federal', 'loan', 'home', 'ended'] 581 ['Federal Home Loan Bank of Cincinnati Unauidted Financial Results - Q3 2022']
96 ['accurate', 'complete', 'securities', 'reader', 'necessarily', 'assume', 'reviewed', 'determined', 'information', 'commission'] 383 ['securities and exchange commission filing review and accuracy']
97 ['22', 'arena', 'center', 'vengeance', 'viva', 'las', 'panic', '10', 'disco', 'tx'] 733 ["Panic at the Disco's Viva Las Vengeance Tour Dates"]
98 ['button', 'mobile', 'marketers', 'commerce', 'prnewswire', 'platform', 'cookieless', 'optimized', 'surpassed', 'leading'] 546 ['mobile commerce platform surpasses billion in revenue']

Training hyperparameters

  • calculate_probabilities: False
  • language: None
  • low_memory: False
  • min_topic_size: 500
  • n_gram_range: (1, 1)
  • nr_topics: 100
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: True

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.33
  • UMAP: 0.5.5
  • Pandas: 2.0.3
  • Scikit-Learn: 1.4.1.post1
  • Sentence-transformers: 2.5.1
  • Transformers: 4.39.1
  • Numba: 0.59.1
  • Plotly: 5.20.0
  • Python: 3.11.8
Downloads last month
9
Inference API
This model can be loaded on Inference API (serverless).

Space using AlexanderHolmes0/C2-Topic-Model-100 1

Collection including AlexanderHolmes0/C2-Topic-Model-100