There are 1 repository under bertopic topic.
KoBERTopic은 BERTopic을 한국어 데이터에 적용할 수 있도록 토크나이저와 BERT를 수정한 코드입니다.
HDBSCAN Tuning for BERTopic Models
We created a topic modeling pipeline to evaluate different topic modeling algorithms, including their performance on short and long text, preprocessed and not preprocessed datasets, and with different embedding models. Finally, we summarized the results and suggested how to choose algorithms based on the task.
Topic modeling for NYT articles.
We present our concept of a new type of Active-Learning for Deep Learning with NLP text classification and experimentally prove its performance against Random Sampling as well as its runtime performance on the Security Threat dataset from CySecAlert. These new Active Learning algorithms are based on Sentence-BERT and BERTopic clustering algorithms with allow us to generate fixed length tokens for whole sentences to make them comparable to each other. Further the Tokens are Clustered using K-Means or HDBScan to get diverse clusters to pick the samples out of them.
Slides, Notebook and Data for Presentation: DataHour: Harnessing ML and NLP for Elevated Customer Experiences
Natural Language Processing with Twitter data
Text Mining Final Project about Twitter Topic Modeling with different models
A content-based recommendation system for Hacker News using topic modeling (BERTopic)
BERT-based Topic Modeling on New York Times Headlines (160k rows)
Topic modelling and analysis of different UK newspapers, primarily using BERTopic
This is my final year project "customer reviews classification and analysis system using data mining and nlp". It analyzes and then classifies the customer reviews on the basis of their fakeness, sentiments, contexts and topics discussed. The reviews are taken from various e-commerce platforms like daraz and amazon.
Aspect Based Sentiment Analysis (ABSA) of Customer Reviews
Forecasting Private Capital Market using published research and patents. Project developed at Michigan State University under the guidance of Dr. Mohammed Ghassemi for JP Morgan Chase.
Hierarchical Topic Modeling
BoardTopic is a friendly way to understand your big data. BoardTopic uses state-of-the-art frameworks for topic modeling (BERTopic) and language models to help you analyze and makes sense of your data, no coding required.
A topic modeling pipeline using the BERTopic model and state-of-the-art technologies
Summarize App Reviews with NLP
This projects contains a nlp pipeline for topic labelling with BERTopic
NLP Topic Modeling Techniques (LDA, LSA & BERTopic)
Pipeline leveraging UMAP and HDBSCAN with BERTopic for large datasets.
This project involved analysing the collected dataset about people experiencing insomnia. This repository contains the code to construct a dataset using the Twitter API, then some data should be annotated with sentiments which is used for training transformer models. The data can be clustered into topics for further analysis with BERTopic.
Comparison of Topic Modeling Approaches on Complaints Related to the E-Commerce Industry
Use Siebert and BERTopic Model on Persian Dataset
Tool that allows characterizing territorial issues through news processing. It performs topic clustering, sentiment analysis, and data analysis using BERTopic & ChatGPT.
Understanding cancer's impact is vital for mental and physical health. Communities like r/cancer offer crucial support, sharing experiences and information, transcending boundaries to empower those affected, fostering unity amid life-altering diagnoses.
A data visualization dashboard powered by Dash framework & Python for Higher Education Institution (HEI) Online Reputation System (Final Year Project)
Unveiling Sentiments and Topics in COVID-19 Vaccine Comments on YouTube Over Time: from the First Vaccine Approval to the Post-Pandemic Era
The electronic theses and dissertations topic modeling project was conducted by the Chinese University of Hong Kong Library.
Topic modeling on Shopee's 1-star reviews to uncover insights and prevalent topics within the reviews.
Research Project: Analysis of Chinese Financial Discourse Based on Topic Clustering and Emotional Evolution | Fall 2023 - Spring 2024
Extract and process academic paper data (authors, citation counts, influential citation counts, references )from any search query, cluster the papers based on their abstracts, and analyze their features using R and RShiny
Identify the freedom of a local news outlet by comparing sentiment and stance of published news against international outlets.