There are 0 repository under count-vectorizer topic.
This repository contains Ipython notebooks and datasets for the data analytics youtube tutorials on The Semicolon.
It is a Natural Language Processing Problem where Sentiment Analysis is done by Classifying the Positive tweets from negative tweets by machine learning models for classification, text mining, text analysis, data analysis and data visualization
This project employs emotion detection in textual data, specifically trained on Twitter data comprising tweets labeled with corresponding emotions. It seamlessly takes text inputs and provides the most fitting emotion assigned to it.
NLP based Classification Model that predicts a person's personality type as one of the 16 Myers Briggs personality types. Extremely challenging project dealing with correlation between human psychology and casual writing styles and handling heavily imbalanced classes. Check the app here - https://mb-predictor-motetuzs5q-uc.a.run.app/
Engaged in research to help improve to boost text sentiment analysis using facial features from video using machine learning.
A Natural Language Processing with SMS Data to predict whether the SMS is Spam/Ham with various ML Algorithms like multinomial-naive-bayes,logistic regression,svm,decision trees to compare accuracy and using various data cleaning and processing techniques like PorterStemmer,CountVectorizer,TFIDF Vetorizer,WordnetLemmatizer. It is implemented using LSTM and Word Embeddings to gain accuracy of 97.84%.
Fake News Detection System for detecting whether news is fake or not. The model is trained using "Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection. Link for dataset: https://arxiv.org/abs/1705.00648.
Twitter Sentiment Analysis Using InSet (Indonesia Sentiment Lexicon) and Random Forest Classifier
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
Built a movie recommender system with Streamlit and deploy in Heroku Platform.
Twitter Sentiment Analysis Using Vader Lexicon and Random Forest Classifier
This project involves detecting fake news using a decision tree classifier in Jupyter Notebook. Fake news detection is an important task in the field of natural language processing and machine learning, as it helps identify and filter out misleading or false information.
:syringe: Vaccine Sentiment Classifier is a deep learning classifier trained on real world twitter data, that distinguishes 3 types of tweets: Neutral, Anti-vax & Pro-vax.
Text Mining project about Sentiment Analysis of Drugs Reviews.
Scrapped tweets using twitter API (for keyword ‘Netflix’) on an AWS EC2 instance, ingested data into S3 via kinesis firehose. Used Spark ML on databricks to build a pipeline for sentiment classification model and Athena & QuickSight to build a dashboard
Spam Classifier project for my end-of-semester project for Intro to AI class. We were a group of four people. I worked on all the Naive Bayes models.
A machine learning model that predicts tags for a given question and body.
Application of Machine Learning Techniques for Text Classification and Topic Modelling on CrisisLexT26 dataset.
Natural Language Processing Recipes
This is a restaurant reviewer model which was bulit using the concept of NLP. It was built Jupyter notebook on python version 3.10.
A simple Sklearn based example to demonstrate the working of TF-IDF.
Kaggle Competition - Natural Language Processing with Disaster Tweets
MediaEval challenge 2019 - to predict the memorability of the Videos
my exercises of course natural language processing datacamp
Movie Recommendation - provides user with the top choices of movie he/she wanted to watch based on their current choice
Short Stories Recommendations.
A Natural Language Processing with SMS Data to predict whether the SMS is Spam/Ham with various ML Algorithms like multinomialNB & GaussianNB to compare accuracy and using various data cleaning and processing techniques like PorterStemmer,CountVectorizer. It is implemented using LSTM and Word Embeddings to gain accuracy of 97.70% .
AI-powered classifier mobile app using NLP to spot fake job ads and protect users from online scams. Our system analyzes language patterns and leverages algorithms to create a safe and trustworthy job search experience.
The movie recommendation system is implemented using content based filtering
The scope of this project is to classify fake and true news. After performing an analysis on the dataset using two different vectorizers and two machine learning algorithms, the results are conveyed in the form of accuracy score and confusion matrices.
This simple project detects spam content using NLP. It is further powered by MLOps consisting of Docker and Github CI/CD.
Movie recommendation system uses the user input and generate similar kind of movies using cosine similarity and countvectorizer techniques
This project uses machine learning to classify messages as spam or ham based on text analysis. It includes data preprocessing, feature extraction (TF-IDF), and classification models like Logistic Regression and Naive Bayes for accurate spam detection. Built with Python and Scikit-Learn. 🚀
Movie Recommender System
ViewWise is a recommendation system project that suggests TV shows based on cosine similarity between their metadata. By analyzing aggregated textual data of TV shows, the system provides users with personalized recommendations from a curated list of popular shows.
Machine learning project to identify semantically duplicate questions