There are 0 repository under countvectorizer topic.
Grocery Recommendation on Instacart Data
A web-app which can be used to get recommendations for a series/movie, the app recommends a list of media according to list of entered choices of movies/series in your preferred language using Python and Flask for backend and HTML, CSS and JavaScript for frontend.
A Naive Bayes spam/ham classifier based on Bayes' Theorem. A bunch of email subject is first used to train the classifier and then a previously unseen email subject is fed to predict whether it is Spam or Ham.
Hire the Perfect candidate. HackerEarth Competitions solution.
Spam message detection using classifier
Graduation Project/Sentiment Analysis in Turkish Film Reviews
Created Hate speech detection model using Count Vectorizer & XGBoost Classifier with an Accuracy upto 0.9471, which can be used to predict tweets which are hate or non-hate.
Assignment-11-Text-Mining-01-Elon-Musk, Perform sentimental analysis on the Elon-musk tweets (Exlon-musk.csv), Text Preprocessing: remove both the leading and the trailing characters, removes empty strings, because they are considered in Python as False, Joining the list into one string/text, Remove Twitter username handles from a given twitter text. (Removes @usernames), Again Joining the list into one string/text, Remove Punctuation, Remove https or url within text, Converting into Text Tokens, Tokenization, Remove Stopwords, Normalize the data, Stemming (Optional), Lemmatization, Feature Extraction, Using BoW CountVectorizer, CountVectorizer with N-grams (Bigrams & Trigrams), TF-IDF Vectorizer, Generate Word Cloud, Named Entity Recognition (NER), Emotion Mining - Sentiment Analysis.
This is the Movie Recommendation System project using a Content-Based recommender system trained on more than 5000 movies for generating movie recommendations based on user search.
Using content-based approach to construct a suggestion for films. Films based on user feedback are recommended. By the machine learning model, all connected and equivalent films are suggested for the consumer.
Classification of emails received on a mass distribution group
This project suggests you the list of movies based on the movie title that you have entered. It uses Count Vectorizer (Text-Feature Extraction tool) to find the relation between similar movies.
What is the difference between a data scientist and a data analyst? An NLP approach.
Silicon Valley (TV Show on HBO) language analysis
Malware classification using Extreme Gradient Boosting - XGBoost, CountVectorizer, TruncatedSVD
Semantic Analysis of Restaurant Reviews (NLP Use Case)
Predicting Tags for Stack Overflow
This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
This competition is hosted by Kaggle https://www.kaggle.com/c/nlp-getting-started/overview. I participated in the competition in order to try my hands on the field of Artificial Intelligence known as Natural Language Processing.
Spam Detection – Cluster SMS messages to “Spam” and “Ham” (Kaggle Challenge)
The aim - is to develop a model that will give accurate predictions for the customer's test sample, but the training sample for is not given. It should be collected by parsing
This project is to compare the F1 scores on performing sentiment analysis on reviews using various methods. We test the efficeintcy of TfidfVectorizer and CountVectrorizers when used with Multinomial Naive Bayes and SVC respectively.
A Machine Learning Model that detects different language syntax.
SMS SPAM FILTERING
This is a machine learning project that focuses on detecting spam messages from regular messages. The project includes data cleaning and preprocessing, creating a bag of words model, and training the model using the Naive Bayes classifier. The final accuracy of the model is 98.39%.
I have done some Natural Language Processing on the Twitter US Airline Sentiment Dataset, which contains data for over 14000 tweets. Then I have used several classifiers namely, Support Vector Machine, Multinomial Naive Bayes, Random Forest and Decision Trees to predict the sentiment of the tweet i.e. positive, negative or neutral.
Used NLTK library from text pre-processing, Data Visualisation and Analysis done with matplotlib, used sklearn CountVectorizer and Tfidf transformer for feature extraction from text, then used Linear SVC algorithm to train the ML model. Got 99% accuracy.
A movie recommender system based on Content-Based Filtering using tmdb dataset
linebot messengerbot @mango by fastapi
project 02 | classification model | mail/sms/text spam classifier | nlp | end to end project | machine learning nlp project
Unlock Your Next Favorite Film! Our NLP-powered Movie Recommendation Web App delivers tailored suggestions based on cast, genres, and production companies. Explore a seamless Streamlit interface, also, you can see the description of selected movie. and all movies list.
This is the Repository for different Natural language Processing(NLP) projects using Hugging face,Gensim, NLTK,Spacy and other Libraries
The provided code snippet performs movie recommendation based on movie metadata using the TMDB Movie Metadata dataset from Kaggle.
Text classification using various techniques, including Naive Bayes and Passive Aggressive classifiers, along with different vectorization methods such as Count Vectorization