There are 0 repository under countvectorizer topic.
Grocery Recommendation on Instacart Data
Unlock Your Next Favorite Film! Our NLP-powered Movie Recommendation Web App delivers tailored suggestions based on cast, genres, and production companies. Explore a seamless Streamlit interface, also, you can see the description of selected movie. and all movies list.
A web-app which can be used to get recommendations for a series/movie, the app recommends a list of media according to list of entered choices of movies/series in your preferred language using Python and Flask for backend and HTML, CSS and JavaScript for frontend.
A Naive Bayes spam/ham classifier based on Bayes' Theorem. A bunch of email subject is first used to train the classifier and then a previously unseen email subject is fed to predict whether it is Spam or Ham.
This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Hire the Perfect candidate. HackerEarth Competitions solution.
Spam message detection using classifier
Graduation Project/Sentiment Analysis in Turkish Film Reviews
Created Hate speech detection model using Count Vectorizer & XGBoost Classifier with an Accuracy upto 0.9471, which can be used to predict tweets which are hate or non-hate.
Assignment-11-Text-Mining-01-Elon-Musk, Perform sentimental analysis on the Elon-musk tweets (Exlon-musk.csv), Text Preprocessing: remove both the leading and the trailing characters, removes empty strings, because they are considered in Python as False, Joining the list into one string/text, Remove Twitter username handles from a given twitter text. (Removes @usernames), Again Joining the list into one string/text, Remove Punctuation, Remove https or url within text, Converting into Text Tokens, Tokenization, Remove Stopwords, Normalize the data, Stemming (Optional), Lemmatization, Feature Extraction, Using BoW CountVectorizer, CountVectorizer with N-grams (Bigrams & Trigrams), TF-IDF Vectorizer, Generate Word Cloud, Named Entity Recognition (NER), Emotion Mining - Sentiment Analysis.
Using content-based approach to construct a suggestion for films. Films based on user feedback are recommended. By the machine learning model, all connected and equivalent films are suggested for the consumer.
This is the Movie Recommendation System project using a Content-Based recommender system trained on more than 5000 movies for generating movie recommendations based on user search.
Classification of emails received on a mass distribution group
This project suggests you the list of movies based on the movie title that you have entered. It uses Count Vectorizer (Text-Feature Extraction tool) to find the relation between similar movies.
What is the difference between a data scientist and a data analyst? An NLP approach.
Used NLTK library from text pre-processing, Data Visualisation and Analysis done with matplotlib, used sklearn CountVectorizer and Tfidf transformer for feature extraction from text, then used Linear SVC algorithm to train the ML model. Got 99% accuracy.
Silicon Valley (TV Show on HBO) language analysis
Malware classification using Extreme Gradient Boosting - XGBoost, CountVectorizer, TruncatedSVD
Semantic Analysis of Restaurant Reviews (NLP Use Case)
This is the Repository for different Natural language Processing(NLP) projects using Hugging face,Gensim, NLTK,Spacy and other Libraries
Predicting Tags for Stack Overflow
Amazon Product Reviews: Sentiment Analysis with NLP
This competition is hosted by Kaggle https://www.kaggle.com/c/nlp-getting-started/overview. I participated in the competition in order to try my hands on the field of Artificial Intelligence known as Natural Language Processing.
Spam Detection – Cluster SMS messages to “Spam” and “Ham” (Kaggle Challenge)
The aim - is to develop a model that will give accurate predictions for the customer's test sample, but the training sample for is not given. It should be collected by parsing
This project is to compare the F1 scores on performing sentiment analysis on reviews using various methods. We test the efficeintcy of TfidfVectorizer and CountVectrorizers when used with Multinomial Naive Bayes and SVC respectively.
This project aims to develop a machine learning algorithm that can accurately detect and filter out spam comments on YouTube videos.
CraveCrafters is an AI-powered food ordering web application that seamlessly integrates a chatbot to assist users with menu browsing, order placement, and customer service. The system features a Node.js backend, a FastAPI-based chatbot, and an interactive frontend built with HTML, CSS, and JavaScript. It utilizes MongoDB as its database.
A Machine Learning Model that detects different language syntax.
SMS SPAM FILTERING
This is a machine learning project that focuses on detecting spam messages from regular messages. The project includes data cleaning and preprocessing, creating a bag of words model, and training the model using the Naive Bayes classifier. The final accuracy of the model is 98.39%.
I have done some Natural Language Processing on the Twitter US Airline Sentiment Dataset, which contains data for over 14000 tweets. Then I have used several classifiers namely, Support Vector Machine, Multinomial Naive Bayes, Random Forest and Decision Trees to predict the sentiment of the tweet i.e. positive, negative or neutral.
A movie recommender system based on Content-Based Filtering using tmdb dataset
linebot messengerbot @mango by fastapi