tfidf-vectorizer

There are 0 repository under tfidf-vectorizer topic.

MLWithPytorch
Mayurji / MLWithPytorch
Objective of the repository is to learn and build machine learning models using Pytorch. 30DaysofML Using Pytorch
pytorch-implementation pytorch machine-learning-algorithms python3 kmeans-clustering linear-regression logistic-regression decision-trees naive-bayes-classifier svm-classifier pca-analysis tfidf-vectorizer naive-bayes-algorithm knn-classification gaussian-mixture-models lasso-regression ridge-regression machine-learning adaboost-algorithm
Language:Python 129
zayedrais / DocumentSearchEngine
Document Search Engine project with TF-IDF abd Google universal sentence encoder model
machine-learning tensorflow-models tensorflow-tutorials tensorflow universal-sentence-encoder tfidf-vectorizer tfidf tfidf-text-analysis deep-learning text-analysis text-semantic-similarity text-search document-similarity document-search semantic-search-engine semantic-search data-science python-text-analysis python juypter
Language:Jupyter Notebook 54
soumyajit4419 / AI_For_Social_Good
Using natural language processing to analyze the sentiments of people and detect suicidal ideation on online social content.
natural-language-processing lstm tfidf-vectorizer web-scraping random-forest
Language:Jupyter Notebook 38
anime-recommendation-system
Sajid030 / anime-recommendation-system
Personalized anime recommendations based on collaborative filtering. Discover your next favorite anime!
keras neural-networks numpy pandas sklearn tensorflow tfidf-vectorizer wordcloud-visualization data-preprocessing data-visualization h5py langdetect anime plotly recommendation-system data-science datetime deep-learning flask tailwind-css
Language:Jupyter Notebook 24
ksdkamesh99 / Spam-Classifier
A Natural Language Processing with SMS Data to predict whether the SMS is Spam/Ham with various ML Algorithms like multinomial-naive-bayes,logistic regression,svm,decision trees to compare accuracy and using various data cleaning and processing techniques like PorterStemmer,CountVectorizer,TFIDF Vetorizer,WordnetLemmatizer. It is implemented using LSTM and Word Embeddings to gain accuracy of 97.84%.
sms-spam-detection tfidf-vectorizer count-vectorizer bag-of-words naive-bayes-classifier multinomial-naive-bayes logistic-regression wordnetlemmatizer porter-stemmer decision-tree-classifier support-vector-machines lstm-neural-networks embeddings
Language:Jupyter Notebook 15
Shubha23 / Text-processing-NLP
This notebook contains entire text preprocessing pipeline for NLP problems. The ready-to-use functions require NLTK and SKlearn package installations. It also contains some prominent text classification models.
nlp python nltk sklearn tfidf-vectorizer textpreprocessing
Language:Jupyter Notebook 15
tamanna18 / ML-NLP-DL
For learning Purposes
ml nlp-machine-learning deep-learning machine-learning reinforcement-learning information-retrieval data-mining bag-of-words natural-language-processing nlp classification classification-algorithm countvectorizer tfidf-vectorizer tokenizer
Language:Jupyter Notebook 15
rjarman / Bus-Mama
The Bus-Mama is a bus tracking mobile application for the transportation of the students of BSMRSTU. It helps the students of our university by showing the available route, bus, and their exact location. This app includes real-time bus tracking which is going to solve a problem that university students have been facing for many years. Students are often seen missing their buses. Often they can't maintain the bus time. Since there are many buses in our university, students can easily catch a bus if they know where and when it will pass by. My goal is to track the buses and make hardware, mobile application, and machine learning solution to solve the issue. This way the students can get relief from missing the bus and use the buses efficiently. The main idea is to track the buses. GPS trackers will be attached to every bus that will give the current position of them and automatically sync on the server. The Bus-Mama mobile application will show every real-time position of those buses. This application will be installed on students' mobile phones and in this way the students can easily maintain their transportation. In this application, the current location of the bus can be seen through Google map. Every bus will have a specific marker on Google map and all the details about a specific bus will be shown by clicking on the marker. There will be seen about how far the bus is, from which direction it will come, how much time to reach the bus, how much time it will take if there is any traffic on road, etc. There is also a search option to know about any specific bus details. There is also a list of all buses with sufficient details that will help students to know about all the details. Every student will have an account through which they can access bus data. Another main objective is the Bus-Mama Chatbot in the Bengali language so that the students can communicate to know about the bus easily. For now, they can make conversation only about bus-related information. The Chatbot is not yet able to make conversation except bus-related questions. If anyone asks anything except bus-related questions, it cannot reply to the question rather it will give a tag to the question as a reply. As the Chatbot is created in the Bengali language, it has used the "trie" data structure in lemmatization. A library has been designed to lemmatize the Bengali words. Almost 63,205 Bengali words have been lemmatized by using the library to train the SVM machine learning model.
typescript python javascript angular scss gps googlemap trie lemmatization tfidf-vectorizer nodejs mongodb iot distancematrixservice nosql socket svm machine-learning chatbot bangla
Language:TypeScript 14
SauravPattnaikCS60 / Weighted-Class-Tfidf
Weighted Class TFIDF technique to deal with imbalanced datasets
machine-learning nlp tfidf tfidf-vectorizer
Language:Python 14
raj1603chdry / Fake-News-Detection-System
Fake News Detection System for detecting whether news is fake or not. The model is trained using "Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection. Link for dataset: https://arxiv.org/abs/1705.00648.
svm-classifier logistic-regression random-forest naive-bayes-classifier pipeline gridsearchcv voting-classifier custom-transformer tfidf-vectorizer count-vectorizer word-cloud nltk scikit-learn pos-tagging data-preprocessing machine-learning fake-news pattern-library
Language:Jupyter Notebook 11
Ankit152 / IMDB-sentiment-analysis
Sentiment analysis of IMDB dataset.
sentiment-analysis imdb-dataset classification imdb tf-idf logistic-regression logistic-regression-algorithm tfidf-vectorizer text-classification text lstm keras tf2 tensorflow2
Language:Jupyter Notebook 10
VipinJain1 / VIP-Machine-Learning-Exercises-and-Practices
VIP Machine Learning Exercises and Practices
machine-learning-exercises tsne bag-of-words bagofwords pca pca-analysis pandas matplotlib tfidf tfidf-vectorizer tfidf-matrix text-preprocessing python dimensionality-reduction
Language:Jupyter Notebook 10
Transformer-BERT-SMS-Spam-Detection
Tejas-TA / Transformer-BERT-SMS-Spam-Detection
Spam SMS Detection Project implemented using NLP & Transformers. DistilBERT - a hugging face Transformer model for text classification is used to fine-tune to best suit data to achieve the best results. Multinomial Naive Bayes achieved an F1 score of 0.94, the model was deployed on the Flask server. Application deployed in Google Cloud Platform
nlp-machine-learning natural-language-processing machine-learning matplotlib joblib flask-api lightgbm-classifier multinomial-naive-bayes scikitlearn-machine-learning stemming lemmatization bag-of-words tfidf-vectorizer spam-sms-detection huggingface-transformers bert-model tensorflow2 attention-is-all-you-need confusion-matrix roc-auc-curve
Language:Jupyter Notebook 9
pemagrg1 / Magic-Of-TFIDF
TFIDF being the most basic and simple topic in NLP, there's alot that can be done using TFIDF only! So, in this repo, I'll be adding the blog, TFIDF basics, wonders done using tfidf etc.
tfidf python tfidf-vectorizer tfidfvectorizer textclassification text-clustering nlp text-similarity
Language:Jupyter Notebook 8
sasivatsal7122 / Go_Screen-CineMatrix-ML-MODEL
This repo contains a machine learning model made using advanced and enhanced algos like KNN,SVD and also concepts like vectorization ,cosine similarity which predicts the similar movies for a given fav movie of user. So no more time wasting on searching for a good of you're choice
machine-learning movie-database movie-recommendation movies-api frontend-app responsive-web-design knn-algorithm svd-matrix-factorisation cosine-similarity tfidf-vectorizer linearkernel
Language:Jupyter Notebook 8
ryukaizen / marai
Conversational AI designed specifically for the Marathi language using Rasa.
conversational-ai marathi marathi-language rasa tfidf-vectorizer
Language:Python 7
VuBacktracking / Deep-Neural-Network-Vietnamese-Student-Feedback-Sentiment-Analysis
Vietnamese Student Feedback Sentiment Analysis
nlp vietnamese-nlp word2vec-model tfidf-vectorizer deep-learning lstm-model lstm-sentiment-analysis sentiment-analysis lstm-neural-networks word2vec streamlit-webapp keras-tensorflow tensforflow streamlit
Language:Jupyter Notebook 7
abhishtagatya / text2meme
🖼️ Text2Meme is a Meme Classification Experiment based on Caption Text (Implemented as a Discord Bot)
discord-bot kaggle linear-svc meme-generator tfidf-vectorizer
Language:Jupyter Notebook 6
faizann24 / Authorship-Attribution
Authorship Attribution with Machine Learning
machine-learning random-forest authorship-attribution tfidf-vectorizer scikit-learn cybersecurity authorship
Language:Python 6
ksopyla / scikit-learn-tutorial
Scikit-learn tutorial for beginniers. How to perform classification, regression. How to measure machine learning model performacne acuuracy, presiccion, recall, ROC.
roc-curve scikit-learn text-classification tfidf-vectorizer tutorial
Language:Python 6
ozlemekici / detecting_fake_news
TfidfVectorizer & PassiveAggressiveClassifier
python sklearn tfidf-vectorizer passive-aggressive-classifier advanced-python scikit-learn machine-learning numpy pandas itertools
Language:Jupyter Notebook 6
sherincheah / amz-ecom-recommender
E-Commerce Recommendation System
data cleaning eda tfidf-vectorizer tfidf-text-analysis cosine-similarity
Language:Jupyter Notebook 6
Vishwa22 / Multi-Label-Text-Classification
A text can be assigned more than one label
multi-label-problem beginners-friendly logistic-regression text-processing bag-of-words tfidf-vectorizer
Language:Jupyter Notebook 6
adarshpalaskar1 / Movie-Recommender-System
Recommendation system built using multiple ML models that aim to predict users' interests based on their past behavior and preferences.
cosine-similarity knn machine-learning matrix-factorization movie-recommendation pearson-correlation python3 recommendation-system recommender-system similarity-score streamlit tfidf-vectorizer
Language:Python 5
chiraag-kakar / FUND
An NLP model to detect fake news and accurately classify a piece of news as REAL or FAKE trained on dataset provided by Kaggle.
sklearn tf-idf tfidf-vectorizer tfidf-text-analysis passive-aggressive-classifier confusion-matrix machine-learning-algorithms project fake-news tfidfvectorizer news-article
Language:Jupyter Notebook 5
Nikoletos-K / Entity-resolution-SIGMOD-2020
📷🎥 Entity resolution system for SIGMOD 2020 programming contest
sigmod-programming-contest 2020 entity-resolution logistic-regression tfidf-vectorizer machine-learning c unit-testing university-of-athens bash-script gradient-descent acm
Language:C 5
sidharth178 / Natural-Language-Processing-Tutorial
This repo contains code files of all the important topics of NLP.
nlp tokenization stemming lemmatization tfidf-vectorizer
Language:Jupyter Notebook 5
vaitybharati / Assignment-11-Text-Mining-01-Elon-Musk
Assignment-11-Text-Mining-01-Elon-Musk, Perform sentimental analysis on the Elon-musk tweets (Exlon-musk.csv), Text Preprocessing: remove both the leading and the trailing characters, removes empty strings, because they are considered in Python as False, Joining the list into one string/text, Remove Twitter username handles from a given twitter text. (Removes @usernames), Again Joining the list into one string/text, Remove Punctuation, Remove https or url within text, Converting into Text Tokens, Tokenization, Remove Stopwords, Normalize the data, Stemming (Optional), Lemmatization, Feature Extraction, Using BoW CountVectorizer, CountVectorizer with N-grams (Bigrams & Trigrams), TF-IDF Vectorizer, Generate Word Cloud, Named Entity Recognition (NER), Emotion Mining - Sentiment Analysis.
text-mining tweets-dataset sentiment-analysis correlation-analysis sentiment-value sentiment-score affinity-scores emotion-lexicon emotion-mining named-entity-recognition pos-tagging word-cloud tfidf-vectorizer n-grams countvectorizer feature-extraction text-preprocessing tokenization clean-tweets lemmatization
Language:Jupyter Notebook 5
alexaapo / Feed-Forward-Neural-Network
Feed Forward Neural Network for Twitter Sentiment Analysis Dataset
feedforward-neural-network twitter-sentiment-analysis glove-embeddings tfidf-vectorizer deep-neural-networks pytorch
Language:Jupyter Notebook 4
Denis-Mukhanov / english-score
Practicum Workshop
catboost nltk optuna python streamlit tfidf-vectorizer
Language:Jupyter Notebook 4
engares / KNN-Based-Telegram-Chatbot-hosted-in-ESP32
A lightweight, customizable chatbot for Telegram running on an ESP32 microcontroller. It's optimized for low-resource environments and embedded systems projects.
chatbot cosine-similarity esp32 esp32-s3 knn knn-algorithm nlp-machine-learning telegram-bot tf-idf tfidf-vectorizer
Language:C++ 4
puskal-khadka / MovieRecommendationSystem
Content-based movie recommendation engine
nlp-machine-learning django movierecommender cosine-similarity tfidf-vectorizer
Language:Jupyter Notebook 4
Rishabbh-Sahu / information_retrieval
Given a document, identifying the closest documents within the list of documents using tf-idf matrix and cosine similarity
tfidf-vectorizer text-vectorization information-retrieval matrix-multiplication similarity-search similar-patterns root-cause-analysis lookalike-queries
Language:Python 4
Saket046 / course-recommender
This is a recommendation engine that recommends 10 courses related to course you search.
css exploratory-data-analysis flask heroku heroku-deployment html natural-language-processing nltk pandas python recommendation-system tfidf tfidf-vectorizer
Language:Jupyter Notebook 4
VipinJain1 / VIP-PCA_tSNE
pca-analysis pca tsne-algorithm tsne tsne-plot bag-of-words tfidf tfidf-vectorizer tfidf-text-analysis tfidf-weighted-w2v
Language:Jupyter Notebook 4
Al-Hasib / NoCodeTextClassifier
A Python package for automatically training, evaluation, inference of Text Classification task with Low code/No Code
machine-learning text-analysis text-classification tfidf-vectorizer
Language:Jupyter Notebook 3

tfidf-vectorizer

Mayurji / MLWithPytorch

zayedrais / DocumentSearchEngine

soumyajit4419 / AI_For_Social_Good

Sajid030 / anime-recommendation-system

ksdkamesh99 / Spam-Classifier

Shubha23 / Text-processing-NLP

tamanna18 / ML-NLP-DL

rjarman / Bus-Mama

SauravPattnaikCS60 / Weighted-Class-Tfidf

raj1603chdry / Fake-News-Detection-System

Ankit152 / IMDB-sentiment-analysis

VipinJain1 / VIP-Machine-Learning-Exercises-and-Practices

Tejas-TA / Transformer-BERT-SMS-Spam-Detection

pemagrg1 / Magic-Of-TFIDF

sasivatsal7122 / Go_Screen-CineMatrix-ML-MODEL

ryukaizen / marai

VuBacktracking / Deep-Neural-Network-Vietnamese-Student-Feedback-Sentiment-Analysis

abhishtagatya / text2meme

faizann24 / Authorship-Attribution

ksopyla / scikit-learn-tutorial

ozlemekici / detecting_fake_news

sherincheah / amz-ecom-recommender

Vishwa22 / Multi-Label-Text-Classification

adarshpalaskar1 / Movie-Recommender-System

chiraag-kakar / FUND

Nikoletos-K / Entity-resolution-SIGMOD-2020

sidharth178 / Natural-Language-Processing-Tutorial

vaitybharati / Assignment-11-Text-Mining-01-Elon-Musk

alexaapo / Feed-Forward-Neural-Network

Denis-Mukhanov / english-score

engares / KNN-Based-Telegram-Chatbot-hosted-in-ESP32

puskal-khadka / MovieRecommendationSystem

Rishabbh-Sahu / information_retrieval

Saket046 / course-recommender

VipinJain1 / VIP-PCA_tSNE

Al-Hasib / NoCodeTextClassifier