Joshua Jin's repositories
IJCAI_2018_Alimama_CTR
:fuelpump: CVR Prediction Competition Solution of Taobao Search Ads
Opinio-Extraction
Opinion Extraction based on Amazon Reviews
2018Wayfair_Datathon_3rd_Solution
This is the 3rd solution for Datathon host by Wayfair, a ecommerce company in Boston.
Fuzzy-match-on-listed-companies
:boat: Are you public, dear company?
leetcode_py
:whale: Leetcode Practice with Python3
sentiment-analysis
Multilingual sentiment analysis for English, German, French and Italian.
Sentiment-Analysis-Twitter
:mortar_board:RESEARCH [NLP :thought_balloon:] We use different feature sets and machine learning classifiers to determine the best combination for sentiment analysis of twitter.
Times-Series-Analysis-of-Stock
moneybag Time Series Analysis to Predict Amazon Stock Price https://finance.yahoo.com/quote/AMZN/
FeatureX-Amazon-Feature-Opinion-Mining
This project crawls Amazon reviews and extracts features and opinions to calculate a feature based rating of every product (mainly smartphones) Done with python, pyqt5
finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
GraphEmbedding
Implementation and experiments of graph embedding algorithms.deep walk,LINE(Large-scale Information Network Embedding),node2vec,SDNE(Structural Deep Network Embedding),struc2vec
gspread
Google Spreadsheets Python API
gt-nlp-class
Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"
hadoop-cluster-docker
Run Hadoop Custer within Docker Containers
LearnBasicBigDataTech
:rocket:Some projects on Big Data Analysis like Spark, Hive, Presto and Data Visualization like Superset
MachineLearning
Machine learning resources,including algorithm, paper, dataset, example and so on.
ML-Tutorial-Experiment
Coding the Machine Learning Tutorial for Learning to Learn
MovieLens-RecSys
基于MovieLens-1M数据集实现的协同过滤算法demo
Review_Summarization-Aspect_based_opinion_mining
Summarize opinions of users about a product from a set of reviews. Extract the most common product features mentioned, most common opinion words used for a feature and the corresponding positive and negitive opinions about related to the feature and opinions. This way it becomes extremely easy to identify the most prominent positive and negitive features and opinions about a product.
SparkInternals
Notes talking about the design and implementation of Apache Spark
tensorflow-DSMM
Tensorflow implementations of various Deep Semantic Matching Models
yellowbrick
Visual analysis and diagnostic tools to facilitate machine learning model selection.
Yelp_Challenge
Yelp dataset challenge: NLP & sentiment analysis