Macintoshxz's repositories
PKUCourse
北大计算机课程大作业
magic_google
Google search results crawler, get google search results that you need
missingno
Missing data visualization module for Python.
predicting-customer-churn
A general-purpose framework for solving problems with machine learning applied to predicting customer churn
InvestopediaHistoricalStockQuotes
Investopedia data, better than yahoo finance
investopedia-terms
Scrape financial terms from Investopedia
tabula
Tabula is a tool for liberating data tables trapped inside PDF files
tabula-java
Extract tables from PDF files
pdftabextract
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
awesome-machine-learning-on-source-code
Cool links & research papers related to Machine Learning applied to source code (MLonCode)
OpenJudge
北大百练题解(C/C++)
scrapy-finance
scrapy spiders to crawl the financial text data :books: :scroll: pertinent to train word vectors :rocket:
AutoenCODE
AutoenCODE is a Deep Learning infrastructure that allows to encode source code fragments into vector representations, which can be used to learn similarities.
bhtsne
Barnes-Hut t-SNE
Hadoop
A repository for Hadoop MapReduce toy examples
cc150
《程序员面试金典》(cc150)
AppCrawler
Android应用市场网络爬虫
deepdetect
Deep Learning API and Server in C++11 with Python bindings and support for Caffe, Tensorflow, XGBoost and TSNE
Hadoop-Project-Establishment
This file contains three main projects. 1), MapReduce Project - Google Search Auto Complete. 2), MapReduce Project - PageRank. 3), MapReduce Project - Recommender System
mutpy
MutPy is a mutation testing tool for Python 3.x source code
stat-nlp-book
Interactive Lecture Notes, Slides and Exercises for Statistical NLP
Sentiment-Analysis-Twitter
:mortar_board:RESEARCH [NLP :thought_balloon:] We use different feature sets and machine learning classifiers to determine the best combination for sentiment analysis of twitter.
CoreNLP
Stanford CoreNLP: A Java suite of core NLP tools.
code-docstring-corpus
Preprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.
deep-learning-book
《Deep Learning》《深度学习》 by Ian Goodfellow, Yoshua Bengio and Aaron Courville
TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
mit-deep-learning-book-pdf
MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville