Johann Hamel-Akré's starred repositories
openwebtext
Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.
Python-Elasticsearch
An example program that scrapes data from AllRecipes.com and store in Elasticsearch
Web-page-classification
Classifies webpages into categories defined in DMOZ dataset
python-readability
fast python port of arc90's readability tool, updated to match latest readability.js!
grammarVAE
Code for the "Grammar Variational Autoencoder" https://arxiv.org/abs/1703.01925
Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
dragnet_data
Training/test data for Dragnet
dl4ir-webnav
WebNav: A New Large-Scale Task for Natural Language based Sequential Decision Making
word-cloud-world
Dash app for creating word clouds
word_cloud
A little word cloud generator in Python
ML-From-Scratch
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
practicalDataAnalysisCookbook
A collection of data and codes to supplement the practicalDataAnalysisCookbook (in preparation)
awesome-python
An opinionated list of awesome Python frameworks, libraries, software and resources.
pyspark.test
Example unit tests for Apache Spark Python scripts using the py.test framework