Johann Hamel-Akré (johann-ha)

johann-ha

Geek Repo

Company:Aikan

Location:Caen, FR

Github PK Tool:Github PK Tool

Johann Hamel-Akré's starred repositories

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:NOASSERTIONStargazers:22067Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:454Issues:0Issues:0

openwebtext

Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.

Language:PythonLicense:GPL-3.0Stargazers:703Issues:0Issues:0

textacy

NLP, before and after spaCy

Language:PythonLicense:NOASSERTIONStargazers:2193Issues:0Issues:0

pattern

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Language:PythonLicense:BSD-3-ClauseStargazers:4Issues:0Issues:0

textpipe

Textpipe: clean and extract metadata from text

Language:PythonLicense:MITStargazers:300Issues:0Issues:0

Python-Elasticsearch

An example program that scrapes data from AllRecipes.com and store in Elasticsearch

Language:PythonLicense:MITStargazers:98Issues:0Issues:0

Web-page-classification

Classifies webpages into categories defined in DMOZ dataset

Language:ShellLicense:MITStargazers:41Issues:0Issues:0

spider

Web Content Extraction Through Machine Learning

Language:TeXLicense:MITStargazers:185Issues:0Issues:0

rlntm

An implementation of the RL-NTM from http://arxiv.org/abs/1505.00521

Language:LuaLicense:NOASSERTIONStargazers:155Issues:0Issues:0

eqnet

Code related to "Learning Continuous Semantic Representations of Symbolic Expressions" project.

Language:PythonLicense:BSD-3-ClauseStargazers:36Issues:0Issues:0

python-readability

fast python port of arc90's readability tool, updated to match latest readability.js!

Language:PythonLicense:Apache-2.0Stargazers:2614Issues:0Issues:0

grammarVAE

Code for the "Grammar Variational Autoencoder" https://arxiv.org/abs/1703.01925

Language:PythonStargazers:269Issues:0Issues:0

Deep-Reinforcement-Learning-Hands-On

Hands-on Deep Reinforcement Learning, published by Packt

Language:PythonLicense:MITStargazers:2815Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:29360Issues:0Issues:0

yago3

YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources

Language:JavaLicense:GPL-3.0Stargazers:724Issues:0Issues:0

Cookbook

The Data Engineering Cookbook

License:Apache-2.0Stargazers:13418Issues:0Issues:0

dragnet

Just the facts -- web page content extraction

Language:PythonLicense:MITStargazers:1238Issues:0Issues:0

dragnet_data

Training/test data for Dragnet

Language:ShellLicense:AGPL-3.0Stargazers:41Issues:0Issues:0

frontera

A scalable frontier for web crawlers

Language:PythonLicense:BSD-3-ClauseStargazers:1287Issues:0Issues:0

dl4ir-webnav

WebNav: A New Large-Scale Task for Natural Language based Sequential Decision Making

Language:PythonLicense:BSD-3-ClauseStargazers:81Issues:0Issues:0

deep-deep

Adaptive crawler which uses Reinforcement Learning methods

Language:Jupyter NotebookStargazers:170Issues:0Issues:0

word-cloud-world

Dash app for creating word clouds

Language:PythonStargazers:6Issues:0Issues:0

word_cloud

A little word cloud generator in Python

Language:PythonLicense:MITStargazers:10038Issues:0Issues:0

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Language:PythonLicense:MITStargazers:74232Issues:0Issues:0

ML-From-Scratch

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

Language:PythonLicense:MITStargazers:23661Issues:0Issues:0

practicalDataAnalysisCookbook

A collection of data and codes to supplement the practicalDataAnalysisCookbook (in preparation)

Language:PythonLicense:GPL-2.0Stargazers:21Issues:0Issues:0

awesome-python

An opinionated list of awesome Python frameworks, libraries, software and resources.

Language:PythonLicense:NOASSERTIONStargazers:214393Issues:0Issues:0

pyspark.test

Example unit tests for Apache Spark Python scripts using the py.test framework

License:NOASSERTIONStargazers:84Issues:0Issues:0