thomasO's repositories
RapportiveLookup
discrete __rapportivelookup__
wikipedia-spark
Spark app to parse wikipedia xml dump using json-wikipedia parser
airtable-python-wrapper
Python Airtable Client Wrapper
charts
Helm Charts
docker-events-notifier
Receive a Slack notifications when a container dies
elastic2-doc-manager
Mongo-Connector doc manager for elasticsearch 2.x
gpn
Genomic Pre-trained Network
InferSent
Sentence embeddings (InferSent) and training code for NLI.
json-wikipedia
Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby
mongo-backup-s3
Docker image that periodically uploads MongoDB backups to Amazon S3
mongo-connector
Data replication from MongoDB to MongoDB, Elasticsearch, Solr, and more!
neuralcoref
✨Fast Coreference Resolution in spaCy with Neural Networks
PatentPublicData
Utility tools to help download and parse patent data made available to the public
scrape-linkedin-selenium
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
scrapy-pyppeteer
Pyppeteer integration for Scrapy
scrapy-selenium
Scrapy middleware to handle javascript pages using selenium
slate
Beautiful static documentation for your API
spaCy
💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
tangermeme
Biological sequence analysis for the modern age.
tfmodisco-lite
A lite implementation of tfmodisco, a motif discovery algorithm for genomics experiments.
wikidata-parser
Parsing Wikidata Dumps