peter (pezon)

pezon

Geek Repo

Location:Washington, DC

Github PK Tool:Github PK Tool

peter's starred repositories

numerical-linear-algebra

Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

Language:Jupyter NotebookStargazers:10117Issues:367Issues:15

tpot

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Language:PythonLicense:LGPL-3.0Stargazers:9603Issues:288Issues:918

kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

Language:PythonLicense:Apache-2.0Stargazers:9512Issues:106Issues:1865

pipeline

PipelineAI

Language:JsonnetLicense:Apache-2.0Stargazers:4165Issues:346Issues:254

missingno

Missing data visualization module for Python.

Language:PythonLicense:MITStargazers:3866Issues:76Issues:134

vim-anywhere

Use Vim everywhere you've always wanted to

Language:ShellLicense:MITStargazers:3625Issues:42Issues:73

arctic

High performance datastore for time series and tick data

Language:PythonLicense:LGPL-2.1Stargazers:3040Issues:173Issues:556

nlp_tasks

Natural Language Processing Tasks and References

License:Apache-2.0Stargazers:3018Issues:237Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2979Issues:24Issues:75

mathematics_dataset

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Language:PythonLicense:Apache-2.0Stargazers:1761Issues:67Issues:13

livy

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Language:ScalaStargazers:1006Issues:91Issues:0

daff

align and compare tables

Language:JavaLicense:MITStargazers:786Issues:25Issues:111

PyHamcrest

Hamcrest matchers for Python

Language:PythonLicense:NOASSERTIONStargazers:756Issues:25Issues:82

pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).

Language:PythonLicense:GPL-3.0Stargazers:477Issues:31Issues:25

spark-structured-streaming-internals

The Internals of Spark Structured Streaming

PySpark-Boilerplate

A boilerplate for writing PySpark Jobs

adam_qas

ADAM - A Question Answering System. Inspired from IBM Watson

Language:PythonLicense:GPL-3.0Stargazers:358Issues:30Issues:26

spark-style-guide

Spark style guide

Language:Jupyter NotebookStargazers:250Issues:18Issues:6

spaczz

Fuzzy matching and more functionality for spaCy.

Language:PythonLicense:MITStargazers:246Issues:10Issues:37

RUL-Net

Deep learning approach for estimation of Remaining Useful Life (RUL) of an engine

Language:PythonLicense:MITStargazers:215Issues:6Issues:5

followthemoney

Data model and processing tools for investigative entity data

Language:PythonLicense:MITStargazers:204Issues:21Issues:75

machine-failure-detection

PCA and DBSCAN based anomaly and outlier detection method for time series data.

Language:PythonStargazers:43Issues:4Issues:0

synonames

Trying to generate name synonyms from wikidata

Language:PythonStargazers:33Issues:18Issues:0

azure-apim-deployment-utils

Python utilities to extract, update and deploy to and from Azure API Management instances

Language:PythonLicense:Apache-2.0Stargazers:14Issues:5Issues:2

greenbutton-python

Python parser for ESPI ("Green Button") files.

Language:PythonLicense:NOASSERTIONStargazers:10Issues:3Issues:2

docker-aci-workshop

Docker and Azure Container Instances workshop

Language:HTMLLicense:MITStargazers:3Issues:2Issues:0

adv-diagnostics

Course repository for XBUS-511 - Diagnostics for More Informed Machine Learning

Language:Jupyter NotebookLicense:MITStargazers:3Issues:0Issues:0

intro-to-dl

Course repository for XBUS-512 - Introduction to AI and Deep Learning

Language:Jupyter NotebookLicense:MITStargazers:2Issues:2Issues:9

greenbutton-python

Python parser for ESPI ("Green Button") files.

Language:PythonLicense:NOASSERTIONStargazers:1Issues:8Issues:0

music-mining

Datasets and analysis for recordings that have charted globally and been nominated for a Grammy.

Language:Jupyter NotebookStargazers:1Issues:1Issues:0