Srivatsan Ramanujam's repositories
gp-ark-tweet-nlp
A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum
text_analytics_on_mpp
Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP platform (Greenplum/HAWQ).
pandas_via_psql
Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).
postgresopen-2017
Scalable in-database machine learning with PL/Python: Postgres Open SV 2017 talk
gp_xgboost_gridsearch
In-database parallel grid-search for XGBoost on Greenplum
dspcfboilerplate
Boilerplate code for flask apps on PCF that interact with a backend environment (ex: Pivotal BDS or ElephantSQL).
gp-sql-snippets
Temporary home for data processing/machine learning SQL snippets on Greenplum/HAWQ
gp_jupyter_notebook_templates
Collection of Jupyter notebook templates to work with Greenplum/HAWQ/PostgreSQL
conda-buildpack
Buildpack for Conda.
gpdb
Pivotal Greenplum Database
incubator-madlib
Mirror of Apache MADlib (Incubating)
ipythonnbsamples
Ipython Notebook samples
Meta
Python Meta Programming
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
PDLTools
PDL Tools is a library of reusable tools used and developed by the Pivotal Data Science and Data Engineering teams.
shap
A unified approach to explain the output of any machine learning model.
spaCy
Industrial-strength Natural Language Processing with Python and Cython
vatsan.github.io
Personal website based on Jekyll Chirpy