LJ's repositories
neuspell
NeuSpell: A Neural Spelling Correction Toolkit
docker-hadoop-spark-workbench
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
pyspark_config
Some configuration scripts for pyspark
spark
Apache Spark - A unified analytics engine for large-scale data processing
scala-exercises
The easy way to learn Scala.
fpinscala
Code, exercises, answers, and hints to go along with the book "Functional Programming in Scala"
sbt-native-packager-examples
A set of sbt-native-pakager examples
data-validator
A tool to validate data built around Apache Spark.
pyox
A simple REST client library for Hadoop / Knox in Python
pragmatic-sbt
A pragmatic introduction to sbt
sbt-getting-started
Learn to use sbt with Scala
facebook-business-sdk-codegen
Codegen project for our business SDKs
cookiecutter-pypackage
Cookiecutter template for a Python package.
pyspark-example-project
Example project and best practices for Python-based Spark ETL jobs and applications.
project-layout
Standard Go Project Layout
embedmd
embedmd: embed code into markdown and keep everything in sync
voluptuous
Voluptuous, despite the name, is a Python data validation library.
protobuild
Build protobufs in Go, easily
sparkMeasure
SparkMeasure is a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
go-tooling-workshop
A workshop covering all the tools gophers use in their day to day life
todo-grpc
An example todo app using gRPC/REST and PostgresQL in a few lines of code
quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
pyspark.test
Example unit tests for Apache Spark Python scripts using the py.test framework
shapeless-type-class-derivation-2015-demo
Example code from my presentation on shapeless type class derivation
util
general utilities and tools