Youngbin Kim's repositories
aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
bespin
Reference implementations of "big data" algorithms in MapReduce and Spark
bigdata-2018w
CS 451/651 431/631 Data-Intensive Distribute Computing (Winter 2018) at the University of Waterloo
cassovary
Cassovary is a simple big graph processing library for the JVM
Cassovary-vs-GraphJet
Performance comparison between Cassovary and GraphJet
cnn-text-classification-tf
Convolutional Neural Network for Text Classification in Tensorflow
CS224D-Assignments
My answers to the assignments to Stanford's NLP Course CS 224D
DeepLearningZeroToAll
TensorFlow Basic Tutorial Labs
googletest
Google Test
jsonresume-theme-short
Boilerplate theme for JSON Resume
koalas
Koalas: Pandas API on Apache Spark
libgo
Go-style concurrency in C++11
lucene-solr
Mirror of Apache Lucene + Solr
models
Models built with TensorFlow
oltpbench
OLTP Benchmark Framework
PyHive
Python interface to Hive and Presto. 🐝
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
resume-1
Software developer resume in Latex
sentiment_analysis
sentiment analysis using CNN (Tensorflow)
spark
Mirror of Apache Spark
tensorflow
Computation using data flow graphs for scalable machine learning
vel
Velocity in deep-learning research