Catherine Shen's repositories

Git-Influencer

Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Network.

Language:PythonStargazers:15Issues:1Issues:0

Realtime-Stock-Monitoring

Real Time Stock Data Monitoring Platform - A practice project using Kafka, Cassandra and Spark.

Language:PythonStargazers:6Issues:1Issues:0

AWS_SageMaker

Personal guide and examples to learn and use AWS SageMaker to deploy your ML model at scale.

Language:Jupyter NotebookStargazers:2Issues:1Issues:0

Multithreading_python

Tutorials and collections on multithreading and async in python

Language:PythonStargazers:2Issues:1Issues:0

Scala-Spark

Spark Streaming and Machine Learning with Scala.

Language:ScalaStargazers:2Issues:0Issues:0

algorithms-1

Minimal examples of data structures and algorithms in Python

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

awesome-python

A curated list of awesome Python frameworks, libraries, software and resources

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

AWS_solutionArchitecture

preparing materials for AWS solution Architecture exams

Stargazers:1Issues:0Issues:0

chalice

Python Serverless Microframework for AWS

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

twitter_sentimentClassification_pipeline

A twitter sentiment classification pipeline, generated sentiment score based on model trained on twitter140 dataset.

Language:PythonStargazers:1Issues:0Issues:0

ETL_with_airflow

Self-edited Airflow tutorial based on the ETL Best practices with airflow repository.

Language:ShellStargazers:0Issues:0Issues:0

Presto_Hands_on_tutorials

Collections and sample code for learning PrestoDB.

Language:PythonStargazers:0Issues:0Issues:0

Airflow_Datapipeline

Airflow cheatsheet and tips for work schedulling

Stargazers:0Issues:0Issues:0

cloud-bigtable-examples

Examples of how to use Cloud Bigtable both with GCE map/reduce as well as stand alone applications.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DataEngineering-Daily-Reading

Great Tech post collections from daily reading.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

examples

A repository to host extended examples and tutorials

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Flask_blog

simple blog written in python

Stargazers:0Issues:0Issues:0
Language:ShellStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

istio

Connect, secure, control, and observe services.

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kafka-streams-course

Learn Kafka Streams with several examples!

Language:JavaStargazers:0Issues:0Issues:0

mongo-python-driver

PyMongo - the Python driver for MongoDB

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

nosqlclient

Cross-platform and self hosted, easy to use mongodb management tool - Formerly Mongoclient

Language:JavaScriptLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pubsub

This repository contains open-source projects managed by the owners of Google Cloud Pub/Sub.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PyGithub

Typed interactions with the GitHub API v3

Language:PythonLicense:LGPL-3.0Stargazers:0Issues:0Issues:0

SQL-Python-api

SQL practice from Leetcode and other sources

Language:PythonStargazers:0Issues:1Issues:0