Tom (ladylazy9x)

ladylazy9x

Geek Repo

Github PK Tool:Github PK Tool

Tom's starred repositories

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:39168Issues:2025Issues:0

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:35951Issues:759Issues:9468

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

pymc

Bayesian Modeling and Probabilistic Programming in Python

Language:PythonLicense:NOASSERTIONStargazers:8586Issues:225Issues:3287

awesome-apache-airflow

Curated list of resources about Apache Airflow

machine-learning-book

Code Repository for Machine Learning with PyTorch and Scikit-Learn

Language:Jupyter NotebookLicense:MITStargazers:3237Issues:53Issues:94

Spark-The-Definitive-Guide

Spark: The Definitive Guide's Code Repository

Language:ScalaLicense:NOASSERTIONStargazers:2814Issues:187Issues:49

practical-statistics-for-data-scientists

Code repository for O'Reilly book

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:2664Issues:70Issues:26

LearningSparkV2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Language:ScalaLicense:Apache-2.0Stargazers:1154Issues:40Issues:18

pyspark-examples

Pyspark RDD, DataFrame and Dataset Examples in Python language

kylo

Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.

Language:JavaLicense:Apache-2.0Stargazers:1104Issues:114Issues:0

airflow-tutorial

Apache Airflow tutorial

Language:Jupyter NotebookLicense:MITStargazers:919Issues:17Issues:45

Python-for-Probability-Statistics-and-Machine-Learning

Jupyter Notebooks for Springer book "Python for Probability, Statistics, and Machine Learning"

Language:Jupyter NotebookLicense:MITStargazers:725Issues:38Issues:2

data-pipelines-with-apache-airflow

Code for Data Pipelines with Apache Airflow

Language:PythonLicense:NOASSERTIONStargazers:689Issues:17Issues:23

algorithmic_trading_book

2 books and related source codes for algorithmic trading.

Language:PythonStargazers:380Issues:10Issues:0

Python-for-Finance-Second-Edition

Python for Finance – Second Edition, published by Packt

Language:PythonLicense:MITStargazers:258Issues:31Issues:2

Python-for-Finance-Cookbook-2E

The repository of "Python for Finance Cookbook" 2nd edition

Language:Jupyter NotebookLicense:MITStargazers:128Issues:8Issues:11

Algorithmic-Trading

Algorithmic trading using machine learning.

Language:PythonLicense:GPL-3.0Stargazers:112Issues:14Issues:0

Practical-Data-Science-with-Python

Practical Data Science with Python, published by Packt

Language:Jupyter NotebookLicense:MITStargazers:109Issues:10Issues:4

Python-for-Finance-Cookbook-2E

The repository of "Python for Finance Cookbook" 2nd edition

Language:Jupyter NotebookLicense:MITStargazers:95Issues:6Issues:0

hypothesis-testing-with-python

True difference or noise? 📊

Language:Jupyter NotebookStargazers:70Issues:8Issues:0

Data-Pipelines-with-Airflow

This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging the data, filling the data warehouse, and running checks on the data quality as the final step. Automate the ETL pipeline and creation of data warehouse using Apache Airflow. Skills include: Using Airflow to automate ETL pipelines using Airflow, Python, Amazon Redshift. Writing custom operators to perform tasks such as staging data, filling the data warehouse, and validation through data quality checks. Transforming data from various sources into a star schema optimized for the analytics team’s use cases. Technologies used: Apache Airflow, S3, Amazon Redshift, Python.

Language:PythonStargazers:66Issues:1Issues:0

Advanced-Algorithmic-Trading

The book <Advanced Algorithmic Trading> and its source code

Language:PythonStargazers:51Issues:1Issues:0

WQU-data-science-challenges

I successfully completed a 2-unit, 16-week and 6 mini-projects of the Data Science module at WorldQuant University. The mini-projects included scientific computing, data wrangling, machine learning and natural language processing with Python.

Language:Jupyter NotebookStargazers:35Issues:2Issues:0

Learning-Apache-Spark-2

Code repository for Learning Apache Spark 2, published by Packt

Language:Jupyter NotebookLicense:MITStargazers:20Issues:7Issues:0

financial-engineering-wqu

Repository for MScFE, WQU.

Language:Jupyter NotebookStargazers:18Issues:1Issues:0

A-B-testing---experimentation-project

#A/B testing: A step-by-step guide in Python This is a walkthrough of how to design and analyse an A/B test using Python.

Language:Jupyter NotebookLicense:MITStargazers:13Issues:1Issues:0

sample-parquet-files

A repo hosting sample parquet files.

SAS-code

SAS code from "Credit Risk Analytics"

Language:SASStargazers:3Issues:3Issues:0

WQU-Applied-data-science-lab

The WQU Applied Data Science Lab is a sequence of eight projects, where you solve real-world problems using data science tools. Each project consists of four lessons and one assignment.

Language:Jupyter NotebookStargazers:2Issues:1Issues:1