Daviid_Du's repositories
Analytical_skills-tips
Includes learning, reviews from Online tutorial materials.
E-Commence
Developing a marketing program targeted to dormant one time buyers on platform to incentivize them to purchase again. Improve the program with machine learning models by at least 80%, with over 2GBs of training data processed on single machine. Implement classification models including Lasso logistic regression and Random Forest with cross validation optimization. Using R&SQL for data importing, cleaning and reprocessing, model testing and optimization.
Anomaly_Detection
Stores my project and research about conducting anomaly detection to network data.
Databricks-DavidDu
Previous functions and Runable Scripts written on Databricks platform.
Home_Default_Risk
Contains Code & Summary & Flowchart from work in Home Default Risk Competition
awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
Data-science
Collection of useful data science topics along with code and articles
databricks
Repository of sample Databricks notebooks
DataScienceCourse
This holds iPython notebooks and lecture slides for the Intro to Data Science Master's course I teach at NYU.
Delivery_Analysis_Jumpman23
Takehome data challenge
DS-GA-1007-python-for-data-science
This is a course for python for data science.
DS-GA-3001-Advanced-Python
This is the course for advanced python in NYU. It covers performance optimization, multi-thread programming, cuda, and other advanced topics in python.
ml-basics
Exercise notebooks for Machine Learning modules on Microsoft Learn
mlflow-example
This repository provides an example of dataset preprocessing, GBRT (Gradient Boosted Regression Tree) model training and evaluation, model tuning and finally model serving (REST API) in a containerized environment using MLflow tracking, projects and models modules.
prometheus-anomaly-detector
A newer more updated version of the prometheus anomaly detector (https://github.com/AICoE/prometheus-anomaly-detector-legacy)
pyod
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
Python
All Algorithms implemented in Python
Spark_projects
Using Python to analyze large scale datasets using PySpark.
stanford-cs-230-deep-learning
VIP cheatsheets for Stanford's CS 230 Deep Learning
Web_Scraping_projects
Use Python (Request, BeautifulSoup, ) to scrap and analysis Ebay dataset.