Zhengxuan (Sheeran) Yan's repositories
birds-migration
This repo is for the final project of course DS-GA 1007 at NYU CDS
NYU-MFE-Projects
Projects for NYU MFE Program
NYU-Tandon-LeetCode-Bootcamp
NYU Tandon Career Hub LeetCode Bootcamp Fall 2022
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
SDV
Synthetic Data Generation for tabular, relational and time series data.
shap
A game theoretic approach to explain the output of any machine learning model.
PiML-Toolbox
PiML (Python Interpretable Machine Learning) toolbox for model development and validation
ML_Spring_2022
NYU Tandon Machine Learning and Finance Spring 2022
pomegranate
Fast, flexible and easy to use probabilistic modelling in Python.
d6tflow
Python library for building highly effective data science workflows
ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
intro-to-active-learning
Notebook accompanying the blog post Intro to Active Learning
Coursera_Capstone
This is for IBM Data Science Specialization Capstone
Machine-Learning-for-Algorithmic-Trading-Second-Edition_Original
Machine Learning for Algorithmic Trading, Second Edition - published by Packt
TensorFlow
Project containig related material for my TensorFlow articles
Deep-Clustering-Network
PyTorch Implementation of "Towards K-Means-Friendly Spaces: Simultaneous Deep Learning and Clustering," Bo Yang et al., ICML'2017.
MLSMOTE
Multi label Synthetic Minority Over-sampling Technique (MLSMOTE)
markdown-here
Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.
BARRA_risk
A risk evaluation program that follows BARRA's CNE6 and USE4 risk model to predict the risk and distribution of factors in a portfolio. Created by Rosemary He Sept. 2019, under Zhiqiang Zhang.
latexify_py
Generates LaTeX math description from Python functions.
pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
IBM-Data-Science-Professional-Certification
This repository contains all the resources and solution to quizzes given and asked in IBM Data Science Professional Certification.
PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
Barra_CNE6
Barra CNE6 因子构建
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
py-faster-rcnn
Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version
DeepLearnToolbox
Matlab/Octave toolbox for deep learning. Includes Deep Belief Nets, Stacked Autoencoders, Convolutional Neural Nets, Convolutional Autoencoders and vanilla Neural Nets. Each method has examples to get you started.
IntroToDataScience
GitHub Repository to accompany my YouTube series of videos on Introductory Data Science using R.