Ya Xiao's starred repositories

developer-roadmap

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

Language:TypeScriptLicense:NOASSERTIONStargazers:283338Issues:6811Issues:2061

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:129520Issues:1121Issues:15256

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37517Issues:997Issues:1142

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:34276Issues:1062Issues:1816

gensim

Topic Modelling for Humans

Language:PythonLicense:LGPL-2.1Stargazers:15455Issues:432Issues:1846

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9861Issues:124Issues:733

machine-learning-systems-design

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"

machine-learning-interview

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

generative-models

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Language:PythonLicense:UnlicenseStargazers:7291Issues:297Issues:66

course-v3

The 3rd edition of course.fast.ai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4905Issues:228Issues:161

soot

Soot - A Java optimization framework

Language:JavaLicense:LGPL-2.1Stargazers:2833Issues:103Issues:1183

phasar

A LLVM-based static analysis framework.

Language:C++License:NOASSERTIONStargazers:920Issues:30Issues:184

mimic3-benchmarks

Python suite to construct benchmark machine learning datasets from the MIMIC-III 💊 clinical database.

Language:PythonLicense:MITStargazers:785Issues:39Issues:116

javalang

Pure Python Java parser and tools

Language:PythonLicense:MITStargazers:716Issues:19Issues:105

samples

DARPA Cyber Grand Challenge Sample Challenges

learning-to-reweight-examples

Code for paper "Learning to Reweight Examples for Robust Deep Learning"

Language:PythonLicense:NOASSERTIONStargazers:269Issues:10Issues:12

heros

IFDS/IDE Solver for Soot and other frameworks

Language:JavaLicense:LGPL-2.1Stargazers:227Issues:22Issues:23

sometimes_deep_sometimes_learning

A collection of DL experiments and notes

Language:Jupyter NotebookLicense:MITStargazers:135Issues:8Issues:3

EN-FR-MLT-tensorflow

English-French Machine Language Translation in Tensorflow

Language:JavaLicense:GPL-3.0Stargazers:103Issues:7Issues:9

genprog-code

GenProg: heuristic, GP-based automatic program repair for C.

systemCallAnomalyDetectionLSTM

system call-based anomaly detection with LSTM

Language:Jupyter NotebookStargazers:15Issues:0Issues:0

FrameHanger

A tool to extract statically and dynamically injected iframes. The information are feed as features into a machine-learning based classifier for malicious iframe detection.

Language:HTMLLicense:Apache-2.0Stargazers:13Issues:2Issues:1
Language:JavaLicense:MITStargazers:1Issues:0Issues:0