Hetian Bai (hb1500)

hb1500

Geek Repo

Company:@NYU @BuzzFeed @VoxMedia @HBOMax

Location:New York

Github PK Tool:Github PK Tool

Hetian Bai's starred repositories

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language:PythonLicense:MITStargazers:29498Issues:561Issues:5620

handson-ml

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:25144Issues:1086Issues:562

pydata-book

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:21868Issues:1482Issues:122

stanford-cs-229-machine-learning

VIP cheatsheets for Stanford's CS 229 Machine Learning

vision

Datasets, Transforms and Models specific to Computer Vision

Language:PythonLicense:BSD-3-ClauseStargazers:15878Issues:428Issues:3217

gensim

Topic Modelling for Humans

Language:PythonLicense:LGPL-2.1Stargazers:15523Issues:432Issues:1847

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonLicense:NOASSERTIONStargazers:13779Issues:201Issues:2308

lime

Lime: Explaining the predictions of any machine learning classifier

Language:JavaScriptLicense:BSD-2-ClauseStargazers:11458Issues:263Issues:634

turicreate

Turi Create simplifies the development of custom machine learning models.

Language:C++License:BSD-3-ClauseStargazers:11189Issues:338Issues:1796

state-of-the-art-result-for-machine-learning-problems

This repository provides state of the art (SoTA) results for all machine learning problems. We do our best to keep this repository up to date. If you do find a problem's SoTA result is out of date or missing, please raise this as an issue or submit Google form (with this information: research paper name, dataset, metric, source code and year). We will fix it immediately.

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonLicense:MITStargazers:8681Issues:94Issues:181

dowhy

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.

Language:PythonLicense:MITStargazers:6954Issues:136Issues:465

OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Language:PythonLicense:MITStargazers:6695Issues:176Issues:1443

StarSpace

Learning embeddings for classification, retrieval and ranking.

leetcode

👏🏻 leetcode solutions for Humans™

Transformer

Transformer seq2seq model, program that can build a language translator from parallel corpus

Language:PythonLicense:Apache-2.0Stargazers:1327Issues:19Issues:34

sparkit-learn

PySpark + Scikit-learn = Sparkit-learn

Language:PythonLicense:Apache-2.0Stargazers:1152Issues:89Issues:61

RecNN

Reinforced Recommendation toolkit built around pytorch 1.7

Language:PythonLicense:Apache-2.0Stargazers:575Issues:29Issues:25

DME

Dynamic Meta-Embeddings for Improved Sentence Representations

Language:PythonLicense:NOASSERTIONStargazers:332Issues:19Issues:7

PyTorch-Batch-Attention-Seq2seq

PyTorch implementation of batched bi-RNN encoder and attention-decoder.

NATS

Neural Abstractive Text Summarization with Sequence-to-Sequence Models

Language:PythonLicense:GPL-3.0Stargazers:154Issues:5Issues:13

ML-AI-experiments

All my experiments with AI and ML

Language:Jupyter NotebookStargazers:117Issues:17Issues:4

googler

:octocat: Get into Google for Humans™

ir18

Inference and Representation (DS-GA-1005, CSCI-GA.2569), fall 18

Language:TeXStargazers:62Issues:6Issues:0

trcrpm

Temporally-reweighted Chinese restaurant process mixture models for multivariate time series

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:37Issues:8Issues:5

pTSAFall2018

DS-GA 3001.001/.002 Probabilistic time series analysis Fall 2018

Language:Jupyter NotebookStargazers:15Issues:4Issues:0

translate_machine_translation

Vietnamese and Chinese to English

Language:PythonStargazers:15Issues:0Issues:0

DS-GA-1004-Big-Data-Correlation-Discovery-Cross-Multiple-Datasets

Big Data Term Project: Manipulate on very large datasets with Hadoop map-reduce, Spark

Language:PythonStargazers:3Issues:0Issues:0

Plated_Recipe_Tags_Predict

Capstone project--collaborated with Plated.com

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3Issues:4Issues:0
Language:Jupyter NotebookStargazers:1Issues:1Issues:0