Beast code in Giters

mtanghu's repositories

Transformer-Trader

Investigation into whether Transformers and self-supervised learning could be used to trade currency markets

Language:Jupyter Notebook5 1 2

LEAP

LEAP: Linear Explainable Attention in Parallel for causal language modeling with O(1) path length, and O(1) inference

Language:Jupyter NotebookCC0-1.04 1 16

DNI-RNN

Decoupled Neural Interfaces (Jaderberg et al. 2017) mini-package for easy integration with pytorch RNNs

Language:PythonMIT2 10

Active-Passive-Losses

[ICML2020] Normalized Loss Functions for Deep Learning with Noisy Labels

Language:PythonMIT000

Attention-Advice

Transformers with learned advice vectors

CC0-1.0010

awd-lstm-lm

LSTM and QRNN Language Model Toolkit for PyTorch

Language:PythonBSD-3-Clause000

blockchain_video

COM 217 video presentation code for an explainer on how blockchain works using Manim

Language:Jupyter Notebook000

Citadel-Central-Datathon-Fall21

2nd place winning analysis of smoking data for the Citatdel Central Datathon of Fall 2021 (final report included)

Language:Jupyter Notebook010

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonApache-2.0000

Rethinking-Neural-Computation

Draft & experiments for an alternative approach to neuro-symbolic AI that allows for "thinking fast and slow"

Language:Jupyter NotebookCC0-1.0010

URF

URF: Unsupervised Random Forest fork that uses scikit learn instead of pycluster for ~100x speed up

Language:PythonApache-2.0000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonMIT000

dni-pytorch

Decoupled Neural Interfaces using Synthetic Gradients for PyTorch

Language:PythonMIT000

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

Language:Jupyter NotebookNOASSERTION000

Fastformer

A pytorch &keras implementation and demo of Fastformer.

Language:Jupyter Notebook000

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause000

flops-profiler

pytorch-profiler

Language:PythonMIT000

hccpy

A Python implementation of Hierarchical Condition Categories

Language:PythonApache-2.0000

martingale

quick simulation to see how martingale betting would work with realistic conditions (i.e. finite but large money), as well as removing finite stopping condition

Language:Jupyter NotebookCC0-1.0010

Mega-pytorch

Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena

Language:PythonMIT000

parallelizing_linear_rnns

MIT000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++NOASSERTION000

RWKV-CUDA

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

Language:Cuda000

RWKV-LM

RWKV is a RNN with transformer-level performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.0000

mtanghu

mtanghu's repositories

Transformer-Trader

LEAP

DNI-RNN

Active-Passive-Losses

Attention-Advice

awd-lstm-lm

blockchain_video

Citadel-Central-Datathon-Fall21

datasets

Rethinking-Neural-Computation

URF

DeepSpeed

dni-pytorch

EconML

Fastformer

flash-attention

flops-profiler

hccpy

martingale

Mega-pytorch

parallelizing_linear_rnns

pytorch

RWKV-CUDA

RWKV-LM

SGConv

smart-on-fhir-tutorial

sru

tinygrad