lucidrains

Phil Wang's repositories

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonMIT3008 98 52

reformer-pytorch

Reformer, the efficient Transformer, in Pytorch

Language:PythonMIT2053 54 120

make-a-video-pytorch

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

Language:PythonMIT1838 72 15

perceiver-pytorch

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Language:PythonMIT1047 31 59

memorizing-transformers-pytorch

Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch

Language:PythonMIT608 11 13

MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Language:PythonMIT590 10 13

siren-pytorch

Pytorch implementation of SIREN - Implicit Neural Representations with Periodic Activation Function

Language:PythonMIT452 13 6

memory-efficient-attention-pytorch

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

Language:PythonMIT338 9 5

conformer

Implementation of the convolutional module from the Conformer paper, for use in Transformers

Language:PythonMIT329 9 12

deformable-attention

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"

Language:PythonMIT248 9 7

x-unet

Implementation of a U-net complete with efficient attention as well as the latest research findings

Language:PythonMIT248 12 7

electra-pytorch

A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch

Language:PythonMIT215 9 11

jax2torch

Use Jax functions in Pytorch

Language:PythonMIT209 5 3

block-recurrent-transformer-pytorch

Implementation of Block Recurrent Transformer - Pytorch

Language:PythonMIT204 8 6

Mega-pytorch

Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena

Language:PythonMIT201 8 2

mlm-pytorch

An implementation of masked language modeling for Pytorch, made as concise and simple as possible

Language:PythonMIT169 5 6

graph-transformer-pytorch

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

Language:PythonMIT163 4 2

ETSformer-pytorch

Implementation of ETSformer, state of the art time-series Transformer, in Pytorch

Language:PythonMIT141 10 10

mixture-of-attention

Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts

Language:PythonMIT93 70

discrete-key-value-bottleneck-pytorch

Implementation of Discrete Key / Value Bottleneck, in Pytorch

Language:PythonMIT87 4 2

TPDNE

Thispersondoesnotexist went down, so this time, while building it back up, I am going to open source all of it.

Language:PythonMIT82 40

VN-transformer

A Transformer made of Rotation-equivariant Attention using Vector Neurons

Language:PythonMIT73 7 2

rvq-vae-gpt

My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation

Language:PythonMIT72 50

product-key-memory

Standalone Product Key Memory module in Pytorch - for augmenting Transformer models

Language:PythonMIT67 3 1

flash-genomics-model

My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hierarchical methods)

Language:PythonMIT52 6 3

coordinate-descent-attention

Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk

Language:PythonMIT43 3 1

autoregressive-linear-attention-cuda

CUDA implementation of autoregressive linear attention, with all the latest research findings

Language:PythonMIT42 40

nim-tokenizer

Implementation of a simple BPE tokenizer, but in Nim

Language:NimMIT20 30

hyena-dna

Fork of HyenaDNA, a long-range genomic foundation model built with Hyena

Language:AssemblyApache-2.09 20

nitter

Alternative Twitter front-end

AGPL-3.0400