Beast code in Giters

DDanlov's starred repositories

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.03083300

video-occupancy-models

Language:Python500

hyperlight

Modular and intuitive Hypernetworks in Pytorch

Language:PythonApache-2.03000

ShiftAddLLM

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Language:PythonApache-2.04200

pam

Language:PythonApache-2.01400

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT1384300

self-reasoning-tokens-pytorch

Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto

Language:PythonMIT5000

relu_kan

Language:Python5400

DIY-Astra

Language:Python14000

LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Language:Python15000

init2winit

Language:PythonApache-2.07000

mad-lab

A MAD laboratory to improve AI architecture designs 🧪

Language:PythonMIT7700

grok-1

Grok open release

Language:PythonApache-2.04918300

CALM-pytorch

Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind

Language:PythonMIT14600

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonApache-2.0134900

seq_icl

Language:Jupyter NotebookApache-2.04800

goodai-ltm-benchmark

A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you need to evaluate your own agents. See more in the blogpost:

Language:HTMLNOASSERTION4900

diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Language:PythonNOASSERTION126300

Pushdown-Layers

Code for Pushdown Layers from our EMNLP 2023 paper

Language:Python2600

dnc

A TensorFlow implementation of the Differentiable Neural Computer.

Language:PythonApache-2.0249200

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT5201300

DDanlov

DDanlov's starred repositories

pytorch-image-models

video-occupancy-models

hyperlight

ShiftAddLLM

pam

pykan

self-reasoning-tokens-pytorch

relu_kan

DIY-Astra

LLM_Tree_Search

init2winit

mad-lab

grok-1

CALM-pytorch

OpenDiT

seq_icl

goodai-ltm-benchmark

diplomacy_cicero

Pushdown-Layers

dnc

annotated_deep_learning_paper_implementations

ponder-transformer

dataset-generator

neuralstruct

quip-sharp

QuIP

Qwen-Audio

mirasol-pytorch

DT_Mem

Perceiver-Music-Transformer