DDanlov's starred repositories

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30833Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

hyperlight

Modular and intuitive Hypernetworks in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:30Issues:0Issues:0

ShiftAddLLM

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Language:PythonLicense:Apache-2.0Stargazers:42Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:14Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13843Issues:0Issues:0

self-reasoning-tokens-pytorch

Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto

Language:PythonLicense:MITStargazers:50Issues:0Issues:0
Language:PythonStargazers:54Issues:0Issues:0
Language:PythonStargazers:140Issues:0Issues:0

LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Language:PythonStargazers:150Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:70Issues:0Issues:0

mad-lab

A MAD laboratory to improve AI architecture designs ๐Ÿงช

Language:PythonLicense:MITStargazers:77Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49183Issues:0Issues:0

CALM-pytorch

Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind

Language:PythonLicense:MITStargazers:146Issues:0Issues:0

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1349Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:48Issues:0Issues:0

goodai-ltm-benchmark

A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you need to evaluate your own agents. See more in the blogpost:

Language:HTMLLicense:NOASSERTIONStargazers:49Issues:0Issues:0

diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Language:PythonLicense:NOASSERTIONStargazers:1263Issues:0Issues:0

Pushdown-Layers

Code for Pushdown Layers from our EMNLP 2023 paper

Language:PythonStargazers:26Issues:0Issues:0

dnc

A TensorFlow implementation of the Differentiable Neural Computer.

Language:PythonLicense:Apache-2.0Stargazers:2492Issues:0Issues:0

annotated_deep_learning_paper_implementations

๐Ÿง‘โ€๐Ÿซ 60 Implementations/tutorials of deep learning papers with side-by-side notes ๐Ÿ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ŸŽฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐Ÿง 

Language:PythonLicense:MITStargazers:52013Issues:0Issues:0

ponder-transformer

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

Language:PythonLicense:MITStargazers:78Issues:0Issues:0

dataset-generator

A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of the cost of prompting LLMs directly.

Language:Jupyter NotebookLicense:MITStargazers:18Issues:0Issues:0

neuralstruct

Differentiable data structures for neural nets

Language:GoStargazers:9Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:451Issues:0Issues:0
Stargazers:2Issues:0Issues:0

Qwen-Audio

The official repo of Qwen-Audio (้€šไน‰ๅƒ้—ฎ-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:1273Issues:0Issues:0

mirasol-pytorch

Implementation of ๐ŸŒป Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:85Issues:0Issues:0

DT_Mem

Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

Perceiver-Music-Transformer

SOTA Google's Perceiver-AR Music Transformer Implementation and Model

Language:PythonLicense:Apache-2.0Stargazers:91Issues:0Issues:0