kyegomez

Kye Gomez's repositories

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonMIT1533 39 36

swarms

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503

Language:PythonNOASSERTION1136 30 249

MultiModalMamba

A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.

Language:PythonMIT430 8 3

Gemini

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Language:PythonMIT411 12 8

zeta

Build high-performance AI models with modular building blocks

Language:PythonApache-2.0379 4 60

Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory when performing batch inference to extract features on high-res images

Language:PythonMIT353 6 17

awesome-multi-agent-papers

A compilation of the best multi-agent papers

166 140

RT-X

Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"

Language:PythonMIT156 8 6

Python-Package-Template

A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more

Language:ShellMIT125 20

Mixture-of-Depths

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:PythonMIT56 4 2

Infini-attention

Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTORCH

Language:PythonMIT48 3 1

Lets-Verify-Step-by-Step

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Language:PythonApache-2.047 3 1

NeoSapiens

The next evolution of Agents

Language:PythonMIT44 4 2

Reka-Torch

Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch

Language:PythonMIT27 20

swarms-cloud

Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.

Language:PythonMIT22 4 21

MM1

PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"

Language:PythonMIT21 30

MambaFormer

Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks"

Language:PythonMIT19 4 3

MLXTransformer

Simple Implementation of a Transformer in the new framework MLX by Apple

Language:PythonMIT19 40

swarms-platform

Language:TypeScriptNOASSERTION19 4 29

BRAVE-ViT-Swarm

Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"

Language:PythonMIT17 2 1

MHMoE

Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch

Language:PythonMIT16 20

TeraGPT

Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT

Language:PythonMIT15 30

SimplifiedTransformers

SimplifiedTransformer simplifies transformer block without affecting training. Skip connections, projection parameters, sequential sub-blocks, and normalization layers are removed. Experimental results confirm similar training speed and performance.

Language:PythonMIT14 3 3