Simu (simudt)

simudt

Geek Repo

Location:New Atlantis

Home Page:simudt.xyz

Github PK Tool:Github PK Tool

Simu's repositories

Griffin-Jax

Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"

Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

mpi-ds

MPI Operator DeepSpeed Base Configuration for CIFAR-10

Language:DockerfileStargazers:4Issues:0Issues:0

miniF2F-code

Dataset of formal Olympiad-level mathematics problems solved with Python code instructions.

Language:ShellStargazers:3Issues:1Issues:0

Tri-RMSNorm

Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

LongConv-Jax

Jax/Flax/Linen implementation of "Simple Hardware-Efficient Long Convolutions for Sequence Modeling"

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

triton-activations

Collection of neural network activation function kernels for Triton Language Compiler by OpenAI

Language:PythonStargazers:2Issues:0Issues:0

GradientAscent-Jax

Custom gradient ascent solver (optimizer) for JAX/Flax models

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

kmeansops

PyKeops Powered K-Means Clustering Algorithms Module both on CPU & GPU

Language:PythonStargazers:1Issues:0Issues:0

lmppl-cli-csv-wrapper

A tiny CLI wrapper around lmppl for Pre-Trained Language Models Perplexity Calculation for CSV files

Language:PythonStargazers:1Issues:0Issues:0

Mixture-of-Depths-Jax

Jax module for the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Ring-Attention-Jax

Packaged Ring Attention with Blockwise Transformers for Near-Infinite Context implemented in Jax + Flax.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Python-Template

Python Package Template is all you need

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

simudt.github.io

blog for the AI era

Language:SCSSLicense:MITStargazers:0Issues:1Issues:0

Composable-Datasets

Transform JSONL Q&A datasets to instruct format with ease

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

jax-triton

jax-triton contains integrations between JAX and OpenAI Triton

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MEGABYTE-pytorch-DS

Modificated DeepSpeed training setup fork of MEGABYTE - PyTorch by lucidrains, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PaLM-rlhf-pytorch-DS

Modificated DeepSpeed training setup fork of RLHF (Reinforcement Learning with Human Feedback) by lucidrains on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Simba

A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

zeta

Build high-performance AI models with modular building blocks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0