Beast code in Giters

Rui Wang's repositories

Deep-Approximate-Shapley-Propagation

This is a Pytorch Implementation of the DASP algorithm from the paper "Explaining Deep Neural Networks with a Polynomial Time Algorithm for Shapley Value Approximation"

Language:Python8 3 1

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonBSD-3-Clause000

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Language:ShellNOASSERTION010

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonBSD-3-Clause010

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause000

google-research

Google Research forked

Language:Jupyter NotebookApache-2.0010

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION000

OmegaFold

OmegaFold Release Code

Language:PythonApache-2.0010

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.0000

RuiWang1998