Taowen (Tony)'s repositories

ICHaskellStyleGuide

A Haskell style guide that follows conventions in Imperial College 40009 Computing Practical.

Language:HaskellStargazers:1Issues:0Issues:0

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:0Issues:0
Language:CStargazers:0Issues:0Issues:0

cpufp

A CPU tool for benchmarking the peak of floating points

Language:AssemblyLicense:GPL-3.0Stargazers:0Issues:0Issues:0

dotfiles

my dotfiles

Language:ShellStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:HTMLLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

LLM_Tree_Search

The official implementation of paper: Alphazero-like Tree-Search can guide large language model decoding and training

Language:PythonStargazers:0Issues:0Issues:0

Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

License:NOASSERTIONStargazers:0Issues:0Issues:0

memory-efficient-attention-pytorch

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

microxcaling

PyTorch emulation library for Microscaling (MX)-compatible data formats

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

omnisafe

OmniSafe is an infrastructural framework for accelerating SafeRL research.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

paper_reading

A shared paper reading repository for people in the group

Stargazers:0Issues:0Issues:0

please

a command line copilot

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Retriever

Retriever-0.1B

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

License:Apache-2.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0