Weigao Sun's repositories
DataDriven-POPF
Data driven probabilistic optimal power flow with Probabilistic Methods
DeepSpeed-LASP
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:PythonApache-2.0000
Language:PythonApache-2.0000
fairscale-CO2
The Fairscale framework with CO2 integrated.
Language:PythonMIT000
fairseq-CO2
Example of using CO2 within Fairseq.
Language:PythonMIT000
Hard
The exercise code for <learn python the hard way>
Language:PythonApache-2.0000
TNL-MoE
TNL-MoE: Building Mixture-of-Experts from TransNormerLLM (TNL) with Continual Pre-training
Language:PythonApache-2.0000
Language:PythonMIT000
Language:PythonMIT000
Language:PythonNOASSERTION000
Megatron-LM
Ongoing research training transformer models at scale
Language:PythonNOASSERTION000
ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Language:PythonMIT000
Language:Jupyter Notebook000