Zhen Zhang (zarzen)

zarzen

Geek Repo

Location:Baltimore, MD

Github PK Tool:Github PK Tool


Organizations
ZJUT

Zhen Zhang's repositories

dt-autorun

autorun distributed training experiments and gathering logs

Language:PythonStargazers:1Issues:2Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:1Issues:1Issues:0

alpa

Training and serving large-scale neural networks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

byteps

A high performance and generic framework for distributed DNN training

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

d2l-tvm

Dive into Deep Learning Compiler

Language:PythonStargazers:0Issues:1Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0

dlrm

An implementation of a deep learning recommendation model (DLRM)

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

doom.d

doom emacs config

Language:Emacs LispStargazers:0Issues:1Issues:0
Language:C++Stargazers:0Issues:2Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

grace

GRACE - GRAdient ComprEssion for distributed deep learning

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

kickstart.nvim

A launch point for your personal nvim configuration

Language:LuaLicense:MITStargazers:0Issues:0Issues:0

Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

model-prepare

generate models to serve

Language:PythonStargazers:0Issues:2Issues:0

nccl

adding timers for NCCL

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

nccl-fastsocket

NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

nccl-tests

NCCL Tests

Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

ratex

Yuan's fork of Ratex

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

slapo

A schedule language for progressive optimization of large deep learning model training

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

split-annotations

Source code for the split annotations project.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

UGATIT-pytorch

Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:2Issues:0