zhenghh04

followers

following

stars

Argonne National Laboratory

Lemont, IL

https://scholar.google.com/citations?user=dd7fUtEAAAAJ&hl=en

Organizations

argonne-lcf

hpc-io

Huihuo Zheng's repositories

dlio_ml_workloads

Reference workloads for DLIO Benchmark

Language:Python000

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:PythonNOASSERTION000

dlio_benchmark

An I/O benchmark for deep Learning applications

Language:PythonApache-2.0000

test_dali

Language:Python100

dl_scaling_hang

000

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0000

E3SM-IO

Benchmark programs using the I/O pattern of E3SM

Language:C++NOASSERTION000

vol-log-based

000

exahdf5_sdk

ExaHDF5 project build scripts

Language:Shell000

pyutils

This is a set of utils that I created throughout the years

100

dlio-profiler

A low-level profiler for capture I/O calls from deep learning applications.

Language:C++MIT000

vol-cache

HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O overhead.

Language:CBSD-3-Clause000

exahdf5

Language:Shell000

MLPerf_training

Reference implementations of MLPerf™ training benchmarks

Language:PythonApache-2.0100

dlio_microbenchmark

Language:C++000

E4S-Documenter

A tool to generate documentation for a project based on project metadata (README, Changelog, License, etc.) stored in a yaml file.

Language:PythonMIT000

mlperf_storage

Language:ShellNOASSERTION000

h5bench

A benchmark suite for measuring HDF5 performance.

Language:CNOASSERTION000

ai-science-training-series

Language:Jupyter Notebook000

user-guides

ALCF Systems User Documentation

Language:HTML000

incubator-mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Apache-2.0000

dlio_profiling

This repo demonstrate how to profile I/O for deep learning applications. This is based on VaniDL

Language:Python000

training_results_v1.1

NOASSERTION000

amrex

AMReX: Software Framework for Block Structured AMR

NOASSERTION000

scorpio

A high-level Parallel I/O Library for structured grid applications

000

vanidl

VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.

Language:PythonMIT000

vol-async

HDF5 Asynchronous I/O VOL connector that enables asynchronous I/O for HDF5 applications

Language:CNOASSERTION000

UnifyFS

UnifyFS: A file system for burst buffers

NOASSERTION000

vol-external-passthrough

Language:C000