hwpeng

Huwan Peng's starred repositories

UWThesis

Class file for University of Washington thesis formatting with LaTeX.

Language:TeXNOASSERTION6900

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM standards, emerging RowHammer mitigation techniques). Described in our paper https://people.inf.ethz.ch/omutlu/pub/Ramulator2_arxiv23.pdf

Language:C++MIT19300

Surelog

SystemVerilog 2017 Pre-processor, Parser, Elaborator, UHDM Compiler. Provides IEEE Design/TB C/C++ VPI and Python AST & UHDM APIs. Compiles on Linux gcc, Windows msys2-gcc & msvc, OsX

Language:C++Apache-2.034300

llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

Language:PythonApache-2.031400

trax

Trax — Deep Learning with Clear Code and Speed

Language:PythonApache-2.0802000

haoel.github.io

Language:Shell1263600

parallelformers

Parallelformers: An Efficient Model Parallelization Toolkit for Deployment

Language:PythonApache-2.076600

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.03405800

slidev

Presentation Slides for Developers

Language:TypeScriptMIT3218900

circuit_training

Language:PythonApache-2.071200

hedgehog-lab

Run, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.

Language:TypeScriptApache-2.0236300