Huwan Peng (hwpeng)

hwpeng

Geek Repo

Company:University of Washington

Location:Seattle

Home Page:huwan.org

Github PK Tool:Github PK Tool

Huwan Peng's starred repositories

UWThesis

Class file for University of Washington thesis formatting with LaTeX.

Language:TeXLicense:NOASSERTIONStargazers:69Issues:0Issues:0

DAC2022

Cost Model

Language:PythonStargazers:6Issues:0Issues:0

ramulator2

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM standards, emerging RowHammer mitigation techniques). Described in our paper https://people.inf.ethz.ch/omutlu/pub/Ramulator2_arxiv23.pdf

Language:C++License:MITStargazers:193Issues:0Issues:0

Surelog

SystemVerilog 2017 Pre-processor, Parser, Elaborator, UHDM Compiler. Provides IEEE Design/TB C/C++ VPI and Python AST & UHDM APIs. Compiles on Linux gcc, Windows msys2-gcc & msvc, OsX

Language:C++License:Apache-2.0Stargazers:343Issues:0Issues:0

llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:314Issues:0Issues:0

trax

Trax — Deep Learning with Clear Code and Speed

Language:PythonLicense:Apache-2.0Stargazers:8020Issues:0Issues:0
Language:ShellStargazers:12636Issues:0Issues:0

parallelformers

Parallelformers: An Efficient Model Parallelization Toolkit for Deployment

Language:PythonLicense:Apache-2.0Stargazers:766Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34058Issues:0Issues:0

slidev

Presentation Slides for Developers

Language:TypeScriptLicense:MITStargazers:32189Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:712Issues:0Issues:0

hedgehog-lab

Run, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.

Language:TypeScriptLicense:Apache-2.0Stargazers:2363Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:91Issues:0Issues:0

running_page

Make your own running home page

Language:PythonLicense:MITStargazers:3426Issues:0Issues:0

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Language:PythonLicense:MITStargazers:690Issues:0Issues:0

KataGo

GTP engine and self-play learning in Go

Language:C++License:NOASSERTIONStargazers:3388Issues:0Issues:0

bolt

10x faster matrix and vector operations

Language:C++License:MPL-2.0Stargazers:2465Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:29314Issues:0Issues:0
Language:PythonLicense:MITStargazers:2426Issues:0Issues:0
Language:PythonLicense:MITStargazers:20Issues:0Issues:0

WU-UCT

A novel parallel UCT algorithm with linear speedup and negligible performance loss.

Language:PythonLicense:MITStargazers:99Issues:0Issues:0

awesome-monte-carlo-tree-search-papers

A curated list of Monte Carlo tree search papers with implementations.

Language:PythonLicense:CC0-1.0Stargazers:616Issues:0Issues:0

alphafold

Open source code for AlphaFold.

Language:PythonLicense:Apache-2.0Stargazers:12158Issues:0Issues:0

rlpyt

Reinforcement Learning in PyTorch

Language:PythonLicense:MITStargazers:2212Issues:0Issues:0

snntorch

Deep and online learning with spiking neural networks in Python

Language:PythonLicense:MITStargazers:1207Issues:0Issues:0

PyProf

A GPU performance profiling tool for PyTorch models

Language:PythonLicense:Apache-2.0Stargazers:490Issues:0Issues:0

distributedRL

A framework for easy prototyping of distributed reinforcement learning algorithms

Language:PythonLicense:MITStargazers:94Issues:0Issues:0

Awesome-Federated-Learning

FedML - The Research and Production Integrated Federated Learning Library: https://fedml.ai

Stargazers:1903Issues:0Issues:0

Ryujinx

Experimental Nintendo Switch Emulator written in C#

Language:C#License:MITStargazers:33800Issues:0Issues:0

Arcade-Learning-Environment

The Arcade Learning Environment (ALE) -- a platform for AI research.

Language:C++License:GPL-2.0Stargazers:2106Issues:0Issues:0