Yu-gyoung-Yun

Yu-gyoung-Yun

Geek Repo

Github PK Tool:Github PK Tool

Yu-gyoung-Yun's repositories

alpa

Training and serving large-scale neural networks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ase_riscv_gem5_sim

RISCV Gem5 simulator flow for Architetture dei Sistemi di Elaborazione

Language:PythonLicense:GPL-2.0Stargazers:0Issues:0Issues:0

awesome-distributed-ml

A curated list of awesome projects and papers for distributed training or inference

Stargazers:0Issues:0Issues:0

awesome-emdl

Embedded and mobile deep learning research resources

License:MITStargazers:0Issues:0Issues:0

awesome-machine-learning-in-compilers

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

License:CC0-1.0Stargazers:0Issues:0Issues:0

awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

Stargazers:0Issues:0Issues:0

code-samples

Source code examples from the Parallel Forall Blog

Language:HTMLLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

cuptisamples

NVIDIA CUPTI samples mirror.

License:NOASSERTIONStargazers:0Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

License:NOASSERTIONStargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

Hands-On-GPU-Programming-with-Python-and-CUDA

Hands-On GPU Programming with Python and CUDA, published by Packt

License:MITStargazers:0Issues:0Issues:0

iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

License:Apache-2.0Stargazers:0Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLMSys-PaperList

LLM Systems Paper List

Stargazers:0Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ML-Hardware-Collections

News and Paper Collections for Machine Learning Hardware

License:CC0-1.0Stargazers:0Issues:0Issues:0

ml4se

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

Stargazers:0Issues:0Issues:0

scale-sim-v2

Repository to host and maintain scale-sim-v2 code

Stargazers:0Issues:0Issues:0

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

License:MITStargazers:0Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

TensorNVMe

A Python library transfers PyTorch tensors between CPU and NVMe

Stargazers:0Issues:0Issues:0

torch-ccl

oneCCL Bindings for Pytorch*

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

tutorial-multi-gpu

Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial

License:MITStargazers:0Issues:0Issues:0

xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:CSSStargazers:0Issues:0Issues:0