pjs102793

followers

following

stars

Neosapience, Inc.

https://www.linkedin.com/in/pjs102793/

Junseo Park's starred repositories

FlameGraph

Stack trace visualizer

Language:Perl1734100

TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.

Language:PythonNOASSERTION53900

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.0195800

lectures

Material for gpu-mode lectures

Language:Jupyter NotebookApache-2.0297900

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++NOASSERTION563700

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0861700

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

MIT2731100

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Language:PythonAGPL-3.0971600

awesome-knowledge-distillation

Awesome Knowledge Distillation

Apache-2.0346800

jetson-containers

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

Language:Jupyter NotebookMIT230900

KakaoPostCodeWeb

KakaoPostCodeWeb 호스팅용

Language:HTMLMIT100

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonApache-2.070300

CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

Language:C++NOASSERTION237600

wuff

Wuff employs black and ruff for simplifying well-formatted python projects.

Language:ShellMIT300

UTM

Virtual machines for iOS and macOS

Language:SwiftApache-2.02696500

2023-MatKor-Rust-Interpreter

2023년 고려대학교 MatKor 스터디 - Rust 기초 프로그래밍 + 인터프리터 만들기

Language:RustMIT34400

Tensorrt-Deformable-Detr

Tensorrt-Deformable-Detr

Language:PythonApache-2.05500

awesome

😎 Awesome lists about all kinds of interesting topics

CC0-1.033246100

Awesome-Pruning

A curated list of neural network pruning resources.

HALP

Language:PythonNOASSERTION5600

poly-match

Source for the "Making Python 100x faster with less than 100 lines of Rust" blog post

Language:PythonApache-2.03800

awesome-actions

A curated list of awesome actions to use on GitHub

CC0-1.02513100

inference

Reference implementations of MLPerf™ inference benchmarks

Language:PythonApache-2.0123400

mage-ai

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Language:PythonApache-2.0792400

da-fusion

Effective Data Augmentation With Diffusion Models

Language:PythonMIT21700

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonApache-2.03704700

pyo3

Rust bindings for the Python interpreter

Language:RustApache-2.01232200

maturin

Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages

Language:RustApache-2.0391200

Articles

The lists of articles

Language:JavaScriptCC0-1.0300

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.03539300