Junesoo Kang (junesookang)

junesookang

Geek Repo

Company:UNIST

Location:Republic of Korea

Github PK Tool:Github PK Tool

Junesoo Kang's starred repositories

Language:PythonStargazers:21Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0

ringattention

Transformers with Arbitrarily Large Context

Language:PythonLicense:Apache-2.0Stargazers:577Issues:0Issues:0
Language:CudaLicense:BSD-2-ClauseStargazers:109Issues:0Issues:0

gds-nvidia-fs

NVIDIA GPUDirect Storage Driver

Language:CLicense:NOASSERTIONStargazers:177Issues:0Issues:0

DeepGNN

DeepGNN is a framework for training machine learning models on large scale graph data.

Language:PythonLicense:MITStargazers:110Issues:0Issues:0

ptgnn

A PyTorch Graph Neural Network Library

Language:PythonLicense:MITStargazers:374Issues:0Issues:0

DeepPlan

Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)

Language:C++License:MITStargazers:50Issues:0Issues:0

pytorch-direct_dgl

PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB)

Stargazers:46Issues:0Issues:0

ogb

Benchmark datasets, data loaders, and evaluators for graph machine learning

Language:PythonLicense:MITStargazers:1894Issues:0Issues:0

IGB-Datasets

Largest realworld open-source graph dataset - Worked done under IBM-Illinois Discovery Accelerator Institute and Amazon Research Awards and in collaboration with NVIDIA Research.

Language:PythonLicense:NOASSERTIONStargazers:74Issues:0Issues:0

gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

Language:C++License:MITStargazers:821Issues:0Issues:0

mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

Language:C++License:MITStargazers:185Issues:0Issues:0

ark

A GPU-driven system framework for scalable AI applications

Language:C++License:MITStargazers:96Issues:0Issues:0

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

Language:CLicense:NOASSERTIONStargazers:5808Issues:0Issues:0

MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Language:PythonLicense:Apache-2.0Stargazers:2226Issues:0Issues:0

graph-based-deep-learning-literature

links to conference publications in graph-based deep learning

Language:Jupyter NotebookLicense:MITStargazers:4688Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11741Issues:0Issues:0

DUCATI_SIGMOD

Accepted paper of SIGMOD 2023, DUCATI: A Dual-Cache Training System for Graph Neural Networks on Giant Graphs with the GPU

Language:PythonStargazers:13Issues:0Issues:0

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:2675Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:559Issues:0Issues:0

dgSPARSE-Lib

PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity

Language:CudaLicense:MITStargazers:94Issues:0Issues:0

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1391Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0

backprop

Backpropagation in Python, C++, and Cuda

Language:C++License:MITStargazers:42Issues:0Issues:0

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

Language:C++License:MITStargazers:936Issues:0Issues:0

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Language:C++License:MITStargazers:13608Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:4925Issues:0Issues:0

cudnn.torch

Torch-7 FFI bindings for NVIDIA CuDNN

Language:LuaLicense:BSD-2-ClauseStargazers:401Issues:0Issues:0