Yunsong Wang's repositories
benchmark
A microbenchmark support library
Catch2
A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)
cccl
CUDA C++ Core Libraries
CppCoreGuidelines
The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++
cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
cugraph
cuGraph - RAPIDS Graph Analytics Library
cutlass
CUDA Templates for Linear Algebra Subroutines
fastparquet
python implementation of the parquet columnar file format.
Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
hpx
The C++ Standard Library for Parallelism and Concurrency
identify
File identification library for Python
integration
RAPIDS - combined conda package & integration tests for all of RAPIDS libraries
kokkos
Kokkos C++ Performance Portability Programming EcoSystem: The Programming Model - Parallel Execution and Memory Abstraction
nccl
Optimized primitives for collective multi-GPU communication
parquet-format
Apache Parquet
pre-commit
A framework for managing and maintaining multi-language pre-commit hooks.
pycuda
CUDA integration for Python, plus shiny features
raft
Rapids Analytics Framework Toolset to share building blocks between cuGraph and cuML
stdexec
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.