acceleration

There are 3 repositories under acceleration topic.

linearmouse / linearmouse
The mouse and trackpad utility for Mac.
mac macos mouse scrolling utility acceleration productivity sensitivity cursor
Language:Swift 3366
three-mesh-bvh
gkjohnson / three-mesh-bvh
A BVH implementation to speed up raycasting and enable spatial queries against three.js meshes.
graphics raycast tree bounds threejs three-js bounds-hierarchy performance geometry mesh distance intersection acceleration bvh webvr point-cloud pointcloud raytracing pathtracing three-mesh-bvh
Language:JavaScript 2300
mit-han-lab / temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
acceleration low-latency temporal-modeling video-understanding efficient-model nvidia-jetson-nano tsm
Language:Python 2021
mit-han-lab / once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
tinyml edge-ai efficient-model acceleration nas automl
Language:Python 1841
mit-han-lab / proxylessnas
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
acceleration automl efficient-model hardware-aware on-device-ai specialization
Language:C++ 1412
mit-han-lab / torchsparse
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
acceleration pytorch
Language:Cuda 1121
channel-pruning
ethanhe42 / channel-pruning
Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)
image-recognition model-compression acceleration object-detection image-classification channel-pruning deep-neural-networks
Language:Python 1065
react-native-sensors / react-native-sensors
A developer friendly approach for sensors in React Native
react-native sensor gyroscope acceleration rxjs magnetometer barometer
Language:Objective-C 885
polygonplanet / chillout
Reduce CPU usage by non-blocking async loop and psychologically speed up in JavaScript
javascript cpu async cpu-load async-await acceleration cpu-usage cpu-utilization performance optimization lightweight-javascript-library user-experience speedup async-functions non-blocking
Language:JavaScript 592
staticallyio / statically
The CDN for developers.
images acceleration compress-images sponsors image-processing minification cdn optimization javascript css zap
Language:JavaScript 571
mayankk2308 / set-egpu
Display-agnostic acceleration of macOS applications using external GPUs.
egpu macos mojave high-sierra nvidia amd displays acceleration graphics
Language:Shell 481
Syncleus / aparapi
The New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
aparapi gpu opencl java java-library native-components jni gpgpu acceleration accelerator
Language:Java 461
mit-han-lab / distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
acceleration diffusion-models generative-ai generative-model parallelism
Language:Python 445
microsoft / hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
acceleration analytics big-data databases indexing spark
Language:Scala 421
wenwei202 / caffe
Caffe for Sparse and Low-rank Deep Neural Networks
deep-neural-networks sparsity acceleration compression low-rank-approximation caffe sparse-convolution
Language:C++ 374
Media-Smart / volksdep
volksdep is an open-source toolbox for deploying and accelerating PyTorch, ONNX and TensorFlow models with TensorRT.
pytorch onnx tensorrt deploy tensorflow jetson-nano jetson-tx2 jetson-xavier inference python keras acceleration
Language:Python 285
gin66 / FastAccelStepper
A high speed stepper library for Atmega 168/328p (nano), Atmega32u4, Atmega 2560, ESP32, ESP32S2, ESP32S3, ESP32C3 and Atmel SAM Due
arduino atmega328 nano acceleration driver-ic highspeed tested stepper motor a4988 esp32-arduino stepper-motor avr delay platformio esp32 sam
Language:C++ 268
lmxyy / sige
[NeurIPS 2022] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
acceleration conditional-gan conditional-gans ddim ddpm diffusion-models gans gaugan image-editing progressive-distillation sparse sparse-convolution
Language:Python 247
lmbxmu / HRank
Pytorch implementation of our paper accepted by CVPR 2020 (Oral) -- HRank: Filter Pruning using High-Rank Feature Map
acceleration compression pruning
Language:Python 246
jingwood / d2dlib
A .NET library for hardware-accelerated, high performance, immediate mode rendering via Direct2D.
direct2d dotnet library hardward acceleration drawing rendering gpu performance immediate draw bitmap direct2d-api memory-bitmap graphics-context direct2d-bitmap
Language:C# 230
BUAA-CI-LAB / Literatures-on-GNN-Acceleration
A reading list for deep graph learning acceleration.
acceleration accelerator deep-learning gcn gnn graph graph-algorithms graph-computing graph-convolutional-networks graph-neural-networks literature paper-list reading-list
198
mit-han-lab / inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
inference-optimization cnn parallelism acceleration
Language:C++ 187
ros-acceleration / robotic_processing_unit
A robot-specific processing unit. Contains CPUs, FPGAs and GPUs and maps ROS efficiently to them for best performance.
acceleration cpu fpga gpu hardware hardware-acceleration hardwareaccelerated hardwareaccelerator robotics robots ros ros2 rpu robotic-processing-unit
134
robotperf / benchmarks
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
acceleration benchmarking cpu fpga gpu performance robotics ros2 tpu
Language:Python 131
Cultrarius / Swarmz
A free, header-only C++ swarming (flocking) library for real-time applications
boids swarm velocity position acceleration game public-domain library flocking algorithm unreal-engine
Language:C++ 130
mbroemme / vdi-stream-client
VDI Stream Client is a very tiny, low latency and GPU accelerated client to connect to Windows running Parsec Host.
parsec desktop vdi nvidia amd intel gpu acceleration 3d gaming low-latency sdl2 microsoft windows stream vdi-stream-client
Language:C 120
firebuild / firebuild
Automatic build accelerator cache for Linux
acceleration build cache cmake make java ninja parallel scala cpp fortran python clang rust cargo compile rustc
Language:C++ 118
Infini-AI-Lab / TriForce
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
acceleration efficiency inference llm llm-inference long-context speculative-decoding
Language:Python 114
nebuly-ai / exploring-AI-optimization
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient 🚀
acceleration artificial-intelligence deep-learning inference machine-learning neural-network
108
juliagusak / model-compression-and-acceleration-progress
Repository to track the progress in model compression and acceleration
neural-network compression acceleration tensor-decomposition pruning architecture-search knowledge-distillation sparsification low-rank
102
obss / BIOBSS
A package for processing signals recorded using wearable sensors, such as Electrocardiogram (ECG), Photoplethysmogram (PPG), Electrodermal activity (EDA) and 3-axis acceleration (ACC).
acceleration ecg eda electrocardiography electrodermal-activity feature-extraction galvanic-skin-response heart-rate-variability hrv photoplethysmography ppg signal-processing
Language:Python 96
intel / hexl-fpga
Intel Homomorphic Encryption Acceleration Library for FPGAs, including open source implementation of FPGA kernels for accelerating NTT, INTT, Keyswitch and Dyadic Multiplication modular arithmetic operations, FPGA runtime, and host APIs for connecting to third-party homomorphic encryption libraries.
cryptography privacy homomorphic-encryption fpga acceleration
Language:C++ 84
xtknight / vdpau-va-driver-vp9
Experimental VP9 codec support for vdpau-va-driver (NVIDIA VDPAU-VAAPI wrapper) and chromium-vaapi
vdpau-va-driver chromium-vaapi va-api hardware video acceleration vp9 nvidia chromium vdpau vaapi 4k gpu nvdec
Language:C 76
rinzler
GitSquared / rinzler
An autonomous parallel processing engine for the browser.
acceleration webworkers
Language:TypeScript 56
whitelok / tvm-lesson
动手学习TVM核心原理教程
tvm deep-learning high-performance-computing gpu-acceleration acceleration inference
Language:Python 56
ghamerly / fast-kmeans
Code to speed up k-means clustering. Originally at BaylorCS/baylorml.
k-means clustering unsupervised-learning geometric bounds acceleration
Language:C++ 54

acceleration

linearmouse / linearmouse

gkjohnson / three-mesh-bvh

mit-han-lab / temporal-shift-module

mit-han-lab / once-for-all

mit-han-lab / proxylessnas

mit-han-lab / torchsparse

ethanhe42 / channel-pruning

react-native-sensors / react-native-sensors

polygonplanet / chillout

staticallyio / statically

mayankk2308 / set-egpu

Syncleus / aparapi

mit-han-lab / distrifuser

microsoft / hyperspace

wenwei202 / caffe

Media-Smart / volksdep

gin66 / FastAccelStepper

lmxyy / sige

lmbxmu / HRank

jingwood / d2dlib

BUAA-CI-LAB / Literatures-on-GNN-Acceleration

mit-han-lab / inter-operator-scheduler

ros-acceleration / robotic_processing_unit

robotperf / benchmarks

Cultrarius / Swarmz

mbroemme / vdi-stream-client

firebuild / firebuild

Infini-AI-Lab / TriForce

nebuly-ai / exploring-AI-optimization

juliagusak / model-compression-and-acceleration-progress

obss / BIOBSS

intel / hexl-fpga

xtknight / vdpau-va-driver-vp9

GitSquared / rinzler

whitelok / tvm-lesson

ghamerly / fast-kmeans