gpu

There are 112 repositories under gpu topic.

pytorch / pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
neural-network autograd gpu numpy deep-learning tensor python machine-learning
Language:Python 80767
alacritty
alacritty / alacritty
A cross-platform, OpenGL terminal emulator.
terminal-emulators opengl gpu rust vte terminal linux macos windows bsd
Language:Rust 54883
microsoft / DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
deep-learning pytorch gpu machine-learning billion-parameters data-parallelism model-parallelism inference pipeline-parallelism compression mixture-of-experts trillion-parameters zero
Language:Python 34024
fastai
fastai / fastai
The fastai deep learning library
colab deep-learning fastai gpu machine-learning notebooks python pytorch
Language:Jupyter Notebook 25925
taichi-dev / taichi
Productive, portable, and performant GPU programming in Python.
computer-graphics differentiable-programming gpu gpu-programming sparse-computation taichi
Language:C++ 25147
stats
exelban / stats
macOS system monitor in your menu bar
battery bluetooth clock cpu disk fans gpu macos menubar monitor network sensors stats temperature
Language:Swift 23622
NVIDIA / nvidia-docker
Build and run Docker containers leveraging NVIDIA GPUs
cuda docker gpu nvidia-docker
17170
gpujs / gpu.js
GPU Accelerated JavaScript
gpu webgl javascript math gpgpu glsl nodejs
Language:JavaScript 15042
WebGL-Fluid-Simulation
PavelDoGreat / WebGL-Fluid-Simulation
Play with fluids in your browser (works even on mobile)
fluid gpu navier-stokes simulation webgl
Language:JavaScript 14459
deeplearning4j / deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...
artificial-intelligence clojure deeplearning deeplearning4j dl4j gpu hadoop intellij java linear-algebra matrix-library neural-nets python scala spark
Language:Java 13556
Rem0o / FanControl.Releases
This is the release repository for Fan Control, a highly customizable fan controlling software for Windows.
control cpu curves fan fancontrol gpu pwm speed temperature
13329
neovide / neovide
No Nonsense Neovim Client in Rust
gpu neovim neovim-guis rust skia
Language:Rust 12446
wgpu
gfx-rs / wgpu
A cross-platform, safe, pure-Rust graphics API.
webgpu rust gpu metal opengl vulkan hacktoberfest d3d12
Language:Rust 11732
apache / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
compiler tensor deep-learning gpu opencl metal performance javascript rocm tvm vulkan spirv machine-learning
Language:Python 11456
scalene
plasma-umass / scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
cpu cpu-profiling gpu gpu-programming memory-allocation memory-consumption performance-analysis performance-cpu profiler profiles-memory profiling python python-profilers scalene
Language:Python 11435
Open3D
isl-org / Open3D
Open3D: A Modern Library for 3D Data Processing
mesh-processing computer-graphics opengl cpp python reconstruction odometry visualization registration machine-learning 3d pointcloud rendering gui 3d-perception gpu arm cuda pytorch tensorflow
Language:C++ 10924
openwall / john
John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
john c ripper password jtr opencl gpgpu assembler cracker crypt hash simd openmp gpu fpga mpi
Language:C 9722
pycaret / pycaret
An open-source, low-code machine learning library in Python
data-science citizen-data-scientists python machine-learning pycaret ml gpu time-series regression classification anomaly-detection clustering
Language:Jupyter Notebook 8695
OlafenwaMoses / ImageAI
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
artificial-intelligence machine-learning prediction image-prediction python python3 offline-capable imageai artificial-neural-networks algorithm image-recognition object-detection squeezenet densenet video inceptionv3 detection gpu ai-practice-recommendations
Language:Python 8506
rapidsai / cudf
cuDF - GPU DataFrame Library
arrow cpp cuda cudf dask data-analysis data-science dataframe gpu pandas pydata python rapids
Language:C++ 8081
cupy / cupy
NumPy & SciPy for GPU
cuda cudnn cublas cusolver nccl python numpy cupy curand cusparse gpu cutensor scipy nvtx nvrtc tensor cusparselt rocm
Language:Python 8005
catboost / catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
machine-learning decision-trees gradient-boosting gbm gbdt python r kaggle gpu-computing catboost tutorial categorical-features gpu coreml data-science big-data cuda data-mining
Language:Python 7932
MVIG-SJTU / AlphaPose
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
accurate alpha-pose alphapose crowdpose full-body gpu human-computer-interaction human-joints human-pose-estimation human-pose-tracking human-tracking keypoints person-pose-estimation pose-estimation posetracking pytorch realtime skeleton tracking whole-body
Language:Python 7866
triton-inference-server / server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
cloud datacenter deep-learning edge gpu inference machine-learning
Language:Python 7825
Syllo / nvtop
GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm
adreno amd apple ascend command-line-tool gpu huawei intel linux monitoring ncurses nvidia
Language:C 7793
h2oai / h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
h2o machine-learning data-science deep-learning big-data ensemble-learning gbm random-forest naive-bayes pca opensource distributed java python r hadoop spark gpu automl h2o-automl
Language:Jupyter Notebook 6823
gyroflow / gyroflow
Video stabilization using gyroscope data
video-processing stabilization gyroscope rust gpu-computing rolling-shutter-undistortion gopro sony-alpha-cameras insta360 fpv video gpu
Language:Rust 6401
skypilot-org / skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
cloud-computing data-science deep-learning gpu hyperparameter-tuning machine-learning tpu job-queue job-scheduler cloud-management distributed-training ml-infrastructure multicloud spot-instances ml-platform cost-management cost-optimization finops llm-serving llm-training
Language:Python 6323
intel-analytics / ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.
pytorch llm transformers gpu
Language:Python 6317
g-helper
seerge / g-helper
Lightweight Armoury Crate alternative for Asus laptops and ROG Ally. Control tool for ROG Zephyrus G14, G15, G16, M16, Flow X13, Flow X16, TUF, Strix, Scar and other models
ally amd armoury armoury-crate asus aura cpu fan g-helper g14 g16 gpu intel mux nvidia overclock power rog strix tuf
Language:C# 6100
chainer / chainer
A flexible framework of neural networks for deep learning
chainer cuda cudnn cupy deep-learning gpu machine-learning neural-network neural-networks numpy python
Language:Python 5882
halide / Halide
a language for fast, portable data-parallel computation
compiler dsl gpu halide hexagon image-processing performance
Language:C++ 5796
mviereck / x11docker
Run GUI applications and desktops in docker and podman containers. Focus on security.
containers desktop docker gpu gui html5 nerdctl podman printer pulseaudio sandbox sound ssh vnc wayland webcam x x11 xorg xpra
Language:Shell 5523
zeux / meshoptimizer
Mesh optimization library that makes meshes smaller and faster to render
compression gltf gpu mesh-processing optimization simplification
Language:C++ 5372
gfx
gfx-rs / gfx
[maintenance mode] A low-overhead Vulkan-like GPU API for Rust.
graphics-apis rust gfx opengl vulkan metal dx12 dx11 graphics gpu
Language:Rust 5348
DALI
NVIDIA / DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch
Language:C++ 5015

gpu

pytorch / pytorch

alacritty / alacritty

microsoft / DeepSpeed

fastai / fastai

taichi-dev / taichi

exelban / stats

NVIDIA / nvidia-docker

gpujs / gpu.js

PavelDoGreat / WebGL-Fluid-Simulation

deeplearning4j / deeplearning4j

Rem0o / FanControl.Releases

neovide / neovide

gfx-rs / wgpu

apache / tvm

plasma-umass / scalene

isl-org / Open3D

openwall / john

pycaret / pycaret

OlafenwaMoses / ImageAI

rapidsai / cudf

cupy / cupy

catboost / catboost

MVIG-SJTU / AlphaPose

triton-inference-server / server

Syllo / nvtop

h2oai / h2o-3

gyroflow / gyroflow

skypilot-org / skypilot

intel-analytics / ipex-llm

seerge / g-helper

chainer / chainer

halide / Halide

mviereck / x11docker

zeux / meshoptimizer

gfx-rs / gfx

NVIDIA / DALI