There are 3 repositories under nvcc topic.
Tutorial to install NVIDIA Drivers, CUDA 11.4 and cuDNN for deep learning programming on Ubuntu 20.04.
Example Makefile for CUDA and C++ source files in a standard project layout.
bilibili视频【CUDA 12.1 并行编程入门(C++语言版)】配套代码
Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the development of new skills and the formation of new knowledge. This research studies the behavior and performance of two interdisciplinary and widely adopted scientific kernels, a Fast Fourier Transform and Matrix Multiplication. Both routines are implemented in the two current most popular many-core programming models CUDA and OpenACC. A Fast Fourier Transform (FFT) samples a signal over a period of time and divides it into its frequency components, computing the Discrete Fourier Transform (DFT) of a sequence. Unlike the traditional approach to computing a DFT, FFT algorithms reduce the complexity of the problem from O(n2) to O(nLog2n). Matrix multiplication is a cornerstone routine in Mathematics, Artificial Intelligence and Machine Learning. This research also shows that the nature of the problem plays a crucial role in determining what many-core model will provide the highest benefit in performance.
Autotuning NVCC Compiler Parameters, published @ CCPE Journal
This repository contains various CUDA C programs demonstrating parallel computing techniques using NVIDIA's CUDA platform.
bilibili视频【CUDA 12.1 并行编程入门(Python语言版)】配套代码
creating a personal neural network library in cuda by as a learning stage side by side project
bilibili视频【CUDA 12.1 并行编程入门(Rust语言版)】配套代码
Resources for autotuning CUDA compiler parameters
Nvidia NVCC CUDA programs for begineers.
Hungarian Algorithm for Linear Assignment Problem implemented using CUDA.
this repository contains the various programs that can written using CUDA Toolkit.
Gradle plugin for integrating Cuda's nvcc tool
Personal libraries for deep learning with C++
Parallel Heterogeneous CPU/GPU computing
This is my thesis work for the Bachelor's degree in Physics. / Este es mi trabajo de titulación para la Licenciatura en Física.
Implementação simples do Perceptron Multicamadas em CUDA.
Problems of June day to day challenge in Leetcode
A plugin for Jupyter Notebook to run CUDA C/C++ code
Solutions to assignment given in the class of CO316
vector calculation with GPU acceleration using CUDA
A python script which helps visualize the sorting routine of bitonic sort (executed in parallel using nvcc).