gpu-parallelization

There are 1 repository under gpu-parallelization topic.

LLNL / inq
This is a mirror. Please check our main website on gitlab.
dft exchange-correlation-functionals gpu-parallelization hpc inq llnl math-physics
Language:C++ 29
oekosheri / pytorch_unet_scaling
Scaling Unet in Pytorch
data-parallelism gpu-parallelization horovod pytorch unet-image-segmentation ddp
Language:Jupyter Notebook 2
Vivek-Tate / Performance-Analysis-of-Parallel-Computing-Algorithms-using-CUDA-and-OpenMP
This is an academic experiment comparing CPU and GPU performance using CUDA and OpenMP. It involves implementing three algorithms: Standard Deviation Calculation, Image Convolution, and Histogram-Based Data Structure, optimised for parallel execution to demonstrate performance improvements on different hardware architectures.
cpu-monitoring cuda data-structures experiments gpu-monitoring gpu-parallelization image-convolution openmp parallel-computing performance-analysis standard-deviation
Language:Cuda 0
Ferdib-Al-Islam / gpu_parallelization
Co-occurrence matrices act as the input to many unsupervised learning algorithms, including those that learn word embedding, and modern spectral topic models. However, the computation of these inputs often takes longer time than the inference. While much thought has been given to implementing fast learning algorithms. The co-occurrence matrix computation tasks are well suited to GPU parallelization. GPUs or other specialized hardware, have never been used to explicitly compute word-to-word co-occurrence matrix.
cuda cuda-c cuda-toolkit gpu-parallelization word-to-word-co-occurrence-matrix
Language:Jupyter Notebook

gpu-parallelization

LLNL / inq

oekosheri / pytorch_unet_scaling

Vivek-Tate / Performance-Analysis-of-Parallel-Computing-Algorithms-using-CUDA-and-OpenMP

Ferdib-Al-Islam / gpu_parallelization