distributed-deep-learning

There are 10 repositories under distributed-deep-learning topic.

intel-analytics / ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc
pytorch llm transformers gpu
Language:Python 5941
intel-analytics / BigDL-2.x
BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray
apache-spark deep-neural-network distributed-deep-learning keras-tensorflow bigdl analytics-zoo python scala pytorch
Language:Jupyter Notebook 2633
dkeras
dkeras-project / dkeras
Distributed Keras Engine, Make Keras faster with only one line of code.
data-parallelism deep-learning deep-neural-networks distributed distributed-deep-learning distributed-keras-engine distributed-systems keras keras-classification-models keras-models keras-neural-networks keras-tensorflow machine-learning neural-network parallel-computing plaidml python ray tensorflow tensorflow-models
Language:Python 192
dyadxmachina / Applied-Deep-Learning-with-TensorFlow
Learn applied deep learning from zero to deployment using TensorFlow 1.8+
tensorflow deep-learning deep-neural-networks distributed-deep-learning google-cloud cloud-machine-learning-engine christian-fanli-ramsey
Language:Jupyter Notebook 163
zoranzhao / DeepThings
A Portable C Library for Distributed CNN Inference on IoT Edge Clusters
deep-neural-networks distributed-deep-learning edge-computing internet-of-things iot-edge-clusters
Language:C 77
sensAI
GuanhuaWang / sensAI
sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data
machine-learning distributed-systems cifar10 cifar100 imagenet1k cnn-classification sysml deep-neural-networks deep-learning distributed-deep-learning distributed-machine-learning vgg resnet shufflenet-v2 mobilenet-v2 mlsys cifar-10 cifar-100 imagenet
Language:Python 62
dnn-distributed
vdutts7 / dnn-distributed
Distributed training of DNNs • C++/MPI Proxies (GPT-2, GPT-3, CosmoFlow, DLRM)
distributed-deep-learning dnn message-passing-interface model-parallelism deep-neural-networks mpi
Language:C++ 46
rocketmlhq / rmldnn
RocketML Deep Neural Networks
deep-learning machine-learning distributed-deep-learning high-performance-computing scientific-machine-learning
Language:Jupyter Notebook 42
Shigangli / Chimera
Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines.
transformers distributed-deep-learning pipeline-parallelism
Language:Python 37
intel / e2eAIOK
Intel® End-to-End AI Optimization Kit
automl neural-architecture-search python pytorch spark tensorflow distributed-deep-learning
Language:Jupyter Notebook 31
R-I-S-Khan / SHADE
SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training
caching deep-learning distributed-deep-learning machine-learning storage
Language:Python 24
Shigangli / Ok-Topk
Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k communication volume which is asymptotically optimal) with the decentralized parallel Stochastic Gradient Descent (SGD) optimizer, and its convergence is proved theoretically and empirically.
distributed-deep-learning sparse-allreduce topk-sgd
Language:Python 20
gsyang33 / Driple
🚨 Prediction of the Resource Consumption of Distributed Deep Learning Systems
deep-learning distributed-deep-learning machine-learning
Language:Python 14
christianramsey / Tensorflow-for-Distributed-Deep-Learning
TensorFlow (1.8+) Datasets, Feature Columns, Estimators and Distributed Training using Google Cloud Machine Learning Engine
tensorflow cloud machine-learning inference deep-learning neural-networks distributed-deep-learning
Language:Jupyter Notebook 11
Shigangli / eager-SGD
Eager-SGD is a decentralized asynchronous SGD. It utilizes novel partial collectives operations to accumulate the gradients across all the processes.
distributed-deep-learning partial-allreduce gradient-averaging
Language:Python 8
Shigangli / WAGMA-SGD
WAGMA-SGD is a decentralized asynchronous SGD based on wait-avoiding group model averaging. The synchronization is relaxed by making the collectives externally-triggerable, namely, a collective can be initiated without requiring that all the processes enter it. It partially reduces the data within non-overlapping groups of process, improving the parallel scalability.
distributed-deep-learning partial-allreduce model-averaging
Language:Python 6
ray-project / anyscale-workshop-nyc-2023
Scalable NLP model fine-tuning and batch inference with Ray and Anyscale
anyscale distributed-deep-learning distributed-machine-learning nlp ray mlops
Language:Jupyter Notebook 5
deepspark / deepspark_java
Java based Convolutional Neural Network package running on Apache Spark framework
java convolutional-neural-networks spark distributed-deep-learning deepspark-java spark-framework cnn distributed-machine-learning deep-learning deep-neural-networks jblas jcublas apache-spark
Language:Java 4
amirhosein-mesbah / Deep_Learning
This repository contains the implementation of a wide variety of Deep Learning Projects in different applications of computer vision, NLP, federated, and distributed learning. These projects include university projects and projects implemented due to interest in Deep Learning.
crowd-counting deep-learning deep-neural-networks distributed-deep-learning federated-learning image-captioning machine-translation pytorch rnn segmentation transformer lstm
Language:Jupyter Notebook 2
lancelee82 / necklace
Distributed deep learning framework based on pytorch/numba/nccl and zeromq.
distributed-deep-learning nccl zerorpc pytorch mxnet numba deep-learning distributed distributed-training
Language:Python 2
AmrMKayid / KayDDRL
Distributed Deep Reinforcement Learning for Large Scale Robotic Simulations 👨‍💻🤖🕸🕹🕷❤️👨‍🔬
artificial-general-intelligence deep-reinforcement-learning distributed-deep-learning robotics pytorch
Language:TeX 1
explcre / SHUKUN-Technology-AlgorithmIntern-MultiNodeTraining-for-DLmodels-Horovod-ConfigurationTutorial-Perf
SHUKUN Technology Co.,Ltd Algorithm intern (2020/12-2021/5). Multi-GPU, Multi-node training for deep learning models. Horovod, NVIDIA clara train sdk, configuration tutorial,performance testing.
distributed-deep-learning horovod clara-train nvidia multi-gpu-training multi-node-training deep-learning docker nfs ssh
Language:HTML 1
StefanoFioravanzo / distributed-deeplearning-kubernetes
Collection of resources for automatic deployment of distributed deep learning jobs on a Kubernetes cluster
kubernetes-operator azure-kubernetes-service mxnet tensorflow distributed-deep-learning
Language:Python 1
trilliwon / pytorch-examples
PyTorch Examples for Beginners
pytorch python deeplearning distributed-deep-learning distributed-pytorch
Language:Jupyter Notebook 1
veritas9872 / Horovod-Pytorch-Tutorial
Horovod Tutorial for Pytorch using NVIDIA-Docker.
horovod-pytorch-tutorial horovod-tutorial horovod-pytorch distributed-deep-learning horovod pytorch nvidia-docker horovod-example docker horovod-pytorch-example
Language:Python 1
hkvision / analytics-zoo
Distributed Tensorflow, Keras and BigDL on Apache Spark
analytics-zoo apache-spark bigdl deep-neural-networks distributed-deep-learning keras-tensorflow python scala
Language:Jupyter Notebook 0
jayparks / deepspark_java
Java based Convolutional Neural Network package running on Apache Spark framework
java neural-networks convolutional-neural-networks cnn deep-learning deep-neural-networks jblas jcublas apache-spark spark distributed-computing distributed-machine-learning distributed-deep-learning large-scale-learning
Language:Java 0
sqaz91819 / Blockchain-NAS
A blockchain based neural architecture search project.
blockchain neural-architecture-search distributed-deep-learning
Language:Python 0
thanoskaravangelis / distributed-deep-learning-ntua
Distributed Deep Learning experiments with the BigDL framework over Databricks
bigdl deep-learning distributed-deep-learning jupyter-notebook spark
Language:Jupyter Notebook 0
bilalsp / yelp-distributed-DL
Yelp review classification using CNN model with horovod on HPC cluster
distributed-deep-learning horovod hpc-cluster cnn-model yelp-reviews slurm tensorflow2
ch3njust1n / smpl
Simultaneous Multi-Party Learning Framework
distributed-deep-learning gradient-descent hypergraph deep-learning deep-neural-networks artificial-neural-networks artificial-intelligence distributed-systems hgsgd hypergraph-sgd sgd metaheuristic evolutionary-algorithm asynchronous-sgd
Language:Python
mma735 / TFM-DS
Comparison of distributed machine learning techniques applied to openly available datasets
distributed-deep-learning distributed-machine-learning federated-learning gossip-learning open-dataset privacy split-learning
Language:Jupyter Notebook
pierric / Mnist-Caffe-MPI
mnist, using caffe and openmpi
mnist caffe openmpi distributed-deep-learning
Language:C++
siddhanthiyer-99 / Distributed-Training-of-GANs
Implemented training strategies to help improve bottlenecks and to improve the training speed while maintaining the quality of our GANs.
deep-learning distributed-deep-learning python pytorch tensorflow
Language:Python
sotheanithsok / Image-Recognition-using-Distributed-ResNet-Model
An implementation of a distributed ResNet model for classifying CIFAR-10 and MNIST datasets.
python tensorflow horovod cifar10 mnist distributed-deep-learning
Language:Python

distributed-deep-learning

intel-analytics / ipex-llm

intel-analytics / BigDL-2.x

dkeras-project / dkeras

dyadxmachina / Applied-Deep-Learning-with-TensorFlow

zoranzhao / DeepThings

GuanhuaWang / sensAI

vdutts7 / dnn-distributed

rocketmlhq / rmldnn

Shigangli / Chimera

intel / e2eAIOK

R-I-S-Khan / SHADE

Shigangli / Ok-Topk

gsyang33 / Driple

christianramsey / Tensorflow-for-Distributed-Deep-Learning

Shigangli / eager-SGD

Shigangli / WAGMA-SGD

ray-project / anyscale-workshop-nyc-2023

deepspark / deepspark_java

amirhosein-mesbah / Deep_Learning

lancelee82 / necklace

AmrMKayid / KayDDRL

explcre / SHUKUN-Technology-AlgorithmIntern-MultiNodeTraining-for-DLmodels-Horovod-ConfigurationTutorial-Perf

StefanoFioravanzo / distributed-deeplearning-kubernetes

trilliwon / pytorch-examples

veritas9872 / Horovod-Pytorch-Tutorial

hkvision / analytics-zoo

jayparks / deepspark_java

sqaz91819 / Blockchain-NAS

thanoskaravangelis / distributed-deep-learning-ntua

bilalsp / yelp-distributed-DL

ch3njust1n / smpl

mma735 / TFM-DS

pierric / Mnist-Caffe-MPI

siddhanthiyer-99 / Distributed-Training-of-GANs

sotheanithsok / Image-Recognition-using-Distributed-ResNet-Model