Intel® Gaudi® AI Accelerator

Intel® Gaudi® AI Accelerator 's repositories

Model-References

Reference models for Intel(R) Gaudi(R) AI Accelerator

Language:Python147 33 31

Gaudi-tutorials

Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/

Language:Jupyter Notebook34 7 2

SynapseAI_Core

SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi

Language:CNOASSERTION33 12 2

vllm-fork

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.023 20

Setup_and_Install

Setup and Installation Instructions for Habana binaries, docker image creation

Language:PythonApache-2.020 11 6

Habana_Custom_Kernel

Provides the examples to write and build Habana custom kernels using the HabanaTools

Language:C++14 5 4

hccl_demo

Language:C++Apache-2.011 14 1

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.010 20

Gaudi-solutions

Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases

Language:Jupyter NotebookApache-2.08 90

Gaudi2-Workshop

Language:Jupyter NotebookApache-2.08 90

deepspeed_old

Language:PythonMIT6 60

hl-thunk-open

Thunk library for HabanaLabs kernel driver

Language:CNOASSERTION5 460

Megatron-DeepSpeed

Intel Gaudi's Megatron DeepSpeed Large Language Models for training

Language:PythonNOASSERTION5 20

habana-container-runtime

Habana container runtime

Language:GoApache-2.04 1 1

gohlml

HABANA Management Library bindings for Go

Language:GoApache-2.0300

habanalabs-k8s-device-plugin

HABANA device plugin for Kubernetes

Language:GoApache-2.03 20

Fairseq

Language:PythonMIT2 30

optimum-habana-fork

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Language:PythonApache-2.02 40

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Language:PythonApache-2.02 10

DL1-Workshop

Language:Jupyter Notebook1 20

drivers.accel.habanalabs.kernel

Language:CNOASSERTION1 20

hccl_ofi_wrapper

Language:C++BSD-3-Clause1 160

papers

Academic papers by Habana research team

1 100

slurm

Slurm: A Highly Scalable Workload Manager

Language:CNOASSERTION100

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonMIT000

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0010

drivers.gpu.linux-nic.kernel

NIC drivers (Ethernet, IBverbs and common) for the NIC IP that is inside Intel's data-center GPU

Language:CNOASSERTION000

Intel_Gaudi3_Software

Intel® Gaudi® Software is an implementation of the runtime and graph compiler for Gaudi3

Language:C++NOASSERTION000

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Language:PythonApache-2.0010

rdma-core

RDMA core userspace libraries and daemons

Language:CNOASSERTION000