Intel® Gaudi® AI Accelerator (HabanaAI)

Intel® Gaudi® AI Accelerator

HabanaAI

Geek Repo

Home Page:habana.ai

Github PK Tool:Github PK Tool

Intel® Gaudi® AI Accelerator 's repositories

Model-References

Reference models for Intel(R) Gaudi(R) AI Accelerator

Gaudi-tutorials

Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/

Language:Jupyter NotebookStargazers:34Issues:7Issues:2

SynapseAI_Core

SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi

Language:CLicense:NOASSERTIONStargazers:33Issues:12Issues:2

vllm-fork

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23Issues:2Issues:0

Setup_and_Install

Setup and Installation Instructions for Habana binaries, docker image creation

Language:PythonLicense:Apache-2.0Stargazers:20Issues:11Issues:6

Habana_Custom_Kernel

Provides the examples to write and build Habana custom kernels using the HabanaTools

Language:C++License:Apache-2.0Stargazers:11Issues:14Issues:1

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:10Issues:2Issues:0

Gaudi-solutions

Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8Issues:9Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8Issues:9Issues:0
Language:PythonLicense:MITStargazers:6Issues:6Issues:0

hl-thunk-open

Thunk library for HabanaLabs kernel driver

Language:CLicense:NOASSERTIONStargazers:5Issues:46Issues:0

Megatron-DeepSpeed

Intel Gaudi's Megatron DeepSpeed Large Language Models for training

Language:PythonLicense:NOASSERTIONStargazers:5Issues:2Issues:0

habana-container-runtime

Habana container runtime

Language:GoLicense:Apache-2.0Stargazers:4Issues:1Issues:1

gohlml

HABANA Management Library bindings for Go

Language:GoLicense:Apache-2.0Stargazers:3Issues:0Issues:0

habanalabs-k8s-device-plugin

HABANA device plugin for Kubernetes

Language:GoLicense:Apache-2.0Stargazers:3Issues:2Issues:0
Language:PythonLicense:MITStargazers:2Issues:3Issues:0

optimum-habana-fork

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Language:PythonLicense:Apache-2.0Stargazers:2Issues:4Issues:0

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:2Issues:0
Language:CLicense:NOASSERTIONStargazers:1Issues:2Issues:0
Language:C++License:BSD-3-ClauseStargazers:1Issues:16Issues:0

papers

Academic papers by Habana research team

slurm

Slurm: A Highly Scalable Workload Manager

Language:CLicense:NOASSERTIONStargazers:1Issues:0Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

drivers.gpu.linux-nic.kernel

NIC drivers (Ethernet, IBverbs and common) for the NIC IP that is inside Intel's data-center GPU

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Intel_Gaudi3_Software

Intel® Gaudi® Software is an implementation of the runtime and graph compiler for Gaudi3

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

rdma-core

RDMA core userspace libraries and daemons

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0