tpu

There are 11 repositories under tpu topic.

vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
amd cuda deepseek gpt hpu inference inferentia llama llm llm-serving llmops mlops model-serving pytorch qwen rocm tpu trainium transformer xpu
Language:Python 41242
tensorflow / tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
deep-learning machine-learning machine-translation reinforcement-learning tpu
Language:Python 15929
skypilot-org / skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 15+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
cloud-computing cloud-management cost-management cost-optimization data-science deep-learning distributed-training finops gpu hyperparameter-tuning job-queue job-scheduler llm-serving llm-training machine-learning ml-infrastructure ml-platform multicloud spot-instances tpu
Language:Python 7502
tensorflow / adanet
Fast and flexible AutoML with learning guarantees.
automl deep-learning distributed-training ensemble gpu learning-theory machine-learning neural-architecture-search python tensorflow tpu
Language:Jupyter Notebook 3462
hollance / neural-engine
Everything we actually know about the Apple Neural Engine (ANE)
ane coreml ios iphone neural-engine neural-network tpu
2169
imcaspar / gpt2-ml
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
gpt-2 tpu bert pretrained-models chinese nlp tensorflow text-generation colab
Language:Python 1710
aphrodite-engine / aphrodite-engine
Large-scale LLM inference engine
api-rest inference-engine machine-learning cuda inferentia rocm intel lora speculative-decoding tpu
Language:C++ 1328
ayaka14732 / tpu-starter
Everything you want to know about Google Cloud TPU
tpu deep-learning cloud-tpu google-cloud-platform gcp machine-learning jax
Language:Python 518
chrisbutner / ChessCoach
Neural network-based chess engine capable of natural language commentary
alphazero chess chess-engine commentary-generation cpp gpu keras lichess-bot nlg tensorflow tpu uci
Language:C++ 484
jofrfu / tinyTPU
Implementation of a Tensor Processing Unit for embedded systems and the IoT.
fpga fpga-accelerator tensorflow tensor tpu vhdl zynq xilinx vivado assembly hardware-description-language hardware-designs hardware-acceleration hardware-architectures verilog ip-core embedded-systems linux internet-of-things iot
Language:VHDL 436
tumaer / JAXFLUIDS
Differentiable Fluid Dynamics Package
automatic-differentiation cfd compressible-flows fluid-dynamics gpu gpu-computing high-performance hpc jax machine-learning tpu turbulence jaxfluids deep-learning multi-phase-flows cuda computational-fluid-dynamics
Language:Python 377
magic-blue-smoke / Dual-Edge-TPU-Adapter
Dual Edge TPU Adapter to use it on a system with single PCIe port on m.2 A/B/E/M slot
coral-tpu edge-ai tpu-acceleration tpu-benchmarks tpu pcie-card pcie-interface m2-module m2 coral edge-tpu tensorflow-lite edgetpu home-assistant
332
AI-Hypercomputer / JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
gemma gpt gpu inference jax large-language-models llama llama2 llm model-serving pytorch tpu llm-inference llmops mlops transformer
Language:Python 290
embedeep / Free-TPU
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
free tpu npu cnn-accelerator zynq fpga lstm pytorch rnn npu-compiler caffe darknet deep-learning hardware
Language:Shell 246
DECIMER-Image_Transformer
Kohulan / DECIMER-Image_Transformer
DECIMER Image Transformer is a deep-learning-based tool designed for automated recognition of chemical structure images. Leveraging transformer architectures, the model converts chemical images into SMILES strings, enabling the digitization of chemical data from scanned documents, literature, and patents.
chemical-image-recognition decimer deep-learning image-data-mining python tensorflow tpu transformers
Language:Python 238
JuliaGPU / XLA.jl
Julia on TPUs
deep-learning going-faster julia-language machine-learning peanut-butter tpu xla
Language:Julia 223
robotperf / benchmarks
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
acceleration benchmarking cpu fpga gpu performance robotics ros2 tpu
Language:Python 162
cameronshinn / tiny-tpu
Small-scale Tensor Processing Unit built on an FPGA
tpu tpu-acceleration fpga fpga-accelerator neural-network convolutional-neural-networks
Language:Verilog 157
hhk7734 / tensorflow-yolov4
YOLOv4 Implemented in Tensorflow 2.
yolov4 tensorflow tflite tpu coral edgetpu
Language:Python 136
cea-wind / SimpleTPU
A FPGA Based CNN accelerator, following Google's TPU V1.
accelerator cnn cnn-accelerator fpga hls synthesis tpu xilinx
Language:C++ 135
stylegan2-flax-tpu
nyx-ai / stylegan2-flax-tpu
🖼 Training StyleGAN2 on TPUs in JAX
artificial-intelligence generative-adversarial-network generative-model jax tpu
Language:Python 129
embedeep / FREE-TPU-V3plus-for-FPGA
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
accelerator ai npu npu-compiler tpu tpu-acceleration transformer ai-processor zynq transformer-accelerator ai-compiler caffe fpga lstm pytorch
Language:V 127
rwightman / efficientnet-jax
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
jax objax flax efficientnet mobilenetv3 mobilenetv2 mixnet tpu flax-linen
Language:Python 127
yapay-ogrenme / googlecodelabs
TPU ile Yapay Sinir Ağlarınızı Çok Daha Hızlı Eğitin
tpu codelabs deeplearning colab-notebook
Language:Jupyter Notebook 127
HomebrewML / revlib
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
pytorch deep-learning revnet deepspeed xla tpu momentumnet
Language:Python 126
AI-Hypercomputer / xpk
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
gcloud gke tpu
Language:Python 106
koshian2 / OctConv-TFKeras
Unofficial implementation of Octave Convolutions (OctConv) in TensorFlow / Keras.
octconv tensorflow-keras keras tpu
Language:Jupyter Notebook 100
sayakpaul / FunMatch-Distillation
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
vision bit-resnet transfer-learning knowledge-distillation tpu keras tensorflow image-classification
Language:Jupyter Notebook 87
wmcnally / evopose2d
EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.
deep-learning human-pose-estimation pose-estimation tensorflow tensorflow2 tpu
Language:Python 84
PINTO0309 / TPU-MobilenetSSD
Edge TPU Accelerator / Multi-TPU + MobileNet-SSD v2 + Python + Async + LattePandaAlpha/RaspberryPi3/LaptopPC
colaboratory google lattepanda mobilenetssd mobilenetv2 opencv python raspberrypi tensorflow-lite tensorflowlite tpu
Language:Python 82
rickiepark / deep-learning-with-python-2nd
<케라스 창시자에게 배우는 딥러닝 2판> 도서의 코드 저장소
cnn deep-learning gan image-classification image-style-transfer keras neural-network rnn tensorflow text-classification text-generation transformer image-augmentation image-segmentation keras-tuner machine-translation mixed-precision multi-gpu tpu time-series
Language:Jupyter Notebook 71
GSOC
captain-pool / GSOC
Repository for Google Summer of Code 2019 https://summerofcode.withgoogle.com/projects/#4662790671826944
tensorflow googlesummerofcode keras tensorflow-2 tpu tf-hub tensorflow-datasets tensorflow-2-sample onnx super-resolution enhanced-super-resolution
Language:Python 68
GoogleCloudPlatform / ml-testing-accelerators
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
testing-accelerators machine-learning tpu gpu
Language:Jsonnet 64
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
tpu gcp tpu-vm huggingface transformers t5 text-to-text-transfer-transformer seq2seq
Language:Python 58
instadeepai / sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
deep-learning jax machine-learning reinforcement-learning tpu ai hpc podracer ppo sebulba
Language:Python 57
lmvdz / tpu-client
Solana TpuClient Typescript Implementation
cryptocurrency solana tpu web3js
Language:TypeScript 56

tpu

vllm-project / vllm

tensorflow / tensor2tensor

skypilot-org / skypilot

tensorflow / adanet

hollance / neural-engine

imcaspar / gpt2-ml

aphrodite-engine / aphrodite-engine

ayaka14732 / tpu-starter

chrisbutner / ChessCoach

jofrfu / tinyTPU

tumaer / JAXFLUIDS

magic-blue-smoke / Dual-Edge-TPU-Adapter

AI-Hypercomputer / JetStream

embedeep / Free-TPU

Kohulan / DECIMER-Image_Transformer

JuliaGPU / XLA.jl

robotperf / benchmarks

cameronshinn / tiny-tpu

hhk7734 / tensorflow-yolov4

cea-wind / SimpleTPU

nyx-ai / stylegan2-flax-tpu

embedeep / FREE-TPU-V3plus-for-FPGA

rwightman / efficientnet-jax

yapay-ogrenme / googlecodelabs

HomebrewML / revlib

AI-Hypercomputer / xpk

koshian2 / OctConv-TFKeras

sayakpaul / FunMatch-Distillation

wmcnally / evopose2d

PINTO0309 / TPU-MobilenetSSD

rickiepark / deep-learning-with-python-2nd

captain-pool / GSOC

GoogleCloudPlatform / ml-testing-accelerators

gsarti / t5-flax-gcp

instadeepai / sebulba

lmvdz / tpu-client