AI-Hypercomputer / JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Repository from Github https://github.comAI-Hypercomputer/JetStreamRepository from Github https://github.comAI-Hypercomputer/JetStream

AI-Hypercomputer/JetStream Issues