spaparaju

SriKrishna Paparaju's repositories

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonApache-2.0000

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

BSD-3-Clause000

autogen

Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ

CC-BY-4.0000

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

MIT000

chroma

the AI-native open-source embedding database

Apache-2.0000

composer

Supercharge Your Model Training

Apache-2.0000

dcgm-exporter

NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

Language:GoApache-2.0000

faiss

A library for efficient similarity search and clustering of dense vectors.

MIT000

flash-attention

Fast and memory-efficient exact attention

BSD-3-Clause000

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

BSD-3-Clause000

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

MIT000

HIP

HIP: C++ Heterogeneous-Compute Interface for Portability

MIT000

llama

Inference code for LLaMA models

NOASSERTION000

llama_index

LlamaIndex (formerly GPT Index) is a data framework for your LLM applications

MIT000

llm-foundry

LLM training code for MosaicML foundation models

Apache-2.0000

material-dashboard

Material Dashboard - Open Source Bootstrap 5 Material Design Admin

MIT000

NeMo

NeMo: a toolkit for conversational AI

Apache-2.0000

Python-Algorithms

All Algorithms implemented in Python

MIT000

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Language:PythonApache-2.0000

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Apache-2.0000

safetensors

Simple, safe way to store and distribute tensors

Apache-2.0000

semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

MIT000

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Apache-2.0000

spaparaju

SriKrishna Paparaju's repositories

AITemplate

apex

autogen

AutoGPT

chroma

composer

dcgm-exporter

faiss

flash-attention

gpt-fast

GPTCache

HIP

llama

llama_index

llm-foundry

material-dashboard

NeMo

optimum-nvidia

Python-Algorithms

pytorch-lightning

ray

safetensors

semantic-kernel

TensorRT-LLM

thanos

tinygrad

torchfix

transformers

triton

vllm