Beast code in Giters

Sihan Chen's starred repositories

prometheus-fastapi-instrumentator

Instrument your FastAPI with Prometheus metrics.

Language:PythonISC88200

GenAIEval

Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination

Language:PythonApache-2.01400

jaeger

CNCF Jaeger, a Distributed Tracing Platform

Language:GoApache-2.01991400

GenAIComps

GenAI components at micro-service level; GenAI service composer to create mega-service

Language:PythonApache-2.02400

docarray

Represent, send, store and search multimodal data

Language:PythonApache-2.0286300

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2341400

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT2949200

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonApache-2.0517900

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause537500

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT985800

kserve

Standardized Serverless ML Inference Platform on Kubernetes

Language:PythonApache-2.0331000

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0761400

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.0567000

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonBSD-3-Clause779600

noisereduce

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Language:Jupyter NotebookMIT135800

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.01099000