SheSung's starred repositories
one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
CH9329_COMM
这是一个 Python 包,其提供了对 CH9329 芯片的快捷通信方法
pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
text-generation-inference
Large Language Model Text Generation Inference
ComfyUI-Custom-Scripts
Enhancements & experiments for ComfyUI, mostly focusing on UI features
comfyui_bmad_nodes
Utility nodes for ComfyUI
ComfyUI_Comfyroll_CustomNodes
Custom nodes for SDXL and SD1.5 including Multi-ControlNet, LoRA, Aspect Ratio, Process Switches, and many more nodes.
pydegensac
Advanced RANSAC (DEGENSAC) with bells and whistles for H and F estimation
awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
api-for-open-llm
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
flash-attention
Fast and memory-efficient exact attention