Azure99's starred repositories
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
dlssg-to-fsr3
Adds AMD FSR 3 Frame Generation to games by replacing Nvidia DLSS-G Frame Generation (nvngx_dlssg).
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
distributions
NodeSource Node.js Binary Distributions
inshellisense
IDE style command line auto complete
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
clash-verge
A Clash GUI based on tauri. Supports Windows, macOS and Linux.
ChatGPT-AutoExpert
🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).
bce-qianfan-sdk
Provide best practices for LMOps, as well as elegant and convenient access to the features of the Qianfan MaaS Platform. (提供大模型工具链最佳实践,以及优雅且便捷地访问千帆大模型平台)
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
package-manager-proxy-settings
记录各个包管理器代理设置坑点。
llama-cpp-python
Python bindings for llama.cpp
readability
A standalone version of the readability lib
ReadabiliPy
A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.