gannn's starred repositories
StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
golang-design-pattern
设计模式 Golang实现-《研磨设计模式》读书笔记
interview-go
golang面试题集合
system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
go-design-pattern
golang design pattern go 设计模式实现,包含 23 种常见的设计模式实现,同时这也是极客时间-设计模式之美 的笔记
gobyexample
Go by Example 通过例子学 Golang
apiserver-gin
生产级可用golang api服务端(基于gin)
learning_tools
Go 学习、Go 进阶、Go 实用工具类、Go DDD 项目落地、Go-kit 、Go-Micro 、Go 推送平台、微服务实践
IncarnaMind
Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
nvidia_gpu_exporter
Nvidia GPU exporter for prometheus using nvidia-smi binary
transpeeder
train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism
feishu-openai
🎒 飞书 ×(GPT-4 + GPT-4V + DALL·E-3 + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀