LiHao217

followers

following

stars

中国科学院大学（University of Chinese Academy of Sciences）

Beijing

https://blog.csdn.net/l919898756

Hao Li's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT66092 548 3849

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonGPL-3.064675 274 1592

generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookMIT63681 538 122

llama

Inference code for Llama models

Language:PythonNOASSERTION55938 523 966

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.038715 383 1653

WeChatMsg

提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手

Language:PythonGPL-3.033770 171 405

Flowise

Drag & drop UI to build your customized LLM flow

Language:TypeScriptApache-2.030490 248 1372

Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

roop

one-click face swap

Language:PythonGPL-3.028247 2550

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.028117 230 4772

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonAGPL-3.025580 177 130

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonApache-2.018845 172 1369

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.018582 153 469

llama2.c

Inference Llama 2 in one file of pure C

Language:CMIT17286 193 220

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonNOASSERTION14797 262 208

triton

Development repository for the Triton language and compiler

Language:C++MIT12976 193 1440

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.011681 376 3381

ShiArthur03

Language:MATLABGPL-3.010369 32 1357

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.09688 74 1152

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonBSD-3-Clause8159 139 3742

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT7914 77 162

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonMIT2520 51 281

Olive

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

Language:PythonMIT1546 30 186

trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

Language:PythonApache-2.01489 20 91

onnxruntime-inference-examples

Examples for using ONNX Runtime for machine learning inferencing.

Language:C++MIT1163 38 158

Llama-2-Onnx

Language:PythonNOASSERTION1020 337 26

alphadev

Language:PythonApache-2.0689 13 11

MST-plus-plus-TensorRT

:poodle: :poodle: :poodle: TensorRT 2022复赛方案：首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化

Language:PythonApache-2.0135 2 7

ocolos-public

Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.

Language:C++BSD-2-Clause52 8 6

PerFlow

Domain-specific framework for performance analysis of parallel programs

Language:C++11 2 5