Hao Li (LiHao217)

LiHao217

Geek Repo

Company:中国科学院大学(University of Chinese Academy of Sciences)

Location:Beijing

Home Page:https://blog.csdn.net/l919898756

Github PK Tool:Github PK Tool

Hao Li's starred repositories

llama.cpp

LLM inference in C/C++

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:64675Issues:274Issues:1592

generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:63681Issues:538Issues:122

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55938Issues:523Issues:966

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38715Issues:383Issues:1653

WeChatMsg

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Language:PythonLicense:GPL-3.0Stargazers:33770Issues:171Issues:405

Flowise

Drag & drop UI to build your customized LLM flow

Language:TypeScriptLicense:Apache-2.0Stargazers:30490Issues:248Issues:1372

Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

roop

one-click face swap

Language:PythonLicense:GPL-3.0Stargazers:28247Issues:255Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:28117Issues:230Issues:4772

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:25580Issues:177Issues:130

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:18845Issues:172Issues:1369

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18582Issues:153Issues:469

llama2.c

Inference Llama 2 in one file of pure C

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14797Issues:262Issues:208

triton

Development repository for the Triton language and compiler

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:11681Issues:376Issues:3381

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9688Issues:74Issues:1152

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonLicense:BSD-3-ClauseStargazers:8159Issues:139Issues:3742

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7914Issues:77Issues:162

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonLicense:MITStargazers:2520Issues:51Issues:281

Olive

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

Language:PythonLicense:MITStargazers:1546Issues:30Issues:186

trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

Language:PythonLicense:Apache-2.0Stargazers:1489Issues:20Issues:91

onnxruntime-inference-examples

Examples for using ONNX Runtime for machine learning inferencing.

Language:C++License:MITStargazers:1163Issues:38Issues:158
Language:PythonLicense:NOASSERTIONStargazers:1020Issues:337Issues:26
Language:PythonLicense:Apache-2.0Stargazers:689Issues:13Issues:11

MST-plus-plus-TensorRT

:poodle: :poodle: :poodle: TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化

Language:PythonLicense:Apache-2.0Stargazers:135Issues:2Issues:7

ocolos-public

Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.

Language:C++License:BSD-2-ClauseStargazers:52Issues:8Issues:6

PerFlow

Domain-specific framework for performance analysis of parallel programs