Misby's repositories

llm_kvcache_sparsity

Implement some method of LLM KV Cache Sparsity

Stargazers:0Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level MLLM on Your Phone

License:Apache-2.0Stargazers:0Issues:0Issues:0

libpfm4

This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branch for developing patches.

License:NOASSERTIONStargazers:0Issues:0Issues:0

catapult

Deprecated Catapult GitHub. Please instead use http://crbug.com "Speed>Benchmarks" component for bugs and https://chromium.googlesource.com/catapult for downloading and editing source code..

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

llamafile

Distribute and run LLMs with a single file.

License:NOASSERTIONStargazers:0Issues:0Issues:0

LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

License:MITStargazers:0Issues:0Issues:0

ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

License:MITStargazers:0Issues:0Issues:0

aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

License:BSD-3-ClauseStargazers:1Issues:0Issues:0

Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding

License:Apache-2.0Stargazers:0Issues:0Issues:0

FastGPT

FastGPT is a knowledge-based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization!

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
License:BSD-3-Clause-ClearStargazers:0Issues:0Issues:0

MCSD

Multi-Candidate Speculative Decoding

License:MITStargazers:0Issues:0Issues:0

hpipm

High-performance interior-point-method QP and QCQP solvers

License:NOASSERTIONStargazers:0Issues:0Issues:0

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:CLicense:MITStargazers:0Issues:0Issues:0

agi

Android GPU Inspector

License:Apache-2.0Stargazers:0Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

License:MITStargazers:0Issues:0Issues:0

GPy

Gaussian processes framework in python

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
Stargazers:1Issues:0Issues:0

LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Stargazers:0Issues:0Issues:0

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

License:MITStargazers:0Issues:0Issues:0

MegEngine

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

License:Apache-2.0Stargazers:0Issues:0Issues:0

examples

TensorFlow examples

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and per

License:NOASSERTIONStargazers:0Issues:0Issues:0

coder-kung-fu

开发内功修炼

License:Apache-2.0Stargazers:0Issues:0Issues:0

transformers-android-demo

📲 Transformers android examples (Tensorflow Lite & Pytorch Mobile)

License:Apache-2.0Stargazers:0Issues:0Issues:0