liujuncheng

Juncheng's starred repositories

LLocalSearch

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.

Language:SvelteApache-2.0481400

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookMIT1728700

algebraic-nnhw

AI acceleration using matrix multiplication with half the multiplications

Language:Python25200

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.0572500

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.0249500

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonGPL-3.0802600

FriendsDontLetFriends

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

Language:RMIT568200

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonMIT4928200

lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Language:PythonMIT74200

cilium

eBPF-based Networking, Security, and Observability

Language:GoApache-2.01854800

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0653300

pulsar

A modular and blazing fast runtime security tool for the IoT, powered by eBPF.

Language:RustNOASSERTION82500

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Language:RustApache-2.0199400

uptime-kuma

A fancy self-hosted monitoring tool

Language:JavaScriptMIT4941100

DocsGPT

GPT-powered chat for documentation, chat with your documents

Language:PythonMIT1416200

EETQ

Easy and Efficient Quantization for Transformers

Language:C++13200

openstatus

🏓 The open-source synthetic & real user monitoring platform 🏓

Language:TypeScriptAGPL-3.0424000

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.0185300

KVM-Opencore

OpenCore disk image for running macOS VMs on Proxmox/QEMU

Language:MakefileGPL-3.0113900

alaz

Alaz: Advanced eBPF Agent for Kubernetes Observability – Effortlessly monitor K8s service interactions and performance metrics in your K8s environment. Gain in-depth insights with service maps, metrics, distributed tracing, and more, while staying alert to crucial system anomalies 🐝

Language:CAGPL-3.058400

netbird

Connect your devices into a single secure private WireGuard®-based mesh network with SSO/MFA and simple access controls.

Language:GoBSD-3-Clause887900

whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

Language:PythonMIT83900

Olive

Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.

Language:PythonMIT121500

llm-engine

Scale LLM Engine public repository

Language:PythonApache-2.074600

mosec

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

Language:PythonApache-2.070300

inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language:PythonApache-2.0251600