Juncheng (liujuncheng)

liujuncheng

Geek Repo

Company:OneFlow

Location:Beijing

Github PK Tool:Github PK Tool

Juncheng's starred repositories

LLocalSearch

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.

Language:SvelteLicense:Apache-2.0Stargazers:4814Issues:0Issues:0

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:17287Issues:0Issues:0

algebraic-nnhw

AI acceleration using matrix multiplication with half the multiplications

Language:PythonStargazers:252Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:5725Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:2495Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:8026Issues:0Issues:0

FriendsDontLetFriends

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

Language:RLicense:MITStargazers:5682Issues:0Issues:0

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonLicense:MITStargazers:49282Issues:0Issues:0

lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Language:PythonLicense:MITStargazers:742Issues:0Issues:0

cilium

eBPF-based Networking, Security, and Observability

Language:GoLicense:Apache-2.0Stargazers:18548Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:6533Issues:0Issues:0

pulsar

A modular and blazing fast runtime security tool for the IoT, powered by eBPF.

Language:RustLicense:NOASSERTIONStargazers:825Issues:0Issues:0

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Language:RustLicense:Apache-2.0Stargazers:1994Issues:0Issues:0

uptime-kuma

A fancy self-hosted monitoring tool

Language:JavaScriptLicense:MITStargazers:49411Issues:0Issues:0

DocsGPT

GPT-powered chat for documentation, chat with your documents

Language:PythonLicense:MITStargazers:14162Issues:0Issues:0

EETQ

Easy and Efficient Quantization for Transformers

Language:C++Stargazers:132Issues:0Issues:0

openstatus

🏓 The open-source synthetic & real user monitoring platform 🏓

Language:TypeScriptLicense:AGPL-3.0Stargazers:4240Issues:0Issues:0

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1853Issues:0Issues:0

KVM-Opencore

OpenCore disk image for running macOS VMs on Proxmox/QEMU

Language:MakefileLicense:GPL-3.0Stargazers:1139Issues:0Issues:0

alaz

Alaz: Advanced eBPF Agent for Kubernetes Observability – Effortlessly monitor K8s service interactions and performance metrics in your K8s environment. Gain in-depth insights with service maps, metrics, distributed tracing, and more, while staying alert to crucial system anomalies 🐝

Language:CLicense:AGPL-3.0Stargazers:584Issues:0Issues:0

netbird

Connect your devices into a single secure private WireGuard®-based mesh network with SSO/MFA and simple access controls.

Language:GoLicense:BSD-3-ClauseStargazers:8879Issues:0Issues:0

whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

Language:PythonLicense:MITStargazers:839Issues:0Issues:0

Olive

Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.

Language:PythonLicense:MITStargazers:1215Issues:0Issues:0

llm-engine

Scale LLM Engine public repository

Language:PythonLicense:Apache-2.0Stargazers:746Issues:0Issues:0

mosec

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

Language:PythonLicense:Apache-2.0Stargazers:703Issues:0Issues:0

inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language:PythonLicense:Apache-2.0Stargazers:2516Issues:0Issues:0

netmaker

Netmaker makes networks with WireGuard. Netmaker automates fast, secure, and distributed virtual networks.

Language:GoLicense:NOASSERTIONStargazers:8953Issues:0Issues:0

llm-awq

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:1786Issues:0Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:3767Issues:0Issues:0

openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

Language:TeXStargazers:3666Issues:0Issues:0