kebe7jun

Kebe's starred repositories

UTM

Virtual machines for iOS and macOS

Language:SwiftApache-2.02550600

MInference

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Language:PythonMIT55400

s3fs-fuse

FUSE-based file system backed by Amazon S3

Language:C++GPL-2.0831900

🧊 The next generation Package Manager for Kubernetes 📦 Featuring a GUI and a CLI. Glasskube packages are dependency aware, GitOps ready and can get automatic updates via a central public package repository.

Language:GoApache-2.0245900

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonApache-2.03109300

gpu-optimization-workshop

Slides, notes, and materials for the workshop

28200

AIOS

AIOS: LLM Agent Operating System

Language:PythonMIT304200

gogo

面向红队的, 高度可控可拓展的自动化引擎

Language:GoGPL-3.0124700

llama-fs

A self-organizing file system with llama 3

Language:Jupyter NotebookMIT463200

LlamaEdge

The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge

Language:RustApache-2.084700

amber

💎 Amber the programming language compiled to bash

Language:RustGPL-3.0367000

ydb

YDB is an open source Distributed SQL Database that combines high availability and scalability with strong consistency and ACID transactions

Language:C++Apache-2.0371100

stable-diffusion-webui-distributed

Chains stable-diffusion-webui instances together to facilitate faster image generation.

Language:Python17300

textbee

textbee.dev is an opensource and free sms-gatway for sending SMS messages through API or dashboard web interface.

Language:TypeScriptMIT30200

triton

Development repository for the Triton language and compiler

Language:C++MIT1203000

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT1132400

LLMBook-zh.github.io

《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣

194300

continue

⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains

Language:TypeScriptApache-2.01360300

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.01287900

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonNOASSERTION282700

nut.js

Native UI testing / controlling with node

Language:TypeScript214200

inpaint-web

A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。| 基于 Webgpu 技术和 wasm 技术的免费开源 inpainting & image-upscaling 工具, 纯浏览器端实现。

Language:TypeScriptGPL-3.0474200

it-tools

Collection of handy online tools for developers, with great UX.

Language:VueGPL-3.01887200

hai-platform

一种任务级GPU算力分时调度的高性能深度学习训练平台

Language:PythonLGPL-3.027100

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause363300

zed

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Language:RustNOASSERTION4208600

lkl-js

Run Linux kernel in your web browser directly

Language:HTML10300

Webpilot

Language:VueGPL-3.0174200

pyelftools

Parsing ELF and DWARF in Python

Language:PythonNOASSERTION194300