Jun's repositories
k8s-learning
kubernetes learning
mqtt-rabbit
rule engine for mqtt
BentoML
The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!
certmagic
Automatic HTTPS for any Go program: fully-managed TLS certificate issuance and renewal
chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs
dubbo-kubernetes
The Dubbo Kubernetes integration.
ekuiper
Lightweight data stream processing engine for IoT edge
higress-console
higress console
higress-group.github.io
Higress Official Website
istio
Connect, secure, control, and observe services.
khoj
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.
LLaMA-Factory
Unify Efficient Fine-tuning of 100+ LLMs
llama.cpp
LLM inference in C/C++
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
magistrala
Industrial IoT Messaging and Device Management Platform
neuron
Open source industrial IoT connectivity server
ollama
Get up and running with Llama 2, Mistral, Gemma, and other large language models.
Open-Sora-Plan
This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
QAnything
Question and Answer based on Anything.
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"