Sean Zhong's repositories
ahocorasick
A faster and more efficient Golang implement of Aho-Corasick algorithm using Double Array Trie
aidea
AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等,支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。
awesome-prometheus-alerts
🚨 Collection of Prometheus alerting rules
baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
chatbot-ui
A ChatGPT clone for running locally in your browser.
codeshell-vscode
An intelligent coding assistant plugin for Visual Studio Code, developed based on CodeShell
dify
An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.
filebrowser
📂 Web File Browser
gitea
Git with a cup of tea! Painless self-hosted all-in-one software development service, including Git hosting, code review, team collaboration, package registry and CI/CD
google-webfonts-helper
A Hassle-Free Way to Self-Host Google Fonts. Get eot, ttf, svg, woff and woff2 files + CSS snippets
gotenberg
A developer-friendly API for converting numerous document formats into PDF files, and more!
gptscript
Build AI assistants that interact with your systems
lanarky
Open-source framework to deploy LLM applications in production. Built on top of FastAPI
llama_index
LlamaIndex (GPT Index) is a data framework for your LLM applications
llm-foundry
LLM training code for MosaicML foundation models
LocalAI
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others
lunary
The production toolkit for LLMs. Observability, prompt management and evaluations.
MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
protocompile
A parsing/linking engine for protobuf; the guts for a pure Go replacement of protoc.
search2ai
Help your LLMs online
search_with_ai
🤖 Free Search with AI, 💡 Open-Source Perplexity, 📍 Support Ollama/SearXNG, Support Docker deployment. 让AI大模型和搜索引擎回答你的问题,支持本地大模型(Ollama)、聚合搜索引擎SearXNG,支持Docker一键部署。
starlarky
VGS edition of Google's safe and hermetically sealed Starlark language - a non-Turing complete subset of Python 3.
tencent-sensitive-words
腾讯的离线敏感词库
toolbox
The multi-purpose utility command-line tool for web services including Dropbox, Figma, Google, GitHub, etc.
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
uptime-kuma
A fancy self-hosted monitoring tool
xlsx
Go library for reading and writing XLSX files.