zhanghua's repositories
apitable
🚀🎉📚 APITable, an API-oriented low-code platform for building collaborative apps and better than all other Airtable open-source alternatives. [WIP]
aria2
aria2 is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.
chromium
The official GitHub mirror of the Chromium source
clif
Binding generator to wrap C++ for Python using LLVM.
Cloudreve
🌩支持多家云存储的云盘系统 (Self-hosted file management and sharing system, supports multiple storage providers)
CPM
Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)
cubefs
CubeFS is a cloud native distributed storage platform.
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
duckdb
DuckDB is an in-process SQL OLAP Database Management System
highway
Performance-portable, length-agnostic SIMD with runtime dispatch
html2image
A package acting as a wrapper around the headless mode of existing web browsers to generate images from URLs and from HTML+CSS strings or files.
imagebot
A web bot to crawl websites and scrape images.
kuzu
An in-process property graph database management system built for query speed and scalability.
langchain
⚡ Building applications with LLMs through composability ⚡
lucene
Apache Lucene open-source search software
LucenePlusPlus
Lucene++ is an up to date C++ port of the popular Java Lucene library, a high-performance, full-featured text search engine.
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
pipelinedb
High-performance time-series aggregation for PostgreSQL
redpanda
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
repository-doc
文档库。
searx
Privacy-respecting metasearch engine
terarkdb
A RocksDB compatible KV storage engine with better performance
vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
vast
:crystal_ball: Visibility Across Space and Time – The network telemetry engine for data-driven security investigations.
vespa
The open big data serving engine. https://vespa.ai
videodl
Videodl: A lightweight video downloader written by pure python.
webvideo-downloader
Website video downloader, supports the videos on Bilibili, iQIYI, Tencent Video, MGTV and WeTV. 网站视频下载器,主要支持Bilibili、爱奇艺、腾讯视频、芒果TV、WeTV、愛奇藝台灣站。
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
YoutubeDownloader
Downloads videos and playlists from YouTube
YoutubeExplode
Library for exploiting YouTube's internal API