Ren Xuancheng's starred repositories
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
hugo-PaperMod
A fast, clean, responsive Hugo theme.
mediawiki-services-parsoid
This is a mirror from https://gerrit.wikimedia.org/g/mediawiki/services/parsoid/. See https://www.mediawiki.org/wiki/Developer_access for contributing.
crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
python-magic
A python wrapper for libmagic
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
internetarchive
A Python and Command-Line Interface to Archive.org
ia-download
Internet archive downloader
llama-cpp-python
Python bindings for llama.cpp
dash-cookbook
Receipts for creating AI Applications with APIs from DashScope (and friends)!
python-markdownify
Convert HTML to Markdown
the-stack-v2
Code for the curation of The Stack v2 and StarCoder2 training data
CodeQwen1.5
CodeQwen1.5 is the code version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
web-content-extraction-benchmark
Web Content Extraction Benchmark