drunkpig's repositories
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
BaiduPCS-Go
Re-upload of iikira/BaiduPCS-Go
Language:GoApache-2.0000
Magic-Doc
conversion doc(doc/docx/ppt/pptx)to markdown
Language:PythonApache-2.0000
MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Language:PythonAGPL-3.0000
PDF-Explained
《PDF 解析》
MIT000
pdf_toolbox
pdf 解析基础函数
shadowsocks-auth-go
A auth front for shadowsocks,which support rfc1929 username/password authentication method
terminal-stock
终端下看股票
twitter-video-dl
Download twitter videos as mp4 files
Language:PythonUnlicense000
UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Language:Jupyter NotebookApache-2.0000