Xu Wenhao (xuwenhao)

xuwenhao

Geek Repo

Company:bothub & abukito

Location:Singapore

Home Page:http://www.xuwenhao.com

Github PK Tool:Github PK Tool

Xu Wenhao's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:NOASSERTIONStargazers:25782Issues:0Issues:0

crawloop

基于PlayWright和xvfb实现对js渲染的动态网页进行抓取,包含网页源码、截图、网站入口发现、网页交互过程、Web 指纹信息等等,支持优先级任务调度。

Language:PythonLicense:Apache-2.0Stargazers:43Issues:0Issues:0

DrissionPage

基于python的网页自动化工具。既能控制浏览器,也能收发数据包。可兼顾浏览器自动化的便利性和requests的高效率。功能强大,内置无数人性化设计和便捷功能。语法简洁而优雅,代码量少。

Language:PythonLicense:BSD-3-ClauseStargazers:5873Issues:0Issues:0

Infini-attention

Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTORCH

Language:PythonLicense:MITStargazers:28Issues:0Issues:0

DreamLLM

[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation

Language:PythonLicense:Apache-2.0Stargazers:341Issues:0Issues:0
Language:PythonLicense:MITStargazers:3955Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:2452Issues:0Issues:0

curl-impersonate

curl-impersonate: A special build of curl that can impersonate Chrome & Firefox

Language:PythonLicense:MITStargazers:3440Issues:0Issues:0

curl_cffi

Python binding for curl-impersonate via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.

Language:PythonLicense:MITStargazers:1589Issues:0Issues:0

praw

PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.

Language:PythonLicense:BSD-2-ClauseStargazers:3376Issues:0Issues:0

crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

Language:TypeScriptLicense:Apache-2.0Stargazers:12825Issues:0Issues:0

PulsarRPA

Automate webpages at scale, scrape web data completely and accurately with high performance, distributed RPA.

Language:KotlinLicense:AGPL-3.0Stargazers:680Issues:0Issues:0

NeuScraper

[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".

Language:PythonLicense:MITStargazers:194Issues:0Issues:0

Agora-Python-SDK

Use Agora RTC SDK with Python!

Language:C++License:MITStargazers:63Issues:0Issues:0

py-spy

Sampling profiler for Python programs

Language:RustLicense:MITStargazers:12092Issues:0Issues:0

pisa

PISA: Performant Indexes and Search for Academia

Language:C++License:Apache-2.0Stargazers:889Issues:0Issues:0

tantivy

Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust

Language:RustLicense:MITStargazers:11223Issues:0Issues:0

SimXNS

SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.

Language:PythonLicense:MITStargazers:104Issues:0Issues:0

silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4666Issues:0Issues:0

setup-ffmpeg

Set up your GitHub Actions workflow with ffmpeg

Language:JavaScriptLicense:MITStargazers:106Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3129Issues:0Issues:0
Language:PythonStargazers:339Issues:0Issues:0

q

q - Run SQL directly on delimited files and multi-file sqlite databases

Language:PythonLicense:GPL-3.0Stargazers:10143Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:4341Issues:0Issues:0

RealChar

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖

Language:JavaScriptLicense:MITStargazers:5870Issues:0Issues:0

visiondk

A powerful baseline for image classification and face recognition with Pytorch

Language:PythonLicense:GPL-3.0Stargazers:533Issues:0Issues:0

vosk-browser

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

Language:JavaScriptLicense:Apache-2.0Stargazers:342Issues:0Issues:0

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7317Issues:0Issues:0

librosa

Python library for audio and music analysis

Language:PythonLicense:ISCStargazers:6806Issues:0Issues:0

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:5281Issues:0Issues:0