Harry Chen's repositories
text2knowledge
Extract entities and relationships from biomedical text and build a knowledge graph.
Seeing-is-Believing
Official Implementation of "Seeing is Believing: A Novel Approach to Voice Synthesis from Facial Images Using Zero-shot TTS"
OpenVoice
Instant voice cloning by MyShell.
whisperXX
Use WhisperX to Diarization and other model to denoise.
Spider_XHS
小红书爬虫,小红书笔记、主页、搜索爬取
MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
ChatINDUS
LLM4IE testing
WaveNet
Unofficial Implementation of WAVENET: A GENERATIVE MODEL FOR RAW AUDIO
plip
Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI. PLIP is a large-scale pre-trained model that can be used to extract visual and language features from pathology images and text description. The model is a fine-tuned version of the original CLIP model.
spaceship-section-cuda
Add CUDA status on zsh
WaveMixSR
Single Image Super Resolution Using WaveMix
OLI2MSI
a dataset for remote sensing super-resolution
End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.