hellocym

Harry Chen's repositories

text2knowledge

Extract entities and relationships from biomedical text and build a knowledge graph.

Language:Jupyter NotebookMIT000

Seeing-is-Believing

Official Implementation of "Seeing is Believing: A Novel Approach to Voice Synthesis from Facial Images Using Zero-shot TTS"

MIT000

whisperXX

Use WhisperX to Diarization and other model to denoise.

MIT000

MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫

Apache-2.0000

WaveNet

Unofficial Implementation of WAVENET: A GENERATIVE MODEL FOR RAW AUDIO

MIT000

Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI. PLIP is a large-scale pre-trained model that can be used to extract visual and language features from pathology images and text description. The model is a fine-tuned version of the original CLIP model.

Language:Python000

Optimization

Language:PythonMIT200

CPPPP

MIT000

spaceship-section-cuda

Add CUDA status on zsh

Language:ShellMIT100

BDTK-Download

000

TSP

Language:PythonMIT100

WaveMixSR

Single Image Super Resolution Using WaveMix

Language:Jupyter NotebookMIT000

OLI2MSI

a dataset for remote sensing super-resolution

Language:Python000

LLaVA-Med-Notes

100

BERT

MIT100

End-to-end-ASR-Pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Language:PythonMIT000

hellocym

Harry Chen's repositories

CV

ZHNewsSum

SentiBERT

text2knowledge

Seeing-is-Believing

PatternRecognition

r00t2024

OpenVoice

Transformer-Seq2Seq

whisperXX

Spider_XHS

Mamba

kawaii-python

MediaCrawler

ChatINDUS

WaveNet

Pitt-Radar

copilot-gpt4-service

01Knapsack

plip

Optimization

CPPPP

spaceship-section-cuda

BDTK-Download

TSP

WaveMixSR

OLI2MSI

LLaVA-Med-Notes

BERT

End-to-end-ASR-Pytorch