MiyazonoKaori137's repositories
Bert-VITS2
vits2 backbone with multilingual-bert
Language:PythonAGPL-3.0000
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:PythonMIT000
Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter NotebookApache-2.0000
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:PythonApache-2.0000
so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:PythonAGPL-3.0000
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:PythonMIT000