CZ26

followers

0

following

stars

CZ26's repositories

CycleTransGAN-EVC

CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer

Language:Python3100

FaceSwapping

Face swapping function with Paper: Motion Representations for Articulated Animation

Language:Python1 10

AudioCLIP

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Language:PythonMIT000

avatarface_implement

Face Swapping

Language:PythonNOASSERTION000

character-mining

Mining individual characters in multiparty dialogue

Language:PythonNOASSERTION000

controllable_evc_code

This is the code for controllable EVC framework for seen and unseen emotion generation.

Language:Python000

CZ-HP

NOASSERTION000

dataset_medical

医学影像数据集列表『An Index for Medical Imaging Datasets』

000

DemoPage-C-CycleTransGAN-VoiceConversion

Language:HTML000

DemoPage-CycleTransGAN-EmotionalSpeechConversion

Language:HTML000

Depression_FAU-guided

Depression_FAU-guided

Language:Python000

dl-for-emo-tts

:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:

MIT000

facial-landmark-frontalization

Function to frontalize non-frontal 2D facial landmarks generated from the DLIB library

MIT000

HierarchicalFusionMER

Language:Python000

icassp2021-emotion-tts

Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/

000

ICE-Talk

Interface for Controllable Expressive Talking Machine

Apache-2.0000

Learning-Graph-Representation-of-Person-specific-Cognitive-Processes-from-Audio-visual-Behaviours-fo

000

nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

MIT000

phonemizer

Simple text to phones converter for multiple languages

GPL-3.0000

PythonPark

Python 开源项目之「自学编程之路」，保姆级教程：AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。

000

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION000

remote-opencv-streaming-live-video

A remote live video streaming connection with Flask

MIT000

segmentation-kit

Speech Segmentation Toolkit using Julius

MIT000

seq2seq-EVC

000

SKAIG-ERC

The code for "Past, Present, and Future: Conversational Emotion Recognition through Structural Modeling of Psychological Commonsense Knowledge" plus the code of models in "A Hierarchical Transformer with Speaker Modeling for Emotion Recognition in Conversations"

000

statisticbooks

000

Transformer-TTS

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

MIT000

video_features

Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.

GPL-3.0000

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Apache-2.0000

XrayGLM

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.

NOASSERTION000