CZ26

CZ26

Geek Repo

Github PK Tool:Github PK Tool

CZ26's repositories

License:NOASSERTIONStargazers:0Issues:0Issues:0

himallgg

himallgg

Stargazers:0Issues:0Issues:0

EmoLLM

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

License:Apache-2.0Stargazers:0Issues:0Issues:0

XrayGLM

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Depression_FAU-guided

Depression_FAU-guided

Language:PythonStargazers:0Issues:0Issues:0

w2v2-vad

A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

facial-expression-analysis

Dimensional estimation of emotions (Arousal, Valence, Intensity) from facial landmarks extracted by DLIB.

License:MITStargazers:0Issues:0Issues:0

character-mining

Mining individual characters in multiparty dialogue

License:NOASSERTIONStargazers:0Issues:0Issues:0

FaceSwapping

Face swapping function with Paper: Motion Representations for Articulated Animation

Language:PythonStargazers:1Issues:0Issues:0

video_features

Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.

License:GPL-3.0Stargazers:0Issues:0Issues:0

dataset_medical

医学影像数据集列表 『An Index for Medical Imaging Datasets』

Stargazers:0Issues:0Issues:0

PythonPark

Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。

Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

remote-opencv-streaming-live-video

A remote live video streaming connection with Flask

License:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

SKAIG-ERC

The code for "Past, Present, and Future: Conversational Emotion Recognition through Structural Modeling of Psychological Commonsense Knowledge" plus the code of models in "A Hierarchical Transformer with Speaker Modeling for Emotion Recognition in Conversations"

Stargazers:0Issues:0Issues:0

CycleTransGAN-EVC

CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer

Language:PythonStargazers:31Issues:0Issues:0

AudioCLIP

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

License:MITStargazers:0Issues:0Issues:0

phonemizer

Simple text to phones converter for multiple languages

License:GPL-3.0Stargazers:0Issues:0Issues:0

icassp2021-emotion-tts

Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/

Stargazers:0Issues:0Issues:0

nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

License:MITStargazers:0Issues:0Issues:0

facial-landmark-frontalization

Function to frontalize non-frontal 2D facial landmarks generated from the DLIB library

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ICE-Talk

Interface for Controllable Expressive Talking Machine

License:Apache-2.0Stargazers:0Issues:0Issues:0

Transformer-TTS

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

License:MITStargazers:0Issues:0Issues:0