CaptainPrice2023

CaptainPrice2023

Geek Repo

Github PK Tool:Github PK Tool

CaptainPrice2023's starred repositories

ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Language:PythonLicense:MITStargazers:122Issues:0Issues:0
Language:PythonLicense:MITStargazers:32Issues:0Issues:0

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonLicense:Apache-2.0Stargazers:865Issues:0Issues:0

Speech-Resources

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

Stargazers:441Issues:0Issues:0

PromptKD

[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"

Language:PythonLicense:Apache-2.0Stargazers:137Issues:0Issues:0

VMamba

VMamba: Visual State Space Models,code is based on mamba

Language:PythonLicense:MITStargazers:1780Issues:0Issues:0

speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Language:PythonLicense:Apache-2.0Stargazers:328Issues:0Issues:0

PETL_AST

This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture of Adapters".

Language:PythonStargazers:30Issues:0Issues:0

DePT

[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"

Language:PythonLicense:MITStargazers:85Issues:0Issues:0

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Stargazers:792Issues:0Issues:0

Meta-voicebox

Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.

License:MITStargazers:542Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14701Issues:0Issues:0

SLT22_MultiHead-Factorized-Attentive-Pooling

An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification

Language:PythonStargazers:9Issues:0Issues:0

Voice-Privacy-Challenge-2022

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Language:PythonStargazers:60Issues:0Issues:0

PyTSMod

An open-source Python library for audio time-scale modification.

Language:PythonLicense:GPL-3.0Stargazers:185Issues:0Issues:0

sslsv

Framework for training and evaluating self-supervised learning methods for speaker verification.

Language:Jupyter NotebookLicense:MITStargazers:18Issues:0Issues:0

ATTEMPT

This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)

Language:PythonLicense:MITStargazers:97Issues:0Issues:0

UniPELT

Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022

Language:PythonStargazers:58Issues:0Issues:0

PLOT

[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models

Language:PythonLicense:MITStargazers:122Issues:0Issues:0
Language:PythonLicense:MITStargazers:798Issues:0Issues:0

PromptingWhisper

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

Language:PythonStargazers:128Issues:0Issues:0

SAN

Open-vocabulary Semantic Segmentation

Language:PythonLicense:MITStargazers:279Issues:0Issues:0

awesome-neural-reprogramming-prompting

A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022

Language:PythonLicense:Apache-2.0Stargazers:34Issues:0Issues:0

Great-Deep-Learning-Tutorials

A Great Collection of Deep Learning Tutorials and Repositories

License:MITStargazers:175Issues:0Issues:0

speech-adapters

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

Speech-Prompts-Adapters

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

Stargazers:97Issues:0Issues:0

HorNet

[NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions

Language:PythonLicense:MITStargazers:309Issues:0Issues:0

ChatReviewer

ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议

Language:PythonLicense:NOASSERTIONStargazers:1232Issues:0Issues:0

AVCleanse

ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'

Language:PythonStargazers:26Issues:0Issues:0

Loss-Gated-Learning

ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'

Language:PythonLicense:MITStargazers:83Issues:0Issues:0