Alexanda's repositories

Voice-Recognition-to-Text-Tool-

Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

License:Apache-2.0Stargazers:0Issues:0Issues:0

auto_labeling_for_BERT_VITS2

这个项目是数据预处理。第一步是对获取到的音频做处理,结合Funasr的时间戳去掉空背景音。也包含了喂给BERT前的label

Stargazers:0Issues:0Issues:0

Automatic_Speech_Annotator

Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition

Stargazers:0Issues:0Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:0Issues:0Issues:0

bulk_transcribe_youtube_videos_from_playlist

Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.

License:MITStargazers:0Issues:0Issues:0

CapsWriter-Offline

CapsWriter 的离线版,一个好用的 PC 端的语音输入工具

Stargazers:0Issues:0Issues:0

Chenyme-AAVT-

这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Dataset_Generator_For_VITS

基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting Technology

Language:ShellLicense:MITStargazers:0Issues:0Issues:0

emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Stargazers:0Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

License:Apache-2.0Stargazers:0Issues:0Issues:0

faster-whisper-GUI

faster_whisper GUI with PySide6

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

License:Apache-2.0Stargazers:0Issues:0Issues:0

leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

License:NOASSERTIONStargazers:0Issues:0Issues:0

MakeDiffSinger

Pipelines and tools to build your own DiffSinger dataset.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ParaClipper

一款基于FunASR高准确率开源语音识别模型的自动化视频剪辑工具/A video clipping tool based on FunASR open source ASR model.

License:MITStargazers:0Issues:0Issues:0

Pink-Trombone

A programmable version of Neil Thapen's Pink Trombone

License:GPL-3.0Stargazers:0Issues:0Issues:0

PyQt-Fluent-Widgets

A fluent design widgets library based on C++ Qt/PyQt/PySide. Make Qt Great Again.

License:GPL-3.0Stargazers:0Issues:0Issues:0

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音

License:GPL-3.0Stargazers:0Issues:0Issues:0

SpeechTasks

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

Stargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

License:Apache-2.0Stargazers:0Issues:0Issues:0

TTS-for-GPT-soVITS

这是一个简单的TTS后端项目 基于https://github.com/RVC-Boss/GPT-SoVITS 并提供了一些推理优化的特性/This is a simple TTS backend project based on https://github.com/RVC-Boss/GPT-SoVITS and provides some inference optimization features:

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

whisper-web

ML-powered speech recognition directly in your browser

Stargazers:0Issues:0Issues:0

Whisper-WebUI

A Web UI for easy subtitle using whisper model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

License:GPL-3.0Stargazers:0Issues:0Issues:0