Beast code in Giters

Ethan's repositories

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:Python000

actionformer_release

Code release for ActionFormer (ECCV 2022)

Language:PythonMIT010

AMR-Benchmark

A Unified Implementation of Several Baseline Deep Learning Models for Automatic Modulation Recognition

Language:Python010

asr

沪语（上海话）ASR（语音识别）模型

000

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

MIT000

AutoX

AutoX is an efficient automl tool, which is mainly aimed at data mining tasks with tabular data.

Apache-2.0000

bark

🔊 Text-prompted Generative Audio Model

NOASSERTION000

bisheng

Bisheng is an open LLM devops platform for next generation AI applications.

Apache-2.0000

ctc_decoder

A ctc decoder for both online and offline asr model

Language:C++010

DecryptPrompt

总结Prompt&LLM论文，开源数据&模型，AIGC应用

000

FastASR

基于PaddleSpeech所使用的conformer模型，使用C++的高效实现模型推理，在树莓派4B等ARM平台运行也可流畅运行。

Language:C++Apache-2.0010

FinGLM

000

HierSpeechpp

The official implementation of HierSpeech++

NOASSERTION000

HowToLiveLonger

程序员延寿指南 | A programmer's guide to live longer

Unlicense010

kws

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection

MIT000

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

MIT000

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All.

Language:PythonApache-2.0000

mmyolo_tensorrt

MIT000

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

MIT000

NaturalSpeech2

MIT000

phkit

phoneme toolkit. 好用的音素处理工具箱，包含中文音素、英文音素、文本转拼音、文本正则化等模块。

MIT000

Pix2Text

Pix In, Latex & Text Out. Recognize Chinese, English Texts, and Math Formulas from Images.

MIT000

Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

NOASSERTION000

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

Language:PythonMIT010

simple_ddp_test

toy code for ddp test

Language:Python020

SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Language:PythonApache-2.0010

StyleTTS

Official Implementation of StyleTTS

MIT000

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

MIT000

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.0000

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

NOASSERTION000