Zhikang Niu (ZhikangNiu)

ZhikangNiu

Geek Repo

Company:Xidian Univeristy

Location:Beijing

Home Page:https://zhikangniu.github.io/

Github PK Tool:Github PK Tool

Zhikang Niu's repositories

encodec-pytorch

unofficial implementation of the High Fidelity Neural Audio Compression

Language:PythonLicense:MITStargazers:116Issues:4Issues:19

AI-research-tools

:hammer:AI 方向好用的科研工具

License:MITStargazers:3Issues:0Issues:0

pre-train-dockerfile

An Intro to set up your Speech Docker environment and debug using VSCode

Language:DockerfileStargazers:2Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

License:MITStargazers:0Issues:0Issues:0

Awesome-VQVAE

📚 A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application

License:MITStargazers:0Issues:0Issues:0

CMG

The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

descript-audio-vae

VAE GAN modified from Descript Audio Codec, which replaces the RVQ with VAE

License:MITStargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

llama-recipes

Examples and recipes for Llama 2 model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

M2UGen

This is the official repository for M2UGen

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Language:GoLicense:MITStargazers:0Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Language:PythonStargazers:0Issues:0Issues:0

snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tango

Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0