Yueh-Po Peng's repositories
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
CLAP
Contrastive Language-Audio Pretraining
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
compound-word-transformer
Official implementation of compound word transformer (AAAI'21)
DATA5009_2023fall
Computational Methods for Data Science
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
ggml
Tensor library for machine learning
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Homework2
FinTech Homework 2
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
llama-cpp-python
Python bindings for llama.cpp
llama.cpp
Port of Facebook's LLaMA model in C/C++
midi-model
Midi event transformer for music generation
mind-vis
Code base for MinD-Vis
MU-LLaMA
MU-LLaMA: Music Understanding Large Language Model
pytorch-lightning-template
An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much easier using this template, and keep your freedom to edit all the functions as well. Big-project-friendly as well. No need to rewrite your config in hydra.
pytorch-metric-learning
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
RelTR
RelTR: Relation Transformer for Scene Graph Generation: https://arxiv.org/abs/2201.11460v2
Sklearn-genetic-opt
ML hyperparameters tuning and features selection, using evolutionary algorithms.
StableDiffusionReconstruction
Takagi and Nishimoto, CVPR 2023
whisper.cpp
Port of OpenAI's Whisper model in C/C++