yearnyeen ho's starred repositories

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5095Issues:0Issues:0

airgen

Official source codes of airsep

Language:PythonLicense:MITStargazers:25Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:6969Issues:0Issues:0

LGM

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Language:PythonLicense:MITStargazers:1391Issues:0Issues:0

papers.cool

Cool Papers - Immersive Paper Discovery

Language:HTMLStargazers:274Issues:0Issues:0

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonLicense:MITStargazers:305Issues:0Issues:0

encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Language:PythonStargazers:64Issues:0Issues:0

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1551Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2185Issues:0Issues:0
Language:PythonLicense:LGPL-3.0Stargazers:3Issues:0Issues:0

wandb

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Language:PythonLicense:MITStargazers:8514Issues:0Issues:0

hoac

Higher-Order Ambisonics Codec for Spatial Audio

Language:PythonLicense:MITStargazers:32Issues:0Issues:0

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonLicense:Apache-2.0Stargazers:3403Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:37Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10167Issues:0Issues:0

chroma

the AI-native open-source embedding database

Language:RustLicense:Apache-2.0Stargazers:13227Issues:0Issues:0

PAM

PAM is a no-reference audio quality metric for audio generation tasks

Language:PythonLicense:MITStargazers:30Issues:0Issues:0

MARBLE-Benchmark

Music Audio Representation Benchmark for Universal Evaluation

Language:PythonLicense:MITStargazers:71Issues:0Issues:0

nomic

Interact, analyze and structure massive text, image, embedding, audio and video datasets

Language:PythonStargazers:1085Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5529Issues:0Issues:0

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1805Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10335Issues:0Issues:0

JEN-1-COMPOSER-pytorch

Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)

Language:PythonStargazers:25Issues:0Issues:0

JEN-1-pytorch

Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)

Language:PythonStargazers:45Issues:0Issues:0

OpenSTL

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

Language:PythonLicense:Apache-2.0Stargazers:641Issues:0Issues:0

makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookLicense:MITStargazers:544Issues:0Issues:0

Smooth-Diffusion

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

Language:PythonLicense:MITStargazers:274Issues:0Issues:0

music-modeling-time-duration

Code of the paper "Impact of time and note duration tokenizations on deep learning symbolic music modeling" (ISMIR 2023)

Language:PythonStargazers:9Issues:0Issues:0
Language:PythonLicense:MITStargazers:116Issues:0Issues:0

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1968Issues:0Issues:0