Beast code in Giters

yearnyeen ho's starred repositories

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonApache-2.0509500

airgen

Official source codes of airsep

Language:PythonMIT2500

LWM

Language:PythonApache-2.0696900

LGM

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Language:PythonMIT139100

papers.cool

Cool Papers - Immersive Paper Discovery

Language:HTML27400

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonMIT30500

encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Language:Python6400

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonApache-2.0155100

stable-audio-tools

Generative models for conditional audio generation

Language:PythonMIT218500

Physics-Informed-Differentiable-Piano

Language:PythonLGPL-3.0300

wandb

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Language:PythonMIT851400

hoac

Higher-Order Ambisonics Codec for Spatial Audio

Language:PythonMIT3200

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonApache-2.0340300

aria

Language:PythonApache-2.03700

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

1016700

chroma

the AI-native open-source embedding database

Language:RustApache-2.01322700

PAM

PAM is a no-reference audio quality metric for audio generation tasks

Language:PythonMIT3000

MARBLE-Benchmark

Music Audio Representation Benchmark for Universal Evaluation

Language:PythonMIT7100

nomic

Interact, analyze and structure massive text, image, embedding, audio and video datasets

Language:Python108500

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.0552900

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonApache-2.0180500

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.01033500

JEN-1-COMPOSER-pytorch

Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)

Language:Python2500

JEN-1-pytorch

Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)

Language:Python4500

OpenSTL

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

Language:PythonApache-2.064100

makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookMIT54400

Smooth-Diffusion

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

Language:PythonMIT27400

music-modeling-time-duration

Code of the paper "Impact of time and note duration tokenizations on deep learning symbolic music modeling" (ISMIR 2023)

Language:Python900

tril

Language:PythonMIT11600

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.0196800