Slice (xuanhan863)

xuanhan863

Geek Repo

Location:Los Angeles, USA

Github PK Tool:Github PK Tool

Slice's starred repositories

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:10332Issues:74Issues:399

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5220Issues:59Issues:86

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4140Issues:62Issues:89

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4014Issues:52Issues:110

segment-anything-fast

A batched offline inference oriented version of segment-anything

Language:PythonLicense:Apache-2.0Stargazers:1125Issues:11Issues:38

tarsier

Vision utilities for web interaction agents 👀

Language:Jupyter NotebookLicense:MITStargazers:1079Issues:5Issues:10

stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language:PythonLicense:MITStargazers:984Issues:14Issues:108

PatchFusion

[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation

Language:PythonLicense:MITStargazers:905Issues:20Issues:34

punica

Serving multiple LoRA finetuned LLM as one

Language:PythonLicense:Apache-2.0Stargazers:849Issues:14Issues:36

genmusic_demo_list

a list of demo websites for automatic music generation research

ziplora-pytorch

Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"

Language:PythonLicense:MITStargazers:462Issues:11Issues:19

glake

GLake: optimizing GPU memory management and IO transmission.

Language:C++License:Apache-2.0Stargazers:288Issues:5Issues:17

SegDrawer

Simple static web-based mask drawer, supporting semantic segmentation with interactive Segment Anything Model (SAM) and video segmentation with XMem.

Language:PythonLicense:Apache-2.0Stargazers:272Issues:4Issues:10

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:253Issues:14Issues:11

flash-fft-conv

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Language:C++License:Apache-2.0Stargazers:228Issues:16Issues:19

Consistent4D

[ICLR 2024] Official Implementation of Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video

Language:PythonLicense:Apache-2.0Stargazers:212Issues:9Issues:9

Diff-HierVC

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

flare

[SIGGRAPH Asia '23] FLARE: Fast Learning of Animatable and Relightable Mesh Avatars

Language:PythonLicense:NOASSERTIONStargazers:128Issues:9Issues:5

d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Language:PythonLicense:MITStargazers:121Issues:7Issues:8

DepthFlow

🌊 Image to → 2.5D Parallax Effect Video. High quality, user first.

Language:PythonLicense:AGPL-3.0Stargazers:114Issues:5Issues:17

nxtp

Object Recognition as Next Token Prediction (CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:109Issues:2Issues:2

AQUA-Tk

AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)

Language:PythonLicense:GPL-3.0Stargazers:88Issues:3Issues:3

bandit

BandIt: Cinematic Audio Source Separation

Language:PythonLicense:Apache-2.0Stargazers:47Issues:3Issues:3

MULTI-AUDIODEC

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

Language:PythonStargazers:36Issues:2Issues:0

ScorePerformer

ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)

Language:PythonLicense:NOASSERTIONStargazers:30Issues:1Issues:1
Language:PythonStargazers:27Issues:0Issues:0

cordvox

experiments of fast, lightweight, high-quality neural vocoder for single speaker's speech synthesization

Language:PythonStargazers:8Issues:0Issues:0