Slice (xuanhan863)

xuanhan863

Geek Repo

Location:Los Angeles, USA

Github PK Tool:Github PK Tool

Slice's starred repositories

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:15902Issues:105Issues:820

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5536Issues:63Issues:98

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4473Issues:58Issues:152

SuGaR

[CVPR 2024] Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

Language:C++License:NOASSERTIONStargazers:2133Issues:64Issues:212

segment-anything-fast

A batched offline inference oriented version of segment-anything

Language:PythonLicense:Apache-2.0Stargazers:1186Issues:10Issues:44

stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language:PythonLicense:MITStargazers:1152Issues:18Issues:122

PatchFusion

[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation

Language:PythonLicense:MITStargazers:953Issues:23Issues:40

3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Language:PythonLicense:MITStargazers:907Issues:16Issues:62

genmusic_demo_list

a list of demo websites for automatic music generation research

ziplora-pytorch

Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"

Language:PythonLicense:MITStargazers:504Issues:11Issues:19

tokenize-anything

[ECCV 2024] Tokenize Anything via Prompting

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:502Issues:6Issues:16
Language:PythonLicense:GPL-3.0Stargazers:482Issues:11Issues:61

glake

GLake: optimizing GPU memory management and IO transmission.

Language:PythonLicense:Apache-2.0Stargazers:352Issues:7Issues:22

PoseAnything

A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]

Language:PythonLicense:Apache-2.0Stargazers:303Issues:4Issues:10
Language:PythonLicense:NOASSERTIONStargazers:288Issues:13Issues:0

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:281Issues:14Issues:13

DepthFlow

🌊 Image to → 2.5D Parallax Effect Video. A Free and Open Source ImmersityAI alternative

Language:PythonLicense:AGPL-3.0Stargazers:210Issues:7Issues:29

Diff-HierVC

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Language:PythonLicense:MITStargazers:161Issues:7Issues:14

DCI

Densely Captioned Images (DCI) dataset repository.

Language:PythonLicense:NOASSERTIONStargazers:155Issues:4Issues:14

nxtp

Object Recognition as Next Token Prediction (CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:153Issues:2Issues:5

inferno

🔥🔥🔥 Set the world of 3D faces on fire with INFERNO 🔥🔥🔥

Language:PythonLicense:NOASSERTIONStargazers:153Issues:8Issues:25

AQUA-Tk

AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)

Language:PythonLicense:GPL-3.0Stargazers:93Issues:3Issues:3

bandit

BandIt: Cinematic Audio Source Separation

Language:PythonLicense:Apache-2.0Stargazers:47Issues:3Issues:3

Gigabind

Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA

Language:PythonStargazers:8Issues:1Issues:0

cordvox

Experiments of neural vocoder

Language:PythonLicense:MITStargazers:4Issues:0Issues:0