xk-huang

Xiaoke Huang's repositories

segment-caption-anything

[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gradio demo that show how to use the model.

Language:PythonApache-2.0174 7 10

OrdinalCLIP

[NeurIPS 2022] OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression

Language:PythonMIT38 5 7

ema

[Preprint 23] "Efficient Meshy Neural Fields for Animatable Human Avatars" https://arxiv.org/abs/2303.12965

Language:Python23 7 1

benchmark-referring-vllm

We benchmark VLLM for referring image captioning. From paper "Segment and Caption Anything"

Language:Python5 20

Promptable-GRiT

Promptable GRiT: support inference with both automatic proposal generation and custom point/box prompts.

Language:PythonMIT400

ATVGnet

(add docker, fix code) CVPR 2019

Language:Python200

dotfiles

A collection of my personal dotfiles

Language:Shell1 10

MakeItTalk

add docker

Language:Jupyter NotebookNOASSERTION100

phdrule

The Chinese version of the book: The Unwritten Rules of Ph.D. Research.

GPL-3.0100

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Apache-2.0100

Wav2Lip

(add docker) This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

Language:Python100

240416-research-contributions

(Custom) Try UNETR, MONAI research-contributions

Language:PythonApache-2.0000

All-Seeing-Dataset-Browser

000

All-Seeing-Model-Demo

Language:Python000

azfuse

(comp. w/ oldest azure-storage-blob) A lightweight blobfuse-like python tool with the data transfer through azcopy

Language:PythonMIT000

azure-storage-python

(change to old version) Microsoft Azure Storage Library for Python

Language:PythonMIT000

blog

Language:HTML010

cli-dictionary

(add phonetic and syn ant) Dictionary for command line.

Language:PythonMIT000

EasyMocap

(fix path, use my SCHP) Make human motion capture easier.

Language:PythonNOASSERTION000

embeddings

Fast, DB Backed pretrained word embeddings for natural language processing.

Language:PythonMIT000

nerf

(update llff) My re-implementation of NeRF.

Language:Python000

neuralbody

(add visualization, for ema video) Code for "Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans" CVPR 2021 best paper candidate

Language:PythonNOASSERTION000

segment-anything

(visualization mode in amg) The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0000