zhiqic

Zhi-Qi Cheng's repositories

Rethinking-Counting

[CVPR 2022] Rethinking Spatial Invariance of Convolutional Networks for Object Counting

Language:Python59 2 9

Awesome-Video-Generation

Language:Python21 10

ChartReader

[ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules

Language:Jupyter Notebook18 3 9

GSRFormer

[ACM-MM 2022] GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement

Language:PythonApache-2.09 4 2

KeyPosS

[ACM MM 2023] KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration

Language:Python900

DAMO-StreamNet

[IJCAI 2023] DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Language:Python700

MotionEditor

MotionEditor is the first diffusion-based model capable of video motion editing.

200

11775-HW1-2023-Fall

This is the code repo of HW1 for 11775 Fall 2023

Language:PythonApache-2.0100

11775-HW1-2024-Spring

Language:PythonApache-2.0100

AdvancedProfanityFilter

A browser extension to filter profanity from webpages

Language:TypeScriptGPL-3.0100

BlockGCN

This is the official implementation of our paper "Overcoming Topology Agnosticism: Enhancing Skeleton-Based Action Recognition through Redefined Skeletal Topology Awareness"

Language:PythonApache-2.0100

CMU-2023-Fall-11-775-MultimediaAnalysis

1 10

CMU-2024-Spring-11-775-MultimediaAnalysis

1 10

DCPT

Language:PythonMIT100

Emotion-LLaMA

100

gen-ai-tutorials

Tutorials for CMU's 2023 Generative AI Tutorial Series

Language:Jupyter NotebookMIT100

HA3D_simulator

100

homepage

Language:HTMLNOASSERTION100

HQTrack

Tracking Anything in High Quality

Language:PythonMIT100

Hyperformer

This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition."

Language:PythonApache-2.0100

IVAC-P2L

IVAC-P^2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting

MIT100

LongShortNet

[ICASSP 2023] LongShortNet: Exploring Temporal and Semantic Features Fusion in Streaming Perception

Language:PythonApache-2.0100

MMTTS

MM-TTS: A Unified Framework of Multi-modal Prompt-Induced Emotional Text-to-Speech Synthesis

Language:PythonGPL-2.0100

Music2P

Multi-modal Music ML service that will fulfil promotion needs of musicians and companies

100

PoSynDA

[ACM MM 2023] PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation

Language:PythonMIT100

ProContEXT

[ICASSP 2023] ProContEXT: Exploring Progressive Context Transformer for Tracking

Language:PythonMIT100

RTPCA

Language:Python100

TrackGPT

Tracking with Human-Intent Reasoning

Apache-2.0100

Video2ShopExactMatching

[CVPR 2017] Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

Language:Python100

WordArt

This work introduces WordArt Designer, a user-driven framework for artistic typography synthesis, relying on Large Language Models (LLM).

Apache-2.0100