Zhi-Qi Cheng (zhiqic)

zhiqic

Geek Repo

Company:Carnegie Mellon University

Location:Pittsburgh

Home Page:https://zhiqic.github.io/homepage/index.html

Github PK Tool:Github PK Tool

Zhi-Qi Cheng's repositories

Rethinking-Counting

[CVPR 2022] Rethinking Spatial Invariance of Convolutional Networks for Object Counting

ChartReader

[ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules

Language:Jupyter NotebookStargazers:18Issues:3Issues:9

GSRFormer

[ACM-MM 2022] GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement

Language:PythonLicense:Apache-2.0Stargazers:9Issues:4Issues:2

KeyPosS

[ACM MM 2023] KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration

Language:PythonStargazers:9Issues:0Issues:0

DAMO-StreamNet

[IJCAI 2023] DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Language:PythonStargazers:7Issues:0Issues:0

MotionEditor

MotionEditor is the first diffusion-based model capable of video motion editing.

Stargazers:2Issues:0Issues:0

11775-HW1-2023-Fall

This is the code repo of HW1 for 11775 Fall 2023

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

AdvancedProfanityFilter

A browser extension to filter profanity from webpages

Language:TypeScriptLicense:GPL-3.0Stargazers:1Issues:0Issues:0

BlockGCN

This is the official implementation of our paper "Overcoming Topology Agnosticism: Enhancing Skeleton-Based Action Recognition through Redefined Skeletal Topology Awareness"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

gen-ai-tutorials

Tutorials for CMU's 2023 Generative AI Tutorial Series

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0
Language:HTMLLicense:NOASSERTIONStargazers:1Issues:0Issues:0

HQTrack

Tracking Anything in High Quality

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Hyperformer

This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition."

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

IVAC-P2L

IVAC-P^2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting

License:MITStargazers:1Issues:0Issues:0

LongShortNet

[ICASSP 2023] LongShortNet: Exploring Temporal and Semantic Features Fusion in Streaming Perception

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

MMTTS

MM-TTS: A Unified Framework of Multi-modal Prompt-Induced Emotional Text-to-Speech Synthesis

Language:PythonLicense:GPL-2.0Stargazers:1Issues:0Issues:0

Music2P

Multi-modal Music ML service that will fulfil promotion needs of musicians and companies

Stargazers:1Issues:0Issues:0

PoSynDA

[ACM MM 2023] PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

ProContEXT

[ICASSP 2023] ProContEXT: Exploring Progressive Context Transformer for Tracking

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

TrackGPT

Tracking with Human-Intent Reasoning

License:Apache-2.0Stargazers:1Issues:0Issues:0

Video2ShopExactMatching

[CVPR 2017] Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

Language:PythonStargazers:1Issues:0Issues:0

WordArt

This work introduces WordArt Designer, a user-driven framework for artistic typography synthesis, relying on Large Language Models (LLM).

License:Apache-2.0Stargazers:1Issues:0Issues:0