Gary Feng's repositories
Global-Flow-Local-Attention
The source code for paper "Deep Image Spatial Transformation for Person Image Generation"
google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
ActGPT
chatbot does what you ask, like open Google search, post a Tweet, etc.
BackgroundMattingV2
Real-Time High-Resolution Background Matting
coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
dev-gpt
Your Virtual Development Team
docker-hadoop-hive-parquet
Hadoop, Hive, Parquet and Hue in docker-compose v3
docker-pytorch-cpu
docker image for pytorch-cpu + opencv4.5.x + FFMPEG
embedding_vector_search
prototyping global search for embedding vectors
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
gpt-aria
gpt + aria = ability to read browser contents
knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
mailarchiva-dockerized
Dockerized version of the professional enterprise grade email archiving solution MailArchiva from Stimulus Software
natbot
Drive a browser with GPT-3
OpenLineage
An Open Standard for lineage metadata collection
openlineage_decorator
Python decorator class for Open Lineage client
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
simple-chat-app
Vercel chat UI with AWS Bedrock Claude3 Sonnet
UniPose
We propose UniPose, a unified framework for human pose estimation, based on our “Waterfall” Atrous Spatial Pooling architecture, that achieves state-of-art-results on several pose estimation metrics. Current pose estimation methods utilizing standard CNN architectures heavily rely on statistical postprocessing or predefined anchor poses for joint localization. UniPose incorporates contextual seg- mentation and joint localization to estimate the human pose in a single stage, with high accuracy, without relying on statistical postprocessing methods. The Waterfall module in UniPose leverages the efficiency of progressive filter- ing in the cascade architecture, while maintaining multi- scale fields-of-view comparable to spatial pyramid config- urations. Additionally, our method is extended to UniPose- LSTM for multi-frame processing and achieves state-of-the- art results for temporal pose estimation in Video. Our re- sults on multiple datasets demonstrate that UniPose, with a ResNet backbone and Waterfall module, is a robust and efficient architecture for pose estimation obtaining state-of- the-art results in single person pose detection for both sin- gle images and videos.