xiaohongzhong's starred repositories
stable-diffusion-webui
Stable Diffusion web UI
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
styleguide
Style guides for Google-originated open-source projects
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
mlp-mixer-pytorch
An All-MLP solution for Vision, from Google AI
Video-ChatGPT
"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
RealBasicVSR
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"
Image-processing-algorithm
paper implement
All-In-One-Deflicker
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
MIVisionX
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
VR-Baseline
Video Restoration Toolbox including FGST (ICML 2022), S2SVR (ICML 2022), etc.
acuity-models
Acuity Model Zoo
WACV2024-SAFA
WACV2024 - Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution
winner-ntire22-vqe
Method and experience of winning the NTIRE'22 VQE challenge.
Real-Time-Multiple-Person-Recognition-and-Tracking-for-CCTV-Camera
a surveillance system for CCTV cameras which recognizes selected multiple target individuals and tracks in real time across multiple cameras, with detection, recognition, and kernel-based tracking modules. Facial recognition is done using HOG features and image embedding using OpenFace. We were able to perform simultaneous tracking and recognition of multiple individuals across multiple cameras in real time. Winning project, Smart India Hackathon 2019.