rookiexiao123

xiaohongzhong's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.0129439 1026 7312

llama

Inference code for LLaMA models

Language:PythonNOASSERTION50895 499 872

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:Jupyter NotebookMIT47989 431 119

styleguide

Style guides for Google-originated open-source projects

Language:HTMLApache-2.036548 1298 324

leveldb

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

Language:C++BSD-3-Clause35044 1314 743

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.032636 328 2504

serenity

The Serenity Operating System 🐞

Language:C++BSD-2-Clause28547 349 4100

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language:C++Apache-2.025455 493 4856

video2x

A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley II, 2018.

Language:PythonAGPL-3.08617 121 951

sd-scripts

Language:PythonApache-2.04162 42 752

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonBSD-3-Clause2412 30 135

CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

Language:C++NOASSERTION2191 47 133

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

Language:C++NOASSERTION1442 41 118

mlp-mixer-pytorch

An All-MLP solution for Vision, from Google AI

Language:PythonMIT966 11 11

Video-ChatGPT

"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.0896 12 96

EGVSR

Efficient & Generic Video Super-Resolution

Language:PythonMIT883 19 24

RealBasicVSR

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Language:PythonApache-2.0836 15 84

Image-processing-algorithm

paper implement

Language:C++830 33 3

hdrnet

An implementation of 'Deep Bilateral Learning for Real-Time Image Enhancement', SIGGRAPH 2017

Language:PythonApache-2.0791 34 18

All-In-One-Deflicker

[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas

Language:Python647 24 32

CUDA-code

Language:Cuda382 6 6

RVRT

Recurrent Video Restoration Transformer with Guided Deformable Attention (NeurlPS2022, official repository)

Language:PythonNOASSERTION326 23 28

MIVisionX

MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.

Language:C++MIT179 24 224

VR-Baseline

Video Restoration Toolbox including FGST (ICML 2022), S2SVR (ICML 2022), etc.

Language:PythonApache-2.0145 12 24

acuity-models

Acuity Model Zoo

Language:JavaScript129 16 10

WACV2024-SAFA

WACV2024 - Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution

Language:PythonMIT88 8 3

Shift-Net

A Simple Baseline for Video Restoration with Grouped Spatial-temporal Shift

Language:Python81 6 14

winner-ntire22-vqe

Method and experience of winning the NTIRE'22 VQE challenge.

Apache-2.070 6 5

Real-Time-Multiple-Person-Recognition-and-Tracking-for-CCTV-Camera

a surveillance system for CCTV cameras which recognizes selected multiple target individuals and tracks in real time across multiple cameras, with detection, recognition, and kernel-based tracking modules. Facial recognition is done using HOG features and image embedding using OpenFace. We were able to perform simultaneous tracking and recognition of multiple individuals across multiple cameras in real time. Winning project, Smart India Hackathon 2019.

Language:Python5100

aml_npu_sdk

Language:CGPL-2.025 5 8