Xizi Wang's repositories
LoCoNet_ASD
code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection
ctr_prediction
conversion rate prediction of an online article
FPGA-verilog-AES
a group programme
active-speakers-context
Code for the Active Speakers in Context Paper (CVPR2020)
DDM
[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
epic-kitchens-100-annotations
:plate_with_cutlery: Annotations for the public release of the EPIC-KITCHENS-100 dataset
LaViLa
Code release for "Learning Video Representations from Large Language Models"
pic_classify2
picture classification
pytorch_face_landmark
Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up to 100 FPS landmark inference speed with SOTA face detector on CPU.
senet.pytorch
PyTorch implementation of SENet
splatter-image
Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction'
TalkNet_ASD
TalkNet: Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
vedatad
A single stage temporal action detection toolbox based on PyTorch