Xing Liu (rosebbb)

rosebbb

Geek Repo

Github PK Tool:Github PK Tool

Xing Liu's repositories

speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

accessmath-icfhr2018

Lecture Video Summarization by Extracting Handwritten Content from Whiteboards

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Light-ASD

The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)

Language:PythonStargazers:0Issues:0Issues:0

yolov7-face

yolov7 face detection with landmark

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

License:Apache-2.0Stargazers:0Issues:0Issues:0

DocEnTR

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Pytorch-UNet

PyTorch implementation of the U-Net for image semantic segmentation with high quality images

License:GPL-3.0Stargazers:0Issues:0Issues:0

pan_pp.pytorch

Official implementations of PSENet, PAN and PAN++.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

HAT

Arxiv2022 - Activating More Pixels in Image Super-Resolution Transformer

License:MITStargazers:0Issues:0Issues:0

voxceleb_trainer

In defence of metric learning for speaker recognition

License:MITStargazers:0Issues:0Issues:0

SPELL

Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)

License:MITStargazers:0Issues:0Issues:0

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

License:Apache-2.0Stargazers:0Issues:0Issues:0

FOTS.PyTorch

FOTS Pytorch Implementation

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

robin

RObust document image BINarization

License:MITStargazers:0Issues:0Issues:0

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

License:MITStargazers:0Issues:0Issues:0

DB

A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ssd.pytorch

A PyTorch Implementation of Single Shot MultiBox Detector

License:MITStargazers:0Issues:0Issues:0

TextFuseNet

A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".

License:MITStargazers:0Issues:0Issues:0

PAN.pytorch

A unofficial pytorch implementation of PAN(PSENet2): Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

License:Apache-2.0Stargazers:0Issues:0Issues:0

active-speakers-context

Code for the Active Speakers in Context Paper (CVPR2020)

Stargazers:0Issues:0Issues:0

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Transfer-Learning-in-keras---custom-data

Implementing Transfer Learning for custom data using VGG-16 and Resnet-50

Language:PythonStargazers:0Issues:0Issues:0

VGG16_feature_computation

c++ class to get the output of a pre-trained VGG16 network

Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0