wentaozhu

Wentao Zhu's repositories

DeepLung

WACV18 paper "DeepLung: Deep 3D Dual Path Nets for Automated Pulmonary Nodule Detection and Classification"

Language:Jupyter NotebookApache-2.0310 7 158

AnatomyNet-for-anatomical-segmentation

AnatomyNet: Deep 3D Squeeze-and-excitation U-Nets for fast and fully automated whole-volume anatomical segmentation

Language:PythonApache-2.0144 4 26

AutoShot

AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023

Language:PythonMIT57 5 5

adversarial-deep-structural-networks

ISBI2018: Adversarial Deep Structural Networks for Mammographic Mass Segmentation https://arxiv.org/abs/1612.05970

Language:PythonApache-2.051 5 9

speechnas

SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification

Language:PythonMIT30 3 3

leetcode-master

LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

100

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

BSD-3-Clause000

asv-subtools

An Open Source Tools for Speaker Recognition

Language:PythonApache-2.0000

ccf_2020_qa_match

ccf 2020 qa match competition top1

Language:Python000

CLIP

Contrastive Language-Image Pretraining

MIT000

D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Language:Python000

DAT

Repository of Vision Transformer with Deformable Attention (CVPR2022)

Apache-2.0000

Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Apache-2.0000

Det3D

World's first general purpose 3D object detection codebse.

Apache-2.0000

Few-shot-NAS

The official repo for Few-Shot Neural Architecture Search (ICML'21 long oral)

Language:Python000

flamingo-pytorch

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Language:PythonMIT000

GEBD

Generic Event Boundary Detection: A Benchmark for Event Segmentation

Language:PythonMIT000

machine-learning-systems-design

A booklet on machine learning systems design with exercises

000

manning

Repository for the book Grokking Machine Learning, by Manning Editors

Language:Jupyter Notebook000

mmt

Multi-Modal Transformer for Video Retrieval

Apache-2.0000

Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

000

Q-A-matching-of-real-estate-industry

MIT000

TuRBO

NOASSERTION000

ufom

Language:PythonMIT000

UniFormer

[ICLR2022] official implementation of UniFormer

Apache-2.0000

vision

Datasets, Transforms and Models specific to Computer Vision

Language:PythonBSD-3-Clause000

ViT-pytorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

MIT000

vit-pytorch-1

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonMIT000

voxceleb_trainer

In defence of metric learning for speaker recognition

MIT000

wav2tok

Codebase for ICLR' 23 paper- ''Wav2Tok: Deep Sequence Tokenizer for Audio Retrieval"

Language:PythonNOASSERTION000