Beast code in Giters

Ryan Wong's starred repositories

docusaurus

Easy to maintain open source documentation websites.

Language:TypeScriptMIT55378 407 3081

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Language:PythonAGPL-3.049505 368 9226

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.031379 309 904

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++MIT30377 482 2463

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT30126 428 4182

fastText

Library for fast text representation and classification.

Language:HTMLMIT25831 846 1078

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonApache-2.08061 55 1503

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonApache-2.06438 44 85

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Language:PythonApache-2.05124 43 1014

nlpaug

Data augmentation for NLP

Language:Jupyter NotebookMIT4401 41 221

deit

Official DeiT repository

Language:PythonApache-2.03998 48 197

pytorch-openpose

pytorch implementation of openpose including Hand and Body Pose Estimation.

Language:Jupyter Notebook2053 24 78

sam

SAM: Sharpness-Aware Minimization (PyTorch)

Language:PythonMIT1735 12 81

pytorch_scatter

PyTorch Extension Library of Optimized Scatter Operations

Language:PythonMIT1536 17 384

pytorch-seq2seq

An open source framework for seq2seq models in PyTorch.

Language:PythonApache-2.01491 60 116

bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Language:PythonMIT1175 28 63

Neighborhood-Attention-Transformer

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Language:PythonMIT1038 16 75

ml_collections

ML Collections is a library of Python Collections designed for ML use cases.

Language:PythonApache-2.0879 17 19

WLASL

WACV 2020 "Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison"

Language:Python795 10 69

InterHand2.6M

Official PyTorch implementation of "InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose Estimation from a Single RGB Image", ECCV 2020

Language:PythonNOASSERTION681 25 151

textaugment

TextAugment: Text Augmentation Library

Language:PythonMIT393 8 23

torch_videovision

Transforms for video datasets in pytorch

Language:PythonGPL-3.0269 8 8

UniPose

We propose UniPose, a unified framework for human pose estimation, based on our “Waterfall” Atrous Spatial Pooling architecture, that achieves state-of-art-results on several pose estimation metrics. Current pose estimation methods utilizing standard CNN architectures heavily rely on statistical postprocessing or predefined anchor poses for joint localization. UniPose incorporates contextual seg- mentation and joint localization to estimate the human pose in a single stage, with high accuracy, without relying on statistical postprocessing methods. The Waterfall module in UniPose leverages the efficiency of progressive filter- ing in the cascade architecture, while maintaining multi- scale fields-of-view comparable to spatial pyramid config- urations. Additionally, our method is extended to UniPose- LSTM for multi-frame processing and achieves state-of-the- art results for temporal pose estimation in Video. Our re- sults on multiple datasets demonstrate that UniPose, with a ResNet backbone and Waterfall module, is a robust and efficient architecture for pose estimation obtaining state-of- the-art results in single person pose detection for both sin- gle images and videos.

Language:PythonNOASSERTION211 10 44

ryanwongsa