ArrowLuo

ArrowLuo's repositories

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonMIT785 12 109

SegCLIP

PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"

Language:Python73 9 5

DOER

The implementation of ACL 2019 paper DOER: Dual Cross-Shared RNN for Aspect Term-Polarity Co-Extraction

Language:Python58 3 6

VideoFeatureExtractor

Video Feature Extractor for S3D-HowTo100M

Language:PythonApache-2.028 2 3

GRACE

The impletation of paper titled GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis

Language:PythonApache-2.019 3 7

FCMFrame

FCM visualization by Java awt

Language:JavaApache-2.013 20

BiDTree

The implementation of TASLP 2019 paper Improving Aspect Term Extraction with Bidirectional Dependency Tree Representation

Language:PythonMIT9 5 2

allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Language:PythonGPL-3.0000

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

000

CLIP

Contrastive Language-Image Pretraining

Language:Jupyter NotebookMIT010

ConvNeXt

Code release for ConvNeXt model

Language:PythonMIT000

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonMIT000

faceswap

Deepfakes Software For All

Language:PythonGPL-3.0010

fast_pytorch_kmeans

This is a pytorch implementation of k-means clustering algorithm

Language:PythonMIT000

frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Language:Python010

GroupViT

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

Language:PythonNOASSERTION000

IDE-3D

[SIGGRAPH Asia 2022] IDE-3D: Interactive Disentangled Editing For High-Resolution 3D-aware Portrait Synthesis

Language:Jupyter Notebook000

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language:C++Apache-2.0000

MMSA

CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality (ACL2020)

Language:PythonMIT010

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonBSD-3-Clause000

ScreenToGif

🎬 ScreenToGif allows you to record a selected area of your screen, edit and save it as a gif or video.

Language:C#MS-PL010

sd-1click-colab

Language:Jupyter Notebook000

SLIP

Code release for SLIP Self-supervision meets Language-Image Pre-training

Language:PythonMIT010

TempFiles

020

Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary"

Language:Python010

uda

Unsupervised Data Augmentation (UDA)

Language:PythonApache-2.0010

unilm

UniLM AI - Unified "Language" Model Pre-training across Tasks, Languages, and Modalities

Language:PythonMIT010

UniVL

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Language:PythonMIT010

vision

Datasets, Transforms and Models specific to Computer Vision

Language:PythonBSD-3-Clause010

vissl

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Language:Jupyter NotebookMIT010