ArrowLuo's repositories

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:785Issues:12Issues:109

SegCLIP

PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"

DOER

The implementation of ACL 2019 paper DOER: Dual Cross-Shared RNN for Aspect Term-Polarity Co-Extraction

VideoFeatureExtractor

Video Feature Extractor for S3D-HowTo100M

Language:PythonLicense:Apache-2.0Stargazers:28Issues:2Issues:3

GRACE

The impletation of paper titled GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis

Language:PythonLicense:Apache-2.0Stargazers:19Issues:3Issues:7

FCMFrame

FCM visualization by Java awt

Language:JavaLicense:Apache-2.0Stargazers:13Issues:2Issues:0

BiDTree

The implementation of TASLP 2019 paper Improving Aspect Term Extraction with Bidirectional Dependency Tree Representation

Language:PythonLicense:MITStargazers:9Issues:5Issues:2

allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Stargazers:0Issues:0Issues:0

CLIP

Contrastive Language-Image Pretraining

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

faceswap

Deepfakes Software For All

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

fast_pytorch_kmeans

This is a pytorch implementation of k-means clustering algorithm

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Language:PythonStargazers:0Issues:1Issues:0

GroupViT

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

IDE-3D

[SIGGRAPH Asia 2022] IDE-3D: Interactive Disentangled Editing For High-Resolution 3D-aware Portrait Synthesis

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

MMSA

CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality (ACL2020)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

ScreenToGif

🎬 ScreenToGif allows you to record a selected area of your screen, edit and save it as a gif or video.

Language:C#License:MS-PLStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

SLIP

Code release for SLIP Self-supervision meets Language-Image Pre-training

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary"

Language:PythonStargazers:0Issues:1Issues:0

uda

Unsupervised Data Augmentation (UDA)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

unilm

UniLM AI - Unified "Language" Model Pre-training across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

UniVL

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

vision

Datasets, Transforms and Models specific to Computer Vision

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

vissl

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0