Beast code in Giters

wanboyang's repositories

Anomaly_AR_Net_ICME_2020

This repository is for Weakly Supervised Video Anomaly Detection via Center-Guided Discriminative Learning(ICME 2020). The original paper can be found (https://ieeexplore.ieee.org/document/9102722) or (https://arxiv.org/abs/2104.07268)

Language:PythonMIT52 3 8

Awesome-Multimodal-Large-Language-Models

Latest Papers and Datasets on Multimodal Large Language Models

200

Protein-Localization-Transformer

Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein Localization Prediction

Language:PythonMIT100

IASGVD_ICASSP2022

NOASSERTION020

UCF_2018_CVPR

A reproduce code for Real-world Anomaly Detection in Surveillance Videos

Language:Python020

awesome-industrial-anomaly-detection

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

000

CAA

Channelized Axial Attention for Semantic Segmentation (AAAI-2022)

Language:PythonMIT010

camel

CaMEL: Mean Teacher Learning for Image Captioning. arXiv 2022.

Language:PythonBSD-3-Clause010

CBTrans

Language:PythonNOASSERTION010

Chinese-STD-GB-T-7714-related-csl

GB/T 7714相关的csl以及Zotero使用技巧及教程。

GPL-3.0010

DAT

Repository of Vision Transformer with Deformable Attention

Language:Python010

davit

Code for paper "DaViT: Dual Attention Vision Transformer"

Language:Jupyter NotebookMIT010

DIFNet

Language:PythonBSD-3-Clause010

docker-images

000

grit

GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)

Language:Python000

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.0000

InternLM-XComposer

Language:Python000

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellNOASSERTION000

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language:PythonMIT000

LLaMA-Adapter

Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

GPL-3.0000

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

000

LLMVA-GEBC

Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)

Language:PythonBSD-3-Clause000

Neighborhood-Attention-Transformer

[Preprint] Neighborhood Attention Transformer

Language:Python010

pykaldi

A Python wrapper for Kaldi

Language:PythonApache-2.0000

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION000

Textual-Visual-Semantic-Dataset

Visual Semantic Relatedness Dataset for Image Captioning. https://arxiv.org/abs/2301.08784

Language:Python000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT000

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonBSD-3-Clause000

wanboyang.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

Language:SCSSMIT000

Xmodal-Ctx

Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

Language:Python010