Beast code in Giters

[CVPR 2022 Oral] Official repository for "MAXIM: Multi-Axis MLP for Image Processing". SOTA for denoising, deblurring, deraining, dehazing, and enhancement.

Apache-2.0000

merlot_reserve

Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"

MIT000

NAFNet

The state-of-the-art image restoration model without nonlinear activation functions.

Language:PythonNOASSERTION000

NeuralNeighborStyleTransfer

Optimization based style transfer

Language:PythonMIT000

OFA

Official repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonApache-2.0000

omnivore

Omnivore A Single Model for Many Visual Modalities

Language:Jupyter NotebookNOASSERTION000

omnizart

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

Language:PythonMIT000

PICa

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)

MIT000

StructuredDreaming

Repo for structured dreaming

Language:Jupyter Notebook000

StyleCLIP

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)

MIT000

stylegan3-editing

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" https://arxiv.org/abs/2201.13433

Language:PythonMIT000

SWAG

Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.

Language:Jupyter NotebookNOASSERTION000

TransEditor

[CVPR 2022] TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

MIT000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

v-diffusion-jax

v objective diffusion inference code for JAX.

Language:PythonMIT000

VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Language:PythonMIT000

VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

Language:PythonMIT000

wiki_crosslingual

Code to reproduce the NAACL 2021 paper "Wikipedia entities as rendezvous across languages: grounding multilingual LMs by predicting wikipedia hyperlinks".

MIT000