Beast code in Giters

PL's repositories

ALIA

Augmenting with Language-guided Image Augmentation

Language:Python000

Ask-Anything

[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT000

Awesome_Prompting_Papers_in_Computer_Vision

A curated list of prompt-based paper in computer vision and vision-language learning.

000

CLCAE

Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint

Language:PythonMIT000

Clip2Protect

[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".

Language:Python000

CUDA_LTR

Official Implementation of Curriculum of Data Augmentation for Long-tailed Recognition (CUDA) (ICLR'23 Spotlight)

Language:Python000

DeltaEdit

Language:Python000

DiLM

Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".

Language:PythonMIT000

EAT_code

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

Language:Python000

Efficient-Dataset-Condensation

Official PyTorch implementation of "Dataset Condensation via Efficient Synthetic-Data Parameterization" (ICML'22)

Language:PythonMIT000

FATE-LLM

Federated Learning for LLMs.

Language:PythonApache-2.0000

FedDG-GA

Language:Python000

FreD

Official PyTorch implementation for Frequency Domain-based Dataset Distillation [NeurIPS 2023]

Language:Python000

GatedPromptTuning

Language:Python000

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.0000

IDM

Language:Python000

ILM-VP

[CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zhang, and Sijia Liu

Language:Python000

LLaMA-Adapter

Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.0000

LLaVA

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

Language:PythonApache-2.0000

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonMIT000

pingliu264.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

PoDD

Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.

Language:Python000

Point-In-Context

Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding

000

SRe2L

Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves highest 60.8% on original ImageNet-1K val set.

Language:Python000

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION000

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.0000

TalkLip

000

Video-LLaMA

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonBSD-3-Clause000

video_distillation

Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.

Language:Python000

WatermarkDM

Code of the paper: A Recipe for Watermarking Diffusion Models

Language:Jupyter NotebookMIT000