PL's repositories

DiLM

Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".

License:MITStargazers:0Issues:0Issues:0

video_distillation

Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.

Stargazers:0Issues:0Issues:0

PoDD

Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.

Stargazers:0Issues:0Issues:0

FreD

Official PyTorch implementation for Frequency Domain-based Dataset Distillation [NeurIPS 2023]

Stargazers:0Issues:0Issues:0

WatermarkDM

Code of the paper: A Recipe for Watermarking Diffusion Models

License:MITStargazers:0Issues:0Issues:0

Efficient-Dataset-Condensation

Official PyTorch implementation of "Dataset Condensation via Efficient Synthetic-Data Parameterization" (ICML'22)

License:MITStargazers:0Issues:0Issues:0

FATE-LLM

Federated Learning for LLMs.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLaMA-Adapter

Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

License:GPL-3.0Stargazers:0Issues:0Issues:0

EAT_code

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

Stargazers:0Issues:0Issues:0

Video-LLaMA

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

License:Apache-2.0Stargazers:0Issues:0Issues:0

Ask-Anything

[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

License:NOASSERTIONStargazers:0Issues:0Issues:0

ALIA

Augmenting with Language-guided Image Augmentation

Stargazers:0Issues:0Issues:0

Clip2Protect

[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".

Stargazers:0Issues:0Issues:0

pingliu264.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

License:MITStargazers:0Issues:0Issues:0

SRe2L

Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves highest 60.8% on original ImageNet-1K val set.

Stargazers:0Issues:0Issues:0

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

License:MITStargazers:0Issues:0Issues:0

Point-In-Context

Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

CLCAE

Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint

License:MITStargazers:0Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ILM-VP

[CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zhang, and Sijia Liu

Stargazers:0Issues:0Issues:0

CUDA_LTR

Official Implementation of Curriculum of Data Augmentation for Long-tailed Recognition (CUDA) (ICLR'23 Spotlight)

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

LLaVA

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Awesome_Prompting_Papers_in_Computer_Vision

A curated list of prompt-based paper in computer vision and vision-language learning.

Stargazers:0Issues:0Issues:0