z-fabian's starred repositories
diffusiondb
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
InterpretDiffusion
[CVPR 2024] Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation
sparse_autoencoder
Sparse Autoencoder for Mechanistic Interpretability
sparse_coding
Using sparse coding to find distributed representations used by neural networks.
cryoet-deepfinder
Macromolecules Localization and Identification in 3D Cellular Cryo-Electron Tomograms
llm_benchmarks
A collection of benchmarks and datasets for evaluating LLM.
MultiDiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
CLIP_benchmark
CLIP-like model evaluation
whatsup_vlms
Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".
tomotwin-cryoet
cryo-ET particle picking by representation and metric learning
RecvisProject
In this project, we propose to study Vision Transformers trained using the Barlow Twins self-supervised method, and compare the results with DINO. We demonstrate the effectiveness of the Barlow Twins method by showing that networks pretrained on the small PASCAL VOC 2012 dataset are able to generalize well. Authors: Apavou Clément & Zucker Arthur
Awesome-Foundation-Models
A curated list of foundation models for vision and language tasks
finetune-anything
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.