z-fabian

z-fabian

Geek Repo

Company:USC

Location:Los Angeles, CA

Home Page:https://z-fabian.github.io/

Github PK Tool:Github PK Tool


Organizations
MathFLDS

z-fabian's starred repositories

diffusiondb

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Language:PythonLicense:MITStargazers:1197Issues:0Issues:0

InterpretDiffusion

[CVPR 2024] Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation

Language:PythonStargazers:28Issues:0Issues:0

SpLiCE

Sparse Linear Concept Embeddings

Language:PythonLicense:Apache-2.0Stargazers:54Issues:0Issues:0

sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability

Language:PythonLicense:MITStargazers:176Issues:0Issues:0

sparse_coding

Using sparse coding to find distributed representations used by neural networks.

Language:Jupyter NotebookStargazers:171Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9734Issues:0Issues:0

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:1630Issues:0Issues:0

cryoet-deepfinder

Macromolecules Localization and Identification in 3D Cellular Cryo-Electron Tomograms

Language:PythonLicense:GPL-3.0Stargazers:5Issues:0Issues:0

llm_benchmarks

A collection of benchmarks and datasets for evaluating LLM.

Stargazers:261Issues:0Issues:0

ProLLaMA

A Protein Large Language Model for Multi-Task Protein Language Processing

Language:PythonLicense:Apache-2.0Stargazers:129Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1713Issues:0Issues:0

topaz

Pipeline for particle picking in cryo-electron microscopy images using convolutional neural networks trained from positive and unlabeled examples. Also featuring micrograph and tomogram denoising with DNNs.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:171Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5676Issues:0Issues:0

MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Language:Jupyter NotebookStargazers:988Issues:0Issues:0

CLIP_benchmark

CLIP-like model evaluation

Language:Jupyter NotebookLicense:MITStargazers:596Issues:0Issues:0

whatsup_vlms

Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".

Language:PythonLicense:MITStargazers:32Issues:0Issues:0

YOLOv6

YOLOv6: a single-stage object detection framework dedicated to industrial applications.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:5689Issues:0Issues:0

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:13433Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:6089Issues:0Issues:0

tomotwin-cryoet

cryo-ET particle picking by representation and metric learning

Language:PythonLicense:MPL-2.0Stargazers:30Issues:0Issues:0

RecvisProject

In this project, we propose to study Vision Transformers trained using the Barlow Twins self-supervised method, and compare the results with DINO. We demonstrate the effectiveness of the Barlow Twins method by showing that networks pretrained on the small PASCAL VOC 2012 dataset are able to generalize well. Authors: Apavou Clément & Zucker Arthur

Language:PythonLicense:GPL-3.0Stargazers:14Issues:0Issues:0

Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

Stargazers:793Issues:0Issues:0

devit

CoRL 2024

Language:PythonLicense:MITStargazers:330Issues:0Issues:0

finetune-anything

Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios

Language:PythonLicense:MITStargazers:768Issues:0Issues:0

DCI

Densely Captioned Images (DCI) dataset repository.

Language:PythonLicense:NOASSERTIONStargazers:155Issues:0Issues:0

fiftyone

Refine high-quality datasets and visual AI models

Language:PythonLicense:Apache-2.0Stargazers:8744Issues:0Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3649Issues:0Issues:0

USCthesis

a LaTeX style for theses and dissertations at USC

Language:TeXStargazers:25Issues:0Issues:0

RadFM

The official code for "Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data".

Language:PythonStargazers:333Issues:0Issues:0