zero-shot-classification

There are 2 repositories under zero-shot-classification topic.

mlfoundations / open_clip
An open source implementation of CLIP.
deep-learning pytorch computer-vision language-model multi-modal-learning contrastive-loss zero-shot-classification pretrained-models
Language:Python 10365
roboflow / notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
computer-vision deep-learning deep-neural-networks image-classification image-segmentation object-detection yolov5 yolov6 yolov7 pytorch tutorial amazon-sagemaker-lab yolov8 google-colab machine-learning zero-shot-classification zero-shot-detection open-vocabulary-detection automatic-labeling-system open-vocabulary-segmentation
Language:Jupyter Notebook 5571
hcaptcha-challenger
QIN2DIM / hcaptcha-challenger
🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
clip computer-vision hcaptcha hcaptcha-solver image-segmentation multi-modal multi-modal-learning object-detection onnx onnx-models onnxruntime opencv-python playwright solver yolo yolov5 zero-shot-classification
Language:Python 1501
OpenGVLab / InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
foundation-models video-understanding vision-transformer action-recognition masked-autoencoder multimodal open-set-recognition spatio-temporal-action-localization temporal-action-localization video-question-answering video-retrieval zero-shot-classification zero-shot-retrieval benchmark contrastive-learning self-supervised instruction-tuning video-data video-dataset video-clip
Language:Python 1422
diffusion-classifier / diffusion-classifier
Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training
bayes-theorem classification computer-vision deep-learning diffusion diffusion-models elbo generative generative-models machine-learning monte-carlo robustness supervised-learning zero-shot-classification zero-shot-learning
Language:Python 415
UCSC-VLAA / CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
contrastive-learning deep-learning foundation-models multimodal-learning neurips-2023 pytorch zero-shot-classification zero-shot-learning
Language:Python 298
nlpodyssey / cybertron
Cybertron: the home planet of the Transformers in Go
bart bert machine-translation question-answering zero-shot-classification bert-as-service transformers huggingface text-classification named-entity-recognition summarization text-categorization text-similarity translation deep-learning machine-learning natural-language-processing nlp
Language:Go 286
Colin97 / OpenShape_code
official code of “OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding”
3d 3d-classification 3d-shape-retrieval 3d-understanding image-generation point-cloud point-cloud-caption zero-shot-classification zero-shot-retrieval
Language:Python 244
LAION-AI / scaling-laws-openclip
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
clip deep-learning few-shot-learning fine-tuning laion openclip pre-training pytorch scaling-laws transfer-learning zero-shot-classification zero-shot-retrieval
Language:Jupyter Notebook 153
salesforce / MUST
PyTorch code for MUST
clip masked-image-modeling self-training unsupervised-learning zero-shot-classification zero-shot-learning
Language:Python 105
HieuPhan33 / CVPR2024_MAVL
Multi-Aspect Vision Language Pretraining - CVPR2024
medical-vision-and-language-pretraining vision-language-model vision-language-pretraining zero-shot-classification zero-shot-segmentation
Language:Python 63
hfapigo
Kardbord / hfapigo
Unofficial (Golang) Go bindings for the Hugging Face Inference API
api go golang huggingface inference-api conversational-ai question-answering summarization translation zero-shot-classification audio-classification object-detection speech-recognition text-classification token-classification speech-recognition-api text-generation natural-language-processing nlp hacktoberfest
Language:Go 61
shiming-chen / MSDN
Official PyTorch Implementation of MSDN (CVPR'22)
knowledge-distillation mutual-learning semantic-distillation zero-shot-classification zero-shot-learning
Language:Python 52
akshitac8 / Generative_MLZSL
[TPAMI 2023] Generative Multi-Label Zero-Shot Learning
clswgan generative-adversarial-network gzsl multi-label-classification multi-label-zsl pytorch pytorch-implementation self-attention vaegan zero-shot-classification zero-shot-detection zsl
Language:Python 51
tmlr-group / WCA
[ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"
clip large-language-models similarity-score vision-language-model visual-text-alignment domain-generalization test-time-adaptation zero-shot-classification image-text-alignment
Language:Python 43
elkhouryk / RS-TransCLIP
Open-source code for the paper "Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification"
remote-sensing vision-language-models zero-shot-classification satellite-imagery scene-classification transductive-learning
Language:Python 39
GT4SD / zero-shot-bert-adapters
Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.
bert deep-learning intent intent-classification intent-detection nlp pytorch transformers zero-shot zero-shot-classification
Language:Python 38
PrithivirajDamodaran / Alt-ZSC
Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models to do ZSC. Hence, can be lightweight + supports more languages without trading-off accuracy. (Super simple, a 10th-grader could totally write this but since no 10th-grader did, I did) - Prithivi Da
text-classification text-labeling text-labelling weak-labels zero-shot-classification zeroshot-learning
Language:Python 37
text-to-image-eval
encord-team / text-to-image-eval
Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics include Zero-shot accuracy, Linear Probe, Image retrieval, and KNN accuracy.
embedding-evaluation embeddings-extraction evaluation-framework evaluation-metrics knn-search linear-probing model-evaluation-metrics text-to-image-evaluation zero-shot-classification zero-shot-image-classification zero-shot-retrieval
Language:Jupyter Notebook 36
ronaldseoh / atsc_prompts
Codes for the experiments in our EMNLP 2021 paper "Open Aspect Target Sentiment Classification with Natural Language Prompts"
amazon-review-corpus amazon-reviews-sentiment-analysis aspect-based-sentiment-analysis atsc few-shot-classification few-shot-learning language-models lm natural-language-inference natural-language-prompts nli prompts semeval-2014-dataset yelp-restaurants yelp-reviews zero-shot-classification zero-shot-learning
Language:Jupyter Notebook 36
anastasiia-p / airflow-ml
Airflow Pipeline for Machine Learning
airflow airflow-docker data-science docker-compose huggingface machine-learning nlp pipeline transformers zero-shot-classification mlops
Language:Python 30
UCSC-VLAA / MixCon3D
[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
3d contrastive-learning foundation-models multimodal-learning pytorch zero-shot-classification
Language:Python 27
filipbasara0 / simple-clip
A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch
contrastive-learning deep-learning machine-learning multi-modal-learning pytorch representation-learning self-supervised-learning siglip zero-shot-classification
Language:Jupyter Notebook 26
rhysdg / vision-at-a-clip
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
clip foundation-models machine-learning onnx siglip tensorrt zero-shot-classification grounding-dino zero-shot-object-detection
Language:Jupyter Notebook 23
MSQNet
mondalanindya / MSQNet
Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]
action-recognition animal-behavior charades hmdb51 video-representation-learning vision-and-language vision-language vision-transformer zero-shot-classification
Language:Python 22
pha123661 / NTU-2022Fall-DLCV
Deep Learning for Computer Vision 深度學習於電腦視覺 by Frank Wang 王鈺強
adversarial-domain-adaptation cnn computer-vision deep-learning gan image-captioning image-classification image-generation image-segmentation long-tailed-recognition novel-view-synthesis point-cloud-segmentation self-supervised-learning vision-language zero-shot-classification
Language:Python 21
yueyu1030 / ReGen
[ACL'23 Findings] This is the code repo for our ACL'23 Findings paper "ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval".
acl2023 agnews dataset-generation dense-retrieval imdb language-model nlp retrieval sst-2 text-classification zero-shot-classification zero-shot-learning
Language:Python 19
CogComp / Benchmarking-Zero-shot-Text-Classification
Code for EMNLP2019 paper : "Benchmarking zero-shot text classification: datasets, evaluation and entailment approach"
natural-language-processing nlp text-classification zero-shot-classification
Language:Python 18
cloudera / CML_AMP_Few-Shot_Text_Classification
Perform topic classification on news articles in several limited-labeled data regimes.
bert few-shot-learning nlp text-embedding zero-shot-classification
Language:Jupyter Notebook 17
autodistill / autodistill-gpt-4o
GPT-4o (with Vision) module for use with Autodistill.
autodistill gpt multimodal zero-shot-classification
Language:Python 11
JinhaoLee / WCA
[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
deep-learning image-text-matching large-language-models similarity-score textual-prompting vision-language-model visual-prompting visual-text-alignment zero-shot-classification
Language:Python 11
ytaek-oh / fsc-clip
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
compositionality image-text-retrieval vision-language-models zero-shot-classification
Language:Python 10
yzhan238 / PIEClass
The source code used for paper "PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training", published in EMNLP 2023.
pretrained-language-model prompt-based-tuning prompt-learning text-classification weak-supervision weakly-supervised-learning zero-shot zero-shot-classification
Language:Python 10
NeuRoNeLab / RS-DatasetsHub
A hub hosting essential remote sensing datasets.
captioning-images classification cross-modal datasets deep-learning remote-sensing satellite-data satellite-imagery zero-shot-classification
7
rafaelpierre / bullet
bullet: A Zero-Shot / Few-Shot Learning, LLM Based, text classification framework
chatgpt chatgpt-api few-shot-classification few-shot-learning llms openai python text-classification zero-shot-classification zero-shot-learning named-entity-recognition sequence-classification token-classification
Language:Jupyter Notebook 7
KimRass / CLIP
PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k
clip flickr30k flickr8k linear-classification multi-modal zero-shot-classification text-image-retrieval
Language:Python 6