Chaos (ranck626)

ranck626

Geek Repo

Company:University of Electronic Science and Technology of China

Github PK Tool:Github PK Tool

Chaos's starred repositories

MMTrustEval

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:72Issues:0Issues:0

LLaVA-MOSS2

Modified LLaVA framework for MOSS2, and makes MOSS2 a multimodal model.

Language:PythonLicense:Apache-2.0Stargazers:9Issues:0Issues:0

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4803Issues:0Issues:0

AI_Gen_Novel

基于大语言模型(LLM)和多智能体(Multi-Agent),探究AI写小说能力的边界

Language:PythonLicense:MITStargazers:60Issues:0Issues:0

RemoteCLIP

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:263Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:200Issues:0Issues:0
Language:PythonStargazers:13Issues:0Issues:0

WHU-OPT-SAR-dataset

Open source dataset; multimodal fusion;remote sensing;optical images; SAR images;deep learning

Stargazers:130Issues:0Issues:0

ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Language:PythonLicense:Apache-2.0Stargazers:916Issues:0Issues:0

dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Stargazers:3003Issues:0Issues:0
Language:PythonStargazers:22Issues:0Issues:0
Language:PythonStargazers:518Issues:0Issues:0

CLIP-LoRA

An easy way to apply LoRA to CLIP. Implementation of the paper "Low-Rank Few-Shot Adaptation of Vision-Language Models" (CLIP-LoRA) [CVPRW 2024].

Language:PythonStargazers:58Issues:0Issues:0
Language:PythonStargazers:441Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:654Issues:0Issues:0

ImageBind-LoRA

Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA

Language:PythonLicense:NOASSERTIONStargazers:168Issues:0Issues:0

Chinese-LLaVA

支持中英文双语视觉-文本对话的开源可商用多模态模型。

Language:PythonLicense:Apache-2.0Stargazers:348Issues:0Issues:0

Fine-Tuning-the-Image-Encoder-of-clip-using-pre-Trained-CLIP-ViT-Large-Patch14

Optimize CLIP-ViT-Large-Patch14.ipynb with our tailored image encoder fine-tuning script. Quickly adapt the model to your needs for enhanced performance on image-based tasks.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

executor-image-clip-encoder

CLIPImageEncoder is an image encoder that wraps the image embedding functionality using the CLIP

Language:PythonStargazers:8Issues:0Issues:0

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:8157Issues:0Issues:0

CLIP-API-service

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46Issues:0Issues:0

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonLicense:MITStargazers:4178Issues:0Issues:0
Language:PythonLicense:MITStargazers:297Issues:0Issues:0

Matrix-Theory

电子科技大学《矩阵理论》复习笔记

Language:TeXLicense:Apache-2.0Stargazers:5Issues:0Issues:0

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:85458Issues:0Issues:0

RoCLIP

Robust Contrastive Language-Image Pretraining against Data Poisoning and Backdoor Attacks

Language:PythonStargazers:9Issues:0Issues:0

CLIP4CMR

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Language:PythonStargazers:40Issues:0Issues:0

Adversarial-Prompt-Tuning

ECCV2024: Adversarial Prompt Tuning for Vision-Language Models

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

VLAttack

This is an official repository of ``VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models'' (NeurIPS 2023).

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:26Issues:0Issues:0