qwen2-vl

There are 0 repository under qwen2-vl topic.

modelscope / ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
agent deploy dpo internvl liger llama llama3 llava llm lora megatron minicpm-v modelscope multimodal peft pre-training qwen2 qwen2-vl reflection sft
Language:Python 4154
PaddlePaddle / PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
aigc clip controlnet dit eva-clip image-to-text internvl2 llava minigpt4 multimodal ppdiffusers qwen-vl qwen2-vl sd-xl sora stable-diffusion stablevideodiffusion text-to-image text-to-video unidiffuser
Language:Python 351
2U1 / Qwen2-VL-Finetune
An open-source implementaion for fine-tuning Qwen2-VL series by Alibaba Cloud.
chatbot multimodal qwen2-vl vision-language vision-language-model
Language:Python 95
arcstep / illufly
✨🦋 illufly 是自我进化的 Agent 框架: 基于自我进化，快速创造价值
agent ai dashscope glm-4 gpt llm longtext multiagent openai qwen qwen2 qwen2-vl rag zhipu illufly growth
Language:Python 37
soulteary / dify-with-qwen-vl
视频理解：千问视频多模态模型 & Dify
dify qwen2 qwen2-vl
Language:Python 24
fireicewolf / wd-llm-caption-cli
A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.
image-caption wd14 joy-caption llama3-vision qwen2-vl florence-2
Language:Python 20
Kazuhito00 / Qwen2-VL-Colaboratory-Sample
Colaboratory上でQwenLM/Qwen2-VLをお試しするサンプル
colaboratory python qwen2-vl vlm
Language:Jupyter Notebook 7
janelu9 / flash-finetuning
Running Large Language Model easily.
deepspeed fine-tuning llama3 llm megatron-lm pretrain qwen2-vl vlm
Language:Python 5
BUAADreamer / Qwen2-VL-History
Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums
beauty history llama-factory mllm multimodal-large-language-models museum qwen2-vl supervised-finetuning
2
silvererudite / generative-ai
practical projects using LLM, VLM and Diffusion models
gpt-4 huggingface knowledge-graph llm mistral-7b qwen2-vl sam stable-diffusion vlm peft-fine-tuning-llm llama3 rag
Language:Jupyter Notebook 1
ArchismwanChatterjee / OCR-and-Document-Search-Web-Application-Prototype
OCR and Document Search Web Application
colpali easyocr got gradio huggingface-spaces huggingface-transformers ocr python qwen2-vl
Language:Jupyter Notebook

qwen2-vl

modelscope / ms-swift

PaddlePaddle / PaddleMIX

2U1 / Qwen2-VL-Finetune

arcstep / illufly

soulteary / dify-with-qwen-vl

fireicewolf / wd-llm-caption-cli

Kazuhito00 / Qwen2-VL-Colaboratory-Sample

janelu9 / flash-finetuning

BUAADreamer / Qwen2-VL-History

silvererudite / generative-ai

ArchismwanChatterjee / OCR-and-Document-Search-Web-Application-Prototype