dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

dvlab-research/MGM Issues

为什么输出结果为nan呢
Closed 3 months ago1
when use stable-diffusion,AttributeError: 'NoneType' object has no attribute 'tokenize'
Updated 21 days ago1
llama3 result is repeated many times
Updated a month ago1
Does MGM support in-context(few-shot) inference?
Updated a month ago
Will there be support for Qwen2?
Updated a month ago
How to access hidden states?
Updated a month ago1
Error while loading model with transformers library
Closed 3 months ago3
I get this error: WARNING: tokenization mismatch: 156 vs. 161. (ignored) when I finetune llama3
Updated 2 months ago1
使用多gpu启动worker，对话时报错
Closed 2 months ago1
May I ask if the current inference code does not support multi images input
Updated 2 months ago1
loss 0 and grad nan
Closed 2 months ago3
How to fix [NETWORK ERROR DUE TO HIGH TRAFFIC. ] on MacOS ?
Updated 2 months ago2
| EORROR | stderr | RecursionError: Maximun recursion depth exceeded in comparison
Updated 2 months ago1
Do you meet the error "MGMConfig"?
Updated 2 months ago
dataset miss problem
Closed 2 months ago1
error in loading
Closed 2 months ago
Requirement for pretraining weights of LLaMa-3-8B-Instruct
Updated 2 months ago
Can provide laion-gpt4v dataset images zip?
Updated 2 months ago
关于多机多卡效果不如单机多卡好的问题
Updated 2 months ago1
Inference problem about the demo.
Updated 2 months ago1
mgm-34b-hd, should have a 'model_type' key in its config.json
Updated 2 months ago2
The data for alignment and finetuning contains duplicates. Can you please explain why this is happening?
Updated 2 months ago1
可以放一下生成generation_pure_text数据的代码吗
Closed 2 months ago4
Generation-related Instructions dataset link
Updated 2 months ago1
多轮对话修改图像输入后报错
Updated 2 months ago
Loss does not decrease
Closed 2 months ago
pretrain error: lack of preprocessor_config.json
Closed 3 months ago1
Which deepspeed version is it
Updated 3 months ago2
LLama 70B support
Updated 3 months ago
Inference speed
Updated 3 months ago
Some weights of the model checkpoint were not used when initializing MGMLlamaForCausalLM
Updated 3 months ago2
lora initialisation missing from builder.py
Updated 3 months ago1
how to use stage2 ckpt fine-tuning stage3？
Updated 3 months ago1
Excessive Length of Responses from Mini Gemini
Closed 3 months ago1
Use of ocr in Evaluation
Updated 3 months ago1
请问为什么在训练llama的脚本中，预训练和微调所使用的conv不一样
Updated 3 months ago1
model asks self questions and answers
Updated 3 months ago1
Congratulations for the best LLaVA derived models !
Updated 3 months ago1
计划加入DPO训练来缓解模型幻觉问题吗
Updated 3 months ago
Some questions about the demo
Updated 3 months ago3
AttributeError: 'OpenCLIPVisionTower' object has no attribute 'device'
Updated 3 months ago2
Take input image as condition.
Closed 3 months ago2
stage2 loss is 0
Closed 3 months ago1
使用cli调用自定义微调模型，出现'OpenCLIPVisionTower' object has no attribute 'device'
Closed 3 months ago15
'LlamaForCausalLM' object has no attribute 'get_vision_tower'
Closed 3 months ago1
当我使用推理命令的时候出现网络错误，无法构建推理的接口
Closed 3 months ago2
how to prompt to get short response
Updated 3 months ago2
Huggingface inference script
Updated 3 months ago1
Finetune
Updated 3 months ago7
Deployed mini-Gemini in the Windows system and encountered the following error during the “Launch a Graph web server” step.Seeking help to resolve the issue
Updated 3 months ago2