findalexli / mllm-dpo

[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model

Home Page:https://aclanthology.org/2024.acl-long.765/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

findalexli/mllm-dpo Issues