mesolitica / multimodal-LLM

Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

mesolitica/multimodal-LLM Watchers