data format(input/output):
instruction <audio> audio file path </audio> instruction <image> image file path </image> instruction
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra
data format(input/output):
instruction <audio> audio file path </audio> instruction <image> image file path </image> instruction
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra