QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Repository from Github https://github.comQwenLM/Qwen3-OmniRepository from Github https://github.comQwenLM/Qwen3-Omni

QwenLM/Qwen3-Omni Stargazers