๐Ÿค– The LMX-Omni-52B-Halo multimodal model from Lemonade SDK has been introduced.

It is a combination of four models: Qwen3.6-35B-A3B-MTP-GGUF (chat and vision), Flux-2-Klein-9B-GGUF (images), Whisper-Large-v3-Turbo (speech), and kokoro-v1 (voice synthesis). The system operates through a single interface compatible with OpenAI.

๐ŸŒ An approach via orchestration of SOTA models instead of training one giant architecture lowers the barrier to entry for creating multimodal agents and allows powerful solutions to run on local hardware.

๐Ÿ‘ค Users get a full-fledged AI assistant that can see, hear, and speak, working locally and supporting Open WebUI or AnythingLLM interfaces.

Source 1: https://huggingface.co/lemonade-sdk/LMX-Omni-52B-Halo