#MMaDA

4 messages · Page 1 of 1 (latest)

junior plume
#

Multimodal reasoning diffusion LLM, based on LLaDA
paper | github | hf | demo

It can read and generate images and text

Also, all of MMaDA's training data is open (but it initializes from LLaDA which has closed training data)

#

afaik this is the first open reasoning model that can do image generation

sharp veldt
junior plume
#

oh, I saw BAGEL but didn't see that it had reasoning. Nice!