This suite of generative flow matching models allows for advanced image generation and editing. Unlike other text-to-image models, it performs in-context image generation, accepting both text and image inputs to seamlessly extract and modify visual concepts, producing new, coherent renderings. It goes beyond simple text-to-image, understanding and creating from existing images. You can modify input images with simple text instructions, enabling flexible and instant image editing without the need for finetuning or complex workflows.
Key capabilities include:
This allows you to iteratively add instructions and build on previous edits, refining creations step-by-step with minimal latency, while preserving image quality and consistency.
+2 more