Fuente:
WIPO "tomato"
A method, apparatus, non-transitory computer readable medium, and system for generating images based on a target prompt and an anchor prompt include obtaining the target prompt the anchor prompt. The target prompt describes a first element, and the anchor prompt describes a second element. A first attention block of an image generation model generates a first attention output based on the target prompt and a second attention block of the image generation model generates a second attention output based on the anchor prompt. The image generation model then generates a synthetic image that depicts the first element and excludes the second element by combining the first attention output and the second attention output