USING OPTICAL CHARACTER RECOGNITION EXTRACTION AND LANGUAGE MODEL TO POPULATE AN ORDER WITH ITEMS FROM A RECIPE

Fuente: WIPO "tomato"
Embodiments relate to utilizing an optical character recognition extraction and a large language model (LLM) to automatically populate a shopping cart of a user of an online system with items from a physical recipe. The online system receives an image capturing the physical recipe and extracts a raw text from the received image. The online system generates a prompt for input into the LLM, the prompt including a task request for the LLM to generate a list of ingredients using the raw text. The online system inputs the prompt into the LLM to generate the list of ingredients. The online system maps the list of ingredients to a list of items available by one or more retailers associated with the online system. The online system causes a device of the user to display a user interface with the list of items.