[top]

SARI: A Stage-Aware Recognition Method for Ingredients Changing Appearance in Cooking Image Sequences

by Yixing Zhang, Yoko Yamakata, Keishi Tajima

Abstract

Recognizing ingredients in cooking images is a challenging task due to the significant visual changes that ingredients undergo throughout the cooking process. As ingredients are prepared, cooked, and served, their appearances vary greatly between the beginning, intermediate, and finishing stages. Traditional object recognition methods, which assume constant object appearances, struggle with this variability and are often not good at accurately identifying ingredients at different cooking stages. To address this challenge, we propose a stage-aware recognition method specifically designed for dynamically changing ingredients in cooking images. Our approach introduces two techniques: 1. Stage-Wise Model Learning: This technique involves training separate models for each stage of the cooking process. By adapting models to specific stages, we can better capture the distinct visual characteristics of ingredients as their appearances change. 2. Stage-Aware Curriculum Learning: This technique begins training with data from the beginning cooking stages and progressively incorporates data from later stages. This gradual approach helps the model adapt to the evolving appearances of ingredients. Our experimental results, using our published dataset, demonstrate that our stage-aware methods significantly outperform models trained without stage considerations, achieving higher accuracy in ingredient recognition.

Full Text: pdf

BibTex entry

Keywords

multimedia, food recognition, cooking, recipe data, datasets
Published in Journal of Information Processing, Vol.33, pp.104-114, 2025


tajima@i.kyoto-u.ac.jp / Fax: +81(Japan) 75-753-5978 / Office: Research Bldg. #7, room 404