Towards Learning Food Portion From Monocular Images With Cross-Domain
Feature Adaptation
- URL: http://arxiv.org/abs/2103.07562v1
- Date: Fri, 12 Mar 2021 22:58:37 GMT
- Title: Towards Learning Food Portion From Monocular Images With Cross-Domain
Feature Adaptation
- Authors: Zeman Shao, Shaobo Fang, Runyu Mao, Jiangpeng He, Janine Wright,
Deborah Kerr, Carol Jo Boushey, Fengqing Zhu
- Abstract summary: We propose a deep regression process for portion size estimation by combining features estimated from both RGB and learned energy distribution domains.
Our estimates of food energy achieved state-of-the-art with a MAPE of 11.47%, significantly outperforms non-expert human estimates by 27.56%.
- Score: 6.648441500207032
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We aim to estimate food portion size, a property that is strongly related to
the presence of food object in 3D space, from single monocular images under
real life setting. Specifically, we are interested in end-to-end estimation of
food portion size, which has great potential in the field of personal health
management. Unlike image segmentation or object recognition where annotation
can be obtained through large scale crowd sourcing, it is much more challenging
to collect datasets for portion size estimation since human cannot accurately
estimate the size of an object in an arbitrary 2D image without expert
knowledge. To address such challenge, we introduce a real life food image
dataset collected from a nutrition study where the groundtruth food energy
(calorie) is provided by registered dietitians, and will be made available to
the research community. We propose a deep regression process for portion size
estimation by combining features estimated from both RGB and learned energy
distribution domains. Our estimates of food energy achieved state-of-the-art
with a MAPE of 11.47%, significantly outperforms non-expert human estimates by
27.56%.
Related papers
- MetaFood3D: Large 3D Food Object Dataset with Nutrition Values [53.24500333363066]
This dataset consists of 637 meticulously labeled 3D food objects across 108 categories, featuring detailed nutrition information, weight, and food codes linked to a comprehensive nutrition database.
Experimental results demonstrate our dataset's significant potential for improving algorithm performance, highlight the challenging gap between video captures and 3D scanned data, and show the strength of the MetaFood3D dataset in high-quality data generation, simulation, and augmentation.
arXiv Detail & Related papers (2024-09-03T15:02:52Z) - NutritionVerse-Direct: Exploring Deep Neural Networks for Multitask Nutrition Prediction from Food Images [63.314702537010355]
Self-reporting methods are often inaccurate and suffer from substantial bias.
Recent work has explored using computer vision prediction systems to predict nutritional information from food images.
This paper aims to enhance the efficacy of dietary intake estimation by leveraging various neural network architectures.
arXiv Detail & Related papers (2024-05-13T14:56:55Z) - From Canteen Food to Daily Meals: Generalizing Food Recognition to More
Practical Scenarios [92.58097090916166]
We present two new benchmarks, namely DailyFood-172 and DailyFood-16, designed to curate food images from everyday meals.
These two datasets are used to evaluate the transferability of approaches from the well-curated food image domain to the everyday-life food image domain.
arXiv Detail & Related papers (2024-03-12T08:32:23Z) - NutritionVerse-Real: An Open Access Manually Collected 2D Food Scene
Dataset for Dietary Intake Estimation [68.49526750115429]
We introduce NutritionVerse-Real, an open access manually collected 2D food scene dataset for dietary intake estimation.
The NutritionVerse-Real dataset was created by manually collecting images of food scenes in real life, measuring the weight of every ingredient and computing the associated dietary content of each dish.
arXiv Detail & Related papers (2023-11-20T11:05:20Z) - NutritionVerse: Empirical Study of Various Dietary Intake Estimation Approaches [59.38343165508926]
Accurate dietary intake estimation is critical for informing policies and programs to support healthy eating.
Recent work has focused on using computer vision and machine learning to automatically estimate dietary intake from food images.
We introduce NutritionVerse- Synth, the first large-scale dataset of 84,984 synthetic 2D food images with associated dietary information.
We also collect a real image dataset, NutritionVerse-Real, containing 889 images of 251 dishes to evaluate realism.
arXiv Detail & Related papers (2023-09-14T13:29:41Z) - An End-to-end Food Portion Estimation Framework Based on Shape
Reconstruction from Monocular Image [7.380382380564532]
We propose an end-to-end deep learning framework for food energy estimation from a monocular image through 3D shape reconstruction.
Our method is evaluated on a publicly available food image dataset Nutrition5k, resulting a Mean Absolute Error (MAE) of 40.05 kCal and Mean Absolute Percentage Error (MAPE) of 11.47% for food energy estimation.
arXiv Detail & Related papers (2023-08-03T15:17:24Z) - Image Based Food Energy Estimation With Depth Domain Adaptation [6.602838826255494]
We propose an "Energy Density Map" which is a pixel-to-pixel mapping from the RGB image to the energy density of the food.
We then incorporate the "Energy Density Map" with an associated depth map that is captured by a depth sensor to estimate the food energy.
arXiv Detail & Related papers (2022-08-25T15:18:48Z) - Towards the Creation of a Nutrition and Food Group Based Image Database [58.429385707376554]
We propose a framework to create a nutrition and food group based image database.
We design a protocol for linking food group based food codes in the U.S. Department of Agriculture's (USDA) Food and Nutrient Database for Dietary Studies (FNDDS)
Our proposed method is used to build a nutrition and food group based image database including 16,114 food datasets.
arXiv Detail & Related papers (2022-06-05T02:41:44Z) - An End-to-End Food Image Analysis System [8.622335099019214]
We propose an image-based food analysis framework that integrates food localization, classification and portion size estimation.
Our proposed framework is end-to-end, i.e., the input can be an arbitrary food image containing multiple food items.
Our framework is evaluated on a real life food image dataset collected from a nutrition feeding study.
arXiv Detail & Related papers (2021-02-01T05:36:20Z) - Multi-Task Image-Based Dietary Assessment for Food Recognition and
Portion Size Estimation [6.603050343996914]
We propose an end-to-end multi-task framework that can achieve both food classification and food portion size estimation.
Our results outperforms the baseline methods for both classification accuracy and mean absolute error for portion estimation.
arXiv Detail & Related papers (2020-04-27T21:35:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.