Semi-Automated Construction of Food Composition Knowledge Base
- URL: http://arxiv.org/abs/2301.11322v1
- Date: Tue, 24 Jan 2023 22:08:49 GMT
- Title: Semi-Automated Construction of Food Composition Knowledge Base
- Authors: Jason Youn, Fangzhou Li, Ilias Tagkopoulos
- Abstract summary: We propose a semi-automated framework for constructing a knowledge base of food composition from the scientific literature available online.
Our work demonstrates how human-in-the-loop models are a step toward AI-assisted food systems that scale well to the ever-increasing big data.
- Score: 0.06445605125467573
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: A food composition knowledge base, which stores the essential phyto-, micro-,
and macro-nutrients of foods is useful for both research and industrial
applications. Although many existing knowledge bases attempt to curate such
information, they are often limited by time-consuming manual curation
processes. Outside of the food science domain, natural language processing
methods that utilize pre-trained language models have recently shown promising
results for extracting knowledge from unstructured text. In this work, we
propose a semi-automated framework for constructing a knowledge base of food
composition from the scientific literature available online. To this end, we
utilize a pre-trained BioBERT language model in an active learning setup that
allows the optimal use of limited training data. Our work demonstrates how
human-in-the-loop models are a step toward AI-assisted food systems that scale
well to the ever-increasing big data.
Related papers
- Building FKG.in: a Knowledge Graph for Indian Food [2.339288371903242]
We build an automated system for assimilating culinary information for Indian food in the form of a knowledge graph.
We present a novel workflow that uses AI, LLM, and language technology to curate information from recipe blog sites in the public domain.
The design is application-agnostic and can be used for AI-driven smart analysis, building recommendation systems for Personalized Digital Health.
arXiv Detail & Related papers (2024-09-01T20:18:36Z) - Computer Vision in the Food Industry: Accurate, Real-time, and Automatic Food Recognition with Pretrained MobileNetV2 [1.6590638305972631]
This study employs the pretrained MobileNetV2 model, which is efficient and fast, for food recognition on the public Food11 dataset, comprising 16643 images.
It also utilizes various techniques such as dataset understanding, transfer learning, data augmentation, regularization, dynamic learning rate, hyper parameter tuning, and consideration of images in different sizes to enhance performance and robustness.
Despite employing a light model with a simpler structure and fewer trainable parameters compared to some deep and dense models in the deep learning area, it achieved commendable accuracy in a short time.
arXiv Detail & Related papers (2024-05-19T17:20:20Z) - Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research [139.69207791947738]
Dolma is a three-trillion-token English corpus built from a diverse mixture of web content, scientific papers, code, public-domain books, social media, and encyclopedic materials.
We document Dolma, including its design principles, details about its construction, and a summary of its contents.
We present analyses and experimental results on intermediate states of Dolma to share what we have learned about important data curation practices.
arXiv Detail & Related papers (2024-01-31T20:29:50Z) - FoodLMM: A Versatile Food Assistant using Large Multi-modal Model [96.76271649854542]
Large Multi-modal Models (LMMs) have made impressive progress in many vision-language tasks.
This paper proposes FoodLMM, a versatile food assistant based on LMMs with various capabilities.
We introduce a series of novel task-specific tokens and heads, enabling the model to predict food nutritional values and multiple segmentation masks.
arXiv Detail & Related papers (2023-12-22T11:56:22Z) - Diversifying Knowledge Enhancement of Biomedical Language Models using
Adapter Modules and Knowledge Graphs [54.223394825528665]
We develop an approach that uses lightweight adapter modules to inject structured biomedical knowledge into pre-trained language models.
We use two large KGs, the biomedical knowledge system UMLS and the novel biochemical OntoChem, with two prominent biomedical PLMs, PubMedBERT and BioLinkBERT.
We show that our methodology leads to performance improvements in several instances while keeping requirements in computing power low.
arXiv Detail & Related papers (2023-12-21T14:26:57Z) - FIRE: Food Image to REcipe generation [10.45344523054623]
Food computing aims to develop end-to-end intelligent systems capable of autonomously producing recipe information for a food image.
This paper proposes FIRE, a novel methodology tailored to recipe generation in the food computing domain.
We showcase two practical applications that can benefit from integrating FIRE with large language model prompting.
arXiv Detail & Related papers (2023-08-28T08:14:20Z) - Cook-Gen: Robust Generative Modeling of Cooking Actions from Recipes [6.666528076345153]
Food computation models have become increasingly popular in assisting people in maintaining healthy eating habits.
In this study, we explore the use of generative AI methods to extend current food computation models to include cooking actions.
We propose novel aggregation-based generative AI methods, Cook-Gen, that reliably generate cooking actions from recipes.
arXiv Detail & Related papers (2023-06-01T18:49:47Z) - Learning Structural Representations for Recipe Generation and Food
Retrieval [101.97397967958722]
We propose a novel framework of Structure-aware Generation Network (SGN) to tackle the food recipe generation task.
Our proposed model can produce high-quality and coherent recipes, and achieve the state-of-the-art performance on the benchmark Recipe1M dataset.
arXiv Detail & Related papers (2021-10-04T06:36:31Z) - Online Continual Learning For Visual Food Classification [7.704949298975352]
Existing methods require static datasets for training and are not capable of learning from sequentially available new food images.
We introduce a novel clustering based exemplar selection algorithm to store the most representative data belonging to each learned food.
Our results show significant improvements compared with existing state-of-the-art online continual learning methods.
arXiv Detail & Related papers (2021-08-15T17:48:03Z) - Towards Building a Food Knowledge Graph for Internet of Food [66.57235827087092]
We review the evolution of food knowledge organization, from food classification to food to food knowledge graphs.
Food knowledge graphs play an important role in food search and Question Answering (QA), personalized dietary recommendation, food analysis and visualization.
Future directions for food knowledge graphs cover several fields such as multimodal food knowledge graphs and food intelligence.
arXiv Detail & Related papers (2021-07-13T06:26:53Z) - Structure-Aware Generation Network for Recipe Generation from Images [142.047662926209]
We investigate an open research task of generating cooking instructions based on only food images and ingredients.
Target recipes are long-length paragraphs and do not have annotations on structure information.
We propose a novel framework of Structure-aware Generation Network (SGN) to tackle the food recipe generation task.
arXiv Detail & Related papers (2020-09-02T10:54:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.