Creation of Food Specific Ontologies for Food Focused Text Mining

Capitalizing on the growth of scientific knowledge on food

Image: Food at Noordermarkt in Amsterdam, Kotomi Creations (CC License)

There is ample research on foods and diet patterns against obesity in the West, a medical condition that has been formally recognized as a global epidemic. A huge variety of diets has been developed to prevent obesity, which is also becoming an increasing problem in Asia. Wouldn’t it be great if we could translate those specific diets and recipes to the Asian cuisine? What if this knowledge about food could be shared more easily, if we were for example able to view in a database what ingredients are available in Asia that could replace ingredients from the West? Scientific knowledge on food is growing quickly, but we can capitalize more efficiently on the available data.

Scientific publications contain a wealth of unexplored and unstructured data. If we can rapidly search this sea of knowledge, from articles to patents and blogs, on relevant insights, it would accelerate the research process tremendously. What is more, text mining bridges the boundaries between domains, as a result of which hidden connections may emerge. Mining and cross-linking information from the domain of food with information from other domains and sources can provide valuable insight in the physiology of organisms, explain experimental data or lead to new hypotheses.

The technological developments in life science research have led to a vast increase in data that are available in public and proprietary databases. In order to efficiently capitalize on these data, dedicated vocabularies and algorithms are necessary for annotating, searching, filtering and integrating data from various sources. Although a number of generic knowledge discovery and knowledge management (KDKM) and text mining (TM) tools exist, their application in life science areas, in particular food research is limited. One reason is the absence of structured vocabularies that are of interest to specific applications in food research.

In this research project structured vocabularies covering the food domain are developed. These vocabularies will be incorporated in existing KDKM and TM tools to link potentially related research findings. Using these vocabularies, insights into the function of bacteria and organisms involved in food processing can be generated, for example. Furthermore hidden relations which might lead to a better understanding of how processes work or might lead to improved products can be identified. These relations can be used to generate hypotheses addressing important areas in food research.

The ontologies and related (web) services will be evaluated in two ways. Firstly, the ontology and associated services will be validated by measuring the quality of semi-automatic annotations and by demonstrating improved integration of food research data. Secondly, the hypotheses generated with the above computational methods will be validated in experiments in which the effects of probiotics and neutraceuticals are measured in in-vitro and in-vivo models for health. Visualizations of terms often co-occurring with bacteria do already summarize the main applications of those bacteria by a single mouse click. Introducing food concepts enables us to tag those terms, enriching the set of terms with useful concepts which enables discovery of new relations between bacteria and concepts.

Participating organisations

NIZO
Netherlands eScience Center
Radboud University Medical Center
Radboud University Nijmegen
Vrije Universiteit Amsterdam
Leiden University
Life Sciences
Life Sciences
Wageningen University & Research

Impact

Output

  • 1.
    Published in 2017
  • 2.
    Author(s): Wilco Fleuren
    Published by s.n.] ; UB Nijmegen [host
  • 1.
    Published in 2017
  • 2.
    Published in 2017
  • 3.
    Author(s): Wynand Alkema
    Published in 2015
  • 4.
    Author(s): Jan Top
    Published in 2015
  • 5.
    Published in 2015
  • 6.
    Published in 2015
  • 7.
    Published in 2015
  • 8.
    Author(s): Wynand Alkema
    Published in 2015
  • 9.
    Published in 2013
  • 10.
    Author(s): Wynand Alkema
    Published in 2013
  • 11.
    Author(s): Corrado Boscarino
    Published in 2013
  • 12.
    Published in 2013
  • 13.
    Author(s): Wilco Fleuren, Marijn Sanders, Nicole J.J.P. Koenderink
  • 14.
    Author(s): Wynand Alkema
  • 1.
    Published in 2017

Team

WA
Wynand Alkema
Principal investigator
Radboud Universiteit Nijmegen
JT
Jan Top
Co-Applicant
Vrije Universiteit Amsterdam
WWF
Wilco W.M. Fleuren
PhD student
Radboud Universiteit Nijmegen
Jason Maassen
eScience Coordinator
Netherlands eScience Center
Stefan Verhoeven
Senior eScience Research Engineer
Netherlands eScience Center
MI
Milena Ivanova
RSE
Netherlands eScience Center
MS
Marijn Sanders
RSE
Netherlands eScience Center
SL
Scott Lusher
Advisor
Netherlands eScience Center
Lars Ridder
Lars Ridder

Related projects

3D-e-Chem

Efficient exploitation of the massive amount of modern-day life science data

Updated 21 months ago
Finished

candYgene

Prediction of candidate genes for traits using interoperable genome annotations

Updated 20 months ago
Finished

Chemical Analytics Platform

Managing and exploiting growing data resources in chemical design

Updated 20 months ago
Finished

VLPB

The Virtual Laboratory for Plant Breeding

Updated 3 months ago
Finished