Compilers' Toolbox™ - Data Interchange

Interchange of Food Composition Data

	Updated 2017-04-30

History

Interchange of food composition data in a broader sense has taken place - in a maybe simple way - ever since the first food composition tables were published. Authors were and are still borrowing data from each other, using and reusing data from different sources, often with the errors and mistakes as the result.

In the more modern sense, food composition data interchange can be regarded as moving the data via electronic means, i.e. electronic storage facilities, networks, etc.

The most important break-through came in the 1980s and 1990s with the developments of computerised food composition databases and the following possibility for electectronic interchange of data.

Today, two main types of interchange can be distinguished. The first is exchange between FCDMSs Food Composition Data Management Systems being used for the compilation and evaluation of data collections - including the supply of data stored in LIMSs (Laboratory Information Management Systems) by analytical laboratories and food manufacturers. The second is the dissemination, often publication, of evaluated data sets.

Exchange between FCDMSs should provide the potential to interchange any data capable of being stored in such a system. FCDM managers must be able to decide what data is appropriately exchanged in a given circumstance and have the facility to tailor the overall interchange specification to that requirement. Since the overall format must be able to accommodate any FCDMS data, logical data design for the FDBMS and for interchange can be considered to be a single operation.

NORFOODS

The first report of systematic interchange of food composition table data was of work undertaken in the Nordic Countries under the auspices of NORFOODS and supported by the Nordic Council of Ministers:

Møller A.:
NORFOODS computer group. Food composition data interchange among the Nordic countries: a report.
World review of nutrition and dietetics, 1992, Vol.68, pp.104-20.

A significant portion of the work involved the mechanics of interchange appropriate to the hardware used at the time in the various Nordic countries, in practice no longer an issue. The project compared the data files available for each country's food tables and made proposals for a minimum standard for food data interchange. This used a simple approach in which the data file was separated from the documentation file to the extent that components were not identified in the data files, values being recognised by their field position. This close adherence to the layout of printed food tables might now be considered over-restrictive for general data interchange.
Indeed at a more recent CEC-funded workshop under the Eurofoods auspecies, it was suggested that it would be useful to review the NORFOODS work in the light of more recent technological developments.

INFOODS

The INFOODS Food Composition Data Interchange Handbook:

Klensin J.C.:
INFOODS Food Composition Data Interchange Handbook.
United Nations University, Tokyo 1992.

forms the basis of the more recent developments (like in Eurofoods and EuroFIR projects) in food composition data interchange. It defines the organising principles and formats to be used when transferring data, for example between regional centres. The report introduces and and gives an overview covering some basic considerations which are relevant to the wider adoption of the interchange format.
The INFOODS format is based on SGML (Standard Generalised Markup Language) - the predecessor of XML (eXtensible Markup Language) - and a clear, unabiguous description of foods and components. The component description is based on the INFOODS publication:

Klensin J.C., Feskanitch, D., Lin, V., Truswell, A.S. & Southgate, D.A.T.:
Identification of Food Components for INFOODS Data Interchange.
United Nations University, Tokyo 1989.

The report proposes how the INFOODS system relates to FCDMS requirements for structuring and handling data, the benefits of using an SGML-based format and their relevance to FDBMS development. and recommends a coordinated approach in defining the specifications for FCDMSs and an interchange format.

International Interface Standard

The development of the International Interface Standard (IIS) for the U.S. Food and Drug Administration (Douglass et al., 1995) is closely related to data interchange in that it provides an environment into which interchange files can be imported

Pennington J.A.T., Hendricks T.C.:
Proposal for an international interface standard for food databases
Food Additives and Contaminants 1992, Vol. 9, No. 3, 265-275.

In this US system, items can be searched on specific criteria and the associated data retrieved. Food description based on the Langual system provides one of the main retrieval options. The IIS system serves as a reminder that the purpose of interchange is to move useable data from one place to another. Generally, and increasingly, this is between computer facilities managing collections of data. These facilities are frequently supported by FCDMSs which thus need the means to generate interchange files for export and to import them so that the incoming data are correctly interpreted and stored by the receiving FCDMS.

COST Action 99 - Eurofoods

The COST Action 99 - Eurofoods (1995-1999) further developed the basics of food composition data management and data interchange. An overview of Data Interchange formats as import/exports formats was published in 1996:

Unwin, I., Møller, A.;
Data Interchange Formats as import/export formats for food database management systems.
Report to the National Food Agency of Denmark and the COST99 working group on food data interchange.
National Food Agency of Denmark, 1996.

The relationship between interchange formats and FCDMS was considered further in the COST Action 99 - Eurofoods project with developments of food composition data structures and data interchange proposals:

Schlotke F.:
Using Internet services to improve international food data exchange.
Food Chemistry, Vol. 57, No. 1, pp. 137-143, 1996.
Schlotke F., Becker W., Ireland J., Møller A., Ovaskainen M.-L., Monspart J., Unwin I.:
EUROFOODS Recommendations for Food Composition Database Management and Data Interchange.
Journal of Food Composition and Analysis 13, 709-744, 2000.

To promote and encourage interchange of food composition data, the COST Action 99 - Eurofoods working group on food composition data management and interchange proposes a set of recommendations for data management and interchange using electronic media:

Schlotke F., Becker W., Ireland J., Møller A., Ovaskainen M.L., Monspart J., Unwin I. (Eds.):
COST Action 99 - Eurofoods recommendations for food composition database management and data interchange.
Report No. EUR 19538, Luxembourg: Office for Official Publications of the European Communities, 2000 (79 pp.), ISBN 92-828-9757-5.

The recommendations are firmly founded on previous work done internationally by INFOODS and by national agencies and institutes as well as international standards. The recommendations include guidelines for the description of foods, components, compositional values and data sources. A sufficiently generic conceptual schema for food composition is defined to handle food composition data at various levels of aggregation and with various levels of additional descriptive information.
The recommendations also include technical issues such as file formats and media for data interchange. Furthermore, software tools are presented to assist with implementation of the recommendations.

The Eurofoods recommendations themselves are the first step in a two step approach including a minimum set of requirements for food composition data interchange.

The Eurofoods requirements outline the main categories of data and their description of further data (metadata). The recommendations for this first step proposed a text based interchange format and media for data transfer. A subset of the data information described in the Eurofoods recommendations first step have been used in the data documentation and data transfer from the national compilers of the ten EPIC countries in the EPIC study (European Prospective Investigation into Cancer and Nutrition) coordinated by WHO IARC, Lyon. The subset is published in

Vignat J., Unwin I., Ireland J., Møller A., Becker W., Charrondière U.R., Skeie G., and Slimani N.:
Guideline notes for preparing and exporting food composition data according to the common formats of export files.
Version 15 September 2003 - for use by the European Food Information Resource (EuroFIR).
EPIC Nutrient DataBase (ENDB) project coordinated by WHO IARC, 8 February 2006.

The Eurofoods recommendations mention that the second step in the two step approach could be based on the concepts of SGML (ISO standard, 1986), more recently developed as XML by the World Wide Web Consortium in 2000. This approach has been taken up in the CEECFOODS (the sub-regional food composition network for the Central and Eastern European States) and implemented in the Alimenta and the Data Center and Data Management System software developed by FloraFood (websites/references no longer available).
The software allowed for interchange of food composition data using a so-called TransportPackage in XML. This is the first attempt to use an open standard XML format to perform food composition data interchange between partners (in this case, the CEECFOODS network) and internationally. It represents a pragmatic and feasible system that works. The TransportPackage was presented at the International Food Data Conference in Bratislava in 2001.

FAO Technical workshop on Standards for food composition data interchange

The Technical workshop on Standards for food composition data interchange, held in Rome, 19-22 January 2004 "noted that XML would be the most suitable way to interchange data but that technical limitations exist which make it difficult for some compilers, including those in North America and Europe, to convert their data into XML. It was therefore agreed that the next step towards the XML interchange would be a set of relational files with standardized tags and definitions which would then be converted into XML format guidelines. These files should therefore be regarded as collections of elements and their attributes that can be presented in a single file or as a set of relational files". This approach is similar to the two step approach of the Eurofoods recommendations, but introduces a more elaborated, theoretical concept of definitions and relationships than implemented in the Eurofoods scheme.

EuroFIR Network of Excellence

The EuroFIR Network of Excellence (2006-2010) was a five-year Network of Excellence funded by the European Commission's Research Directorate General under the "Food Quality and Safety Priority" of the Sixth Framework Programme for Research and Technological Development. The network involved 49 partners from universities, research institutes and small-to-medium sized enterprises from 27 European countries.
In the EuroFIR project, one of the major tasks was to set up – and implement - the EuroFIR Databank System, the EuroFIR eSearch facility, now replaced by EuroFIR FoodEXplorer.
The constructed data retrieval facilities allow users to specify foods and components, return relevant data, and provide quality measures of the retrieved data matrix. The EuroFIR specifications for a European food composition databank system are based on the actual availability of data and metadata and use and implement an updated subset of the Eurofoods recommendations. The Eurofoods recommendations form the background of definitions and concepts used in the EuroFIR specifications. Whenever possible, the EuroFIR specifications take into account international recommendations and cross-references between local, national and international concepts and entities will be given.

To underpin the EuroFIR NoE specifications, a series of documentation reports concerning data structures and interchange facilities have been published:

Becker W., Unwin I., Ireland J., Møller A.:
Proposal for structure and detail of a EuroFIR standard on food composition data.
I: Description of the standard. EuroFIR Technical Report - 2007-07-13.
Becker W., Møller A., Ireland J., Roe M., Unwin I., Pakkala H.:
Proposal for structure and detail of a EuroFIR Standard on food composition data.
II. Technical Annex - Version 2008.
EuroFIR Technical Report D1.8.19.
Danish Food Information 2008. ISBN 978-87-92125-10-1.
Møller A., Unwin I.D., Ireland J., Roe M.A, Becker W., Colombani P.:
The EuroFIR Thesauri 2008.
EuroFIR Technical Report D1.8.22.
Danish Food Information 2008.
ISBN 978-87-92125-09-5.
Møller, A., Christensen T.:
EuroFIR Web Services - Food Data Transport Package, Version 1.3.
EuroFIR Technical Report D1.8.20.
Danish Food Information 2008.
ISBN 978-87-92125-08-8.
Pakkala H., Christensen T., Gunnarsson Í., Kadvan A., Keshet B., Korhonen T., Martínez de Victoria I, Møller A., Presser K., Colombani P., Nørby E.:
EuroFIR Web Services - Specification of request-response message exchange patterns - Version 1.0.
EuroFIR Technical Report D1.8.29.
Danish Food Information 2008.
ISBN 978-87-92125-12-5.

The development of the system, which allows easy access and interchange of European and other food composition data, is build on state-of-the-art techniques using XML (eXtensible Markup Language) and REST/SOAP inplementations to transport data between local and cetral servers. The resources are build on the comprehensive value documentation developed by the EuroFIR projects and facilitates the retrieval and use of information on foods, food components, calculation parameters, analytical methods, source references and other food-related topics.
For more information on EuroFIR Thesauri and Value Documentation, see Value Documentation.

In connection with the development of the EuroFIR Food Data Transport Package and Meta Data Transport Package XML templates and the EuroFIR Web Services, a series of XML schemata (schemas) were developed to support the integrity and structure of the XML template:

Pakkala, H.:
EuroFIR FDTP Schemata Documentation, version 1.1
EuroFIR 2009
Pakkala, H.:
EuroFIR MDTP Schemata Documentation, version 1.1
EuroFIR 2009

The Schemata documentations include definitions of the special data types required in food data interchange. An example is the values that should not be defined as a numbers (decimal or integer), which restrict the values being interchanged to real numbers, and not the often appearing “no value” (an empty string, used in EuroFIR when no Selected value (mandatory) can be assigned when extreme variability is characterizing the value and only a minimum and a maximum can be assigned) or “0-1” (a string often used as a value in nutrition labelling to indicate that the value is lower that one, but bigger than zero). The value type chosen by EuroFIR is "decimal-as-string", which preserves the precision of a number (significat digits) and at the same time can be used for string information, like the empty string (no value) or value expressed as string ("0-1").
For more details on values in numerical databanks, see Significant Digits.

Apart from the EuroFIR thesauri mentioned above, the EuroFIR Food Data Transport Package template makes use of several of other thesauri:

LanguaL™
The LanguaL™ thesaurus is used for food description. Further information on the LanguaL™ thesaurus, its documentation and software can be found on the the LanguaL™ website.
ISO 3166
Codes for representation of countries and their sub-division. The ISO 3166/MA maintains and update the ISO 3166 Standard on country codes. EuroFIR uses the short country names from ISO 3166-1 (the so-called alpha-2 codes). The alpha-2 codes are made available by ISO at no charge for internal use and non-commercial purposes. The standard can be found at the ISO pages and copied from ISO 3166 Online Browsing Platform.
ISO 639
Codes for representation of languages. The ISO 639 consists of two parts: ISO 639-1:2002 Codes for the representation of names of languages -- Part 1: Alpha-2 code / Codes pour la représentation des noms de langue - Partie 1: Code alpha-2, for which Infoterm has been designated the Registration Authority (ISO 639-1/RA), and ISO ISO 639-2:1998 Codes for the representation of names of languages - Part 2: Alpha-3 code / Codes pour la représentation des noms de langue - Partie 2: Code alpha-3, for which the Library of Congress (LoC) functions as the Registration Authority (ISO 639-2/RA).
EuroFIR uses the ISO 639-1 short codes (the alpha-2 codes), which can be downloaded from the Library of Congress' ISO 639-2 Registration Authority (autorized by ISO) website in two character encodings, UTF-8 and ISO 8859-1.
To read these files, please note that one line of text contains one entry. An alpha-3 (bibliographic) code, an alpha-3 (terminologic) code (when given), an alpha-2 code (when given), an English name, and a French name of a language are all separated by pipe (|) characters. If one of these elements is not applicable to the entry, the field is left empty, i.e., a pipe (|) character immediately follows the preceding entry. The Line terminator is the LF character.

In addition to language itself, it is often important to distinguish between dialects of a language, e.g. British English and American English. Although the ISO 639-6 standard include ways of indicating languages used in different countries/regions, it was decided to use the current best practice as described in Internet Society's RFC 4646 and RFC 4647 (Tags for the Identification of Languages).

In these documents, the Internet Society describes the structure, content, construction, and semantics of language tags in a faceted approach for use in cases where it is desirable to indicate the language used in an information object in Internet applications. It also describes how to register values for use in language tags and the creation of user-defined extensions for private interchange.

The language tag consists of a primary sub-tag and a series of subsequent sub-tags, each of which narrows or refines the range of languages identified by the overall tag. It enables the user to specify, in addition to the primary language, other characteristics such as script, country, or variant.

It is considered an Internet Best Current Practices for the Internet Community and gives guidance for the use of ISO 639 codes.

RFC 4646 specifies use of a 2-character code from ISO 639-1 when it exists; when a language does not have a 2-character code assigned the 3-character code is used. Although it states that the 3-character terminology code is used in these cases where no 2-character code exists, this situation will not occur, since the only alternative codes in ISO 639-2 are for languages that already have a 2-character code.
Some (simple) examples are

Language	Language tag

English	en
British English	en-GB
American English	en-US
German	de
German German	de-DE
Swiss German	de-CH
Austrian German	de-AT

For further information, see the full documentation in RFC 4646 and RFC 4647.

EuroFIR Nexus

The specifications for the EuroFIR Web Services developed during the EuroFIR project were futher developed during the two year project extension named EuroFIR - Nexus (2011-2013). The following reports were published:

Møller A., Christensen T.:
EuroFIR Web Services - Food Data Transport Package, Version 1.4.
EuroFIR Nexus Technical Report D2.1.
Danish Food Information 2012.
ISBN 978-87-92125-15-6.
Pakkala H., Martínez de Victoria I, Christensen T., Unwin I., Gunnarsson Í., Korhonen T., Kadvan A., Møller A., Nørby E. , Presser K., Colombani P., Keshet B.:
EuroFIR Web Services - Specifications for request-response based Web services.
Version 1.2.
EuroFIR Nexus D2.8 - April 2011.

CEN Standard for food data - EN 16104:2012

As a spin-off of the EuroFIR projects, a project committee under the European Standardisation auspices, the CEN/TC 387 food data project committee, was initiated. The work of the CEN/TC 387 finalised its work in March 2012, agreed upon by a final vote by the CEN member countries in August 2012 and published as a European Standard, EN 16104:2012, Food data - Structure and interchange format, on 3 November 2012 (preview).

The European Standard specifies requirements on the structure and semantics of food datasets and of interchange of food data for various applications. Food data refers to information on various food properties and includes various steps in the generation and publication of such data, e.g. sampling, analysis, food description, food property and value description.
The standard regards food data as datasets covering:

identification, description and classification of foods including food ingredients,
qualitative and quantitative food properties that can be measured, calculated or estimated,
data quality values and other metadata,
specifications of methods used for obtaining these values,
references to sources for the information reported.

The standard includes requirements on:

semantics and data structure for food data,
content of referenced controlled vocabularies,
XML encoding for interchange of food data.

The standard does not include:

food description methods,
quality assessment methods,
content of controlled vocabularies, for example controlled vocabularies for nutrients, nor does it have any preferences for country or language codes; the details should be negotiated and agreed upon by the users of the standard,
database implementation.

References

Møller A.:
NORFOODS computer group. Food composition data interchange among the Nordic countries: a report.
World review of nutrition and dietetics, 1992, Vol.68, pp.104-20.
Klensin J.C.:
INFOODS Food Composition Data Interchange Handbook.
United Nations University, Tokyo 1992.
Klensin J.C., Feskanitch, D., Lin, V., Truswell, A.S. & Southgate, D.A.T.:
Identification of Food Components for INFOODS Data Interchange.
United Nations University, Tokyo 1989.
Pennington J.A.T., Hendricks T.C.:
Proposal for an international interface standard for food databases.
Food Additives and Contaminants 1992, Vol. 9, No. 3, 265-275.
Unwin, I., Møller, A.;
Data Interchange Formats as import/export formats for food database management systems.
Report to the National Food Agency of Denmark and the COST99 working group on food data interchange.
National Food Agency of Denmark, 1996.
Schlotke F.:
Using Internet services to improve international food data exchange.
Food Chemistry, Vol. 57, No. 1, pp. 137-143, 1996.
Schlotke F., Becker W., Ireland J., Møller A., Ovaskainen M.-L., Monspart J., Unwin I.:
EUROFOODS Recommendations for Food Composition Database Management and Data Interchange.
Journal of Food Composition and Analysis 13, 709-744, 2000.
Schlotke F., Becker W., Ireland J., Møller A., Ovaskainen M.L., Monspart J., Unwin I. (Eds.):
COST Action 99 - Eurofoods recommendations for food composition database management and data interchange.
Report No. EUR 19538, Luxembourg: Office for Official Publications of the European Communities, 2000 (79 pp.), ISBN 92-828-9757-5.
Vignat J., Unwin I., Ireland J., Møller A., Becker W., Charrondière U.R., Skeie G., and Slimani N.:
Guideline notes for preparing and exporting food composition data according to the common formats of export files.
Version 15 September 2003 - for use by the European Food Information Resource (EuroFIR).
EPIC Nutrient DataBase (ENDB) project coordinated by WHO IARC, 8 February 2006.
Becker W., Unwin I., Ireland J., Møller A.:
Proposal for structure and detail of a EuroFIR standard on food composition data.
I: Description of the standard. EuroFIR Technical Report - 2007-07-13.
Becker W., Møller A., Ireland J., Roe M., Unwin I., Pakkala H.:
Proposal for structure and detail of a EuroFIR Standard on food composition data.
II. Technical Annex - Version 2008.
EuroFIR Technical Report D1.8.19.
Danish Food Information 2008. ISBN 978-87-92125-10-1.
Møller A., Unwin I.D., Ireland J., Roe M.A, Becker W., Colombani P.:
The EuroFIR Thesauri 2008.
EuroFIR Technical Report D1.8.22.
Danish Food Information 2008.
ISBN 978-87-92125-09-5.
Møller, A., Christensen T.:
EuroFIR Web Services - Food Data Transport Package, Version 1.3.
EuroFIR Technical Report D1.8.20.
Danish Food Information 2008.
ISBN 978-87-92125-08-8.
Pakkala H., Christensen T., Gunnarsson Í., Kadvan A., Keshet B., Korhonen T., Martínez de Victoria I, Møller A., Presser K., Colombani P., Nørby E.:
EuroFIR Web Services - Specification of request-response message exchange patterns - Version 1.0.
EuroFIR Technical Report D1.8.29.
Danish Food Information 2008.
ISBN 978-87-92125-12-5.
Pakkala, H.:
EuroFIR FDTP Schemata Documentation, version 1.1
EuroFIR 2009
Pakkala, H.:
EuroFIR MDTP Schemata Documentation, version 1.1
EuroFIR 2009
Møller A., Christensen T.:
EuroFIR Web Services - Food Data Transport Package, Version 1.4.
EuroFIR Nexus Technical Report D2.1.
Danish Food Information 2012.
ISBN 978-87-92125-15-6.
Pakkala H., Martínez de Victoria I, Christensen T., Unwin I., Gunnarsson Í., Korhonen T., Kadvan A., Møller A., Nørby E., Presser K., Colombani P., Keshet B.:
EuroFIR Web Services - Specifications for request-response based Web services.
Version 1.2.
EuroFIR Nexus D2.8 - April 2011.
European Committee for Standardization:
European Standard - Food data - Structure and interchange format.
EN 16104:2012, November 2012 (example )
Library of Congress, ISO 639.2 Registration Authority:
Codes for the Representation of Languages, ISO 639.2.
Library of Congress, Network Development and MARC Standards Office Washington, DC 20540-4402, USA
International Organization for Standardization (ISO):
Online Browsing Platform (OBP) - ISO 3166 Country codes.
International Organization for Standardization , Geneva, Switzeland


	© 2025 Anders Møller, Danish Food Informatics

Interchange of Food Composition Data