In 2016, the “FAIR Guiding Principles for scientific data management and stewardship” were published in Scientific Data. The authors intended to provide guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of digital assets. The principles emphasise machine-actionability (i.e., the capacity of computational systems to find, access, interoperate, and reuse data with none or minimal human intervention) because humans increasingly rely on computational support to deal with data as a result of the increase in volume, complexity, and creation speed of data.
The FAIR Data Principles make data more valuable as it is easier to find through unique identifiers and easier to combine and integrate thanks to the formal shared knowledge representation. Such data is easier to reuse, repurpose and share because machines have the means to understand where data comes from and what it is about. It also accelerates research, boosts cooperation, and facilitates reuse in scientific research. Policymakers and stakeholders have seen its value in driving innovation, and many have embraced these principles.
The four foundational principles:
Findable – The first step in (re)using data is to find them. Metadata and data should be easy to find for both humans and computers. Machine-readable metadata is essential for the automatic discovery of datasets and services;
Accessible – Once the user finds the required data, they need to know how they can be accessed, possibly including authentication and authorisation;
Interoperable – The data usually need to be integrated with other data. In addition, the data need to interoperate with applications or workflows for analysis, storage, and processing;
Reusable – The ultimate goal of FAIR is to optimise the reuse of data. To achieve this, metadata and data should be well-described so that they can be replicated and/or combined in different settings.
- support knowledge discovery and innovation
- support data and knowledge integration
- promote sharing and reuse of data
- are discipline independent and allow for differences in disciplines
- move beyond high-level guidance, containing detailed advice on activities that can be undertaken to make data more FAIR
- help data and metadata to be ‘machine readable’, supporting new discoveries through the harvest and analysis of multiple datasets.