New software helps assess interoperability and reusability of biomedical digital research objects

Mount Sinai investigators, together with an international consortium of colleagues, have developed a new toolbox to allow scientists to assess how easily the digital objects they generate as a result of their research projects can be used by other research teams.

The findings, reported today in Cell Systems, apply to digital objects including datasets, software, source code, and other products that scientists develop that may be useful to the research community.

The goal is to upgrade the way we are managing data in biomedical research-;sharing and reusing data from these projects to maximally extract value . While funders, researchers, and other stakeholders recognized that we need to do a better job with data management in biomedical research, the path toward doing so wasn't clear."

Avi Ma'ayan, PhD

Ma'ayan is the Director of the Mount Sinai Center for Bioinformatics, and senior researcher on the study.

The team hopes that as biomedical researchers apply the toolbox to evaluate their work, they'll become more aware of simple steps they can take to improve the usability of their digital objects.

In 2017 and 2018 the National Institutes of Health (NIH) funded the NIH Data Commons Pilot Phase Consortium, which worked to find ways to help improve the sharing of digital research objects through a cloud-based environment.

To implement such a virtual environment, researchers needed to develop standards and a platform to help scientists adhere to the FAIR (findable, accessible, interoperable, and reusable) guidelines which were previously established by members of the consortium.

First, the team developed metrics and rubrics to assess FAIRness, but recognized that these metrics and rubrics may not apply equally to all research domains. So they created the toolbox, called FAIRshake, which allows those creating digital objects not only to assess their FAIRness but also to develop new standards to define and evaluate FAIRness.

Thus, the researchers stated that "FAIRshake allows for the coexistence of multiple metrics and rubrics, enabling the community to develop standards more democratically."

Dr. Ma'ayan emphasizes that research does not need to be open-access in order to be FAIR. "It just requires people to provide a clear license of how others should use their data," he said.

The FAIRshake toolbox includes both manual and automated tools for assessing FAIRness, which ask questions such as, "Does the digital object have machine-readable metadata?" The toolbox has been tested over the past year on thousands of digital objects and the team has posted tutorials on Youtube about how to use FAIRshake.

While Dr. Ma'ayan says that there has been a "natural improvement of accessibility and adoption of those standards even without any push from us or from NIH," he also explains that the team was surprised by some of the trends they noticed when performing assessments with the toolkit.

He said that few researchers implemented application programming interfaces (APIs)-;software that allows different computer programs to communicate-;which are "critical for bringing data together." Researchers also frequently failed to provide licenses to tell other scientists how to get permission to use the data.

As research products become more FAIR, scientists will be able to make connections and build things that were not possible before using tools like machine learning to "transform the way we are reusing data," says Dr. Ma'ayan.


The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News-Medical.Net.
Post a new comment
You might also like... ×
New collaborative effort focuses on therapeutic research for CDKL5 Deficiency Disorder