SoFAIR
Making Software FAIR: A machine-assisted workflow for the research software lifecycle
Project main objective
A key issue hindering discoverability, attribution and reusability of open research software is that its existence often remains hidden within the manuscript of research papers. For these resources to become first-class bibliographic records, they first need to be identified and subsequently registered with persistent identifiers (PIDs) to be made FAIR (Findable, Accessible, Interoperable and Reusable).
This project will extend the capabilities of critical and widely used open scholarly infrastructures (CORE, Software Heritage, HAL) and tools (GROBID) operated by the consortium partners, delivering and deploying an effective solution for the management of the research software lifecycle.
We are working with
Petr Knoth – Founder & Head of CORE
The key innovations
of the
project are:
SoFAIR will focus precisely on these two main issues by extending the training data and the Softcite models to new domains and by experimenting with recent supervised machine learning techniques for entity disambiguation, in particular using graph-based similarity techniques for entity matching/alignment.