Project main objective

SoFAIR

Making Software FAIR: A machine-assisted workflow for the research software lifecycle

Project main objective

A key issue hindering discoverability, attribution and reusability of open research software is that its existence often remains hidden within the manuscript of research papers. For these resources to become first-class bibliographic records, they first need to be identified and subsequently registered with persistent identifiers (PIDs) to be made FAIR (Findable, Accessible, Interoperable and Reusable).

This project will extend the capabilities of critical and widely used open scholarly infrastructures (CORE, Software Heritage, HAL) and tools (GROBID) operated by the consortium partners, delivering and deploying an effective solution for the management of the research software lifecycle.

Know More About Us

We are working with

full text documents

M+

metadata records

repositories

To incentivise good practices of software assets curation, we need to treat research software as first-class bibliographic records.

Petr Knoth – Founder & Head of CORE

The key innovations
of the
project are:

SoFAIR will focus precisely on these two main issues by extending the training data and the Softcite models to new domains and by experimenting with recent supervised machine learning techniques for entity disambiguation, in particular using graph-based similarity techniques for entity matching/alignment.