The integration and fusion of data and metadata in life and earth sciences calls for the proposal of data and knowledge representations to structure the diverse information collected and produced for/within an experimental framework. Data lakes appear to be a relevant solution for managing and making this diversity of data available. Metadata models need to be devised to connect the data, and appropriate organisation and exploration mechanisms relevant to the context of life and earth sciences need to be devised.
LETITIA aims to build a data lake for curating and exploring experimental data in the life and earth sciences.
Background
This project relies on previous research by the LIRIS group on data-driven geosciences funded by an IEA CNRS action named ADAGEO (2021-2022 https://adageo.github.io). Through the project’s results addressing data curation aspects, we realized the need for storing diverse metadata and data and, thus, the need for a data lake solution. The FIL project will allow funding internships for developing solutions and dissemination activities of the results. The external collaboration with the UFRN (Geosciences Department and Computing Science Department) and the UFPR (Center for Marine Studies and Computing Science Department) will provide a concrete experimental context. Besides, the qualitative curation aspects of the experiments will be integrated into the lake, relying on the current work done with Universidad de la República in Uruguay. Through projects in Brazil and Uruguay, colleagues will visit Lyon and work with the labs’ ERIC and LIRIS. The synergy of the complementary expertise of partners (LIRIS, ERIC and external) will be the key to the project’s success.