<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator>
  <link href="https://index.biohackrxiv.org//feed.xml" rel="self" type="application/atom+xml"/>
  <link href="https://index.biohackrxiv.org/" rel="alternate" type="text/html"/>
  <updated>2026-04-10T13:10:20+00:00</updated>
  <id>https://index.biohackrxiv.org//archive.xml</id>
  <title type="html">BioHackrXiv Preprints</title>
  <subtitle>Preprints for BioHackathons</subtitle>
  <author>
    <name>BioHackrXiv</name>
    <uri>https://biohackrxiv.org/</uri>
  </author>

  
  <entry>
    <title type="html">Minimal information standardization of phenomic experimental data in animals</title>
    <link href="https://index.biohackrxiv.org//2026/04/10/ncrkm.html" rel="alternate" type="text/html" title="Minimal information standardization of phenomic experimental data in animals"/>
    <published>2026-04-10T00:00:00+00:00</published>
    <updated>2026-04-10T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/ncrkm_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2026/04/10/ncrkm.html">
      <![CDATA[ <p>The current landscape of animal phenomics is characterised by a substantial lack of standardisation, hindering data reuse, reproducibility, and interoperability across
studies, all of which are particularly important in light of the 3Rs principles for animal experiments (replace, reduce, refine). Within ELIXIR, the Domestic Animals
Genome and Phenome Focus Group emerged to establish standardised practices that enhance the quality and interoperability of animal research data. In this context, the
ISA model presents a robust, domain-agnostic framework well-established in the life sciences for describing experimental metadata. Notably, other scientific communities,
such as the ELIXIR Plant and Metabolomics Communities (MIAPPE, PhenoMeNal), have successfully leveraged the ISA model to improve the consistency and usability of their
metadata. Our project aims to develop a minimal information checklist tailored specifically for phenomics, facilitating the integration of diverse datasets, including
recirculation systems in agriculture, and fostering collaborative research efforts. We will focus on various goals.Identifying essential aspects of animal phenotyping,
informed by existing frameworks and community input. We aim to produce a concise and practical checklist that can be readily adopted by researchers, and promote a
culture of standardisation.Mapping the checklist to the ISA model ensures alignment with established standards, promotes interoperability and facilitates data reuse
while improving the overall quality of research outputs. Adopting existing ISA tools streamlines the implementation of our metadata checklist, providing user-friendly
interfaces for researchers to manage, document, and share animal phenotyping data efficiently.</p>


      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Sarah Oranna Fischer-Zielke</name><uri>https://orcid.org/0000-0002-6218-7275</uri></author>
    
      <author><name>Rica Johanna Rehfeld</name><uri>https://orcid.org/0009-0007-3289-5872</uri></author>
    
      <author><name>McKinley Santiago</name><uri>https://orcid.org/0009-0009-1160-5041</uri></author>
    
      <author><name>Emily Clark</name><uri>https://orcid.org/0000-0002-9550-7407</uri></author>
    
      <author><name>Daniel Arend</name><uri>https://orcid.org/0000-0002-2455-5938</uri></author>
    
      <author><name>Manuel Feser</name><uri>https://orcid.org/0000-0001-6546-1818</uri></author>
    
    <category term="BioHackEU25"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2026/04/10/ncrkm.html">
      <![CDATA[ The current landscape of animal phenomics is characterised by a substantial lack of standardisation, hindering data reuse, reproducibility, and interoperability across studies, all of which are particularly important in light of the 3Rs principles for animal experiments (replace, reduce, refine). Within ELIXIR, the Domestic Animals Genome and Phenome Focus Group emerged to establish standardised practices that enhance the quality and interoperability of animal research data. In this context, the ISA model presents a robust, domain-agnostic framework well-established in the life sciences for describing experimental metadata. Notably, other scientific communities, such as the ELIXIR Plant and Metabolomics Communities (MIAPPE, PhenoMeNal), have successfully leveraged the ISA model to improve the consistency and usability of their metadata. Our project aims to develop a minimal information checklist tailored specifically for phenomics, facilitating the integration of diverse datasets, including recirculation systems in agriculture, and fostering collaborative research efforts. We will focus on various goals.Identifying essential aspects of animal phenotyping, informed by existing frameworks and community input. We aim to produce a concise and practical checklist that can be readily adopted by researchers, and promote a culture of standardisation.Mapping the checklist to the ISA model ensures alignment with established standards, promotes interoperability and facilitates data reuse while improving the overall quality of research outputs. Adopting existing ISA tools streamlines the implementation of our metadata checklist, providing user-friendly interfaces for researchers to manage, document, and share animal phenotyping data efficiently. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">Evolving FAIR Image Analysis in Galaxy for Cross-domain and AI-ready Applications</title>
    <link href="https://index.biohackrxiv.org//2026/03/31/tsxby.html" rel="alternate" type="text/html" title="Evolving FAIR Image Analysis in Galaxy for Cross-domain and AI-ready Applications"/>
    <published>2026-03-31T00:00:00+00:00</published>
    <updated>2026-03-31T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/tsxby_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2026/03/31/tsxby.html">
      <![CDATA[ <p>The increasing adoption of image-based technologies across life sciences, environmental research, and related domains has increased the demand for
interoperable, reproducible, and FAIR-compliant image analysis infrastructures. At ELIXIR BioHackathon Europe 2025, Project 9, “Evolving FAIR Image
Analysis in Galaxy for Cross-domain and AI-ready Applications”, addressed these challenges by enhancing the Galaxy platform for bioimage analysis
with a focus on semantic interoperability, content-based reproducibility validation, and user-centered onboarding tutorials.To advance semantic
interoperability, we developed a curated vocabulary based on the EDAM Bioimaging ontology, which was applied to annotate tutorials on the Galaxy
Training Network, improving discoverability and aligning with evolving community standards. For reproducibility and AI-readiness, we integrated
the International Standard Content Code (ISCC) via the ISCC-SUM tool suite, enabling format-independent content-based validation, dataset
deduplication, and assessment of data similarity for robust model training. Finally, usability improvements included a comprehensive onboarding
tutorial for newcomers, enhanced integration with OMERO and BioImage Archive, and generally improved tool interoperability, including support for
GeoJSON-based spatial annotations. Collectively, these developments establish a scalable, cross-domain image analysis framework within Galaxy,
promoting FAIR-aligned practices while enabling reproducible and AI-ready workflows.</p>

      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Diana Chiang</name><uri>https://orcid.org/0000-0002-5857-1477</uri></author>
    
      <author><name>Pavankumar Videm</name><uri>https://orcid.org/0000-0002-5192-126X</uri></author>
    
      <author><name>David Lopez Tabernero</name><uri>https://orcid.org/0000-0002-9541-3961</uri></author>
    
      <author><name>Maarten W. Paul</name><uri>https://orcid.org/0000-0002-7990-6010</uri></author>
    
      <author><name>Alireza Heidari</name><uri>https://orcid.org/0000-0003-0315-4403</uri></author>
    
      <author><name>Martin Etzrodt</name><uri>https://orcid.org/0000-0003-1928-3904</uri></author>
    
      <author><name>Beatriz Serrano-Solano</name><uri>https://orcid.org/0000-0002-5862-6132</uri></author>
    
      <author><name>Leonid Kostrykin</name><uri>https://orcid.org/0000-0003-1323-3762</uri></author>
    
    <category term="BioHackEU25"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2026/03/31/tsxby.html">
      <![CDATA[ The increasing adoption of image-based technologies across life sciences, environmental research, and related domains has increased the demand for interoperable, reproducible, and FAIR-compliant image analysis infrastructures. At ELIXIR BioHackathon Europe 2025, Project 9, “Evolving FAIR Image Analysis in Galaxy for Cross-domain and AI-ready Applications”, addressed these challenges by enhancing the Galaxy platform for bioimage analysis with a focus on semantic interoperability, content-based reproducibility validation, and user-centered onboarding tutorials.To advance semantic interoperability, we developed a curated vocabulary based on the EDAM Bioimaging ontology, which was applied to annotate tutorials on the Galaxy Training Network, improving discoverability and aligning with evolving community standards. For reproducibility and AI-readiness, we integrated the International Standard Content Code (ISCC) via the ISCC-SUM tool suite, enabling format-independent content-based validation, dataset deduplication, and assessment of data similarity for robust model training. Finally, usability improvements included a comprehensive onboarding tutorial for newcomers, enhanced integration with OMERO and BioImage Archive, and generally improved tool interoperability, including support for GeoJSON-based spatial annotations. Collectively, these developments establish a scalable, cross-domain image analysis framework within Galaxy, promoting FAIR-aligned practices while enabling reproducible and AI-ready workflows. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">Towards Federated Learning Across Biobanks: Prototype Software from the 2026 Carnegie Mellon University–NVIDIA Hackathon</title>
    <link href="https://index.biohackrxiv.org//2026/03/20/5psfj.html" rel="alternate" type="text/html" title="Towards Federated Learning Across Biobanks: Prototype Software from the 2026 Carnegie Mellon University–NVIDIA Hackathon"/>
    <published>2026-03-20T00:00:00+00:00</published>
    <updated>2026-03-20T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/5psfj_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2026/03/20/5psfj.html">
      <![CDATA[ <p>The Carnegie Mellon University-NVIDIA Federated Learning Hackathon for Biomedical Applications (January 7-9, 2026) convened researchers
from academia, government, and industry to implement federated frameworks for disease subtyping, genetic association studies, and
multimodal clinical prediction using NVIDIA FLARE. This preprint presents ten projects spanninggenome-wide association analyses,
histopathology harmonization, pangenome construction, ancestry deconvolution, rare disease stratification, cancer subtyping, polygenic
risk score aggregation, and multimodal fusion. These proofs of principle collectively demonstrate both the versatility of federated
learning for biomedical applications and the technical considerations required for successful deployment.</p>


      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>James Mu</name><uri>https://orcid.org/0009-0008-1598-9292</uri></author>
    
      <author><name>Aditya Kumar Karna</name><uri>https://orcid.org/0009-0000-0365-5748</uri></author>
    
      <author><name>Telaprolu Kumar Koushik</name><uri>https://orcid.org/0009-0006-5026-5201</uri></author>
    
      <author><name>Jeff Winchell</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Tyler Jay Yang</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Caiwei Maggie Zhang</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Jasmine Baker</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Espen Hagen</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Enamul Hoq</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Kyulin Kim</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Konstantinos Koukoutegos</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Peter Lawson</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Chantera Lazard</name><uri>https://orcid.org/0009-0006-1367-3812</uri></author>
    
      <author><name>Qianqian Liang</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Robert Loughnan</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Diya Patidar</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Chunduru Sri Abhijit</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Vibha Acharya</name><uri>https://orcid.org/0000-0001-6598-0052</uri></author>
    
      <author><name>Rahaf M. Ahmad</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Anna Boeva</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Jingyao Chen</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Ioannis Christofilogiannis</name><uri>https://orcid.org/0009-0008-5906-0776</uri></author>
    
      <author><name>Mariona Jaramillo Civill</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Heena Dalal</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Alina Devkota</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Amrit Gaire</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Dhruv Gor</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Aryan Sharan Guda</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Prashnna Gyawali</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Seungjin Han</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Jiahao He</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Yuan-Ting Hsieh</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Mengying Hu</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Peiran Jiang</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Pu Kao</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Adam Kehl</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Arnav Kharbanda</name><uri>https://orcid.org/0009-0007-9195-9960</uri></author>
    
      <author><name>Yajushi Khurana</name><uri>https://orcid.org/</uri></author>
    
      <author><name>KUSHAL KOIRALA</name><uri>https://orcid.org/0009-0009-7935-4533</uri></author>
    
      <author><name>Sumeet Kothare</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Jędrzej Kubica</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Seohyun Lee</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Zilinghan Li</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Yosen Lin</name><uri>https://orcid.org/</uri></author>
    
      <author><name>William Lu</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Jialan Ma</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Samarpan Mohanty</name><uri>https://orcid.org/0009-0001-1309-7425</uri></author>
    
      <author><name>Abraham G. Moller</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Derek Mu</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Shreyan Balaji Nalwad</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Shreya Nandakumar</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Hieu Ngo</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Bhanvi Paliwal</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Isha Parikh</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Zillur Rahman</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Arunannamalai Sujatha Bharath Raj</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Nikita Rajesh</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Shivank Sadasivan</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Ushta Samal</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Srikant Sarangi</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Andrew Scouten</name><uri>https://orcid.org/0009-0004-6418-7158</uri></author>
    
      <author><name>Aastha Shah</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Sanjnaa Sridhar</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Suratha Sriram</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Mrunali Abhijit Thokadiwala</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Jacob Thrasher</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Jeffrey Wang</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Yiman Wu</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Zhenghao Xiao</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Qiyu Yang</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Zhaoyi You</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Jiayi Zhao</name><uri>https://orcid.org/0009-0008-2597-6196</uri></author>
    
      <author><name>Jiayan Zhou</name><uri>https://orcid.org/0000-0001-5974-087X</uri></author>
    
      <author><name>Zheqian Zhu</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Pravesh Parekh</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Huajin Wang</name><uri>https://orcid.org/0000-0003-0121-4257</uri></author>
    
      <author><name>Melanie Gainey</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Sean Davis</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Beryl Rabindran</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Holger R. Roth</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Ben Busby</name><uri>https://orcid.org/0000-0001-5267-4988</uri></author>
    
    <category term="CMU26"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2026/03/20/5psfj.html">
      <![CDATA[ The Carnegie Mellon University-NVIDIA Federated Learning Hackathon for Biomedical Applications (January 7-9, 2026) convened researchers from academia, government, and industry to implement federated frameworks for disease subtyping, genetic association studies, and multimodal clinical prediction using NVIDIA FLARE. This preprint presents ten projects spanninggenome-wide association analyses, histopathology harmonization, pangenome construction, ancestry deconvolution, rare disease stratification, cancer subtyping, polygenic risk score aggregation, and multimodal fusion. These proofs of principle collectively demonstrate both the versatility of federated learning for biomedical applications and the technical considerations required for successful deployment. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">Tools to develop constraint-based models in R: adapting existing toolboxes</title>
    <link href="https://index.biohackrxiv.org//2026/03/13/ey4c5.html" rel="alternate" type="text/html" title="Tools to develop constraint-based models in R: adapting existing toolboxes"/>
    <published>2026-03-13T00:00:00+00:00</published>
    <updated>2026-03-13T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/ey4c5_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2026/03/13/ey4c5.html">
      <![CDATA[ <p>As part of the BioHackathon Europe 2025, we here report on the progress of the hacking team preparing tools to develop constraint-based
models in R for the Systems Biology community. This preliminary development relies on the adaptation of existing toolboxes. In this project,
we proposed the (re)development of an R based framework for developing and simulating constraint-based models. We proposed to expand
the Sybil library for model simulation with the functionalities for model reconstruction and analysis available in the widely used RAVEN
toolbox in Matlab. The outcome will facilitate constraint based modelling to experimental scientists, thereby contributing to bridge the
gap between data users and data generators. It will also be more FAIR by being usable with non-proprietary software, and align with
software best practices as collected by the ELIXIR Tools Platform. We will work towards increased reproducibility by also considering
implementation of FROG analysis in R. Moreover, as a tool developed by the ELIXIR Systems Biology Community for the wider community,
the long-term maintenance burden is spread across a wider membership.Two weeks before the BioHackathon, we discovered a new tool in R
allowing the simulation of models, called cobrar (https://github.com/Waschina/cobrar). Which calls for an assessment of its current
state and definition of new development areas.</p>


      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Jesubukade Ajakaye</name><uri>https://orcid.org/0000-0002-8966-4422</uri></author>
    
      <author><name>Mihail Anton</name><uri>https://orcid.org/0000-0002-7753-9042</uri></author>
    
      <author><name>Iván Domenzain</name><uri>https://orcid.org/0000-0002-5322-2040</uri></author>
    
      <author><name>Tanisha Malpani</name><uri>https://orcid.org/0009-0007-8065-8492</uri></author>
    
      <author><name>Sebastien Moretti</name><uri>https://orcid.org/0000-0003-3947-488X</uri></author>
    
      <author><name>Rahuman S. Malik Sheriff</name><uri>https://orcid.org/0000-0003-0705-9809</uri></author>
    
      <author><name>Maria Suarez-Diez</name><uri>https://orcid.org/0000-0001-5845-146X</uri></author>
    
      <author><name>Silvio Waschina</name><uri>https://orcid.org/0000-0002-6290-3593</uri></author>
    
    <category term="BioHackEU25"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2026/03/13/ey4c5.html">
      <![CDATA[ As part of the BioHackathon Europe 2025, we here report on the progress of the hacking team preparing tools to develop constraint-based models in R for the Systems Biology community. This preliminary development relies on the adaptation of existing toolboxes. In this project, we proposed the (re)development of an R based framework for developing and simulating constraint-based models. We proposed to expand the Sybil library for model simulation with the functionalities for model reconstruction and analysis available in the widely used RAVEN toolbox in Matlab. The outcome will facilitate constraint based modelling to experimental scientists, thereby contributing to bridge the gap between data users and data generators. It will also be more FAIR by being usable with non-proprietary software, and align with software best practices as collected by the ELIXIR Tools Platform. We will work towards increased reproducibility by also considering implementation of FROG analysis in R. Moreover, as a tool developed by the ELIXIR Systems Biology Community for the wider community, the long-term maintenance burden is spread across a wider membership.Two weeks before the BioHackathon, we discovered a new tool in R allowing the simulation of models, called cobrar (https://github.com/Waschina/cobrar). Which calls for an assessment of its current state and definition of new development areas. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">Bidirectional bridge: GitHub  ⇄  bio.tools</title>
    <link href="https://index.biohackrxiv.org//2026/02/24/8ktd6.html" rel="alternate" type="text/html" title="Bidirectional bridge: GitHub  ⇄  bio.tools"/>
    <published>2026-02-24T00:00:00+00:00</published>
    <updated>2026-02-24T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/8ktd6_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2026/02/24/8ktd6.html">
      <![CDATA[ <p>Research software metadata can be found across many code repositories and software registries. Here, we describe the tooling for a
bidirectional bridge between the software development platform GitHub and the ELIXIR bio.tools registry of life sciences software
tools and data resources. The developed bridge maps and improves metadata records across these two platforms, thereby benefiting
both and helping make research software more FAIR: findable, accessible, interoperable, and reusable. Specifically, the bridge
enables production of high-quality, rich bio.tools entries from the content already available in GitHub repositories, and uses
bio.tools records to suggest improvements to GitHub repositories through pull requests or issues. This includes adding missing
information and standardized descriptions for increased compliance with Software Management Plans. The bidirectional bridge makes
extensive use of existing APIs (GitHub, bio.tools, Europe PMC) and large language models (LLMs) to enrich metadata on both
platforms. By automating metadata extraction, improvement suggestion, and integration, the bridge reduces the manual overhead
required to FAIRify research software, lowering barriers for researchers to contribute or maintain well-annotated, reusable software.</p>


      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Mariia Steeghs-Turchina</name><uri>https://orcid.org/0000-0002-0852-4752</uri></author>
    
      <author><name>Anna Niehues</name><uri>https://orcid.org/0000-0002-9839-5439</uri></author>
    
      <author><name>Ana Mendes</name><uri>https://orcid.org/0009-0008-5170-0927</uri></author>
    
      <author><name>Erik Jaaniso</name><uri>https://orcid.org/0009-0003-4246-6546</uri></author>
    
      <author><name>Ove Johan Ragnar Gustafsson</name><uri>https://orcid.org/0000-0002-2977-5032</uri></author>
    
      <author><name>Walter Baccinelli</name><uri>https://orcid.org/0000-0001-8888-4792</uri></author>
    
      <author><name>Vedran Kasalica</name><uri>https://orcid.org/0000-0002-0097-1056</uri></author>
    
      <author><name>Sam Cox</name><uri>https://orcid.org/0000-0002-9841-9816</uri></author>
    
      <author><name>Ivan Topolsky</name><uri>https://orcid.org/0000-0002-7561-0810</uri></author>
    
      <author><name>Magnus Palmblad</name><uri>https://orcid.org/0000-0002-5865-8994</uri></author>
    
      <author><name>Veit Schwämmle</name><uri>https://orcid.org/0000-0002-9708-6722</uri></author>
    
    <category term="BioHackEU25"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2026/02/24/8ktd6.html">
      <![CDATA[ Research software metadata can be found across many code repositories and software registries. Here, we describe the tooling for a bidirectional bridge between the software development platform GitHub and the ELIXIR bio.tools registry of life sciences software tools and data resources. The developed bridge maps and improves metadata records across these two platforms, thereby benefiting both and helping make research software more FAIR: findable, accessible, interoperable, and reusable. Specifically, the bridge enables production of high-quality, rich bio.tools entries from the content already available in GitHub repositories, and uses bio.tools records to suggest improvements to GitHub repositories through pull requests or issues. This includes adding missing information and standardized descriptions for increased compliance with Software Management Plans. The bidirectional bridge makes extensive use of existing APIs (GitHub, bio.tools, Europe PMC) and large language models (LLMs) to enrich metadata on both platforms. By automating metadata extraction, improvement suggestion, and integration, the bridge reduces the manual overhead required to FAIRify research software, lowering barriers for researchers to contribute or maintain well-annotated, reusable software. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">BH25DE report: On the path to machine-actionable training materials</title>
    <link href="https://index.biohackrxiv.org//2026/01/26/un6cd.html" rel="alternate" type="text/html" title="BH25DE report: On the path to machine-actionable training materials"/>
    <published>2026-01-26T00:00:00+00:00</published>
    <updated>2026-01-26T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/un6cd_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2026/01/26/un6cd.html">
      <![CDATA[ <p>The fragmentation of training materials across research infrastructures often results in unsustainable resource
duplication and significant barriers to upskilling. This work aims to enable developers to build systems that
effectively discover relevant materials by promoting a federated, FAIR-compliant strategy for open training. The
project operated across three interrelated streams: metadata interoperability, material analysis, and the
definition and representation of learning paths in a machine readable manner.We demonstrated content federation
via the mTeSS-X platform, enabling cross-instance exchange and preparing for future integration with the EOSC
federation. To enhance interoperability, we indexed relevant ontologies and curated semantic crosswalks between
established metadata models, specifically MoDALIA and Schema.org/Bioschemas. These mappings were implemented
within the open-source OERbservatory Python package, providing a facility for exchanging data between platforms
such as DALIA and TeSS. For material analysis, we utilised Large Language Models (LLMs) and explored vectorisation
techniques to calculate similarity, allowing for the identification of related materials and the potential for
future deduplication of records across registries.To address the lack of machine-actionable trajectories across
related or sequential materials, we proposed new Bioschemas profiles specifically for learning paths. By extending
Schema.org types, including Course and Syllabus, we developed a schema that supports modular and linear orderings
of training materials. This model was validated using SPARQL queries on knowledge graphs derived from real-world
examples like the Galaxy Training Network. Such advancements provide a foundation for automated path generation
and improved discoverability within training catalogues, and serves as a use case and strategy with broader
applicability beyond those materials.</p>


      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Phil Reed</name><uri>https://orcid.org/0000-0002-4479-715X</uri></author>
    
      <author><name>Nick Juty</name><uri>https://orcid.org/0000-0002-2036-8350</uri></author>
    
      <author><name>Petra Steiner</name><uri>https://orcid.org/0000-0001-8997-2620</uri></author>
    
      <author><name>Leyla Jael Castro</name><uri>https://orcid.org/0000-0003-3986-0510</uri></author>
    
      <author><name>Charles Tapley Hoyt</name><uri>https://orcid.org/0000-0003-4423-4370</uri></author>
    
      <author><name>Oliver Knodel</name><uri>https://orcid.org/0000-0001-8174-7795</uri></author>
    
      <author><name>Martin Voigt</name><uri>https://orcid.org/0000-0001-5556-838X</uri></author>
    
      <author><name>Roman Baum</name><uri>https://orcid.org/0000-0001-5246-9351</uri></author>
    
      <author><name>Dilfuza Djamalova</name><uri>https://orcid.org/0009-0004-7782-2894</uri></author>
    
      <author><name>Jacobo Miranda</name><uri>https://orcid.org/0009-0005-0673-021X</uri></author>
    
      <author><name>Alban Gaignard</name><uri>https://orcid.org/0000-0002-3597-8557</uri></author>
    
    <category term="BH25DE"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2026/01/26/un6cd.html">
      <![CDATA[ The fragmentation of training materials across research infrastructures often results in unsustainable resource duplication and significant barriers to upskilling. This work aims to enable developers to build systems that effectively discover relevant materials by promoting a federated, FAIR-compliant strategy for open training. The project operated across three interrelated streams: metadata interoperability, material analysis, and the definition and representation of learning paths in a machine readable manner.We demonstrated content federation via the mTeSS-X platform, enabling cross-instance exchange and preparing for future integration with the EOSC federation. To enhance interoperability, we indexed relevant ontologies and curated semantic crosswalks between established metadata models, specifically MoDALIA and Schema.org/Bioschemas. These mappings were implemented within the open-source OERbservatory Python package, providing a facility for exchanging data between platforms such as DALIA and TeSS. For material analysis, we utilised Large Language Models (LLMs) and explored vectorisation techniques to calculate similarity, allowing for the identification of related materials and the potential for future deduplication of records across registries.To address the lack of machine-actionable trajectories across related or sequential materials, we proposed new Bioschemas profiles specifically for learning paths. By extending Schema.org types, including Course and Syllabus, we developed a schema that supports modular and linear orderings of training materials. This model was validated using SPARQL queries on knowledge graphs derived from real-world examples like the Galaxy Training Network. Such advancements provide a foundation for automated path generation and improved discoverability within training catalogues, and serves as a use case and strategy with broader applicability beyond those materials. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">METRICS - Monitoring of Key Performance Indicators for ELIXIR Services</title>
    <link href="https://index.biohackrxiv.org//2026/01/22/2jgk4.html" rel="alternate" type="text/html" title="METRICS - Monitoring of Key Performance Indicators for ELIXIR Services"/>
    <published>2026-01-22T00:00:00+00:00</published>
    <updated>2026-01-22T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/2jgk4_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2026/01/22/2jgk4.html">
      <![CDATA[ <p>Key Performance Indicators (KPIs) are increasingly requested by a diverse range of stakeholders across the research
ecosystem. Funders want to measure the impact of projects and related services they fund, or research organisations
want to track the service use for informed decision making. Service providers themselves are also interested in
monitoring their services to gather feedback and improve service quality. KPIs are a simple, but powerful tool for
these purposes.As part of the BioHackathon Europe 2025, we report on the activities of the METRICS project, which
addresses the need for consistent and transparent evaluation of services across ELIXIR and related initiatives
using KPIs. The project brings together experts from multiple ELIXIR Nodes and scientific domains to identify,
harmonise, and semantically model KPIs that reflect service quality, usage, sustainability, and impact. By exploring
existing evaluation frameworks, and processes, the team aims to design a flexible yet coherent foundation for KPI
monitoring of ELIXIR services. This report summarises the project’s motivation, current landscape analysis, and
initial steps toward developing an ontology-driven framework for KPI representation, fostering interoperability
and supporting evidence-based management of life science infrastructures.</p>

      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Nils-Christian Lübke</name><uri>https://orcid.org/0009-0009-4801-9978</uri></author>
    
      <author><name>Helena Schnitzer</name><uri>https://orcid.org/0000-0002-6382-9452</uri></author>
    
      <author><name>Julia Koblitz</name><uri>https://orcid.org/0000-0002-7260-2129</uri></author>
    
      <author><name>Saskia Lawson-Tovey</name><uri>https://orcid.org/0000-0002-8611-162X</uri></author>
    
      <author><name>Nicola Soranzo</name><uri>https://orcid.org/0000-0003-3627-5340</uri></author>
    
      <author><name>Karel Berka</name><uri>https://orcid.org/0000-0001-9472-2589</uri></author>
    
      <author><name>Séverine Duvaud</name><uri>https://orcid.org/0000-0001-7892-9678</uri></author>
    
      <author><name>Kristyna Kvizdova</name><uri>https://orcid.org/0009-0000-9827-1359</uri></author>
    
      <author><name>Manuel Feser</name><uri>https://orcid.org/0000-0001-6546-1818</uri></author>
    
      <author><name>Anna Golobardes Vilarasau</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Gavin Farrell</name><uri>https://orcid.org/0000-0001-5166-8551</uri></author>
    
      <author><name>Adel Bouhraoua</name><uri>https://orcid.org/0000-0001-9531-6339</uri></author>
    
      <author><name>Espen Åberg</name><uri>https://orcid.org/0000-0002-2280-7978</uri></author>
    
      <author><name>David Lloyd</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Sebastian Beier</name><uri>https://orcid.org/0000-0002-2177-8781</uri></author>
    
      <author><name>Vedran Kasalica</name><uri>https://orcid.org/0000-0002-0097-1056</uri></author>
    
      <author><name>Walter Baccinelli</name><uri>https://orcid.org/0000-0001-8888-4792</uri></author>
    
      <author><name>Mijke Jetten</name><uri>https://orcid.org/0000-0001-9114-2896</uri></author>
    
      <author><name>Laura Chabot</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Grégory Gimenez</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Daniel Arend</name><uri>https://orcid.org/0000-0002-2455-5938</uri></author>
    
    <category term="BioHackEU25"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2026/01/22/2jgk4.html">
      <![CDATA[ Key Performance Indicators (KPIs) are increasingly requested by a diverse range of stakeholders across the research ecosystem. Funders want to measure the impact of projects and related services they fund, or research organisations want to track the service use for informed decision making. Service providers themselves are also interested in monitoring their services to gather feedback and improve service quality. KPIs are a simple, but powerful tool for these purposes.As part of the BioHackathon Europe 2025, we report on the activities of the METRICS project, which addresses the need for consistent and transparent evaluation of services across ELIXIR and related initiatives using KPIs. The project brings together experts from multiple ELIXIR Nodes and scientific domains to identify, harmonise, and semantically model KPIs that reflect service quality, usage, sustainability, and impact. By exploring existing evaluation frameworks, and processes, the team aims to design a flexible yet coherent foundation for KPI monitoring of ELIXIR services. This report summarises the project’s motivation, current landscape analysis, and initial steps toward developing an ontology-driven framework for KPI representation, fostering interoperability and supporting evidence-based management of life science infrastructures. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">QPX: Pathway analysis environment</title>
    <link href="https://index.biohackrxiv.org//2026/01/06/m37f2.html" rel="alternate" type="text/html" title="QPX: Pathway analysis environment"/>
    <published>2026-01-06T00:00:00+00:00</published>
    <updated>2026-01-06T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/m37f2_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2026/01/06/m37f2.html">
      <![CDATA[ <p>Building on our work at DBCLS BioHackathon 2023 (BH23), where we introduced QPX and promoted pathway modeling with WikiPathways (Pico et al., 2008)
using PathVisio (Kutmon et al., 2015), we now focused on creating new pathway diagrams for diverse species and registering them in WikiPathways with
functional annotations. In parallel, we deployed WikiPathways node data into Elasticsearch to enable fast and flexible search and integration of
pathway information.</p>


      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Hidemasa Bono</name><uri>https://orcid.org/0000-0003-4413-0651</uri></author>
    
      <author><name>Naoya Oec</name><uri>https://orcid.org/0000-0002-7491-4994</uri></author>
    
      <author><name>Airu Hayashi</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Chiharu Fujita</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Kotaro Uchida</name><uri>https://orcid.org/</uri></author>
    
      <author><name>Ryo Mameda</name><uri>https://orcid.org/0009-0007-5830-6482</uri></author>
    
      <author><name>Sora Yonezawa</name><uri>https://orcid.org/0009-0004-1874-3117</uri></author>
    
      <author><name>Kazuki Nakamae</name><uri>https://orcid.org/0000-0002-4469-664X</uri></author>
    
      <author><name>Ryo Nozu</name><uri>https://orcid.org/0000-0002-1099-3152</uri></author>
    
    <category term="BH25JP"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2026/01/06/m37f2.html">
      <![CDATA[ Building on our work at DBCLS BioHackathon 2023 (BH23), where we introduced QPX and promoted pathway modeling with WikiPathways (Pico et al., 2008) using PathVisio (Kutmon et al., 2015), we now focused on creating new pathway diagrams for diverse species and registering them in WikiPathways with functional annotations. In parallel, we deployed WikiPathways node data into Elasticsearch to enable fast and flexible search and integration of pathway information. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">BioHackEU25 Report Project 16: MiCoReCa (Microbiome Community Resource Catalogue) - Towards Centralized Curation And Integration Of Microbiome Bioinformatics Resources</title>
    <link href="https://index.biohackrxiv.org//2025/12/31/jfpsx.html" rel="alternate" type="text/html" title="BioHackEU25 Report Project 16: MiCoReCa (Microbiome Community Resource Catalogue) - Towards Centralized Curation And Integration Of Microbiome Bioinformatics Resources"/>
    <published>2025-12-31T00:00:00+00:00</published>
    <updated>2025-12-31T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/jfpsx_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2025/12/31/jfpsx.html">
      <![CDATA[ <p>The rapid growth of microbiome research has led to the development of numerous bioinformatics tools and databases, but information about them remains fragmented across disparate,
often outdated cataloging efforts, hindering resource discovery and utilization. To address this critical gap, the ELIXIR Microbiome Community proposes the development of MiCoReCa
(Microbiome Community Resource Catalogue), a comprehensive, dynamic, open-access catalogue of microbiome-related bioinformatics resources (tools, workflows, training, standards,
and databases). Leveraging our community’s expertise, this initiative will utilize standardized ontologies like EDAM and cross-reference established platforms like bio.tools and
WorkflowHub to create a centralized, findable inventory. A key feature is the community-driven process for identifying and curating missing ontological terms and metadata,
ensuring MiCoReCa’s accuracy and relevance in collaboration with partner platforms. Furthermore, the catalogue will integrate links to training materials from TeSS to support
appropriate tool usage, and connect with OpenEBench for benchmarking capabilities. This project will not only provide a vital resource for the microbiome field, enhancing
research efficiency and reproducibility, but will also establish a sustainable, adaptable infrastructure potentially applicable to other ELIXIR Communities. This effort
represents a significant contribution by the ELIXIR Microbiome Community to streamline microbiome bioinformatics.</p>


      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Vivek Ashokan</name><uri>https://orcid.org/0009-0006-1470-3999</uri></author>
    
      <author><name>Clara Emery</name><uri>https://orcid.org/0009-0003-9572-6671</uri></author>
    
      <author><name>Agnès Barnabé</name><uri>https://orcid.org/0000-0002-8420-7556</uri></author>
    
      <author><name>Valentin Loux</name><uri>https://orcid.org/0000-0002-8268-915X</uri></author>
    
      <author><name>Christina Pavloudi</name><uri>https://orcid.org/0000-0001-5106-6067</uri></author>
    
      <author><name>Paul Zierep</name><uri>https://orcid.org/0000-0000-0000-0000</uri></author>
    
      <author><name>Nikolaos Strepis</name><uri>https://orcid.org/0000-0000-0000-0000</uri></author>
    
      <author><name>Bérénice Batut</name><uri>https://orcid.org/0000-0001-9852-1987</uri></author>
    
    <category term="BioHackEU25"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2025/12/31/jfpsx.html">
      <![CDATA[ The rapid growth of microbiome research has led to the development of numerous bioinformatics tools and databases, but information about them remains fragmented across disparate, often outdated cataloging efforts, hindering resource discovery and utilization. To address this critical gap, the ELIXIR Microbiome Community proposes the development of MiCoReCa (Microbiome Community Resource Catalogue), a comprehensive, dynamic, open-access catalogue of microbiome-related bioinformatics resources (tools, workflows, training, standards, and databases). Leveraging our community’s expertise, this initiative will utilize standardized ontologies like EDAM and cross-reference established platforms like bio.tools and WorkflowHub to create a centralized, findable inventory. A key feature is the community-driven process for identifying and curating missing ontological terms and metadata, ensuring MiCoReCa’s accuracy and relevance in collaboration with partner platforms. Furthermore, the catalogue will integrate links to training materials from TeSS to support appropriate tool usage, and connect with OpenEBench for benchmarking capabilities. This project will not only provide a vital resource for the microbiome field, enhancing research efficiency and reproducibility, but will also establish a sustainable, adaptable infrastructure potentially applicable to other ELIXIR Communities. This effort represents a significant contribution by the ELIXIR Microbiome Community to streamline microbiome bioinformatics. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">Enhancement of the Interoperability of Trait Data on Genetic Resources between Japan and France</title>
    <link href="https://index.biohackrxiv.org//2025/12/23/hw2fj.html" rel="alternate" type="text/html" title="Enhancement of the Interoperability of Trait Data on Genetic Resources between Japan and France"/>
    <published>2025-12-23T00:00:00+00:00</published>
    <updated>2025-12-23T00:00:00+00:00</updated>
    <id>https://doi.org/10.37044/osf.io/hw2fj_v1</id>
    <content type="html" xml:base="https://index.biohackrxiv.org//2025/12/23/hw2fj.html">
      <![CDATA[ <p>Japan’s National Agriculture and Food Research Organization initiated a collaborative research project with France’s National Research Institute
for Agriculture, Food and Environment to evaluate wheat genetic resources and to identify materials with desirable traits using standardized
criteria. This paper presents the current status of trait data standardization between the two organizations and outlines a direction for
standardization. Trait data for genetic resources in Japan and France are managed using independently developed standards. The lack of mapping
standards hinders data integration and interoperability. To support experts in the mapping process, we developed a tool that translates trait
terms. A generative AI-based translation tool appears to be applicable for collecting relevant information to support mapping between trait
terms, as well as translating newly submitted Japanese trait terms into English.</p>


      <h4>References</h4>
      <ul>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Akane Takezaki</name><uri>https://orcid.org/0009-0008-3547-0391</uri></author>
    
      <author><name></name><uri>https://orcid.org/0000-0002-5719-7559</uri></author>
    
      <author><name>Celia Michotey</name><uri>https://orcid.org/0000-0003-1877-1703</uri></author>
    
      <author><name>Raphael Flores</name><uri>https://orcid.org/0000-0002-0278-5441</uri></author>
    
      <author><name>Cyril Pommier</name><uri>https://orcid.org/0000-0002-9040-8733</uri></author>
    
    <category term="BH23JP"/>
    
    <summary type="html" xml:base="https://index.biohackrxiv.org//2025/12/23/hw2fj.html">
      <![CDATA[ Japan’s National Agriculture and Food Research Organization initiated a collaborative research project with France’s National Research Institute for Agriculture, Food and Environment to evaluate wheat genetic resources and to identify materials with desirable traits using standardized criteria. This paper presents the current status of trait data standardization between the two organizations and outlines a direction for standardization. Trait data for genetic resources in Japan and France are managed using independently developed standards. The lack of mapping standards hinders data integration and interoperability. To support experts in the mapping process, we developed a tool that translates trait terms. A generative AI-based translation tool appears to be applicable for collecting relevant information to support mapping between trait terms, as well as translating newly submitted Japanese trait terms into English. ]]>
    </summary></entry>
  
</feed>
