Meetings
BioHackSWAT4HCLS 2025
BioHackathon Europe 2024
3rd BioHackathon Germany
DBCLS BioHackathon 2024
ELIXIR INTOXICOM
Recent preprints
-
Synergising ELIXIR resources for training in systems biology
Systems biology (SB) is a new ELIXIR community, that aims to utilise the ELIXIR ecosystem, such as the Training eSupport System (TeSS) and bio.tools, a registry of software tools and data resources for life sciences. One of the main initial objectives of the SB community is to create an SB-themed domain hosted by TeSS, encompassing SB-related ELIXIR services and events, in a fully automated way.Most content in TeSS is sourced through automated aggregation (“scraping”) of external sources containing resources marked up with semantic metadata, like Bioschemas. Currently, TeSS cannot recognize references to bio.tools identifiers from a Bioschemas-annotated resource, so the number of resources linked to bio.tools is relatively low.In this project, we will focus on selected SB disciplines from the priority areas of the ELXIR SB community to integrate and cross-link related ELIXIR products - training events, training materials, computational and bioinformatics tools, databases and services from the bio.tools registry.This will be achieved using suitable ontologies identified by the SB community and by careful curation of SB-related materials. We aim to extend this work to other ELIXIR products such as lists of trainers, related ELIXIR Innovation and Industry events and publications. This will serve as a pilot project leading to broader integration with other SB disciplines, and will be of interest to several other ELIXIR communities. -
Enhancing the image analysis community in Galaxy
Project 6 during the 2023 BioHackathon Europe in Barcelona focused on “Enhancing the image analysis community in Galaxy.” Despite Galaxy’s strong presence in genomics and proteomics, its image analysis tools and workflows are currently scattered. This project aimed to gather efforts in image analysis across fields to build a robust interdisciplinary community. -
Secure data-out API - enabling encrypted htsget transactions
The European Genome-phenome Archive (EGA) is a service for archiving and sharing personally identifiable genetic and phenotypic data, while the The Genomic Data Infrastructure (GDI) project is enabling access to genomic and related phenotypic and clinical data across Europe. Both projects are focused on creating federated and secure infrastructure for researchers to archive and share data with the research community, to support further research.This project proposal is focusing on the data access part of the infrastructure. The files are encrypted in the archives, using the crypt4gh standard. Currently, there exist data access processes, where the files are either decrypted on the server side and then transferred to the user or re-encrypted server-side and provided to the user in an outbox.Htsget as a data access protocol also allows access to parts of files, but there’s currently no production-level client tools that support access to encrypted data. Our goal is to create a client tool that can access encrypted data over the htsget protocol. It should also work with the GA4GH Passport and Visa standard so we can then enhance the security of our data access interfaces. We will also modify htsget-rs, a Rust htsget server, and crypt4gh-rust as required to support the aforementioned standards. Finally, there will be an effort to implement this feature in already existing tools, like samtools and IGV. -
How to increase the findability, visibility, and impact of Galaxy tools for your scientific community
The scale and diversity of available software options in the Galaxy ecosystem can make domain or community specific discovery of software challenging. Here, we present a semi-automated and reusable pipeline for creating tailored interactive tables that list the identity and metadata (e.g. bio.tools, EDAM) available for Galaxy tools in a specific community (e.g. microGalaxy, imaging). In addition, we also describe an annotation framework to improve the quality of the table contents, and training material to support the reuse of both the pipeline and table by additional communities. The sum of these contributions is expected to make it easier for Galaxy users to discover and understand the software within their research area, improve the annotation of these software resources, and allow other domains to enable equivalent discovery processes for their community.This work is the outcome of a BioHackathon Europe 2023 project. -
BioHackEU23: FAIR Workflow Execution with WfExS and Workflow Run Crate
FAIR Computational Workflows argues that workflows should be FAIR scholarly community research objects in their own right as a kind of FAIR Research Software. In this project we go one step further, and argue that workflow executions should also be published with sufficient traces and structured metadata. Workflow Run RO-Crate is a set of profiles of RO-Crate that capture workflow provenance in a lightweight FAIR data package based on existing standards, in order to support traceability, reproducibility and interoperable description of diverse computational analysis. This use of RO-Crate allows the contextualization of a computational workflow and its execution, e.g. relating to people, organisations, projects, funding, data sources and wider research questions and studies.We have implemented the profile in multiple workflow systems, including Galaxy, COMPSs, StreamFlow, WfExS, Sapporo and Autosubmit. The command line tool runcrate can convert from the precursor CWLProv and display or validate crates according to the profiles. The crates are compatible with ELIXIR’s WorkflowHub and support increasing levels of details, including documenting ad-hoc scripts without a workflow engine.WfExS is a workflow orchestrator designed for reproducible and secure workflow executions in isolated environments (like HPC). Every input, workflow and container being used in an execution must have either a public or permanent identifier, or at least a resolvable URI, so the execution scenario can be materialised. The execution scenario before and/or after the execution can be saved to RO-Crate.Here we bring together FAIR Computational Execution practitioners to mature and generalise this approach using Workflow Run Crate. -
Building Towards a Machine-Actionable Software Management Plan: A BioHackathon Europe 2023 Report
This report provides an overview of our activities and accomplishments concerning machine-actionable Software Management Plans (SMPs) and the Software Management Wizard (SMW) during the ELIXIR BioHackathon Europe 2023. ELIXIR acknowledges the critical role of effective software management in facilitating sustainable and reproducible research outcomes. The Software Best Practices group is actively committed to establishing a robust framework for SMP creation. In this project, our primary focus is on streamlining the SMP creation process for research software within ELIXIR. To achieve this, we are working on developing essential integrators and identifying and reviewing the relevant metadata schema. This effort is closely aligned with various related initiatives such as OpenEBench, FAIR4RS, RDA, maSMPs, among others. The outcomes of the BioHackathon project are now available for immediate use and can be further refined in the future based on community feedback and advancements in research software best practices. -
Metadata handling for BioHackathon publications through BioHackrXiv
This paper presents the work executed on BioHackrXiv during the international ELIXIR BioHackathon Europe in Paris, France, 2022. BioHackrXiv is a scholarly publication service for BioHackathons and codefests that target biology and the biomedical sciences in the spirit of pre-publishing platforms.