Meetings

Recent preprints

Aug 29, 2024

Ontologies for single-cell experiments
Research data management is becoming increasingly important in the scientific community. Acritical challenge in this field is making research data FAIR (findable, accessible, interoperableand reusable, (Wilkinson et al., 2016)). Metadata plays a vital role in this challenge as it allowsresearchers to accurately understand and recreate experiments. To tackle this challenge, variousapproaches are being taken towards this goal, including the development of domain-overarchingand domain-specific standards.In the different scientific communities, multiple general, as well as domain-specific minimuminformation standards have been developed, such as MIAPPE (Ćwiek-Kupczyńska, 2016), theminimum information about a plant phenotyping experiment, MIAME (H. Brazma A., 2001),the minimum information about a microarray experiment, and MINSEQE (B. Brazma A., 2012),the minimum information about a high-throughput sequencing experiment. These standards aredesigned to describe specific types of experiments. Recently, a minimum information standardfor single-cell experiments, minSCe (Minimum Information about a Single-Cell Experiment),has been introduced (Füllgrabe et al., 2020). However, it is not yet widely applied.Minimum information standards are an important part of the solution and should be built upon.In addition, the use of controlled vocabularies and ontology terms is also essential. Ontologyterms have a persistent identifier, an expressive name and a curated definition. Using theseterms enables different researchers to understand and recreate annotated experiments. In thisBioHackathon Europe project, we propose to expand biological, experimental and technicalmetadata schema as well as ontologies for single-cell experiments across domains with a focuson transcriptomics. This will facilitate the sharing and reuse of single-cell data and promotecollaboration among researchers in different domains. Our goal is to improve data managementpractices and enhance the reproducibility of single-cell research. 1 minute read
Jul 22, 2024

An analysis of sex ratios using a biodiversity data cube
This investigation uses biodiversity data cubes derived from the datasets mobilised by the Global Biodiversity Information Facility (GBIF), to conduct an analysis of sex ratios of ducks across Europe. Encompassing over 4 million occurrences extracted from nearly 5000 datasets, this study elucidates sex distribution patterns across various species, focussing on temporal and spatial dynamics. The aim of this study is to highlight the availability of open sex data and its potential usefulness in research and monitoring of sex ratios of wild organisms, particularly in sexual dimorphic species. less than 1 minute read
Jun 25, 2024

VIB Hackathon on spatial omics tools and methods
During a three-day hackathon, work was performed on various topics within the field of spatial omics data analysis. The topics were organized in five workgroups and included benchmarking, pipelines, spatial transcriptomics, spatial proteomics, spatial multi-omics and cell-cell communication. Most tools and methods were considered in the context of the Python ecosystem for spatial (SpatialData) and single-cell (scverse) data analysis. less than 1 minute read
Jun 8, 2024

Phenological Diversity Trends with Remote Sensing Datacubes
During the 2024 B-Cubed Hackathon, we extended the R package “rasterdiv” by incorporating Time-Weighted Dynamic Time Warping (TWDTW) to the package’s pre-existing paRao() function for the calculation of parametric Rao’s Quadratic Diversity (Rao’s Q) index. This expands the user’s ability to biodiversity trends when using time series of Earth Observations. Biodiversity indices like Shannon’s H do not consider spatio-temporal dynamics, and others (e.g. Rao’s Q) only incorporate geographic distance between observations, often leaving phenological variation overlooked.Through integrating TWDTW into the paRao() function, users can assess different facets of an ecosystem’s biodiversity by incorporating phenological differences among its plant communities. This is also valuable to distinguish between natural habitats that follow a seasonal phenological trend and artificial land cover types, which may lack phenological changes. Previous studies have also found that the time weighting ability of TWDTW enables the discernment of different floral community types which could otherwise be misclassified as the same with traditional Dynamic Time Warping (DTW).To evaluate the efficacy of TWDTW within the paRao() function, we compared the ability of TWDTW Rao’s Q index against other biodiversity indices at classifying the different plant communities in a disturbed grassland in Calabria, Italy. Our study used a Plant Phenological Index (PPI) time series from the Sentinel-2 satellite network. The results indicated that accounting for phenological cycles can filter out artefacts and better distinguish habitats with differing plant species diversity. This improves the ability to assess ecosystem changes through space and time, providing a more comprehensive understanding of biodiversity dynamics, and the ability to gauge the resilience of different vegetation patches.We conclude that the inclusion of plant phenology in biodiversity assessment is necessary, and that our modifications to paRao() will be valuable to facilitate the accurate detection and description of ecosystem trends in response to our changing environment. 1 minute read
Apr 30, 2024

How to improve the annotation of Galaxy resources? Outcomes of an online hackathon for improving the annotation of Galaxy resources for microbial data resources
Galaxy hosts a vast array of tools, tutorials, and workflows, with the exact number of workflows remaining uncertain. To address the challenge of enhancing tool visibility within this expansive ecosystem, a pipeline called the Galaxy Tool Metadata Extractor was created during the BioHackathon Europe 2023. This pipeline aggregates Galaxy tool suites from various sources, automatically extracts metadata such as bio.tools identifiers and EDAM ontology, and presents the information in an interactive table. Users can filter this table to find tools relevant to their research community. Throughout development, it was noted that many tools lack EDAM annotations. An effort of the microGalaxy community was started to update 50+ microbial-related Galaxy tools, link them to their respective bio.tools entries, and collectively peer-review the results. However, that was far from enough to properly annotate all Galaxy tools and other types of Galaxy resources like training material and workflows. In addition to tools, the community offers other resources that are not properly annotated using ontologies like EDAM. Annotating all mentioned resources would improve their findability but also allow for aggregation and display of resources covering similar topics. To facilitate this work and work on a proof-of-concept for other communities, the microGalaxy community organized an online hackathon in April 2024. During this hackathon, 41 new bio.tools have been created, 85 Galaxy tool suites linked to bio.tools and EDAM terms, and 33 tutorials annotated with EDAM terms. Some microbial-related Galaxy tools have been improved. In addition, new features and improvements have been added to Galaxy Tool Metadata Extractor. The hackathon was successful with outcomes beyond the initial expectations. 1 minute read
Apr 24, 2024

SPARQL services for InterMine databases
InterMine is an open source data warehouse system that can be used to create biological databases that can be accessed via web query tools. There are many public InterMine instances that are currently deployed worldwide and they share a core data model pertaining to common biological entities. Besides the core data model, each instance of InterMine typically has an extended data model to cover data specific to that particular deployment. The data is organised according to the graph-based data model but exists in a relational store (Postgres). The goal of this project was to explore the possibility of translating InterMine data from relational form to a graph form using Resource Description Framework (RDF) as the exchange format. This could provide a route to exposing data from InterMine instances as RDF triples and thus making it possible to query the data using the SPARQL Protocol and RDF Querying Language (SPARQL). less than 1 minute read
Apr 8, 2024

Synergising ELIXIR resources for training in systems biology
Systems biology (SB) is a new ELIXIR community, that aims to utilise the ELIXIR ecosystem, such as the Training eSupport System (TeSS) and bio.tools, a registry of software tools and data resources for life sciences. One of the main initial objectives of the SB community is to create an SB-themed domain hosted by TeSS, encompassing SB-related ELIXIR services and events, in a fully automated way.Most content in TeSS is sourced through automated aggregation (“scraping”) of external sources containing resources marked up with semantic metadata, like Bioschemas. Currently, TeSS cannot recognize references to bio.tools identifiers from a Bioschemas-annotated resource, so the number of resources linked to bio.tools is relatively low.In this project, we will focus on selected SB disciplines from the priority areas of the ELXIR SB community to integrate and cross-link related ELIXIR products - training events, training materials, computational and bioinformatics tools, databases and services from the bio.tools registry.This will be achieved using suitable ontologies identified by the SB community and by careful curation of SB-related materials. We aim to extend this work to other ELIXIR products such as lists of trainers, related ELIXIR Innovation and Industry events and publications. This will serve as a pilot project leading to broader integration with other SB disciplines, and will be of interest to several other ELIXIR communities. 1 minute read

Meetings

Recent preprints

Ontologies for single-cell experiments

An analysis of sex ratios using a biodiversity data cube

VIB Hackathon on spatial omics tools and methods

Phenological Diversity Trends with Remote Sensing Datacubes

How to improve the annotation of Galaxy resources? Outcomes of an online hackathon for improving the annotation of Galaxy resources for microbial data resources

SPARQL services for InterMine databases

Synergising ELIXIR resources for training in systems biology