IBG-5 · Forschungszentrum Jülich

Data Science & Bioinformatics for Mass Spectrometry of Small Molecules

We build efficient algorithms, tools and workflows to identify and quantify lipids and other small molecules — applied to human, microbial and plant lipidomics.

Explore our research Our tools

Research

Four directions, one pipeline

From raw names and spectra to comparable, shareable results — each direction backed by an open-source tool or standard.

Harmonized naming and representation of ambiguity

Fast, exact algorithms for annotating small molecules are the core of our work. We treat lipid names and structures as formal objects that software can parse, normalize and compare — so identifications are consistent across labs, tools and databases.

GoslinReading any lipid dialect into one canonical shorthand

Input — any nomenclature

GPCho(16:0/18:1(9Z)) PtdCho 16:0-18:1 PC(16:0/18:1(9Z)) phosphatidylcholine 34:1

Goslin

→

normalize

Canonical — LIPID MAPS shorthand

PC 16:0/18:1(9Z)

Species

PC 34:1

sum composition

→

Molecular species

PC 16:0_18:1

acyl chains known

→

sn-position

PC 16:0/18:1

sn-1 / sn-2 assigned

→

Structure-defined

PC 16:0/18:1(9Z)

double-bond position

Targeted & quantitative workflows

Alongside untargeted discovery, we build reliable targeted and quantitative methods — the assays and spectral libraries needed to measure defined panels of lipids reproducibly.

LipidCreatorDesigning targeted assays and in-silico spectral libraries

Input — lipid classes & settings

PCPETGSMCer60+ classes

precursor / fragment ionscollision energy

LipidCreator

→

compute

Output — assay & library

in-silico spectral library

transition list→ Skyline

Select

classes & adducts

60+ lipid classes

→

Compute

fragment masses

MS/MS in silico

→

Build

spectral library

intensities & transitions

→

Export

Skyline assay

acquire & quantify

Comparison, ML & cloud analysis

We turn results into interpretable comparisons between whole lipidomes, and are beginning to complement our exact algorithms with machine learning, delivered through reproducible, cloud-native workflows.

LipidSpaceComparing whole lipidomes in a shared structural space

Input — lipidome datasets

Name	Study A	Study B	Study C
PC 34:1	0.42	0.38	0.51
PE 36:2	0.18	0.22	0.15
TG 52:3	0.27	0.31	0.24
SM 34:1	0.09	0.07	0.10

LipidSpace

→

embed

Output — structural space

LipidSpace projection of three lipidomes, points coloured by lipid category (SP, GL, GP, ST, FA)

Build

structural space

from the lipids themselves

→

Compare

lipidome distance

how similar are two studies

→

Select

explanatory lipids

ML feature selection

FAIR data & standards

Methods only matter if their results can be shared and reused. We help define the community data standards that make small-molecule mass spectrometry interoperable and AI-ready.

mzTab-M · mzQCTurning heterogeneous results into FAIR, AI-ready datasets

Input — heterogeneous results

vendor exportcustom .xlsxtool-specific table

mzTab-M + mzQC

→

standardize

Output — FAIR dataset

mzTab-M + mzQC

findableinteroperableAI-ready

Report

quantitative results

mzTab-M structure

→

Assess

quality metrics

mzQC

→

public repositories

reusable & AI-ready

Standards & interoperability

Making MS data FAIR

We help define community data standards through the HUPO Proteomics Standards Initiative — quality metrics, reporting and AI-readiness for small-molecule mass spectrometry.

mzQC

quality metrics

mzTab-M

reporting

HUPO-PSI

AI & formats

Tools · de.NBI “Lipidomics for Life Sciences”

Open-source software we build

As part of the German Network for Bioinformatics Infrastructure (de.NBI) and ELIXIR Germany - LIFS consortium — we contribute to a connected suite of lipidomics tools, from identification to structural comparison.