IBG-5 · Forschungszentrum Jülich

Data Science & Bioinformatics for Mass Spectrometry of Small Molecules

We build efficient algorithms, tools and workflows to identify and quantify lipids and other small molecules — applied to human, microbial and plant lipidomics.

Explore our research Our tools
DSBMS emblem
Research

Four directions, one pipeline

From raw names and spectra to comparable, shareable results — each direction backed by an open-source tool or standard.

01

Harmonized naming and representation of ambiguity

Fast, exact algorithms for annotating small molecules are the core of our work. We treat lipid names and structures as formal objects that software can parse, normalize and compare — so identifications are consistent across labs, tools and databases.

GoslinReading any lipid dialect into one canonical shorthand
Input — any nomenclature
GPCho(16:0/18:1(9Z)) PtdCho 16:0-18:1 PC(16:0/18:1(9Z)) phosphatidylcholine 34:1
Goslin
normalize
Canonical — LIPID MAPS shorthand
PC 16:0/18:1(9Z)
Species
PC 34:1
sum composition
Molecular species
PC 16:0_18:1
acyl chains known
sn-position
PC 16:0/18:1
sn-1 / sn-2 assigned
Structure-defined
PC 16:0/18:1(9Z)
double-bond position
02

Targeted & quantitative workflows

Alongside untargeted discovery, we build reliable targeted and quantitative methods — the assays and spectral libraries needed to measure defined panels of lipids reproducibly.

LipidCreatorDesigning targeted assays and in-silico spectral libraries
Input — lipid classes & settings
PCPETGSMCer60+ classes
precursor / fragment ionscollision energy
LipidCreator
compute
Output — assay & library
in-silico spectral library
transition list→ Skyline
Select
classes & adducts
60+ lipid classes
Compute
fragment masses
MS/MS in silico
Build
spectral library
intensities & transitions
Export
Skyline assay
acquire & quantify
03

Comparison, ML & cloud analysis

We turn results into interpretable comparisons between whole lipidomes, and are beginning to complement our exact algorithms with machine learning, delivered through reproducible, cloud-native workflows.

LipidSpaceComparing whole lipidomes in a shared structural space
Input — lipidome datasets
NameStudy AStudy BStudy C
PC 34:10.420.380.51
PE 36:20.180.220.15
TG 52:30.270.310.24
SM 34:10.090.070.10
LipidSpace
embed
Output — structural space
LipidSpace projection of three lipidomes, points coloured by lipid category (SP, GL, GP, ST, FA)
Build
structural space
from the lipids themselves
Compare
lipidome distance
how similar are two studies
Select
explanatory lipids
ML feature selection
04

FAIR data & standards

Methods only matter if their results can be shared and reused. We help define the community data standards that make small-molecule mass spectrometry interoperable and AI-ready.

mzTab-M · mzQCTurning heterogeneous results into FAIR, AI-ready datasets
Input — heterogeneous results
vendor exportcustom .xlsxtool-specific table
mzTab-M + mzQC
standardize
Output — FAIR dataset
mzTab-M + mzQC
findableinteroperableAI-ready
Report
quantitative results
mzTab-M structure
Assess
quality metrics
mzQC
Share
public repositories
reusable & AI-ready
Standards & interoperability

Making MS data FAIR

We help define community data standards through the HUPO Proteomics Standards Initiative — quality metrics, reporting and AI-readiness for small-molecule mass spectrometry.

mzQC
quality metrics
mzTab-M
reporting
HUPO-PSI
AI & formats
Tools · de.NBI “Lipidomics for Life Sciences”

Open-source software we build

As part of the German Network for Bioinformatics Infrastructure (de.NBI) and ELIXIR Germany - LIFS consortium — we contribute to a connected suite of lipidomics tools, from identification to structural comparison.