Making mass-spectrometry data FAIR and AI-ready
Data that cannot be exchanged cannot be reused. We contribute to the community standards that give small-molecule mass spectrometry a shared language — for reporting results, naming lipids, and measuring data quality — primarily through the HUPO Proteomics Standards Initiative (HUPO-PSI).
A single, well-specified tab-separated format for sharing quantitative metabolomics and lipidomics results between tools and repositories.
A format for capturing quality-control metrics for mass-spectrometry runs and whole datasets, so quality can be tracked and compared.
The community shorthand nomenclature for lipids, implemented as a formal grammar in Goslin so names normalize to one canonical form.
Cross-cutting work to make MS data formats and vocabularies interoperable and ready for machine-learning and AI applications.
An open community, twenty years on
PSI maintains the data formats and controlled vocabularies the field depends on. We contribute to extending this work toward small molecules, quality control, and AI-readiness.