Scientific article
Open access

Open and reusable annotated mass spectrometry dataset of a chemodiverse collection of 1,600 plant extracts

Published inGigascience, vol. 12, giac124
Publication date2023-01-18
First online date2023-01-18


As privileged structures, natural products often display potent biological activities. However, the discovery of novel bioactive scaffolds is often hampered by the chemical complexity of the biological matrices they are found in. Large natural extract collections are thus extremely valuable for their chemical novelty potential but also complicated to exploit in the frame of drug-discovery projects. In the end, it is the pure chemical substances that are desired for structural determination purposes and bioactivity evaluation. Researchers interested in the exploration of large and chemodiverse extract collections should thus establish strategies aiming to efficiently tackle such chemical complexity and access these structures. Establishing carefully crafted digital layers documenting the spectral and chemical complexity as well as bioactivity results of natural extracts collections can help prioritize time-consuming but mandatory isolation efforts. In this note, we report the results of our initial exploration of a collection of 1,600 plant extracts in the frame of a drug-discovery effort. After describing the taxonomic coverage of this collection, we present the results of its liquid chromatography high-resolution mass spectrometric profiling and the exploitation of these profiles using computational solutions. The resulting annotated mass spectral dataset and associated chemical and taxonomic metadata are made available to the community, and data reuse cases are proposed. We are currently continuing our exploration of this plant extract collection for drug-discovery purposes (notably looking for novel antitrypanosomatids, anti-infective and prometabolic compounds) and ecometabolomics insights. We believe that such a dataset can be exploited and reused by researchers interested in computational natural products exploration.

  • LC-MS
  • biodiversity digitization
  • chemodiversity
  • drug discovery
  • mass spectrometry
  • metabolomics
  • natural products
  • open science
  • plant extracts collection
Citation (ISO format)
ALLARD, Pierre-Marie et al. Open and reusable annotated mass spectrometry dataset of a chemodiverse collection of 1,600 plant extracts. In: Gigascience, 2023, vol. 12, p. giac124. doi: 10.1093/gigascience/giac124
Main files (1)
Article (Published version)
ISSN of the journal2047-217X

Technical informations

Creation01/18/2023 9:30:00 AM
First validation01/18/2023 9:30:00 AM
Update time03/16/2023 10:30:39 AM
Status update03/16/2023 10:30:38 AM
Last indexation02/01/2024 9:29:33 AM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack