Saer Samanipour from the Van ‘t Hoff Institute for Molecular Sciences (HIMS) at the University of Amsterdam spoke to LCGC International about the benefits of a data-driven reversed-phase liquid chromatography (RPLC) approach his team developed to enhance RPLC method development, including increased efficiency for non-targeted analysis and suspect screening, a reduction in the amount of false positives produced, and a predictive way to determine if a chemical can be separated using RPLC.
You recently published a paper entitled “Exploring The Chemical Subspace of RPLC: A Data-driven Approach” (1). What was the rationale behind this research? Why is exploring the chemical sub space important and what applications did you explore?
The main hypothesis here is not all organic small molecules can actually be analyzed using reversed-phase liquid chromatography (RPLC) or other specific selectivity. Moreover, it is difficult—almost impossible—to say what organic molecules are measurable by a specific method. At the moment, this is mainly done by assuming a linear relationship between the hydrophobicity of chemicals and their retention behaviour. We have seen time and time again that this assumption is not an accurate one, implying that a lack of detection for a chemical does not necessarily mean that it absent in the sample. Therefore, we decided to see whether the structure of a chemical can give us enough information to assess its measurability by RPLC.
In terms of application, this approach has been mainly used for the method development and structural elucidation during non-target analysis. As an example, you could easily exclude chemicals that are not measurable using RPLC when screening your samples against large databases such as Norman SuSDat. This reduces the number of candidates and consequently the number of false identifications.
Can you expand on your findings on what this data-driven approach discovered?
We have shown for the first time that molecular fingerprints alone, when optimized, have enough structural information to be used in QSAR models.
Around 20000 environmentally-relevant chemicals in Norman SuSDat database are not measurable with RPLC and need a different separation strategy. It should be noted that this does not mean that all measurable chemicals with RPLC can be separated in one single run.
The approach can help streamline the identification of compounds from complex environmental and biological samples by focusing on chemicals that are realistically detectable using RPLC, leading to more accurate and faster analysis.
What is novel about this approach and why is it useful to separation scientists?
This approach reduces false positives in chemical analysis. Separation scientists, particularly those working in non-targeted analysis (NTA), often face the challenge of dealing with false positives—chemicals that are predicted to be present in samples but are undetectable by the chosen method (in this case, RPLC). This approach directly addresses that by narrowing down the list of candidate chemicals to those likely to be retained in RPLC.
For suspect screening, it helps in filtering out compounds that will not elute properly, thereby reducing the list of potential matches. This significantly lowers the number of false positives and reduces the computational resources needed to process large datasets.
Additionally, this approach increases efficiency in NTA and suspect screening. By identifying chemicals that are outside the RPLC subspace, this approach allows scientists to focus on analyzable compounds, thereby streamlining the process of chemical analysis. Instead of spending time on compounds that cannot be detected, separation scientists can target those that fit within the method’s capabilities.
This model also saves computational time during suspect screening by automatically filtering out chemicals that will not fit the RPLC separation, thus accelerating the analysis workflow.
It also guides method development by offering separation scientists a predictive understanding of whether a given chemical can be separated using RPLC. This is particularly useful in method development, where knowing the chemical space covered by RPLC can inform decisions on solvent gradients, column selection, and mobile phase compositions. By applying this model early in the method development process, analysts can determine whether RPLC is suitable for a specific sample or whether an alternative chromatography method (such as hydrophilic interaction LC [HILIC]) is necessary, thus avoiding trial-and-error experimentation.
Are you planning to extend this approach further?
We are planning to expand this to other selectivities as well as the detection via mass spectrometry. The ultimate goal here is to comprehensively map the fraction of chemical space that is measurable with our current analytical technologies.
(1) van Herwerden, D.; Nikolopoulos, A.; Barron, L. P.; O’Brien, J. W.; Pirok, B. W. J.; Thomas, K. V.; Samanipour, S. Exploring the Chemical Subspace of RPLC: A Data-Driven Approach. Anal. Chim. Acta 2024, 1317, 342869. DOI: 10.1016/j.aca.2024.342869
LCGC’s Year in Review: Highlights in Liquid Chromatography
December 20th 2024This collection of technical articles, interviews, and news pieces delves into the latest innovations in LC methods, including advance in high performance liquid chromatography (HPLC), ultrahigh-pressure liquid chromatography (UHPLC), liquid chromatography–mass spectrometry (LC–MS), and multidimensional LC.
Using Chromatography to Study Microplastics in Food: An Interview with Jose Bernal
December 16th 2024LCGC International sat down with Jose Bernal to discuss his latest research in using pyrolysis gas chromatography–mass spectrometry (Py-GC–MS) and other chromatographic techniques in studying microplastics in food analysis.
The Use of SPME and GC×GC in Food Analysis: An Interview with Giorgia Purcaro
December 16th 2024LCGC International sat down with Giorgia Purcaro of the University of Liege to discuss the impact that solid-phase microextraction (SPME) and comprehensive multidimensional gas chromatography (GC×GC) is having on food analysis.
The Future of Digital Method Development: An Interview with Anne Marie Smith
December 12th 2024Following the HPLC 2024 Conference in Denver, Colorado, LCGC International spoke with Anne Marie Smith of ACD/Labs about the new ICH Q14 guidelines and how they impact analytical scientists and their work.