Removing Batch Effects in Large Liquid Chromatography-Mass Spectrometry Experiments

May 7, 2024

News

Article

A paper published by Nature Communications proposes a suite of batch effect removal neural networks (BERNN) to remove batch effects in large liquid chromatography-mass spectrometry (LC-MS) experiments, with the goal of maximizing sample classification performance between conditions.

The paper states that, while LC-MS is a powerful method for profiling complex biological samples, batch effects typically arise due to the omnipresence of confounding factors, which can be divided into those biological in nature (such as age or gender) and non-biological (such as batch effects). Non-biological factors are practically unavoidable in large-scale studies, due to limitations in instrument availability and timeline of sample collection. Ideally, batch effects would be removed from the final biological quantification value. It can be difficult to remove batch effects completely without the quality of the biological signal being affected. These effects can significantly impact the interpretability of results. Correcting batch effects is crucial for the reproducibility of omics research. Current methods, however, are not optimal for the removal of batch effects without compressing the genuine biological variation under study.

The authors in this multi-affiliated paper, representing laboratories in the United States, Canada, the Netherlands, France, and the United Kingdom, present an approach to countering batch effects that is different from most other solutions, as they do not rely on a single solution. Instead, they acknowledge that not all problems require the same solution and propose multiple potential solutions to address batch effects. They therefore aim to empower researchers to easily try multiple methods simultaneously, and then pick the optimal approach for their dataset and scientific questions.

Amongst this suite of models, the authors present the first use of Variational Autoencoders (VAE), Domain Adversarial Neural Networks (DANN), and Domain Inverse Triplet Loss (invTriplet) for batch correction in LC-MS. Furthermore, in contrast to other batch correction methods, they do not recommend using the corrected output of the autoencoder for biomarker discovery through downstream analysis (for example, using differential analysis). Rather, they demonstrate in their paper how SHapley Additive exPlanations (SHAP), a game theoretic approach explaining the output of any machine learning model which connects optimal credit allocation with local explanations using the classic Shapley values from game theory and their related extensions (2), can be used for biomarker discovery.

Comparison of batch effect correction methods across five diverse datasets presented in the paper (Alzheimer’s, Adenocarcinoma, aging mice, benchmark, and mixed tissues) demonstrated that BERNN models consistently showed the strongest sample classification performance. However, the model producing the greatest classification improvements did not always perform best in terms of batch effect removal.

The paper also presented findings that the overcorrection of batch effects resulted in the loss of some essential biological variability. These findings highlight the importance of balancing batch effect removal while preserving valuable biological diversity in large-scale LC-MS experiments.

The authors believe that, through their findings and this resulting paper, their contribution to researchers who are facing batch effect problems is threefold. First, they have demonstrated the effectiveness of models that, to their knowledge, have never been applied in LC-MS experiments to correct batch effects. Secondly, they showed the necessity of trying different models to solve different problems. Finally, they show that, to obtain the best classification on a given dataset, removing parts of the batch effects can improve the results, but removing too many batch effects might come at the cost of diminished classification performance.

References

Pelletier, S.J., Leclercq, M., Roux-Dalvai, F. et al. BERNN: Enhancing Classification of Liquid Chromatography Mass Spectrometry Data with Batch Effect Removal Neural Networks. Nat. Commun. 2024, 15, 3777 (2024). DOI: 10.1038/s41467-024-48177-5
SHAP Home Page. https://shap.readthedocs.io/en/latest (accessed 2024-05-07).

Related Content

Abstract flying dragons on a dark blue background. Technological background for design on the topic of artificial intelligence, neural networks, big data | Image Credit: © LariBat - stock.adobe.com

Predicting Concentration Profiles in Gradient LC Using Neural Networks

Aaron Acevedo

March 13th 2025

Article

Physics-informed neural networks were tested for their capabilities in predicting concentration profiles in gradient liquid chromatography.

The laboratory scientist prepares samples for download to High-performance Liquid Chromatograph Mass Spectrometr. | Image Credit: © Sodel Vladyslav - stock.adobe.com

New Study Investigates Optimizing Extra-Column Band Broadening in Micro-flow Capillary LC

Aaron Acevedo

March 12th 2025

Article

Shimadzu Corporation and Vrije Universiteit Brussel researchers recently investigated how extra-column band broadening (ECBB) can be optimized in micro-flow capillary liquid chromatography.

Scientist injecting a sample via the autosampler of HPLC system. High performance liquid chromatography at chemical laboratory. Developing of pharmaceuticals or vaccine. Biochemistry analysis | Image Credit: © lightpoet - stock.adobe.com

New Study Reviews 2D-LC for Natural Product Analysis

Aaron Acevedo

March 12th 2025

Article

A new review article provides an overview of the use of two-dimensional liquid chromatography (2D-LC) for the quantitative analysis of natural products.

Agilent Reports Financial Results from Q1 of 2025

Aaron Acevedo

March 11th 2025

Article

The company reported $1.68 billion in revenue at a 25.1% operating margin across its businesses in America, Europe, and Asia.

Optimizing Wastewater Treatment for Emerging Micropollutants Using Electrochemical Advanced Oxidation Processes and UHPLC–HRMS

Kate Jones

March 11th 2025

Article

Nadia Gadi and Allisson Barros de Souza spoke to LCGC International about their investigation into the application of electrochemical advanced oxidation processes (eAOPs) for the efficient removal of pharmaceutical residues in wastewater.

Ribonucleic acid strands consisting of nucleotides important for protein bio-synthesis entering cell wall | Image Credit: © Christoph Burgstedt - stock.adobe.com

Hydrophilic Interaction Liquid Chromatography for Oligonucleotide Therapeutics: Method Considerations

Szabolcs Fekete;Makda Araya;Balasubrahmanyam Addepalli;Matthew A. Lauber

March 10th 2025

Article

Hydrophilic interaction liquid chromatography (HILIC) has emerged as a promising alternative to traditional ion-pair reversed phase liquid chromatography (IP-RPLC) methods for separating oligonucleotides (ON). This work investigates the application of HILIC to the separation of ON sequence and length variants, duplexes, and single-stranded components.