Removing Batch Effects in Large Liquid Chromatography-Mass Spectrometry Experiments

May 7, 2024

News

Article

A paper published by Nature Communications proposes a suite of batch effect removal neural networks (BERNN) to remove batch effects in large liquid chromatography-mass spectrometry (LC-MS) experiments, with the goal of maximizing sample classification performance between conditions.

The paper states that, while LC-MS is a powerful method for profiling complex biological samples, batch effects typically arise due to the omnipresence of confounding factors, which can be divided into those biological in nature (such as age or gender) and non-biological (such as batch effects). Non-biological factors are practically unavoidable in large-scale studies, due to limitations in instrument availability and timeline of sample collection. Ideally, batch effects would be removed from the final biological quantification value. It can be difficult to remove batch effects completely without the quality of the biological signal being affected. These effects can significantly impact the interpretability of results. Correcting batch effects is crucial for the reproducibility of omics research. Current methods, however, are not optimal for the removal of batch effects without compressing the genuine biological variation under study.

The authors in this multi-affiliated paper, representing laboratories in the United States, Canada, the Netherlands, France, and the United Kingdom, present an approach to countering batch effects that is different from most other solutions, as they do not rely on a single solution. Instead, they acknowledge that not all problems require the same solution and propose multiple potential solutions to address batch effects. They therefore aim to empower researchers to easily try multiple methods simultaneously, and then pick the optimal approach for their dataset and scientific questions.

Amongst this suite of models, the authors present the first use of Variational Autoencoders (VAE), Domain Adversarial Neural Networks (DANN), and Domain Inverse Triplet Loss (invTriplet) for batch correction in LC-MS. Furthermore, in contrast to other batch correction methods, they do not recommend using the corrected output of the autoencoder for biomarker discovery through downstream analysis (for example, using differential analysis). Rather, they demonstrate in their paper how SHapley Additive exPlanations (SHAP), a game theoretic approach explaining the output of any machine learning model which connects optimal credit allocation with local explanations using the classic Shapley values from game theory and their related extensions (2), can be used for biomarker discovery.

Comparison of batch effect correction methods across five diverse datasets presented in the paper (Alzheimer’s, Adenocarcinoma, aging mice, benchmark, and mixed tissues) demonstrated that BERNN models consistently showed the strongest sample classification performance. However, the model producing the greatest classification improvements did not always perform best in terms of batch effect removal.

The paper also presented findings that the overcorrection of batch effects resulted in the loss of some essential biological variability. These findings highlight the importance of balancing batch effect removal while preserving valuable biological diversity in large-scale LC-MS experiments.

The authors believe that, through their findings and this resulting paper, their contribution to researchers who are facing batch effect problems is threefold. First, they have demonstrated the effectiveness of models that, to their knowledge, have never been applied in LC-MS experiments to correct batch effects. Secondly, they showed the necessity of trying different models to solve different problems. Finally, they show that, to obtain the best classification on a given dataset, removing parts of the batch effects can improve the results, but removing too many batch effects might come at the cost of diminished classification performance.

References

Pelletier, S.J., Leclercq, M., Roux-Dalvai, F. et al. BERNN: Enhancing Classification of Liquid Chromatography Mass Spectrometry Data with Batch Effect Removal Neural Networks. Nat. Commun. 2024, 15, 3777 (2024). DOI: 10.1038/s41467-024-48177-5
SHAP Home Page. https://shap.readthedocs.io/en/latest (accessed 2024-05-07).

Related Content

Chinese traditional herbal medicine in casserole | Image Credit: © xb100 - stock.adobe.com

UHPLC-Q-Orbitrap-MS/MS Used to Identify Active Compounds in Traditional Chinese Medicine

May 17th 2024

Article

Scientists from Changchun University of Chinese Medicine used UHPLC-Q-Orbitrap-MS/MS analysis to identify the active compounds in TongFu XieXia Decoction for treating intestinal obstruction.

Scenery of Linyi City, Shandong, China | Image Credit: © 昊周 - stock.adobe.com

UHPLC–MS/MS to Study Effects of Traditional Chinese Medicine

May 16th 2024

Article

Scientists from the Lunan Pharmaceutical Group explored the use of UHPLC-MS/MS to test the effects of Shouhui Tongbian capsules (SHTBs) on slow transit constipation (STC).

set healthy herbs | Image Credit: © nikolaydonetsk - stock.adobe.com

Integrated 4D Fingerprint Quality Assessment System for Traditional Chinese Medicine

May 14th 2024

Article

Scientists from the Shenyang Pharmaceutical University in China have developed a new quality assessment system for various types of traditional Chinese medicine (TCM).

Clinical trial | Image Credit: © Microgen - stock.adobe.com

Phospholipidomics and Retention Times Measured Using New Workflow

May 13th 2024

Article

Scientists from Chongqing University Cancer Hospital recently developed a new system for analyzing the potential impact of retention time (RT) prediction on targeted LC–MS-based lipidomics.

Woman lying on the floor at home, epilepsy, unconsciousness, faint, stroke, accident or other health problem. | Image Credit: © Tunatura - stock.adobe.com

LC–QTOF-MS Identifies Degradation Products and Process-Related Substances of Brivaracetam

May 13th 2024

Article

Brivaracetam has been approved for prescription use in the last decade for treatment of patients experiencing focal, or partial, seizures.

Best of the Week: Sustainability, PFAS, and ASMS

May 10th 2024

Article

Here are the top five articles that the editors of LCGC International published this week.

Related Content

UHPLC-Q-Orbitrap-MS/MS Used to Identify Active Compounds in Traditional Chinese Medicine

May 17th 2024

Article

Scientists from Changchun University of Chinese Medicine used UHPLC-Q-Orbitrap-MS/MS analysis to identify the active compounds in TongFu XieXia Decoction for treating intestinal obstruction.

UHPLC–MS/MS to Study Effects of Traditional Chinese Medicine

May 16th 2024

Article

Scientists from the Lunan Pharmaceutical Group explored the use of UHPLC-MS/MS to test the effects of Shouhui Tongbian capsules (SHTBs) on slow transit constipation (STC).

Integrated 4D Fingerprint Quality Assessment System for Traditional Chinese Medicine

May 14th 2024

Article

Scientists from the Shenyang Pharmaceutical University in China have developed a new quality assessment system for various types of traditional Chinese medicine (TCM).

Phospholipidomics and Retention Times Measured Using New Workflow

May 13th 2024

Article

Scientists from Chongqing University Cancer Hospital recently developed a new system for analyzing the potential impact of retention time (RT) prediction on targeted LC–MS-based lipidomics.

LC–QTOF-MS Identifies Degradation Products and Process-Related Substances of Brivaracetam

May 13th 2024

Article

Brivaracetam has been approved for prescription use in the last decade for treatment of patients experiencing focal, or partial, seizures.

Best of the Week: Sustainability, PFAS, and ASMS

May 10th 2024

Article

Here are the top five articles that the editors of LCGC International published this week.