AdaNovo presents a novel framework for adaptive de novo peptide sequencing, addressing challenges in identifying amino acids with post-translational modifications (PTMs) and dealing with data noise. The model outperforms existing methods on a 9-species benchmark, showcasing superior performance in predicting never-before-seen peptides and identifying amino acids with PTMs.
Tandem mass spectrometry plays a crucial role in proteomics research, enabling the identification of proteins in biological samples. AdaNovo's adaptive training approach based on conditional mutual information enhances precision in peptide-level identification and robustness against data noise.
The model architecture of AdaNovo consists of a mass spectrum encoder and two peptide decoders built on the Transformer. Training strategies include amino acid-level and PSM-level adaptive training to re-weight losses based on CMI values.
Extensive experiments demonstrate AdaNovo's state-of-the-art performance, surpassing previous de novo sequencing methods. The model excels in identifying amino acids with PTMs and demonstrates higher precision levels at both amino acid and peptide levels.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Jun Xia,Shao... lúc arxiv.org 03-13-2024
https://arxiv.org/pdf/2403.07013.pdfYêu cầu sâu hơn