Abstract

Predicting the impact of coding and noncoding variants on splicing is challenging, particularly in non-canonical splice sites, leading to missed diagnoses in patients. Existing splice prediction tools are complementary but knowing which to use for each splicing context remains difficult. Here, we describe Introme, which uses machine learning to integrate predictions from several splice detection tools, additional splicing rules, and gene architecture features to comprehensively evaluate the likelihood of a variant impacting splicing. Through extensive benchmarking across 21,000 splice-altering variants, Introme outperformed all tools (auPRC: 0.98) for the detection of clinically significant splice variants. Introme is available at https://github.com/CCICB/introme .

Download full-text PDF

Link Source
Download Source 1https://genomebiology.biomedcentral.com/articles/10.1186/s13059-023-02936-7Web Search
Download Source 2http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10190034PMC
Download Source 3http://dx.doi.org/10.1186/s13059-023-02936-7DOI Listing

Publication Analysis

Top Keywords

impact coding
8
coding noncoding
8
noncoding variants
8
variants introme
8
splicing
5
introme
4
introme accurately
4
accurately predicts
4
predicts impact
4
variants
4