Abstract

Here, we develop and apply a semi-quantitative method for the high-confidence identification of pseudouridylated sites on mammalian mRNAs via direct long-read nanopore sequencing. A comparative analysis of a modification-free transcriptome reveals that the depth of coverage and specific k-mer sequences are critical parameters for accurate basecalling. By adjusting these parameters for high-confidence U-to-C basecalling errors, we identify many known sites of pseudouridylation and uncover previously unreported uridine-modified sites, many of which fall in k-mers that are known targets of pseudouridine synthases. Identified sites are validated using 1000-mer synthetic RNA controls bearing a single pseudouridine in the center position, demonstrating systematic under-calling using our approach. We identify mRNAs with up to 7 unique modification sites. Our workflow allows direct detection of low-, medium-, and high-occupancy pseudouridine modifications on native RNA molecules from nanopore sequencing data and multiple modifications on the same strand.

Download full-text PDF

Link Source
Download Source 1https://www.nature.com/articles/s41467-023-35858-w?error=cookies_not_supported&code=38fc6d21-8a71-41e2-95ad-9dfb587fc00dWeb Search
Download Source 2http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9852470PMC
Download Source 3http://dx.doi.org/10.1038/s41467-023-35858-wDOI Listing

Publication Analysis

Top Keywords

pseudouridine modifications
8
mrnas direct
8
direct long-read
8
nanopore sequencing
8
sites
5
semi-quantitative detection
4
pseudouridine
4
detection pseudouridine
4
modifications type
4
type i/ii
4