Interpreting Viral Deep Sequencing Data with GLUE release_e7qxrk6nxzbt5dqz3dft42saqy

by Joshua Singer, Emma Thomson, Joseph Hughes, Elihu Aranday-Cortes, John McLauchlan, Ana da Silva Filipe, Lily Tong, Carmen Manso, Robert Gifford, David Robertson, Eleanor Barnes, M. Ansari (+4 others)

Published in Viruses by MDPI AG.

2019   Volume 11, Issue 4, p323


Using deep sequencing technologies such as Illumina's platform, it is possible to obtain reads from the viral RNA population revealing the viral genome diversity within a single host. A range of software tools and pipelines can transform raw deep sequencing reads into Sequence Alignment Mapping (SAM) files. We propose that interpretation tools should process these SAM files, directly translating individual reads to amino acids in order to extract statistics of interest such as the proportion of different amino acid residues at specific sites. This preserves per-read linkage between nucleotide variants at different positions within a codon location. The samReporter is a subsystem of the GLUE software toolkit which follows this direct read translation approach in its processing of SAM files. We test samReporter on a deep sequencing dataset obtained from a cohort of 241 UK HCV patients for whom prior treatment with direct-acting antivirals has failed; deep sequencing and resistance testing have been suggested to be of clinical use in this context. We compared the polymorphism interpretation results of the samReporter against an approach that does not preserve per-read linkage. We found that the samReporter was able to properly interpret the sequence data at resistance-associated locations in nine patients where the alternative approach was equivocal. In three cases, the samReporter confirmed that resistance or an atypical substitution was present at NS5A position 30. In three further cases, it confirmed that the sofosbuvir-resistant NS5B substitution S282T was absent. This suggests the direct read translation approach implemented is of value for interpreting viral deep sequencing data.
In application/xml+jats format

Archived Files and Locations

application/pdf  687.6 kB
file_c7edqwwwurd2jn5d32gruv2axi (webarchive) (web) (webarchive) (web)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   published
Date   2019-04-03
Language   en ?
DOI  10.3390/v11040323
PubMed  30987147
PMC  PMC6520954
Container Metadata
Open Access Publication
In Keepers Registry
ISSN-L:  1999-4915
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: b1385c9c-f023-40ee-96aa-e255d704ff37