CAMPAREE: a robust and configurable RNA expression simulator release_nisoplyczzewxojuyzclpqjonq

by Nicholas F. Lahens, Thomas G. Brooks, Dimitra Sarantopoulou, Soumyashant Nayak, Cris Lawrence, Antonijo Mrčela, Anand Srinivasan, Jonathan Schug, John B. Hogenesch, Yoseph Barash, Gregory R. Grant

Published in BMC Genomics by Springer Science and Business Media LLC.

2021   Volume 22, Issue 1, p692

Abstract

<jats:title>Abstract</jats:title><jats:sec> <jats:title>Background</jats:title> The accurate interpretation of RNA-Seq data presents a moving target as scientists continue to introduce new experimental techniques and analysis algorithms. Simulated datasets are an invaluable tool to accurately assess the performance of RNA-Seq analysis methods. However, existing RNA-Seq simulators focus on modeling the technical biases and artifacts of sequencing, rather than on simulating the original RNA samples. A first step in simulating RNA-Seq is to simulate RNA. </jats:sec><jats:sec> <jats:title>Results</jats:title> To fill this need, we developed the <jats:underline>C</jats:underline>onfigurable <jats:underline>A</jats:underline>nd <jats:underline>M</jats:underline>odular <jats:underline>P</jats:underline>rogram <jats:underline>A</jats:underline>llowing <jats:underline>R</jats:underline>NA <jats:underline>E</jats:underline>xpression <jats:underline>E</jats:underline>mulation (CAMPAREE), a simulator using empirical data to simulate diploid RNA samples at the level of individual molecules. We demonstrated CAMPAREE's use for generating idealized coverage plots from real data, and for adding the ability to generate allele-specific data to existing RNA-Seq simulators that do not natively support this feature. </jats:sec><jats:sec> <jats:title>Conclusions</jats:title> Separating input sample modeling from library preparation/sequencing offers added flexibility for both users and developers to mix-and-match different sample and sequencing simulators to suit their specific needs. Furthermore, the ability to maintain sample and sequencing simulators independently provides greater agility to incorporate new biological findings about transcriptomics and new developments in sequencing technologies. Additionally, by simulating at the level of individual molecules, CAMPAREE has the potential to model molecules transcribed from the same genes as a heterogeneous population of transcripts with different states of degradation and processing (splicing, editing, etc.). CAMPAREE was developed in Python, is open source, and freely available at <jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/itmat/CAMPAREE">https://github.com/itmat/CAMPAREE</jats:ext-link>. </jats:sec>
In application/xml+jats format

Archived Files and Locations

application/pdf  1.2 MB
file_aodh2oonvrfg5fpje2llp7cl64
bmcgenomics.biomedcentral.com (publisher)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   published
Date   2021-09-25
Language   en ?
Container Metadata
Open Access Publication
In DOAJ
In ISSN ROAD
In Keepers Registry
ISSN-L:  1471-2164
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: c1df9ce9-c22e-420c-89b7-6199297b7573
API URL: JSON