PRADA: Pipeline for RNA sequencing data analysis

Wandaliz Torres-García; Siyuan Zheng; Andrey Sivachenko; Rahulsimham Vegesna; Qianghu Wang; Rong Yao; Michael F. Berger; John N. Weinstein; Gad Getz; Roel G.W. Verhaak

doi:10.1093/bioinformatics/btu169

PRADA: Pipeline for RNA sequencing data analysis

Wandaliz Torres-García, Siyuan Zheng, Andrey Sivachenko, Rahulsimham Vegesna, Qianghu Wang, Rong Yao, Michael F. Berger, John N. Weinstein, Gad Getz, Roel G.W. Verhaak

Bioinformatics & Computational Biology

Research output: Contribution to journal › Article › peer-review

121 Scopus citations

Abstract

Summary:Technological advances in high-throughput sequencing necessitate improved computational tools for processing and analyzing large-scale datasets in a systematic automated manner. For that purpose, we have developed PRADA (Pipeline for RNA-Sequencing Data Analysis), a flexible, modular and highly scalable software platform that provides many different types of information available by multifaceted analysis starting from raw paired-end RNA-seq data: gene expression levels, quality metrics, detection of unsupervised and supervised fusion transcripts, detection of intragenic fusion variants, homology scores and fusion frame classification. PRADA uses a dual-mapping strategy that increases sensitivity and refines the analytical endpoints. PRADA has been used extensively and successfully in the glioblastoma and renal clear cell projects of The Cancer Genome Atlas program. Availability and implementation:http:// sourceforge.net/projects/prada/.

Original language	English (US)
Pages (from-to)	2224-2226
Number of pages	3
Journal	Bioinformatics
Volume	30
Issue number	15
DOIs	https://doi.org/10.1093/bioinformatics/btu169
State	Published - Aug 1 2014

ASJC Scopus subject areas

Statistics and Probability
Biochemistry
Molecular Biology
Computer Science Applications
Computational Theory and Mathematics
Computational Mathematics

MD Anderson CCSG core facilities

Bioinformatics Shared Resource

Access to Document

10.1093/bioinformatics/btu169

Cite this

@article{a8204fcacaf3496fa43e2ee2833653d4,

title = "PRADA: Pipeline for RNA sequencing data analysis",

abstract = "Summary:Technological advances in high-throughput sequencing necessitate improved computational tools for processing and analyzing large-scale datasets in a systematic automated manner. For that purpose, we have developed PRADA (Pipeline for RNA-Sequencing Data Analysis), a flexible, modular and highly scalable software platform that provides many different types of information available by multifaceted analysis starting from raw paired-end RNA-seq data: gene expression levels, quality metrics, detection of unsupervised and supervised fusion transcripts, detection of intragenic fusion variants, homology scores and fusion frame classification. PRADA uses a dual-mapping strategy that increases sensitivity and refines the analytical endpoints. PRADA has been used extensively and successfully in the glioblastoma and renal clear cell projects of The Cancer Genome Atlas program. Availability and implementation:http:// sourceforge.net/projects/prada/.",

author = "Wandaliz Torres-Garc{\'i}a and Siyuan Zheng and Andrey Sivachenko and Rahulsimham Vegesna and Qianghu Wang and Rong Yao and Berger, {Michael F.} and Weinstein, {John N.} and Gad Getz and Verhaak, {Roel G.W.}",

year = "2014",

month = aug,

day = "1",

doi = "10.1093/bioinformatics/btu169",

language = "English (US)",

volume = "30",

pages = "2224--2226",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "15",

}

TY - JOUR

T1 - PRADA

T2 - Pipeline for RNA sequencing data analysis

AU - Torres-García, Wandaliz

AU - Zheng, Siyuan

AU - Sivachenko, Andrey

AU - Vegesna, Rahulsimham

AU - Wang, Qianghu

AU - Yao, Rong

AU - Berger, Michael F.

AU - Weinstein, John N.

AU - Getz, Gad

AU - Verhaak, Roel G.W.

PY - 2014/8/1

Y1 - 2014/8/1

N2 - Summary:Technological advances in high-throughput sequencing necessitate improved computational tools for processing and analyzing large-scale datasets in a systematic automated manner. For that purpose, we have developed PRADA (Pipeline for RNA-Sequencing Data Analysis), a flexible, modular and highly scalable software platform that provides many different types of information available by multifaceted analysis starting from raw paired-end RNA-seq data: gene expression levels, quality metrics, detection of unsupervised and supervised fusion transcripts, detection of intragenic fusion variants, homology scores and fusion frame classification. PRADA uses a dual-mapping strategy that increases sensitivity and refines the analytical endpoints. PRADA has been used extensively and successfully in the glioblastoma and renal clear cell projects of The Cancer Genome Atlas program. Availability and implementation:http:// sourceforge.net/projects/prada/.

AB - Summary:Technological advances in high-throughput sequencing necessitate improved computational tools for processing and analyzing large-scale datasets in a systematic automated manner. For that purpose, we have developed PRADA (Pipeline for RNA-Sequencing Data Analysis), a flexible, modular and highly scalable software platform that provides many different types of information available by multifaceted analysis starting from raw paired-end RNA-seq data: gene expression levels, quality metrics, detection of unsupervised and supervised fusion transcripts, detection of intragenic fusion variants, homology scores and fusion frame classification. PRADA uses a dual-mapping strategy that increases sensitivity and refines the analytical endpoints. PRADA has been used extensively and successfully in the glioblastoma and renal clear cell projects of The Cancer Genome Atlas program. Availability and implementation:http:// sourceforge.net/projects/prada/.

UR - http://www.scopus.com/inward/record.url?scp=84904994551&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84904994551&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btu169

DO - 10.1093/bioinformatics/btu169

M3 - Article

C2 - 24695405

AN - SCOPUS:84904994551

SN - 1367-4803

VL - 30

SP - 2224

EP - 2226

JO - Bioinformatics

JF - Bioinformatics

IS - 15

ER -

PRADA: Pipeline for RNA sequencing data analysis

Abstract

ASJC Scopus subject areas

MD Anderson CCSG core facilities

Access to Document

Other files and links

Fingerprint

Cite this