Bioinformatics of high-throughput insertional mutagenesis

Keiko Akagi; Ming Yi; Jean Roayaei; Robert M. Stephens

doi:10.1007/978-1-4419-7656-7_7

Bioinformatics of high-throughput insertional mutagenesis

Keiko Akagi, Ming Yi, Jean Roayaei, Robert M. Stephens

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Abstract

Bioinformatics plays critical roles to handle large amount of sequence data from insertional mutagenesis. First, computational approaches are used to develop rapid sequence analysis pipelines and biological databases. Millions of reads from an insertion mutagenesis screening are mapped to genomic locations and be annotated to their target genes rapidly by pipeline, and such sequence-based data is stored and managed in database to share the information in the scientific community. Second, statistical techniques are used to distinguish true common insertion sites (loci that have been hit by insertions in multiple tumors: candidate loci for cancer genes) from background insertions in large-scale screenings. Finally, the advanced data mining techniques, pathway and network analysis, are used to give further biological meaning to insertion sites by identifying the interaction of genes in cancer. In this chapter, we discuss features of these three topics and address their future roles: (1) development of sequence analysis pipeline and database, (2) detection of common insertion sites, and (3) network and pathway analysis of insertion sites.

Original language	English (US)
Title of host publication	Insertional Mutagenesis Strategies in Cancer Genetics
Publisher	Springer New York
Pages	167-188
Number of pages	22
ISBN (Print)	9781441976550
DOIs	https://doi.org/10.1007/978-1-4419-7656-7_7
State	Published - 2011
Externally published	Yes

ASJC Scopus subject areas

General Biochemistry, Genetics and Molecular Biology

Access to Document

10.1007/978-1-4419-7656-7_7

Cite this

@inbook{e266eb119f6948b5aa754824be90a56e,

title = "Bioinformatics of high-throughput insertional mutagenesis",

abstract = "Bioinformatics plays critical roles to handle large amount of sequence data from insertional mutagenesis. First, computational approaches are used to develop rapid sequence analysis pipelines and biological databases. Millions of reads from an insertion mutagenesis screening are mapped to genomic locations and be annotated to their target genes rapidly by pipeline, and such sequence-based data is stored and managed in database to share the information in the scientific community. Second, statistical techniques are used to distinguish true common insertion sites (loci that have been hit by insertions in multiple tumors: candidate loci for cancer genes) from background insertions in large-scale screenings. Finally, the advanced data mining techniques, pathway and network analysis, are used to give further biological meaning to insertion sites by identifying the interaction of genes in cancer. In this chapter, we discuss features of these three topics and address their future roles: (1) development of sequence analysis pipeline and database, (2) detection of common insertion sites, and (3) network and pathway analysis of insertion sites.",

author = "Keiko Akagi and Ming Yi and Jean Roayaei and Stephens, {Robert M.}",

year = "2011",

doi = "10.1007/978-1-4419-7656-7_7",

language = "English (US)",

isbn = "9781441976550",

pages = "167--188",

booktitle = "Insertional Mutagenesis Strategies in Cancer Genetics",

publisher = "Springer New York",

}

TY - CHAP

T1 - Bioinformatics of high-throughput insertional mutagenesis

AU - Akagi, Keiko

AU - Yi, Ming

AU - Roayaei, Jean

AU - Stephens, Robert M.

PY - 2011

Y1 - 2011

N2 - Bioinformatics plays critical roles to handle large amount of sequence data from insertional mutagenesis. First, computational approaches are used to develop rapid sequence analysis pipelines and biological databases. Millions of reads from an insertion mutagenesis screening are mapped to genomic locations and be annotated to their target genes rapidly by pipeline, and such sequence-based data is stored and managed in database to share the information in the scientific community. Second, statistical techniques are used to distinguish true common insertion sites (loci that have been hit by insertions in multiple tumors: candidate loci for cancer genes) from background insertions in large-scale screenings. Finally, the advanced data mining techniques, pathway and network analysis, are used to give further biological meaning to insertion sites by identifying the interaction of genes in cancer. In this chapter, we discuss features of these three topics and address their future roles: (1) development of sequence analysis pipeline and database, (2) detection of common insertion sites, and (3) network and pathway analysis of insertion sites.

AB - Bioinformatics plays critical roles to handle large amount of sequence data from insertional mutagenesis. First, computational approaches are used to develop rapid sequence analysis pipelines and biological databases. Millions of reads from an insertion mutagenesis screening are mapped to genomic locations and be annotated to their target genes rapidly by pipeline, and such sequence-based data is stored and managed in database to share the information in the scientific community. Second, statistical techniques are used to distinguish true common insertion sites (loci that have been hit by insertions in multiple tumors: candidate loci for cancer genes) from background insertions in large-scale screenings. Finally, the advanced data mining techniques, pathway and network analysis, are used to give further biological meaning to insertion sites by identifying the interaction of genes in cancer. In this chapter, we discuss features of these three topics and address their future roles: (1) development of sequence analysis pipeline and database, (2) detection of common insertion sites, and (3) network and pathway analysis of insertion sites.

UR - http://www.scopus.com/inward/record.url?scp=84900567435&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84900567435&partnerID=8YFLogxK

U2 - 10.1007/978-1-4419-7656-7_7

DO - 10.1007/978-1-4419-7656-7_7

M3 - Chapter

AN - SCOPUS:84900567435

SN - 9781441976550

SP - 167

EP - 188

BT - Insertional Mutagenesis Strategies in Cancer Genetics

PB - Springer New York

ER -

Bioinformatics of high-throughput insertional mutagenesis

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this