Research Interest
I have been working on developing statistical methods and platforms for cancer genome sequencing data analysis.
Variant Detection and Interpretation
Thanks to the advances in high-throughput sequence technologies,
genome-wide identification of genomic and transcriptome aberrations is
now possible. However, due to ambiguities of short read alignment and
sequence errors, detection of several types of genomic and transcriptome changes
(such as structural variations, gene fusions, complex indels and so on) is still challenging issue.
Furthermore, elucidation of the relationships among inter-omics somatic changes is yet in an early phase.
I’ve been working on developing statistical methods for identifying variety of genomic and
transcriptomic changes including but not limited to
somatic mutations (Shiraishi et al., Nucleic Acids Research, 2013),
gene fusions (Shiraishi et al., PLoS One, 2014),
structural variations (GenomonSV),
which have been integrated into our cancer genome analytical platform.
In addition, I have devised a novel statistical approach for extracting characteristic patterns of somatic mutations
(so-called mutation signatures), in which the mutation signatures are more robust and interpretable compared to previous approaches (Shiraishi et al., PLoS Genetics, 2015).
Currently, I’m especially focusing on the relationships between genomic transcriptomic changes.
We have developed a novel statistical framework to identify somatic variants causing splicing changes,
and applied it to ~9,000 cancer exome and transcriptome sequencing data,
generating a catalog of splicing associated variants (Shiraishi et al., Genome Research, 2018).
The next step would be to obtain more comprehensive relationships through further analysis (especially whole genome sequencing data). Also, I would like to tailor these insights to precision medicine.
I have worked on developing cancer genome analytical pipeline,
Genomon,
and have contributed to detections of a number of novel cancer drivers in
myelodysplasia (Yoshida, Sanada, Shiraishi et al., Nature, 2011),
kidney cancers (Sato, Yoshizato, Shiraishi et al., Nature Genetics, 2013),
adult T cell leukemia/lymphoma (Kataoka K, Nagata Y, Kitanaka A, Shiraishi Y et al., Nature Genetics, 2015) and so on.
Furthermore, we have identified common aberration of an immune checkpoint gene in multiple cancers
(Kataoka, Shiraishi et al., Nature, 2016) through pan-cancer analysis.
Also, we have developed a software for generating dynamic reports
that are frequently necessary for the post-analytical phases of cancer genome studies
(Okada et al., bioRxiv, 2017).
Now, the bioinformatics community is rapidly shifting to use cloud computing,
which is highly expected to enhance sharing of the excessive amount of sequencing data rapidly accumulating all over the world,
as well as analytical workflows developed by each research institute.
I’m very interested in devising an effective framework for using cloud computing to cancer genome sequencing data.
Selected Publications
Preprint
- Shiraishi Y, Koya J, Chiba K, Saito Y, Okada A, Kataoka K.
Precise characterization of somatic structural variations and mobile element insertions from paired long-read sequencing data with nanomonsv.
bioRxiv, doi: https://doi.org/10.1101/2020.07.22.214262.
Software.
- Shiraishi Y, Chiba K, Okada A.
SF3B1ness score: screening SF3B1 mutation status from over 60,000 transcriptomes based on a machine learning approach.
bioRxiv, doi: https://doi.org/10.1101/572834.
Software.
Peer reviewed
The only peer-reviewed journals to which I have contributed as a first, co-first or corresponding author are shown.
The more comprehensive list may be found here.
- PCAWG Transcriptome Core Group et al.
Genomic basis for RNA alterations in cancer.
Nature. 2020 Feb;578(7793):129-136.
- Shiraishi Y, Kataoka K, Chiba K, Okada A, Kogure Y, Tanaka H, Ogawa S, Miyano S.
A comprehensive characterization of cis-acting splicing-associated variants in human cancer.
Genome Res. 2018 Aug;28(8):1111-1125.
- Okada A, Chiba K, Tanaka H, Miyano S, Shiraishi Y.
A framework for generating interactive reports for cancer genome analysis.
Journal of Open Source Software. 2017 2(20),457, doi:10.21105/joss.00457.
Software.
- Kataoka K*, Shiraishi Y*, Takeda Y* et al.
Aberrant PD-L1 expression through 3’-UTR disruption in multiple cancers.
Nature. 2016 Jun 16;534(7607):402-6.
- Shiraishi Y, Tremmel G, Miyano S, Stephens M.
A Simple Model-Based Approach to Inferring and Visualizing Cancer Mutation Signatures.
PLoS Genet. 2015 Dec 2;11(12):e1005657.
Software.
- Kataoka K*, Nagata Y*, Kitanaka A*, Shiraishi Y*, Shimamura T*, Yasunaga J*, Totoki Y* et al.
Integrated molecular analysis of adult T cell leukemia/lymphoma.
Nat Genet. 2015 Nov;47(11):1304-15.
- Chiba K*, Shiraishi Y*, Nagata Y, Yoshida K, Imoto S, Ogawa S, Miyano S.
Genomon ITDetector: a tool for somatic internal tandem duplication detection from cancer genome sequencing data.
Bioinformatics. 2015 Jan 1;31(1):116-8.
Software.
- Shiraishi Y, Fujimoto A, Furuta M et al.
Integrated analysis of whole genome and transcriptome sequencing reveals diverse transcriptomic aberrations driven by somatic genomic changes in liver cancers.
PLoS One. 2014 Dec 19;9(12):e114263.
- Sato Y*, Yoshizato T*, Shiraishi Y*, Maekawa S*, Okuno Y* et al.
Integrated molecular analysis of clear-cell renal cell carcinoma.
Nat Genet. 2013 Aug;45(8):860-7.
- Shiraishi Y, Sato Y, Chiba K, Okuno Y, Nagata Y, Yoshida K, Shiba N, Hayashi Y, Kume H, Homma Y, Sanada M, Ogawa S, Miyano S.
An empirical Bayesian framework for somatic mutation detection from cancer genome sequencing data.
Nucleic Acids Res. 2013 Apr;41(7):e89.
Software.
- Yoshida K*, Sanada M*, Shiraishi Y*, Nowak D*, Nagata Y* et al.
Frequent pathway mutations of splicing machinery in myelodysplasia.
Nature. 2011 Sep 11;478(7367):64-9.
- Shiraishi Y, Okada-Hatakeyama M, Miyano S.
A rank-based statistical test for measuring synergistic effects between two gene sets.
Bioinformatics. 2011 Sep 1;27(17):2399-405.
Software.
- Shiraishi Y, Kimura S, Okada M.
Inferring cluster-based networks from differently stimulated multiple time-course gene expression data.
Bioinformatics. 2010 Apr 15;26(8):1073-81.
Curriculum Vitae
Academic qualifications
- BSc (Engineering), Mathematical Engineering Course, Department of Mathematical Engineering and Information Physics, Faculty of Engineering, The University of Tokyo, 2003.
- MSc (Information Science and Technology), Department of Mathematical Informatics, Graduate School of Information Science and Technology, The University of Tokyo, 2005.
- PhD (Statistical Science), Department of Statistical Science, The Graduate University for Advanced Studies, 2008.
Professional Records
- Postdoctoral Researcher, Cellular Systems Modeling Team, RIKEN Research Center for Allergy and Immunology, Japan (Apr. 2008 – Sep. 2010).
- Postdoctoral Researcher, Laboratory of DNA Information Analysis, Human Genome Center, Institute of Medical Science, The University of Tokyo, Japan (Oct. 2010 – May. 2012).
- Project Assistant Professor, Laboratory of DNA Information Analysis, Human Genome Center, Institute of Medical Science, The University of Tokyo, Japan (Jun. 2012 – Mar. 2014).
- Assistant Professor, Laboratory of DNA Information Analysis, Human Genome Center, Institute of Medical Science, The University of Tokyo, Japan (Apr. 2014 – Now).
- Visiting Scholar, Department of Human Genetics, The University of Chicago. USA. (Apr. 2014 - Nov. 2014).
- Unit Leader, Division of Cellular Signaling, National Cancer Center Research Institute, Japan (Apr. 2018 – Mar. 2020).
- Chief, Division of Genome Analysis Platform Development, National Cancer Center Research Institute, Japan (Apr. 2020 – Now).