Photo by Shopify Partners on burst

Colorectal cancer study

Last updated on Aug 28, 2019

Code

Photo by Shopify Partners on burst

Colorectal cancer study

Last updated on Aug 28, 2019

Code

During my doctoral training, most of my research is focused on developing novel statistical methods for genomics data, specifically, cancer development and tumor growth. This project has enlighted me to work on several interesting perspectives of it, including hierarchical topics model, clustering, and interactive interface. Thanks to my advisors Kimberly Siegmund, and committe member Paul Marjoram and Darryl Shibata.

Motivations

Topic models have been widely applied to extract topics from various range of documents or collections of texts, i.e., online customers reviews, medical records, scientific journals, legal documents, books and etc. Its application facilitates the process for us to quickly understand the most featured and commonly shared information embedded texts without actually reading through the entire collection. In addition, topic models also allow us to access the contribution of each topic and its representations across different documents. Human genomes have been exposed to an assortment of mutational processes by contributing to unique patterns of somatic mutations. What would happen if we apply the same concept to the somatic mutations obtained from the cancer patients and look for “topics” of mutations? What would these “topics” tell us about the most important information for our health, genetic, risk factors for cancer and something more that slip under the radar?

News

Ever want to compare mutational signatures between different cancer types? Check out @zhiiiyang 's approach @thePeerJ https://t.co/TbCgXlmwlm with software available @Bioconductor #Bioinformatics #Genomics #Statistic #CancerResearch #USCBiostat
— Kim Siegmund (@KimSiegmund1) August 28, 2019

Six days after the paper being accepted, the package also got accepted to Bioconductor! I have to say the reviewer team truly made them much better. https://t.co/5lkhRlSXVn pic.twitter.com/FR3JPU9W8b
— Zhi Yang, PhD (@zhiiiyang) July 31, 2019

#IMAGEP01 investigators preparing for a great day of science @uscphs #KeckSOM #USC pic.twitter.com/DHxVxwYaqq
— USC Biostatistics (@USCBiostat) June 12, 2019

R Bayesian inference Statistical analysis

Zhi Yang

Senior Manager, Biostatistics

Publications

Models that combine transcriptomic with spatial protein information exceed the predictive value for either single modality

Ioannis A. Vathiotis, Zhi Yang, Jason Reeves, Maria Toki, Thazin Nwe Aung, Pok Fai Wong, Harriet Kluger, Konstantinos N. Syrigos, Sarah Warren, David L. Rimm

Code Project DOI

iMutSig - a web application to identify the most similar mutational signature using shiny

Zhi Yang, Priyatama Pandey, Paul Marjoram, Kimberly D. Siegmund

Preprint Code Project DOI

Mutational signatures in colon cancer

Priyatama Pandey, Zhi Yang, Darryl Shibata, Paul Marjoram, Kimberly D. Siegmund

Project DOI

HiLDA - a statistical approach to investigate differences in mutational signatures