Arshi Arora

Research Biostatistician

Memorial Sloan Kettering Cancer Center

About me

I am a thoughtful genomics data scientist statistician with strong programming skills in R and Python, and a formal training in both Computational Biology and Biostatistics. Currently, I work as a Principal Scientist at Incyte deploying ML models to mine for novel drug targets and making my work more accessible via data visualization and Shiny and FlexDashboard apps. Previously, I worked at Memorial Sloan Kettering Cancer Center and focused on methodological work in the field of Cancer Genomics.

The right hemisphere of my brain is into ceramics, painting, DIY crafts and biking. I am a minimalist and on a personal mission to reduce what goes in my trashbin. I also co-host a podcast on Computational Biology called Computationally Yours!

Education

MS Biostatistics, 2017
Columbia University
MS Computational Biology, 2010
Carnegie Mellon University
B.Tech Biotechnology, 2008
Amity University

Skills

R

Statistics

Perl,shell

Potter (wizarding and muggle)

Minimalist

Dancer

Projects

gnomeR

Package to wrangle and visualize genomic data in R

iCluster and TCGA

Integrative clustering of TCGA datasets

panelmap

Visualization tool for clustered groups

survClust

An outcome weighted supervised clustering algorithm

Recent & Upcoming Talks

AMSTAT feature, 2021

Jun 1, 2021 12:00 AM — Jun 1, 2020 12:00 AM

panelmap at WSDS, 2020

Sep 30, 2020 12:00 AM — Oct 2, 2020 12:00 AM

BIRSBIO 2020 Hackathon

Jun 14, 2020 12:00 AM — Jun 19, 2020 12:00 AM

MS Comp Bio Lightning Talk, CMU

Apr 7, 2020 12:00 AM

ISMCO (2019)

Oct 4, 2019 12:00 AM — Oct 6, 2019 12:00 AM

See all talks

Journey so far

Principal Investigator

Incyte

Feb 2022 – Present Wilmington, DE

Responsibilities include: My role at Incyte is highly cross-functional where I work with scientists from Discovery, Pharmacology and Biology to power their biological hypothesis with data. This exposed me to various view points sometimes of the same problem and in identifying answers supported by data to key questions in drug discovery and translation. I also lead Target Identification and Validation efforts with the help of Machine Learning and other deep learning models.

Research Biostatistician

Memorial Sloan Kettering Cancer Center

May 2012 – Feb 2022 New York

Responsibilities include:

Developed survClust, a semi-supervised classification algorithm that stratifies patients into cohorts driven by their genetic background and survival.
survClust was then used in a pancancer cohort of patients treated with immune checkpoint blockade therapies to stratify patients with worst prognosis. Read more here
Lead genomics analyst of the International consortium of Melanoma (InterMEL) and building a framework for identifying false positives from tumor-only somatic mutation calling pipeline. (Glitter)
Integrated analysis of various cancer types as part of The Cancer Genome Atlas (TCGA) consortium like Liver Hepatocellular Carcinoma (LIHC), Prostate Adenocarcinoma (PRAD) and Skin Cutaneous Melanoma (SKCM) using joint latent variable model implemented in iCluster, to arrive at molecularly distinct subtypes.
Providing genomics and analytical support to faculty members of Epidemiology and Biostatistics Department at Memorial Sloan Kettering Cancer Center on a broad range of analysis like copy number and clonal evolution, mutational signature analysis, and building statistical models to identify prognostic molecular features in exome sequencing and mutation panel testing datasets.
Understanding etiological tumor heterogeneity across various molecular assays like gene expression, mutation, copy number, and epigenetic data through known clinical risk factors to characterize distinct risk groups.
Developed a validated prognostic gene risk score of colorectal cancer liver metastasis patients.

Arshi Arora

Research Biostatistician

Memorial Sloan Kettering Cancer Center

About me

Education

Skills

R

Statistics

Perl,shell

Potter (wizarding and muggle)

Minimalist

Dancer

Projects

gnomeR

iCluster and TCGA

panelmap

survClust

Recent & Upcoming Talks

Recent Posts

A brief primer on scientific and mathematical notations

Academic Hugo Theme via Blogdown: Few more details and deployment (part 2)

Academic Hugo Theme via Blogdown: Where to start?

Journey so far

Principal Investigator

Incyte

Research Biostatistician

Memorial Sloan Kettering Cancer Center

Ceramics

Reach out to me