Research entry
Genomics PCA Dashboard
2021 · Academic Archive
The original interactive dashboard for the genomics PCA work, built as the first web interface for exploring population structure in the 1000 Genomes dataset.
About
Built as a college project to explore population genetics using principal component analysis. The dashboard lets users interactively explore how genome-wide SNPs cluster by population ancestry.
This page is kept as the original dashboard version of the genomics work. The broader public-facing evolution of the project now lives in the newer explorer app.
Features
- Scatter matrix — all PCs plotted against each other, coloured by country
- 3D PCA plot — PC1/PC2/PC3 in 3D space, coloured by superpopulation
- Choropleth map — geographic distribution of sample ancestry
- Scree plot — cumulative variance explained by each principal component
- Dropdown filter — filter by superpopulation (AFR, AMR, EAS, EUR, SAS)
Public App (archived)
A later Streamlit-based successor to this dashboard was hosted at
genomics.connorfaulkner.com through early 2026 and has since been retired.
The analysis, code, and plots remain available as static portfolio content.
Tech Stack
Python · Dash · Plotly · Pandas · NumPy · Dash Bootstrap Components