Topological Recurrence Index
Persistent-homology pipeline that flags emerging adaptive mutations in viral genomes
The topological Recurrence Index (tRI) ranks SARS-CoV-2 mutations by how frequently they appear across independent, phylogenetically distant lineages. It leverages persistent homology to spot convergent evolutionary signals that classic tree-based pipelines struggle to surface quickly.
Why topology helps
- Phylogeny-free convergence detection – Vietoris–Rips filtrations on genomic Hamming graphs capture reticulate patterns that point to repeated selective advantages.
- Inherently confined to genetic background – tRI scores are always located within the context of specific genetic backgrounds, reducing noise from unrelated mutations.
- Actionable priorities – mutations with high tRI values align with variants of concern and experimentally validated fitness gains.
Workflow at a glance
- Construct Hamming graphs of viral genomes sampled over time windows.
- Compute Vietoris–Rips persistence in dimension one to identify loops corresponding to parallel mutational paths.
- Aggregate loop membership to score individual mutations (the tRI) and track trajectories over time.
- Cross-reference high-tRI mutations with structural data and epidemiological trends.
Highlights
- Early warning signals for spike mutations later dominant in Alpha, Beta, Gamma and Delta variants.
- Open-source implementation alongside interactive dashboards for public-health partners.
- Scales to millions of sequences using distributed ripser-based calculations.
Learn more
- Preprint: (Bleher et al., 2021)
- Contact: reach out if you are interested in applying tRI to other pathogens or longitudinal omics datasets.
Citation
2021
- preprint