A groundbreaking exploration into the human DNA virome reveals intricate associations between viral DNA loads and diverse clinical phenotypes, unveiling critical insights into how human genes and environments shape viral presence within blood. In an unprecedented study encompassing over 490,000 participants from the UK Biobank (UKB), researchers meticulously quantified viral DNA loads using whole genome sequencing (WGS) data to decode the interplay between viral DNA prevalence and a multitude of biological and clinical variables. This extensive analysis yielded 181 significant correlations between viral DNA load and disease phenotypes and 366 associations involving blood counts, biomarkers, and metabolites, with rigorous Bonferroni correction ensuring robustness of these findings.
Critically, the majority of disease associations highlighted elevated viral DNA loads in affected individuals, emphasizing conditions characterized by inflammation or compromised immune function. Diseases such as AIDS, anemia, diabetes, and renal failure prominently featured higher viral DNA burdens, underscoring the potential role of viral persistence or reactivation in these pathological states. Organ transplantation, a condition that necessitates immunosuppressive therapy, exhibited a pronounced association with increased anellovirus DNA load, likely reflecting immunosuppressive treatment effects. Concordantly, participants actively receiving common immunosuppressants demonstrated elevated Epstein-Barr Virus (EBV) and anellovirus DNA loads, possibly accounting for certain observed disease associations. Notably, carrier states for endogenous human herpesvirus 6A or 6B did not exhibit significant associations with biological or clinical phenotypes, suggesting differential impacts of viral types on health outcomes.
The investigation also illuminated the complex relationship between lifestyle factors and viral DNA dynamics. Smoking, in particular, emerged as a critical determinant, exhibiting opposite effects on EBV and human herpesvirus 7 (HHV-7) prevalence. Detailed analysis of smoking pack-years demonstrated a near twofold increase in detectable EBV DNA prevalence among the heaviest smokers compared to never smokers. In stark contrast, HHV-7 DNA prevalence inversely correlated with smoking exposure, depicting suppression with increased cigarette consumption. These smoking-associated trends extended beyond mere prevalence, with EBV abundance also heightened among smokers who were EBV DNA-positive. These findings were robustly replicated in an independent cohort from the All of Us Program (AoU), both in blood and saliva samples, consolidating the evidence for smoking’s divergent modulation of viral loads.
Delving deeper into the genetic architecture underlying viral DNA load variability, genome-wide association studies (GWAS) uncovered numerous genetic loci influencing EBV DNA abundance. Leveraging these genetic insights, the researchers implemented Mendelian randomization analyses to explore the causal relationships between viral DNA load and various disease phenotypes. This approach harnessed SNPs predictive of viral DNA load as instrumental variables, allowing differentiation between associations driven by viral load causality versus confounding or reverse causation. While many observed DNA load–phenotype associations probably result from the disease or treatment-induced changes in viral replication, genetic instruments implicated viral DNA load as a causative upstream factor in select pathologies.
Among autoimmune conditions and hematological malignancies epidemiologically linked to EBV, multiple sclerosis (MS) was a focal point. Despite MS patients exhibiting elevated EBV DNA loads in the UKB cohort, Mendelian randomization analyses incorporating data from FinnGen, UKB, and the Million Veteran Program did not support a causal effect of viral DNA load on MS risk. This finding nuances prior knowledge, suggesting that while primary EBV infection is a major risk factor for MS, variations in latent viral load post-infection have limited influence on disease development. Similar investigations into systemic lupus erythematosus and rheumatoid arthritis produced inconclusive results regarding causality, indicating the need for further research to disentangle the complex interactions between EBV and autoimmunity.
In striking contrast, robust evidence emerged linking elevated EBV DNA load causally with Hodgkin’s lymphoma risk. Across multiple Mendelian randomization methodologies, alleles increasing EBV DNA abundance consistently conferred heightened Hodgkin’s lymphoma susceptibility, underscoring viral DNA load as a significant factor in lymphomagenesis. Conversely, associations with non-Hodgkin’s lymphoma and other lymphoid malignancies were weaker and inconsistent, highlighting a degree of specificity in the oncogenic potential of EBV DNA load. The longitudinal data further reinforced this connection; individuals positive for EBV DNA at baseline demonstrated a twofold increased odds of developing Hodgkin’s lymphoma within approximately 15 years of follow-up, implying that viral burden in blood reflects oncogenic risk.
Mechanistically, increased EBV DNA in blood may correspond to expanded pools of latently infected B cells, providing more substrates for the oncogenic transformation characteristic of Hodgkin’s lymphoma. This hypothesis aligns with EBV’s established role in B cell proliferation and immortalization, where latent viral gene expression disrupts cellular controls and fosters malignancy. The ability to predict Hodgkin’s lymphoma incidence from viral DNA measurements opens avenues for early detection and potential intervention by targeting viral reservoirs or modulating latent infection dynamics.
This study’s comprehensive integration of large-scale WGS, detailed clinical phenotyping, and advanced genetic epidemiology sets a new standard for understanding the human virome’s role in health and disease. By elucidating the multifaceted relationships between viral DNA load, host genetics, environmental exposures such as smoking, and disease risk, it paves the way for novel diagnostic and therapeutic strategies. Future research expanding these findings across diverse populations and viral species will be essential to fully unravel the implications for personalized medicine and public health.
In summary, the nuanced interplay between latent viral DNA and human disease presents a complex landscape where viral load acts as both a biomarker and a causal factor in select pathological processes. The demonstrated causal link to Hodgkin’s lymphoma contrasts with non-significant causal effects for multiple sclerosis and other autoimmune phenotypes, highlighting differential roles of viral persistence in diverse diseases. Incorporating viral DNA load assessments into clinical practice may ultimately refine risk stratification and inform targeted therapies against viral oncogenesis and immune-related pathologies.
Taken together, these insights reinforce the paradigm that the human virome, modulated by genetic and environmental forces, intricately shapes disease susceptibility, progression, and outcomes. This landmark investigation lays the foundation for future virome-centric biomedical discoveries that could transform understanding and management of viral-associated diseases worldwide.
Subject of Research:
Associations between viral DNA load and human clinical phenotypes, including the genetic and environmental determinants of the DNA virome and its causal role in disease risk.
Article Title:
The DNA virome varies with human genes and environments
Article References:
Kamitaki, N., Tang, D., McCarroll, S.A. et al. The DNA virome varies with human genes and environments. Nature (2026). https://doi.org/10.1038/s41586-026-10288-y
Image Credits:
AI Generated

