In a groundbreaking study published in Nature, researchers have unveiled compelling connections between the DNA loads of various human viruses and a broad array of biological and clinical phenotypes. Leveraging an unprecedented dataset from over 490,000 participants in the UK Biobank (UKB), the team meticulously mapped viral DNA presence in blood samples and related these findings to health outcomes, genetic backgrounds, and lifestyle factors. Their findings deepen our understanding of how latent viral infections interact with human health and disease, with significant implications for clinical diagnostics and therapeutic strategies.
The study identified 181 statistically significant associations between viral DNA load and disease phenotypes, alongside 366 associations with blood counts, biomarkers, and metabolites, all rigorously adjusted for multiple testing. Among these disease associations, higher viral DNA loads were predominantly observed in individuals affected by conditions characterized by inflammation or compromised immune function, including AIDS, anemia, diabetes, and renal failure. Organ transplantation emerged as a notable factor correlating with elevated anellovirus DNA loads, which the authors attribute to the immunosuppressive regimens followed post-transplant. Indeed, the use of common immunosuppressive drugs was linked to increased DNA loads of Epstein-Barr virus (EBV) and anelloviruses, offering a plausible mechanistic explanation for some observed disease correlations.
Interestingly, the study also explored carrier status for endogenous herpesviruses HHV-6A and HHV-6B but found no significant associations with any measured biological or clinical phenotypes. This points to a nuanced interplay between different viral species and the human host, where some viruses may be more directly implicated in disease processes than others, or vary in their detectability and pathogenic potential.
Among the most striking discoveries were the divergent effects of smoking on the prevalence of two common herpesviruses, EBV and human herpesvirus 7 (HHV-7). Smoking exposure, measured both by cumulative pack-years and cigarettes smoked per day, was strongly associated with nearly doubled EBV DNA prevalence among the heaviest smokers compared to nonsmokers. In stark contrast, higher smoking levels corresponded with reduced HHV-7 DNA prevalence. These trends held true across multiple tissue types, including blood and saliva, and were replicated in the All of Us (AoU) cohort, underscoring their robustness. The dichotomous influence of smoking on these viral loads highlights complex virus-host-environment interactions that could modulate disease risk differently depending on viral species.
Beyond epidemiological correlations, the large-scale genome-wide association study (GWAS) performed illuminated numerous host genetic loci that modulate EBV DNA load. Exploiting these genetic instruments, the researchers applied Mendelian randomization analyses to probe causal relationships between viral DNA load and disease risk. This methodological approach eschews confounding common in observational studies by leveraging genetic variants as proxies for exposures—in this case, viral DNA load—allowing inference on whether viral load contributes to or results from disease phenotypes.
These causal analyses yielded nuanced insights. For multiple sclerosis (MS), a disease with a well-established epidemiological link to prior EBV infection, the study found that while affected individuals in UKB had higher EBV DNA loads, the Mendelian randomization results did not support a causal effect of viral DNA load levels on MS risk. This suggests that past EBV infection exposure remains crucial, but later variation in viral load among latently infected individuals may not materially influence MS susceptibility. Similarly, causal assessments for systemic lupus erythematosus and rheumatoid arthritis were inconclusive, indicating the need for further investigation.
A different picture emerged for Hodgkin’s lymphoma, an EBV-associated malignancy. Genetic variants increasing EBV DNA load were also significantly associated with heightened risk of Hodgkin’s lymphoma across multiple Mendelian randomization methods. This finding indicates a plausible causal role for elevated EBV DNA burden in oncogenesis, potentially mediated by increased numbers of latently infected B cells prone to malignant transformation. Supporting this, incident cases of Hodgkin’s lymphoma during follow-up displayed a two-fold higher risk among individuals with detectable EBV DNA at baseline, underscoring the prognostic relevance of viral DNA quantification.
The specificity of this association was further reinforced by the absence of consistent causal signals for other lymphoid neoplasms such as non-Hodgkin’s lymphoma, chronic lymphocytic leukemia, and lymphoid leukemia. Collectively, these results argue for a unique pathogenic interconnection between EBV load and Hodgkin’s lymphoma development, highlighting a potential avenue for targeted monitoring or intervention in high-risk individuals.
Methodologically, the study’s exclusion of the highly polymorphic major histocompatibility complex (MHC) region from Mendelian randomization analyses mitigated confounding due to linkage disequilibrium and pleiotropy, enhancing confidence in the inferred causal relationships. Additionally, the alignment across multiple statistical approaches, including inverse-variance weighted and MR-Egger tests, provided consistent support for the conclusions drawn regarding Hodgkin’s lymphoma.
The breadth of the dataset—exceeding 638,000 whole genome sequences—afforded unparalleled statistical power to detect subtle viral-host interactions. This enabled robust disentanglement of environmental exposures, such as smoking, from genetic predispositions affecting viral DNA load, and allowed precise evaluation of viral role in complex diseases. Furthermore, the replication of key findings in multiple independent cohorts strengthens the generalizability of these observations.
Importantly, the research elucidates the differential biological impact that common lifestyle factors exert on distinct herpesviruses in the human virome. While smoking exacerbates EBV prevalence and abundance, it hindered HHV-7 detection, illustrating the complex ecological dynamics within the human viral milieu and the potential consequences for disease outcomes linked to these persistent infections. These divergent effects could relate to differences in viral tropism, latency control mechanisms, or immune evasion strategies.
Clinically, the identification of elevated EBV DNA load as a marker predictive of Hodgkin’s lymphoma development opens promising possibilities for early detection and monitoring. Quantifying viral DNA in blood via sensitive whole genome sequencing could emerge as a non-invasive biomarker to stratify lymphoma risk, guiding surveillance and preventive interventions. Moreover, understanding host genetic influences on viral persistence may inform personalized medicine approaches and the design of therapeutics targeting viral latency or reactivation pathways.
In summary, this expansive investigation into the human DNA virome demonstrates that viral DNA loads are not passive bystanders but dynamic factors intertwined with human genetics, environmental exposures, and disease. The findings furnish valuable insights into the mechanistic underpinnings of viral contributions to immune dysfunction and oncogenesis, paving the way for novel diagnostic and therapeutic strategies that leverage the virome as a modifiable component of health.
Subject of Research:
Associations between human viral DNA loads, host genetics, environmental factors (notably smoking), and their impact on clinical phenotypes including autoimmune diseases and lymphoid cancers.
Article Title:
The DNA virome varies with human genes and environments.
Article References:
Kamitaki, N., Tang, D., McCarroll, S.A. et al. The DNA virome varies with human genes and environments. Nature (2026). https://doi.org/10.1038/s41586-026-10288-y
Image Credits:
AI Generated
DOI:
https://doi.org/10.1038/s41586-026-10288-y
Keywords:
DNA virome, viral DNA load, Epstein-Barr virus, human herpesvirus 7, smoking, Mendelian randomization, Hodgkin’s lymphoma, autoimmune disease, UK Biobank, host genetics

