Tuesday, June 23, 2026
Science
No Result
View All Result
  • Login
  • HOME
  • SCIENCE NEWS
  • CONTACT US
  • HOME
  • SCIENCE NEWS
  • CONTACT US
No Result
View All Result
Scienmag
No Result
View All Result
Home Science News Mathematics

ISTA scientists create algorithm to enhance biobank data analysis of human height using big data

February 19, 2026
in Mathematics
Reading Time: 4 mins read
0
ISTA scientists create algorithm to enhance biobank data analysis of human height using big data
65
SHARES
594
VIEWS
Share on FacebookShare on Twitter
ADVERTISEMENT

In the rapidly evolving landscape of biomedical research, the ability to extract meaningful insights from vast genomic datasets remains a formidable challenge. Large-scale biobanks, which house millions of genetic sequences alongside detailed health and lifestyle data, offer unprecedented opportunities to unravel the genetic underpinnings of complex human traits and diseases. However, analyzing such expansive datasets is computationally intensive, often forcing researchers to compromise between accuracy and feasibility. Traditional algorithms, which typically rely on sampling millions of individual data points, deliver high theoretical precision but at an enormous computational cost that limits their practical use.

A groundbreaking solution has emerged from the Institute of Science and Technology Austria (ISTA), where an interdisciplinary team has developed an innovative algorithm designed to navigate these challenges with remarkable efficiency. By integrating concepts from information theory, advanced mathematics, genomics, and software engineering, the team has created a method that surpasses previous computational techniques in both speed and precision. This new approach enables researchers to jointly analyze whole genome sequences at a scale previously unattainable.

The focal point of their research is a model complex trait, human height, which has long served as a paradigm for studying the genetic architecture of complex traits. Human height is influenced by an extraordinarily large number of genetic variants—on the order of 17 million—which makes it an ideal benchmark for testing the algorithm’s capability. The researchers leveraged the extensive UK Biobank dataset, the world’s most comprehensive resource containing hundreds of thousands of whole-genome sequences from anonymized participants, to validate their approach.

Traditional methodologies typically dissect the dataset into smaller fragments, analyzing each segment separately before synthesizing the results. In contrast, the newly developed “genomic Vector Approximate Message Passing” (gVAMP) algorithm operates under a fundamentally different principle known as joint estimation. This approach simultaneously accounts for the influence of all genetic variants across the entire genome on the trait of interest, thereby capturing complex interactions that fragmentary methods might miss. This innovation allows gVAMP to provide a holistic overview of genetic effects with enhanced interpretability and accuracy.

At the core of gVAMP lies the approximate message passing (AMP) framework—a recent mathematical construct that offers a principled way to perform inference in large, complex datasets. ISTA researcher Marco Mondelli, a key contributor to AMP’s foundational theory, guided the adaptation of this framework to genomic data. The gVAMP algorithm extends AMP’s capabilities, tailored specifically to handle the immense dimensionality and correlation structure characteristic of whole-genome sequence datasets.

The promise of gVAMP is not solely theoretical; it manifests in tangible performance improvements. When tasked with predicting human height from genomic data, gVAMP generated novel insights by identifying genetic variant contributions whose effects had not been previously quantified. The challenge, however, was how to benchmark these predictions in the absence of pre-existing datasets capturing such detailed genetic effect estimations. To address this, the ISTA team designed extensive data simulations, generating synthetic traits approximating the complexity of human height traits. By comparing gVAMP’s performance against established genomic analysis methods on these simulated datasets, they demonstrated superior accuracy and drastically reduced processing times.

Beyond its predictive prowess, gVAMP shines in its interpretability—a critical feature for biomedical applications. The algorithm not only forecasts complex traits with heightened precision but also pinpoints specific genomic regions responsible for trait variability. This granularity provides invaluable biological insights, unveiling the intricate genetic architecture underlying complex characteristics. Such clarity could propel both fundamental genetic research and translational applications, helping to elucidate mechanisms driving traits and diseases alike.

Looking forward, the potential applications of gVAMP stretch into personalized medicine and diagnostic advancements. By enabling accurate joint genomic analyses at unprecedented scales, gVAMP could empower predictive models that inform on disease onset timing, progression severity, and symptom emergence. Further developments aim to integrate additional layers of biological data—including proteomic and epigenetic information—to capture biological complexity beyond genetic sequences alone. Incorporating such multi-omics perspectives promises to refine clinical decision-making, enabling tailored therapeutic interventions and optimized patient stratification in clinical trials.

Moreover, the versatility of gVAMP could extend into less conventional arenas such as forensic science. The ability to accurately predict phenotypic traits like height from DNA profiles retrieved at crime scenes represents a transformative tool for law enforcement and forensic investigations. This application highlights the broader societal impact of the algorithm, showcasing how cutting-edge computational methods can bridge research and real-world problem solving.

The success of this project underscores the power of interdisciplinary collaboration. The combined expertise in theoretical mathematics, computer science, genomic statistics, and software engineering catalyzed an algorithmic breakthrough. ISTA PhD student Al Depope, computer scientist Jakub Bajzik, mathematician Marco Mondelli, and genomic statistician Matthew Robinson exemplify how cross-domain approaches fuel innovation. Their joint supervision and integration of diverse skill sets facilitated a methodological leap forward in computational genomics.

In summary, the gVAMP algorithm stands at the forefront of computational genomics, delivering a scalable, precise, and interpretable solution for analyzing whole-genome sequence data. By redefining the boundaries of data-driven genetic inference, it opens new avenues for understanding human biology, advancing personalized healthcare, and potentially enhancing forensic methodologies. As research progresses, gVAMP’s framework is poised to become a foundational tool in the era of big genomic data.


Subject of Research: Cells

Article Title: Joint modelling of whole genome sequence data for human height via approximate message passing

News Publication Date: 18-Feb-2026

Web References:
https://doi.org/10.1016/j.xgen.2026.101162

References:
Al Depope, Jakub Bajzik, Marco Mondelli, and Matthew R. Robinson. 2026. Joint modelling of whole genome sequence data for human height via approximate message passing. Cell Genomics. DOI: 10.1016/j.xgen.2026.101162

Image Credits: © ISTA

Keywords: Human genetics, Population genetics, Data sets, Big data, Data points, Information retrieval, Information processing, Data storage, Databases, Data analysis, DNA, Genomics, Phenotypes, Algorithms, Mathematics, Information theory

Tags: advanced mathematical modeling in geneticsbig data biobank analysisbiobank data integrationcomputational genomics methodsefficient genomic data processinggenetic architecture of complex traitsgenomic data algorithmhuman height geneticsinformation theory in genomicsinterdisciplinary biomedical researchlarge-scale genome sequencingsoftware engineering for genomics
Share26Tweet16
Previous Post

New Study Identifies Dog Breeds Most Prone to Breathing Difficulties

Next Post

Protein Power: How an Aussie Desert Plant Could Revolutionize Nutrition

Related Posts

New Study Questions Reliability of Widely Cited Gaza Mortality Survey in The Lancet Global Health — Mathematics
Mathematics

New Study Questions Reliability of Widely Cited Gaza Mortality Survey in The Lancet Global Health

June 22, 2026
ACM to Launch Prestigious Journal on Logic Programming — Mathematics
Mathematics

ACM to Launch Prestigious Journal on Logic Programming

June 17, 2026
Researchers Use Math to Crack Wordle: A S-M-A-R-T Breakthrough! — Mathematics
Mathematics

Researchers Use Math to Crack Wordle: A S-M-A-R-T Breakthrough!

June 17, 2026
Limited Data Access May Distort Medical Findings — Mathematics
Mathematics

Limited Data Access May Distort Medical Findings

June 17, 2026
Flexible Untethered Magnetic Actuators Enable Multimodal and Cross-Scale Functionality — Mathematics
Mathematics

Flexible Untethered Magnetic Actuators Enable Multimodal and Cross-Scale Functionality

June 17, 2026
Navigating the Adoption Paradox of AI in Computational Pathology: A Three-Stage Maturity Model from Algorithms to Clinical Practice — Mathematics
Mathematics

Navigating the Adoption Paradox of AI in Computational Pathology: A Three-Stage Maturity Model from Algorithms to Clinical Practice

June 17, 2026
Next Post
Protein Power: How an Aussie Desert Plant Could Revolutionize Nutrition

Protein Power: How an Aussie Desert Plant Could Revolutionize Nutrition

  • Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    27656 shares
    Share 11059 Tweet 6912
  • University of Seville Breaks 120-Year-Old Mystery, Revises a Key Einstein Concept

    1061 shares
    Share 424 Tweet 265
  • Bee body mass, pathogens and local climate influence heat tolerance

    682 shares
    Share 273 Tweet 171
  • Researchers record first-ever images and data of a shark experiencing a boat strike

    546 shares
    Share 218 Tweet 137
  • Groundbreaking Clinical Trial Reveals Lubiprostone Enhances Kidney Function

    531 shares
    Share 212 Tweet 133
Science

Embark on a thrilling journey of discovery with Scienmag.com—your ultimate source for cutting-edge breakthroughs. Immerse yourself in a world where curiosity knows no limits and tomorrow’s possibilities become today’s reality!

RECENT NEWS

  • Collision in Space Fails to Confirm Dark Matter Presence After All
  • Illinois Research Investigates Sustainable Jet Fuel Production from Food Waste
  • AI-Driven Platform Accelerates Discovery of Promising Cancer Therapies
  • Beyond Energy: How Metabolites Orchestrate Embryo Implantation

Categories

  • Agriculture
  • Anthropology
  • Archaeology
  • Athmospheric
  • Biology
  • Biotechnology
  • Blog
  • Bussines
  • Cancer
  • Chemistry
  • Climate
  • Earth Science
  • Editorial Policy
  • Marine
  • Mathematics
  • Medicine
  • Pediatry
  • Policy
  • Psychology & Psychiatry
  • Science Education
  • Social Science
  • Space
  • Technology and Engineering

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 5,146 other subscribers

© 2025 Scienmag - Science Magazine

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • HOME
  • SCIENCE NEWS
  • CONTACT US

© 2025 Scienmag - Science Magazine

Discover more from Science

Subscribe now to keep reading and get access to the full archive.

Continue reading