In an era where artificial intelligence is revolutionizing medical diagnostics, the early detection of breast cancer—a persistently devastating disease affecting millions of women globally—stands at a critical juncture. A newly published systematic review in BMC Cancer unravels the intricate landscape of deep learning (DL) techniques applied to mammographic breast cancer detection, shedding light on significant advancements while unmasking pressing gaps, particularly within Asian populations. This comprehensive synthesis not only maps the trajectory of global research but also calls for a paradigm shift toward more inclusive and demographically representative models.
Breast cancer, notorious for its high mortality rate among women worldwide, presents unique challenges that vary widely by region. While Western countries often dominate the narrative in medical innovation, this review unequivocally demonstrates that Asian populations encounter distinct obstacles in mammographic diagnostics—rooted primarily in physiological differences such as higher breast density. These variations critically affect the performance of DL-based diagnostic systems which, until now, have been predominantly trained on datasets from Caucasian populations.
The authors undertook a rigorous systematic review following PRISMA guidelines, meticulously screening over a thousand scientific records from top-tier databases including Scopus and Web of Science. Spanning literature published between 2018 and 2025, the review narrowed down to 287 studies most relevant to deep learning applications in mammography. Their selection criteria underscore a growing trend: the surge in DL-based computer-aided diagnostic (CAD) systems that leverage convolutional neural networks and other neural architectures to enhance lesion classification, segmentation, and breast density assessment.
Among the key findings is the overwhelming emphasis on lesion classification, a cornerstone task wherein neural networks discern malignant from benign formations. However, a conspicuous scarcity of research addresses other vital components such as tumor detection, precise segmentation, and dynamic breast density quantification. These tasks are essential for improving diagnostic specificity and sensitivity but have been comparatively neglected in the literature.
Asian datasets, representing a demographic with notably denser breast tissue, emerge as a critical locus of study in this review. DL models trained primarily on Caucasian imagery falter when transferred to Asian populations, a phenomenon attributed to intrinsic anatomical and image-acquisition disparities. The compendium of Asian studies highlighted problems including limited availability of annotated datasets—a fundamental bottleneck for supervised learning—and insufficient representation of varied imaging modalities, which limits the robustness of predictive models.
The review also delves into the nuanced preprocessing techniques and augmentation strategies employed to overcome the inherent challenges associated with mammogram data. From noise reduction to contrast enhancement and advanced data augmentation—such as rotation, scaling, and synthetic image generation—researchers have applied diverse methodologies to bolster the generalizability of DL models. Yet, the authors emphasize that these efforts are often piecemeal and not standardized across studies, complicating cross-comparison and clinical translation.
One of the most revealing aspects of the review is the deployment of focus maps to visualize the geographical and topical distribution of DL research efforts. These visual tools starkly illustrate a global bias, with more than 80% of publicly available datasets and resulting studies centered on Caucasian populations. This imbalance not only limits the efficacy of DL models in multiethnic applications but may inadvertently exacerbate healthcare disparities, a concern of paramount importance given the global burden of breast cancer.
Moreover, the authors critically analyze the BI-RADS (Breast Imaging-Reporting and Data System) classification—a universally accepted radiological lexicon—and identify a significant gap in multi-class classification within deep learning studies. Most research simplifies the task to binary classification (benign vs. malignant), a reductionist approach that undermines the granularity needed for nuanced clinical decision-making and risk stratification.
The synthesis uncovers a pressing need for collaborative frameworks aiming at the curation of expansive, diverse mammography datasets encompassing various ethnic groups and geographic regions. Such initiatives would not only democratize access to high-quality data but also facilitate the development of deep learning models that are robust, adaptable, and clinically valid worldwide.
Importantly, the review calls for rigorous cross-populational validation pipelines to prevent the pitfalls of model overfitting and ensure that diagnostic algorithms maintain high sensitivity and specificity across heterogeneous cohorts. Clinical trials and prospective studies involving women from multiple demographic backgrounds must be mandated to verify the translational power of new CAD technologies.
At the core of these revelations lies a call to the global research community: inclusivity and diversity in training data are not merely ethical imperatives but scientific necessities. By embracing demographic heterogeneity, researchers can harness the full potential of deep learning to revolutionize breast cancer detection and screening effectiveness—saving countless lives.
This systematic review acts as both a reflection and a roadmap. It reflects the remarkable strides made in leveraging deep learning for breast cancer diagnostics and illuminates the persistent, subtle biases embedded within current methodologies. Simultaneously, it maps out clear directions for future inquiry—prioritizing ethnic diversity, promoting methodological standardization, and fostering international cooperation.
As breast cancer remains a paramount public health challenge, innovations in AI must be carefully tailored to accommodate anatomical and epidemiological variances that characterize disparate global populations. Only through such conscientious efforts can deep learning-powered mammography achieve its envisioned role: an equitable, precise, and life-saving diagnostic tool accessible to all women, regardless of their ethnicity or geographic location.
The integrative insights offered by this review underscore the multidimensional nature of deploying AI in medicine—an enterprise demanding more than technical sophistication, but also cultural sensitivity and commitment to fairness. In this light, it stands as a pivotal contribution to the evolving discourse on AI in healthcare, compelling researchers, clinicians, and policymakers to rethink, recalibrate, and renew their strategies for breast cancer detection.
Ultimately, the transformational potential of deep learning in mammography hinges on our ability to transcend data silos, confront systemic biases, and embrace diversity as a foundational principle. The future of breast cancer diagnostics depends not only on algorithmic innovation but on global inclusivity—making this comprehensive review both timely and indispensable.
Subject of Research: Deep learning techniques applied to mammography for breast cancer detection, focusing on global and Asian perspectives.
Article Title: A systematic literature review on mammography: deep learning techniques for breast cancer detection with global and Asian perspectives.
Article References: Amin, A., U, D.A., Koteshwara, P. et al. A systematic literature review on mammography: deep learning techniques for breast cancer detection with global and Asian perspectives. BMC Cancer 25, 1627 (2025). https://doi.org/10.1186/s12885-025-14876-5
Image Credits: Scienmag.com