In the relentless pursuit of new therapeutics, structure-based virtual screening (VS) through molecular docking has cemented its role as a cornerstone technique for the early stages of drug discovery. Researchers across the globe rely on this approach to sift through massive compound libraries in search of promising bioactive molecules. Now, a transformative leap in this field is on the horizon, driven by innovative artificial intelligence (AI) models that redefine the speed and accuracy of protein–ligand docking and scoring tasks. A team led by Gu et al. has unveiled the Comprehensive Virtual Screening Platform with AI Engine (CVSP-AIE), an unprecedented digital arsenal designed to revolutionize drug candidate identification and optimization.
CVSP-AIE is a highly sophisticated integration of three cutting-edge AI-powered models, each tailored for a distinct phase of the molecular docking pipeline. The platform harmonizes rapid docking with pinpoint precision and an advanced affinity scoring system, orchestrating a seamless balance between computational efficiency and predictive accuracy. This meticulous blend of speed and rigor addresses a long-standing bottleneck in virtual screening workflows: harnessing AI’s computational prowess without sacrificing the quality of predictions.
The first AI component, KarmaDock, is engineered to perform ultra-fast docking by directly refining atomic coordinates of molecules within the protein binding site. Unlike traditional docking methods that rely heavily on heuristic search algorithms and exhaustive conformational sampling, KarmaDock leverages a streamlined, data-driven approach. By rapidly adjusting atomic positions in three-dimensional space, it expedites the initial pose prediction phase, cutting down processing time dramatically without compromising the structural plausibility of the docked complex.
Following the rapid pose generation stage, the platform invokes CarsiDock, an AI model specializing in precision docking accuracy. Instead of depending solely on scoring functions to discern the best pose, CarsiDock predicts protein–ligand interatomic distances and reconstructs the binding conformations with remarkable fidelity. This approach circumvents common pitfalls of docking algorithms that struggle with flexible binding regions or subtle steric clashes, ensuring that the resultant poses reflect realistic biochemical interactions.
Complementing these docking innovations is RTMScore, an affinity prediction engine that pushes proteochemometric modeling to new heights. RTMScore learns from detailed residue–atom distance distributions within the complex, capturing nuanced intermolecular forces that dictate binding strength. This contrasts starkly with conventional scoring functions that often simplify interaction energy landscapes into aggregated terms, risking loss of critical molecular context. RTMScore’s fine-grained analytical capacity translates to more reliable identification of tightly binding ligands, elevating the hit-to-lead conversion success rate.
Together, these AI modules form a hierarchical workflow within CVSP-AIE, strategically modulating screening throughput and accuracy demands. Initial docking with KarmaDock screens large libraries quickly, funneling top candidates through the more computationally intensive but precise evaluation of CarsiDock and RTMScore. This tiered strategy mirrors human expert intuition: broad preliminary filtering followed by focused scrutiny, yet now performed by an intelligent, autonomous system at unprecedented scale.
Accessibility sets CVSP-AIE apart from many advanced in silico tools. Available as an online web-server via a user-friendly interface (https://cadd.zju.edu.cn/cvsp/), it democratizes access to advanced AI-driven VS technologies. Researchers simply upload a protein target structure alongside a known binder to define the binding pocket, jumpstarting the automated drug screening pipeline. This intuitive yet powerful capability lowers the barrier to entry for scientists, from academic labs to industry R&D teams, accelerating translational research timelines.
The platform’s workflow unfolds in three core phases, starting with comprehensive preprocessing. Here, protein structures undergo meticulous repair to address missing atoms or residues, while molecular inputs are standardized to ensure compatibility with downstream AI models. These steps are crucial to maintain the integrity and reproducibility of the docking process, mitigating errors that can cascade in large-scale virtual screening campaigns.
Following preprocessing, the heart of CVSP-AIE takes center stage: binding pose generation and affinity prediction. By deploying KarmaDock, CarsiDock, and RTMScore in sequence, the system iteratively refines candidate molecules’ spatial orientations and evaluates their binding potentials. This automated yet sophisticated processing pipeline runs efficiently at scale, requiring only 30 to 45 minutes to hierarchically screen 100,000 compounds — a remarkable performance benchmark that empowers rapid hit identification against diverse druggable targets.
Postprocessing completes the virtual screening cycle by calculating protein–ligand interaction profiles and generating visually interactive chemical space analyses. This integrative presentation enables researchers to explore structure-activity relationships and intermolecular contacts in an intuitive manner. The rich output not only prioritizes compounds by predicted affinities but also offers insights into binding mode diversity, guiding rational lead optimization strategies.
Beyond web accessibility, CVSP-AIE equips users with a robust local software package and a versatile command-line module, facilitating unrestricted large-scale screening on high-performance computing clusters. This on-premise deployment caters to projects requiring extensive compound libraries or privacy-sensitive datasets, reflecting the platform’s adaptability to varied scientific contexts.
In an era defined by computational breakthroughs powered by AI, CVSP-AIE exemplifies the fusion of machine intelligence and chemical biology to accelerate drug discovery pipelines fundamentally. By addressing the dual imperatives of screening speed and predictive robustness, it propels researchers toward unlocking new therapies with unprecedented efficiency. The platform’s emergence signals a paradigm shift where intelligent automation becomes the norm, transcending traditional computational limitations.
The team behind CVSP-AIE envisions broad future applications, extending beyond small-molecule drug discovery to encompass biologics and chemical probes, emphasizing modular AI enhancements to accommodate evolving structural biology challenges. Integration with complementary experimental validation workflows also promises to elevate translational success, transforming theoretical predictions into tangible therapeutic advances.
As CVSP-AIE gains traction within the scientific community, its open accessibility and hierarchical AI framework are poised to inspire further innovation in computational drug discovery. This breakthrough demonstrably bridges the gap between cutting-edge AI methodologies and real-world pipeline demands, fostering an era wherein rapid, accurate, and accessible virtual screening reshapes the quest for next-generation medicines.
Subject of Research: Structure-based virtual screening in drug discovery enhanced by artificial intelligence technologies.
Article Title: Facilitating structure-based drug discovery with an artificial intelligence-driven virtual screening platform.
Article References:
Gu, S., Zhang, X., Xiao, M. et al. Facilitating structure-based drug discovery with an artificial intelligence-driven virtual screening platform. Nat Protoc (2026). https://doi.org/10.1038/s41596-026-01389-z
Image Credits: AI Generated
