Monday, August 18, 2025
Science
No Result
View All Result
  • Login
  • HOME
  • SCIENCE NEWS
  • CONTACT US
  • HOME
  • SCIENCE NEWS
  • CONTACT US
No Result
View All Result
Scienmag
No Result
View All Result
Home Science News Biology

Insilico Medicine introduces nach0: A one-stop LLM for chemical and biomedical tasks

May 18, 2024
in Biology
Reading Time: 5 mins read
0
Introducing nach0: a New LLM for Chemical and Biomedical Tasks
72
SHARES
655
VIEWS
Share on FacebookShare on Twitter
ADVERTISEMENT
ADVERTISEMENT

In a new paper, researchers from clinical stage artificial intelligence (AI)-driven drug discovery company Insilico Medicine (“Insilico”), in collaboration with NVIDIA, present a new large language model (LLM) transformer for solving biological and chemical tasks called nach0. The multi-domain and multi-task LLM was trained on a diverse set of tasks, natural language understanding, synthetic route prediction, and molecular generation, and works across domains to answer biomedical questions and synthesize new molecules. The findings were published in Chemical Science Journal.

Introducing nach0: a New LLM for Chemical and Biomedical Tasks

Credit: Insilico Medicine

In a new paper, researchers from clinical stage artificial intelligence (AI)-driven drug discovery company Insilico Medicine (“Insilico”), in collaboration with NVIDIA, present a new large language model (LLM) transformer for solving biological and chemical tasks called nach0. The multi-domain and multi-task LLM was trained on a diverse set of tasks, natural language understanding, synthetic route prediction, and molecular generation, and works across domains to answer biomedical questions and synthesize new molecules. The findings were published in Chemical Science Journal.

While there are other LLMs designed for biomedical discovery, including BioBERT (Bidirectional Encoder Representations from Transformers for Biomedical Text Mining) and SciFive, these datasets rely mainly on biomedical natural language texts, such as drugs, genes, and cell line names, but do not contain chemical structure descriptions. Those that have emerged with both text and chemical structure descriptions, such as Galactica, have not yet been trained for diverse chemical tasks. 

Nach0 seeks to bridge this gap for the first time. It draws from a dataset that includes abstract texts extracted from PubMed and patent descriptions derived from the U.S. Patent and Trademark Office related to the chemistry domain – 100 million documents that became 355 million tokens worth of abstracts and 2.9 billion patents, as well as molecular structures using simplified molecular-input line-entry system (SMILES). To train the system, researchers turned this chemical information into tokens as well – 4.7 billion – and then annotated these tokens with special symbols. 

Using this dataset, researchers trained nach0 to perform three key tasks: natural language processing, such as document classification and question answering; chemistry-related tasks, such as molecular property prediction, molecular generation, and reagent prediction; and cross-domain tasks, including description-guided molecule design and molecular description generation.

“Nach0 represents a step forward in automating drug discovery through natural language prompts,” says Alex Zhavoronkov, PhD, founder and CEO of Insilico Medicine. “In the future, we foresee the potential inclusion of protein sequences with their own special tokens as well as fine-tuning the model in order to accommodate new modalities and exploring the fusion of information from text and knowledge graphs.”

Nach0 is built on the NVIDIA BioNeMo generative AI platform, enabling training and scaling of drug discovery applications. Specifically, the training was performed using NVIDIA NeMo, an end-to-end platform for developing custom generative AI. The research team leveraged NLP capabilities to train and evaluate the new model’s LMs. NVIDIA’s memory-mapped data loader modules allowed researchers to manage large datasets with small memory footprints and optimal reading speed. 

“Generative AI and LLMs are transforming the landscape of scientific discovery in biology and chemistry,” said Rory Kelleher, Global Head of Business Development for Life Sciences at NVIDIA. “Insilico’s domain-specific nach0 model, powered by NVIDIA BioNeMo, is a significant step toward unlocking the full potential of LLMs for drug discovery.”

Measured against other LLMs used for biomedical understanding, such as FLAN, SciFive, and MolT5, nach0 was found to have distinct advantages when performing molecular tasks using molecular data, and it significantly outperformed ChatGPT. 

Researchers tested nach0’s capabilities in two case studies. The first was to generate molecules that could be effective against Diabetes mellitus. Researchers entered the prompt “discover biological targets with potential therapeutic activity, analyze the mechanism of action, generate molecular structure, propose one-step synthesis, and predict molecular properties.” They generated 200 SMILES on the molecule generation prompt and selected one structure as the most promising from a chemical expert knowledge perspective. They also applied nach0 to a case study used as a demo for Insilico’s Chemistry42 generative AI drug design platform, with the model returning 8 molecules satisfying the prompt in just 15 minutes for generation and 30 minutes for scoring in Chemistry42. 

“We anticipate that as nach0 evolves, it will require less supervision, and it will be able to simply generate and validate promising therapeutic options for medicinal chemists,” says Maksim Kuznetsov, a senior research scientist at Insilico and one of the paper’s lead authors. 

Insilico Medicine is a pioneer in using generative AI for drug discovery and development. The Company first described the concept of using generative AI to design novel molecules in a peer-reviewed journal in 2016. Then, Insilico developed and validated multiple approaches and features for its generative adversarial network (GAN)-based AI platform and integrated those algorithms into the commercially available Pharma.AI platform, which includes generative biology, chemistry, and medicine, and has been used to produce a robust pipeline of promising therapeutic assets in multiple disease areas, including fibrosis, cancer, immunology, and aging-related disease, several of which have been licensed. Since 2021, Insilico has nominated 18 preclinical candidates in its comprehensive portfolio of over 30 assets and has advanced six pipelines to the clinical stage. In March 2024, the Company published a paper in Nature Biotechnology that discloses the raw experimental data and the preclinical and clinical evaluation of its lead drug – a potentially first-in-class TNIK inhibitor for the treatment of idiopathic pulmonary fibrosis discovered and designed using generative AI currently in Phase II trials with patients. 

 

About Insilico Medicine

Insilico Medicine, a global clinical-stage biotechnology company powered by generative AI, connects biology, chemistry, and clinical trial analysis using next-generation AI systems. The company has developed AI platforms that utilize deep generative models, reinforcement learning, transformers, and other modern machine learning techniques for novel target discovery and generating novel molecular structures with desired properties. Insilico Medicine is developing breakthrough solutions to discover and develop innovative drugs for cancer, fibrosis, immunity, central nervous system diseases, infectious diseases, autoimmune diseases, and aging-related diseases. www.insilico.com



Journal

Chemical Science

DOI

10.1039/D4SC00966E

Article Title

nach0: multimodal natural and chemical languages foundation model

Article Publication Date

8-May-2024

Share29Tweet18
Previous Post

Study reports ‘excellent’ outcomes for patients receiving optimized treatment for atrial fibrillation

Next Post

Bisoprolol in patients with COPD at high risk of exacerbation

Related Posts

blank
Biology

Unveiling Ancient Insights Behind Modern Cytoskeleton Evolution

August 15, 2025
blank
Biology

Researchers Identify Molecular “Switch” Driving Chemoresistance in Blood Cancer

August 15, 2025
blank
Biology

First Real-Time Recording of Human Embryo Implantation Achieved

August 15, 2025
blank
Biology

Opposing ATPases and ALKBH1 Shape Chromatin, Stress Response

August 15, 2025
blank
Biology

Ecophysiology and Spread of Freshwater SAR11-IIIb

August 15, 2025
blank
Biology

Multifocus Microscope Breaks New Ground in Rapid 3D Live Biological Imaging

August 15, 2025
Next Post
Bisoprolol in patients with COPD at high risk of exacerbation

Bisoprolol in patients with COPD at high risk of exacerbation

  • Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    27535 shares
    Share 11011 Tweet 6882
  • University of Seville Breaks 120-Year-Old Mystery, Revises a Key Einstein Concept

    949 shares
    Share 380 Tweet 237
  • Bee body mass, pathogens and local climate influence heat tolerance

    641 shares
    Share 256 Tweet 160
  • Researchers record first-ever images and data of a shark experiencing a boat strike

    507 shares
    Share 203 Tweet 127
  • Warm seawater speeding up melting of ‘Doomsday Glacier,’ scientists warn

    311 shares
    Share 124 Tweet 78
Science

Embark on a thrilling journey of discovery with Scienmag.com—your ultimate source for cutting-edge breakthroughs. Immerse yourself in a world where curiosity knows no limits and tomorrow’s possibilities become today’s reality!

RECENT NEWS

  • Validating AI Ethics Scale for Nursing Students
  • How Identity Shapes New Nurses’ Turnover Intentions
  • Psychological Flexibility Shapes Lasting Effects of Childhood Trauma
  • New Metabolic Inflammation Model Explains Teen Reproductive Issues

Categories

  • Agriculture
  • Anthropology
  • Archaeology
  • Athmospheric
  • Biology
  • Bussines
  • Cancer
  • Chemistry
  • Climate
  • Earth Science
  • Marine
  • Mathematics
  • Medicine
  • Pediatry
  • Policy
  • Psychology & Psychiatry
  • Science Education
  • Social Science
  • Space
  • Technology and Engineering

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 4,859 other subscribers

© 2025 Scienmag - Science Magazine

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • HOME
  • SCIENCE NEWS
  • CONTACT US

© 2025 Scienmag - Science Magazine

Discover more from Science

Subscribe now to keep reading and get access to the full archive.

Continue reading