Software to generate mega chemical library

Non-toxic peptides that selectively degrade misfolded proteins for neurodegenerative diseases.
Technology No. 2025-LI-70824

Early drug discovery is a very time-consuming and costly process. To accelerate drug discovery, scientists often utilize computational tools to search for compounds that have potential to become pharmaceutically active molecules . Nowadays, public chemical databases or libraries typically contain 10 billion or few compounds. However, researchers at Purdue University have developed software to generate mega chemical libraries. Unlike other software products, this solution is not based on artificial intelligence or machine learning but is rather a systems and knowledge driven approach to identify potential lead molecules. The approach starts from a given compound and readily generates billions or trillions of compounds, covering an unprecedentedly large chemical space, which may contain improved candidates for drug discovery. More specifically, this fragment module library invented by Purdue scientists uses a fully automated approach that is composed of the following steps - 1) creation of backbone and peripheral libraries, 2) introduction of chemical mutations, 3) systematic combination of mutated backbone constituents and peripheral constituents, and 4) systematic attachment of formulated modules to the parent compound - to create a mega library of potential lead compounds that have high potentiality for biological activity and high synthesis accessibility.

Technology Validation:

The fragment module library generated using an example software program of the disclosure (viz., ChemHopperTM) versus Deepfrag was able to generate more known lead candidates when tested on three known small molecules.

Advantages:

-Fully automated

-Starts from a given compound and readily generates billions or trillions of compounds

-Extremely large library size compared to current public libraries

Applications:

-Synthetic biology

-Pharmaceuticals

-Drug Discovery

TRL: Biotechnology

Intellectual Property:

Provisional-Patent, 2024-09-13, United States

Utility Patent, 2025-09-12, United States

Keywords: Computer Technology, Drug Discovery, Fragment libraries, Lead optimization, Pharmaceuticals

  • expand_more cloud_download Supporting documents (1)
    Product brochure
    Software to generate mega chemical library.pdf
Questions about this technology?