CACHE

CRITICAL ASSESSMENT OF COMPUTATIONAL HIT-FINDING EXPERIMENTS

DONATE

  • About
    • WHAT IS CACHE
    • Read More
    • Spotlight
    • Conferences
  • CACHE News
  • CHALLENGES
    • Challenge #1
      • Announcement
      • Computation methods
      • Preliminary results
    • Challenge #2
      • Announcement
      • Computation methods
      • Preliminary results
    • Challenge #3
      • Announcement
      • Computation methods
    • Challenge #4
      • Announcement
      • Computation methods
    • FAQ
  • Sponsor a Challenge
  • CONTACT

Challenge #3

Hit Identification
Method type (check all that applies)
Deep learning
High-throughput docking
Description of your approach (min 200 and max 800 words)

We will deploy a proprietary deep learning-based framework to rapidly screen multi-billion small molecule libraries. The performance of the proposed framework is tested on several curated as well as publicly-available unbiased benchmarking datasets. To demonstrate the actual application of the framework, we have screened 1.37 billion molecules to discover new inhibitors of the epigenetic protein BRD9 bromodomain. We have identified and prioritized 17 molecules for in vitro testing, four of which were active, and three of which were chemically distinct from known binders. A novel, first-in-class hit PS-902 that demonstrated an IC50 of ~1.0 µM was discovered.

What makes your approach stand out from the community? (<100 words)

The proposed framework utilizes a lightweight convolutional neural network architecture highly optimized for handling low-level molecular characteristics capable of discovering novel chemotypes while screening one billion commercial, synthesizable libraries (e.g. ENAMINE Real) per day for both specific and multi-targeted hits identification. The successful validation is already done on one target. We are currently also evaluating the performance of the framework on multiple other targets in collaboration with different partners. 

For the CACHE #3 challenge, we are preparing to screen the current (November 2022, 1st week) ENAMINE Real Database with over 5.5 billion molecules. 

Method Name
PrDIN
Commercial software packages used

Maestro (protein preparation)
Glide for docking/hits prioritization (but we'll make a decision later whether to use it or Autodock4/Autpodock Vina or SMINA)

Free software packages used

Python, TensorFlow, Keras, RDkit, Autodock4/Autdock Vina/SMINA, Gromacs (if required), PyMOL

Relevant publications of previous uses by your group of this software/method

We have not published the work yet as the validation studies are currently undergoing. 

 

Cache

All rights reserved
v5.47.19.49

Footer first

  • Login
  • Applicant Login
  • Privacy Policy
  • FAQ
  • Docs
This website is licensed under CC-BY 4.0