CACHE

CRITICAL ASSESSMENT OF COMPUTATIONAL HIT-FINDING EXPERIMENTS

DONATE

  • About
    • WHAT IS CACHE
    • Read More
    • Spotlight
    • Conferences
  • CACHE News
  • CHALLENGES
    • Challenge #1
      • Announcement
      • Computation methods
      • Preliminary results
    • Challenge #2
      • Announcement
      • Computation methods
      • Preliminary results
    • Challenge #3
      • Announcement
      • Computation methods
    • Challenge #4
      • Announcement
      • Computation methods
    • FAQ
  • Sponsor a Challenge
  • CONTACT

Challenge #4

Hit Identification
Method type (check all that applies)
Deep learning
High-throughput docking
Machine learning
Description of your approach (min 200 and max 800 words)

In the hit identification phase, we plan to deploy a hybrid strategy combining the experience of medicinal chemists with EquiScore. EquiScore is a generic protein-ligand interaction prediction model based on geometric deep learning developed by our team. When designing the model, we thoroughly considered prior information from different sources, including chemical prior information, interaction prior information, spatial prior information, et. We integrated them into a deep learning framework to integrate multiple sources of information to characterize protein-ligand interactions in geometric space. In addition, we considered various potential problems in constructing protein-ligand prediction datasets and proposed several targeted data enhancement strategies so that the model can further extract representations that can be generalized to new targets and ultimately improve the model Screening capabilities on novel targets. In a large-scale retrospective benchmark test, EquiScore's screening ability surpassed the traditional scoring function GLIDE SP, a series of classic machine learning scoring functions, and the newly published deep learning scoring functions DeepDock, RTMScore, PIGNet, TANKBind, etc., and shows the best generalization performance on new targets. Within the team, we cooperated with the experimental department to carry out prospective experimental verification and successfully screened active small molecules of the target.

We will use the trained EquiScore model to screen active compounds in this competition. Specifically, we first use software to generate putative binding poses for all molecules in the database. Second, we will use EquiScore to score these poses and sort the generated pose. Finally, we will cluster the top-scoring molecules and select candidate compounds.

What makes your approach stand out from the community? (<100 words)

We employ several physical and a priori knowledge-based strategies, such as aromatic center, spatial distance, protein-ligand interaction information and space geometric information in the modeling process of EquiScore. At the same time, the advanced equivariant neural network and reasonable data augmentation strategies have further improved his expressive ability and generalization performance, and finally made our model show superior screening ability. Moreover, the performance of EquiScore has been validated both retrospectively and prospectively.

Method Name
EquiScore
Commercial software packages used

Schrödinger Suites 2020-4 version

Free software packages used

RDKit,ProLIF

Relevant publications of previous uses by your group of this software/method

Our article is being submitted

Cache

All rights reserved
v5.47.19.49

Footer first

  • Login
  • Applicant Login
  • Privacy Policy
  • FAQ
  • Docs
This website is licensed under CC-BY 4.0