Chapter4 : Application of Artificial Intelligence and Machine Learning in Drug Discovery_applications of artifificial intelligence and mach

作者：木道寻08 | 2024-07-13 19:12:30

踩

applications of artifificial intelligence and machine learning in heart fail

reading notes of《Artificial Intelligence in Drug Design》

1.Introduction

请添加图片描述

In addition to FAIR principles, Schneider et al. provide an excellent discussion on how data should also follow the ALCOA (Attributable, Legible, Contemporaneous, Original and Accurate) guidelines as defined by US FDA.
As a general principal when an opportunity or challenge is recognized within the drug discovery pipeline, we first ask ourselves if applying machine learning would be a good idea. Are there other methods that may be better as well as quicker to get us the desired information? This leads to investigating the actual use case as well as evaluating the amount and quality of data available for such application.

请添加图片描述

Generative chemistry methods can combine scoring based on multiparameters to allow picking compounds that check most of the criteria as set by the project teams.
There has been work done to bring chemistry and biology close to each other by utilizing gene expression information in de-novo compound generation.
Potentially possible, it would be useful to allow retrosynthesis be part of the latent space during the generative chemistry process so that users can get synthetically viable compounds.

The next challenge at hand is target profiling or target assessment. This also includes predicting polypharmacology as well as off-target effects (including toxicity predictions).
A wishful thinking in the area of target profiling may be to utilize machine learning models using clinical as well as real world evidence (RWE) data in addition to all available preclinical data for better target and disease validation.

Various academic groups and industry have invested a lot of resources to provide these models due to the fact that there are frequent late stage failures due to either undesirable ADME properties or toxicity issues. Some of these properties could be measured in a high throughput fashion and thereby leading to generation of large data sets suitable for machine learning.
It’s imperative to discuss a few best practices:
- models should be interpretable
- models should not only be predictable but provide “confidence” for every prediction
- models should be updated routinely to keep them up to data with newly measured data
- Some sort of prospective predictions should be captured at the time of model update process so that project teams can assess the quality of a model for their projects in a prospective way.
An interesting idea to work on would be to build machine learning models that can utilize predicted ADMET properties in addition to physchem properties and generate low dose compounds.

In a more recent work by Coley et al., a panel of ~140K reaction templates was developed as a framework.
There are several limitations:
- sufficiently cover the reaction space
- insufficient negative examples
To enable collection of a larger dataset that could potentially contain more diverse and both positive and negative examples, one could imagine building a consortium where various pharmaceutical industry representatives can encrypt their respective ELN datasets and share that publicly at a precompetitive level.

We strongly believe that this is the high time when industry embraces these methods and make them part of their routine drug discovery process.

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/w/木道寻08/article/detail/821128