赞
踩
reading notes of《Artificial Intelligence in Drug Design》
Regarding the usability of the generative models we observe two main trends for de novo design: distribution-learning and goal- directed generation
From user’s perspective however it is much more important to understand how to use the generative models in order to achieve either exploration or exploitation of the chemical space. For exploitation, users define an area of interest and focus on generating compounds that share similar structural features. In contrast, the exploration mode enables them to obtain compounds that share less structural similarity but still satisfy other desired features.
Maximizing the outcome of a given scoring function may sometimes lead to get repeatedly stuck in a narrow space of solutions. This would in turn lead to a mode collapse of the generative model. In order to avoid such scenario REINVENT uses diversity filters (DF).
DF prevents from gaining reward when generating the same or similar compounds recurrently. This is achieved by memorizing the generated compounds.
Conducting a successful RL run can sometimes be challenging especially if the SF is composed by very strict components or by components that are often orthogonal. This would lead to longer learning times and reaching the state of productivity much later, thus generating a lower yield of compounds with a score that satisfies us. To help the learning process we may resort to prefocusing the generative model via TL. The focused model can be subsequently used as a starting point in the RL.
An alternative to prefocusing the generative model is to use “inception.” The inception feature in REINVENT is a modified version of experience replay.
A notable feature of REINVENT is the ability to combine together a variety of factors into a single scoring function. However, bringing multiple components together within a single generation run can hold some challenges. Therefore it is important to find the right balance and to include the most relevant factors so that the learning process is conducted at optimal speed and the generated compounds are scored reliably.
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。