Abstract & Introduction

最新推荐文章于 2024-10-06 18:52:33 发布

_森罗万象

最新推荐文章于 2024-10-06 18:52:33 发布

阅读量360

点赞数

分类专栏：读书笔记文章标签： AIDD

本文链接：https://blog.youkuaiyun.com/weixin_52812620/article/details/128938288

版权

读书笔记专栏收录该内容

33 篇文章

订阅专栏

该文探讨了深度学习模型在解决药物发现中的分子表示和生成问题上的应用。尽管深度学习在捕获问题统计特征上表现出色，但需要正确的归纳偏置。文章指出，传统的指纹技术和图神经网络可能无法捕捉到分子任务所需的复杂关系，特别是长程依赖性。文中还提到了高通量筛选和QSAR方法在药物发现早期阶段的作用，并预告将在后续章节介绍新的分子表示模型、原型学习启发的图神经网络范式、反向合成策略以及分子优化方法。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

reading notes of《Molecular Graph Representation Learning and Generation for Drug Discovery》

文章目录

Abstract
1.Introduction
- 1.1.Machine Learning Applications for Drug Discovery
- 1.2.Thesis Overview

Abstract

Deep learning models are powerful because they learn the important statistical features of the problem–but only with the correct inductive biases. We tackle this important problem in the context of two molecular problems: representation and generation.
Canonical success of deep learning is deeply rooted in its ability to map the input domain into a meaningful representation space. This is especially poignant for molecular problems, where the “right” relations between molecules is nuanced and complex.

1.Introduction

Within these methods, fingerprint techniques are widely popular, and can be broadly categorized into several types including structure-based [30], topological [1], circular [8] and pharmacophore fingerprints [91].
However, the problem still lies within the deterministic nature of the generating method: if these predefined rules do not capture the right representation for the task, they will not work well. For instance, property cliffs, a phenomenon in which similar molecules exhibit different properties, remain a challenging problem for many small molecule problems.
While sometimes effective,simple paradigm of GNN may not always incorporate the right kind of biases for molecular tasks. For instance, this local neighborhood aggregation can fail to capture long-range dependencies that are important when considering properties of molecules.

1.1.Machine Learning Applications for Drug Discovery

During the discovery phase, high throughput screening (HTS) is conducted on large libraries of molecules, which yields candidate molecules, known as hits. These hit molecules then undergo more screening and optimization to generate a smaller set of lead molecules. The selection of hit and lead compounds is the ideal frontier for machine learning methods to pave new improvements.
Prior to machine learning, QSAR methods were broadly applied to virtual screening. In its most basic form, QSAR methods use a variety of hand-engineered descriptors, such as simple features including atom and bond counts, molecular weight and ring information; more complex descriptors include higher-order topological features and physicochemical properties.

1.2.Thesis Overview

In Chapter 2, I’ll introduce the different rep- resentations of molecules, and new models for their improvement. In the following chapter (Chapter 3), I will talk about another new graph neural network paradigm that borrows ideas from prototype learning. Chapter 4 will talk about retrosynthesis, and how we can produce accurate and diverse synthesis suggestions. Lastly, Chapter 5 will introduce a new method for molecular optimization.