赞
踩
参考:
https://github.com/iwatobipen/chemo_info/blob/master/chemicalspace2/HDBSCAN_Chemoinfo.ipynb
https://www.cnblogs.com/liulunyang/p/14275612.html
本文章主要方法:
1、通过umap获取分子的降维特征数据
2、再通过hdbscan或kmeans聚类方法聚类(hdbscan不支持指定最后聚类总数量,可能非常多;kmeans可以指定总的聚类数)
代码已经上传github:https://github.com/lonngxiang/molecular-clustering
## 安装
1)pip install umap-learn -i https://pypi.douban.com/simple
2)hdbscan pip安装报错:
ERROR: Failed building wheel for hdbscan
Failed to build hdbscan
ERROR: Could not build wheels for hdbscan, which is required to install pyproject.toml-based projects
解决方法conda安装:conda install -c conda-forge hdbscan
conda安装后调用报错:
__init__() got an unexpected keyword argument 'cachedir'
解决方法joblib降级:
pip install joblib==1
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。