生物信息学 Python 入门之源码安装
阅读原文时间:2023年07月30日阅读:10

编程,作为生物信息学的一个基础性技能,是任何一个生信工程师都无法绕开话题。也许有些人还在纠结 Perl 和 Python 到底应该学习哪一个,但作为目前最火最流行的编程语言 Python 还是非常值得尝试的。它不但可以进行文本处理,在统计、网站、游戏、爬虫、数据可视化等方面也有非常强大的应用,比起曾经的 Perl 真的强大和全面很多,且在学习曲线上比 Perl 更容易入手。不管从长远发展,还是短期需要,学会 Python,看懂 Perl (或者先学  Python,后学 Perl) 应该是每一个生信工程必备的基础技能之一。

工欲善其事,必先利其器。关于 Python 安装教程在网上一搜一大把,但总感觉不够全面,尤其对于中间出现的一些问题的解决方法不尽如人意。鉴于此,本文基于 CentOS Linux release 6.5 对 python-3.7.3 的源码编译安装进行了一下简单的总结,记录如下。

1. 安装环境

操作系统  :CentOS Linux release 6.5 (x86_64)GCC 版本:4.8.5 20150623

2. 安装步骤

如果您拥有 root 权限,请执以下依赖安装:

yum install zlibyum install zlib-develyum install opensslyum install openssl-develyum install libffiyum install libffi-devel

如果您没有 root 权限,可以参考《手把手教你如何在 Linux 上源码安装最新版本 R》一文,手动一个个去解决以上的依赖。

2.1 ssl

python3 需要引用openssl模块,但是 CentOS 需要的 openssl 版本最低为 1.0.2,而 CentOS 默认的为 1.0.1(CentOS-6.x 通过yum源安装的 openssl 的最高版本是 1.0.1),所以需要手动更新 openssl。

# 下载wget http://www.openssl.org/source/openssl-1.1.1.tar.gz# 解压缩tar -zxvf openssl-1.1.1.tar.gz# 进入目录安装cd openssl-1.1.1# 进行配置下,自定义./config --prefix=$HOME/soft-repos/openssl shared zlib# 编译并安装make && make install# 配置到用户环境变量,随处使用echo "export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$HOME/soft-repos/openssl/lib" >> $HOME/.bashrc# 是环境变量配置生效source $HOME/.bashrc

请注意:

  1. openssl 编译(config)的时候必须要加上 shared 参数,否者源码安装 Python 即使添加了--with-openssl的自定义路径,依然会导致Could not build the ssl module!报错!

  2. https://www.openssl.org/source/下载的源码 openssl-1.0.2s、openssl-1.0.2m,包括 CentOS-7.5 使用yum安装的最高版本的 openssl-1.0.2k 目前发现依然会导致Could not build the ssl module,建议从https://www.openssl.org/source/old/下载 1.1.1 的源码编译安装。

2.2 ctypes

在 CentOS 6.x 安装libffi-devel的时候出现以下问题:

$ yum install -y libffi-develLoaded plugins: product-id, refresh-packagekit, search-disabled-repos, security, subscription-managerThis system is not registered to Red Hat Subscription Management. You can use subscription-manager to register.Setting up Install Processcdrom                                                                                                                                         | 4.1 kB     00:00 ...No package libffi-devel available.Error: Nothing to do

可以使用下面的方法安装:

[root@log01 ~]# rpm -ivh http://mirror.centos.org/centos/6/os/x86_64/Packages/libffi-devel-3.0.5-3.2.el6.x86_64.rpmRetrieving http://mirror.centos.org/centos/6/os/x86_64/Packages/libffi-devel-3.0.5-3.2.el6.x86_64.rpmwarning: /var/tmp/rpm-tmp.V9ihbu: Header V3 RSA/SHA256 Signature, key ID c105b9de: NOKEYPreparing...                ########################################### [100%]   1:libffi-devel           ########################################### [100%][root@log01 ~]# rpm -qa|grep libffilibffi-3.0.5-3.2.el6.x86_64libffi-devel-3.0.5-3.2.el6.x86_64

源码方法安装如下:

$ wget ftp://sourceware.org/pub/libffi/libffi-3.2.1.tar.gz$ tar zvxf libffi-3.2.1.tar.gz$ ./configure --prefix=/Bioinfo/Pipeline/SoftWare/LibDependence/libffi-3.2.1$ make $ make install 

2.3 pygraphviz

$&nbsp;/Bioinfo/Pipeline/SoftWare/Python-3.7.3/bin/pip3&nbsp;install&nbsp;pygraphvizCollecting&nbsp;pygraphviz&nbsp;&nbsp;Using&nbsp;cached&nbsp;https://files.pythonhosted.org/packages/7e/b1/d6d849ddaf6f11036f9980d433f383d4c13d1ebcfc3cd09bc845bda7e433/pygraphviz-1.5.zipInstalling&nbsp;collected&nbsp;packages:&nbsp;pygraphviz&nbsp;&nbsp;Running&nbsp;setup.py&nbsp;install&nbsp;for&nbsp;pygraphviz&nbsp;...&nbsp;error&nbsp;&nbsp;&nbsp;&nbsp;Complete&nbsp;output&nbsp;from&nbsp;command&nbsp;/Bioinfo/Pipeline/SoftWare/Python-3.7.3/bin/python3.7&nbsp;-u&nbsp;-c&nbsp;"import&nbsp;setuptools,&nbsp;tokenize;__file__='/tmp/pip-install-_zdjdg0j/pygraphviz/setup.py';f=getattr(tokenize,&nbsp;'open',&nbsp;open)(__file__);code=f.read().replace('\r\n',&nbsp;'\n');f.close();exec(compile(code,&nbsp;__file__,&nbsp;'exec'))"&nbsp;install&nbsp;--record&nbsp;/tmp/pip-record-g0mz7lrq/install-record.txt&nbsp;--single-version-externally-managed&nbsp;--compile:&nbsp;&nbsp;&nbsp;&nbsp;running&nbsp;install&nbsp;&nbsp;&nbsp;&nbsp;Trying&nbsp;dpkg&nbsp;&nbsp;&nbsp;&nbsp;Failed&nbsp;to&nbsp;find&nbsp;dpkg&nbsp;&nbsp;&nbsp;&nbsp;Trying&nbsp;pkg-config&nbsp;&nbsp;&nbsp;&nbsp;Package&nbsp;libcgraph&nbsp;was&nbsp;not&nbsp;found&nbsp;in&nbsp;the&nbsp;pkg-config&nbsp;search&nbsp;path.&nbsp;&nbsp;&nbsp;&nbsp;Perhaps&nbsp;you&nbsp;should&nbsp;add&nbsp;the&nbsp;directory&nbsp;containing&nbsp;`libcgraph.pc'&nbsp;&nbsp;&nbsp;&nbsp;to&nbsp;the&nbsp;PKG_CONFIG_PATH&nbsp;environment&nbsp;variable&nbsp;&nbsp;&nbsp;&nbsp;No&nbsp;package&nbsp;'libcgraph'&nbsp;found&nbsp;&nbsp;&nbsp;&nbsp;Traceback&nbsp;(most&nbsp;recent&nbsp;call&nbsp;last):&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"<string>",&nbsp;line&nbsp;1,&nbsp;in&nbsp;<module>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/tmp/pip-install-_zdjdg0j/pygraphviz/setup.py",&nbsp;line&nbsp;93,&nbsp;in&nbsp;<module>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;tests_require=['nose>=1.3.7',&nbsp;'doctest-ignore-unicode>=0.1.2',&nbsp;'mock>=2.0.0'],&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/Bioinfo/Pipeline/SoftWare/Python-3.7.3/lib/python3.7/site-packages/setuptools/__init__.py",&nbsp;line&nbsp;145,&nbsp;in&nbsp;setup&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;return&nbsp;distutils.core.setup(**attrs)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/Bioinfo/Pipeline/SoftWare/Python-3.7.3/lib/python3.7/distutils/core.py",&nbsp;line&nbsp;148,&nbsp;in&nbsp;setup&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;dist.run_commands()&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/Bioinfo/Pipeline/SoftWare/Python-3.7.3/lib/python3.7/distutils/dist.py",&nbsp;line&nbsp;966,&nbsp;in&nbsp;run_commands&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;self.run_command(cmd)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/Bioinfo/Pipeline/SoftWare/Python-3.7.3/lib/python3.7/distutils/dist.py",&nbsp;line&nbsp;985,&nbsp;in&nbsp;run_command&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;cmd_obj.run()&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/tmp/pip-install-_zdjdg0j/pygraphviz/setup_commands.py",&nbsp;line&nbsp;44,&nbsp;in&nbsp;modified_run&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;self.include_path,&nbsp;self.library_path&nbsp;=&nbsp;get_graphviz_dirs()&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/tmp/pip-install-_zdjdg0j/pygraphviz/setup_extra.py",&nbsp;line&nbsp;162,&nbsp;in&nbsp;get_graphviz_dirs&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;include_dirs,&nbsp;library_dirs&nbsp;=&nbsp;_try_configure(include_dirs,&nbsp;library_dirs,&nbsp;_pkg_config)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/tmp/pip-install-_zdjdg0j/pygraphviz/setup_extra.py",&nbsp;line&nbsp;117,&nbsp;in&nbsp;_try_configure&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;i,&nbsp;l&nbsp;=&nbsp;try_function()&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/tmp/pip-install-_zdjdg0j/pygraphviz/setup_extra.py",&nbsp;line&nbsp;72,&nbsp;in&nbsp;_pkg_config&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;output&nbsp;=&nbsp;S.check_output(['pkg-config',&nbsp;'--libs-only-L',&nbsp;'libcgraph'])&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/Bioinfo/Pipeline/SoftWare/Python-3.7.3/lib/python3.7/subprocess.py",&nbsp;line&nbsp;395,&nbsp;in&nbsp;check_output&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;**kwargs).stdout&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;File&nbsp;"/Bioinfo/Pipeline/SoftWare/Python-3.7.3/lib/python3.7/subprocess.py",&nbsp;line&nbsp;487,&nbsp;in&nbsp;run&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;output=stdout,&nbsp;stderr=stderr)&nbsp;&nbsp;&nbsp;&nbsp;subprocess.CalledProcessError:&nbsp;Command&nbsp;'['pkg-config',&nbsp;'--libs-only-L',&nbsp;'libcgraph']'&nbsp;returned&nbsp;non-zero&nbsp;exit&nbsp;status&nbsp;1.&nbsp;&nbsp;&nbsp;&nbsp;----------------------------------------Command&nbsp;"/Bioinfo/Pipeline/SoftWare/Python-3.7.3/bin/python3.7&nbsp;-u&nbsp;-c&nbsp;"import&nbsp;setuptools,&nbsp;tokenize;__file__='/tmp/pip-install-_zdjdg0j/pygraphviz/setup.py';f=getattr(tokenize,&nbsp;'open',&nbsp;open)(__file__);code=f.read().replace('\r\n',&nbsp;'\n');f.close();exec(compile(code,&nbsp;__file__,&nbsp;'exec'))"&nbsp;install&nbsp;--record&nbsp;/tmp/pip-record-g0mz7lrq/install-record.txt&nbsp;--single-version-externally-managed&nbsp;--compile"&nbsp;failed&nbsp;with&nbsp;error&nbsp;code&nbsp;1&nbsp;in&nbsp;/tmp/pip-install-_zdjdg0j/pygraphviz/

参考:《Installation:fatal error: 'graphviz/cgraph.h' file not found

$&nbsp;wget&nbsp;https://graphviz.gitlab.io/pub/graphviz/stable/SOURCES/graphviz.tar.gz$&nbsp;tar&nbsp;zvxf&nbsp;graphviz.tar.gz$&nbsp;cd&nbsp;graphviz-2.40.1$&nbsp;./configure&nbsp;--prefix=/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1$&nbsp;make&nbsp;&&&nbsp;make&nbsp;install

推荐把安装好的 graphviz 添加到环境变量,这样可以避免运行过程中出现:"pygraphviz/graphviz_wrap.c:2987:29: fatal error: graphviz/cgraph.h: No such file or directory"  无法找到头文件的异常。

export&nbsp;PKG_CONFIG_PATH=/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/lib/pkgconfig:$PKG_CONFIG_PATHexport&nbsp;LD_LIBRARY_PATH=/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/lib:$LD_LIBRARY_PATHexport&nbsp;C_INCLUDE_PATH=/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/include:$C_INCLUDE_PATHexport&nbsp;CPLUS_INCLUDE_PATH=/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/include:$CPLUS_INCLUDE_PATH

如果 graphviz 添加到环境变量,pygraphviz的 python 包可以参考下面的方法安装:

$&nbsp;/Bioinfo/Pipeline/SoftWare/Python-3.7.3/bin/pip3&nbsp;install&nbsp;--global-option=build_ext&nbsp;--global-option="-I/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/include"&nbsp;--global-option="-L/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/lib"&nbsp;pygraphviz/Bioinfo/Pipeline/SoftWare/Python-3.7.3/lib/python3.7/site-packages/pip/_internal/commands/install.py:207:&nbsp;UserWarning:&nbsp;Disabling&nbsp;all&nbsp;use&nbsp;of&nbsp;wheels&nbsp;due&nbsp;to&nbsp;the&nbsp;use&nbsp;of&nbsp;--build-options&nbsp;/&nbsp;--global-options&nbsp;/&nbsp;--install-options.&nbsp;&nbsp;cmdoptions.check_install_build_global(options)Collecting&nbsp;pygraphviz&nbsp;&nbsp;Using&nbsp;cached&nbsp;https://files.pythonhosted.org/packages/7e/b1/d6d849ddaf6f11036f9980d433f383d4c13d1ebcfc3cd09bc845bda7e433/pygraphviz-1.5.zipInstalling&nbsp;collected&nbsp;packages:&nbsp;pygraphviz&nbsp;&nbsp;Running&nbsp;setup.py&nbsp;install&nbsp;for&nbsp;pygraphviz&nbsp;...&nbsp;doneSuccessfully&nbsp;installed&nbsp;pygraphviz-1.5

2.4 编译安装

第一,下载 Python 源码,解压。

#&nbsp;官网下载地址&nbsp;https://www.python.org/downloadswget&nbsp;https://www.python.org/ftp/python/2.7.15/Python-2.7.15.tgzwget&nbsp;https://www.python.org/ftp/python/3.7.3/Python-3.7.3.tgz#&nbsp;解压到指定目录&nbsp;tar&nbsp;zvxf&nbsp;Python-3.7.3.tgz&nbsp;-C&nbsp;/usr/local/src

第二,进入解压的源码路径,编译 python 源码。

export&nbsp;PKG_CONFIG_PATH=/Bioinfo/Pipeline/SoftWare/LibDependence/libffi-3.2.1/lib/pkgconfig:/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/lib/pkgconfig:$PKG_CONFIG_PATHexport&nbsp;LD_LIBRARY_PATH=/Bioinfo/Pipeline/SoftWare/LibDependence/libffi-3.2.1/lib64:/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/lib:$LD_LIBRARY_PATH$&nbsp;./configure&nbsp;\--enable-optimizations&nbsp;\--prefix=/Bioinfo/Pipeline/SoftWare/Python-3.7.3&nbsp;\--with-openssl=/Bioinfo/Pipeline/SoftWare/LibDependence/openssl-1.1.1&nbsp;\CC=/Bioinfo/Pipeline/SoftWare/gcc-4.8.5/bin/gcc&nbsp;\CXX=/Bioinfo/Pipeline/SoftWare/gcc-4.8.5/bin/c++&nbsp;\LDFLAGS="-L/Bioinfo/Pipeline/SoftWare/LibDependence/libffi-3.2.1/lib64&nbsp;-L/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/lib"&nbsp;\CPPFLAGS="-I/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/include"&nbsp;\PKG_CONFIG_PATH="/Bioinfo/Pipeline/SoftWare/LibDependence/libffi-3.2.1/lib/pkgconfig:/Bioinfo/Pipeline/SoftWare/graphviz-2.40.1/lib/pkgconfig"&nbsp;
  • --enable-optimizations是优化选项(LTO,PGO 等)加上这个 flag 编译后,性能有 10% 左右的优化,但是这会明显的增加编译时间。建议使用这个参数;

  • --prefix 声明安装路径;

  • 安装多个 python 的版本,如果不开启--enable-shared,指定不同路径即可。当开启--enable-shared时,默认只有一个版本的 python。

  • python 3 编译可以使用--with-openssl=DIR指定 OpenSSL 安装路径进行编译的方式解决 OpenSSL 依赖,否则make过程可能出错。

    $ make ……The following modules found by detect_modules() in setup.py, have beenbuilt by the Makefile instead, as configured by the Setup files:_abc                  atexit                pwdtimeFailed to build these modules:_ctypes               _hashlib              _sslCould not build the ssl module!Python requires an OpenSSL 1.0.2 or 1.1 compatible libssl with X509_VERIFY_PARAM_set1_host().LibreSSL 2.6.4 and earlier do not provide the necessary APIs, https://github.com/libressl-portable/portable/issues/381……

  • make过程如果出现 ModuleNotFoundError: No module named '_ctypes '或者 INFO: Could not locate ffi libs and/or headers 的错误提示,参考下面 的解决方法。

  • 如果指定--with-openssl=DIR依然无法解决 ssl 模块的问题,可以参考修改 Modules/Setup.dist 文件(默认这块是注释的,放开注释即可。这块功能是开启 SSL 模块,不然会出现安装完毕后,提示找不到ssl模块的错误),再执行 configure,修改内容如下:

    # Socket module helper for SSL support; you must comment out the other# socket line above, and possibly edit the SSL variable:SSL=/usr/local/ssl_ssl _ssl.c \    -DUSE_SSL -I$(SSL)/include -I$(SSL)/include/openssl \    -L$(SSL)/lib -lssl -lcrypto

第三,Makefile 生后依次在当前路径执行编译和安装命令。

make&nbsp;&&&nbsp;make&nbsp;install

第四,安装完成。以上命令执行完毕,且无报错的情况下,我们将默认 python 换将切换至 3.7.3(一般不建议替换,个人建议把自定义安装的 Python bin 路径添加到 PATH 环境变量即可):

#&nbsp;替换系统自带的&nbsp;python(不建议)mv&nbsp;/usr/bin/python&nbsp;/usr/bin/python2ln&nbsp;-s&nbsp;/Bioinfo/Pipeline/SoftWare/Python-3.7.3/bin/python3&nbsp;/usr/bin/python#&nbsp;添加新&nbsp;Python&nbsp;到&nbsp;PATH&nbsp;环境变量(建议)echo&nbsp;"export&nbsp;PATH=/Bioinfo/Pipeline/SoftWare/Python-3.7.3/bin:$PATH"&nbsp;>>~/.bashrcsource&nbsp;~/.bashrc

运行命令python -V,查看是否出现 3.7.3 的版本,出现即为安装成功。

3. 安装 pip+setuptools

#&nbsp;下载&nbsp;setuptools&nbsp;和&nbsp;pip&nbsp;安装程序curl&nbsp;https://bootstrap.pypa.io/get-pip.py&nbsp;-o&nbsp;get-pip.pypython&nbsp;get-pip.py

至此,CentOS Linux release 6.5 下的 python-3.7.3 全部安装完成。如果在安装过程中出现其他的报错,建议把 error 关键信息直接复制到 Google 进行检索,参考其他人的解决方法。


往期精彩:

——The  End——

本文分享自微信公众号 - 生信科技爱好者(bioitee)。
如有侵权,请联系 support@oschina.cn 删除。
本文参与“OSC源创计划”,欢迎正在阅读的你也加入,一起分享。