本篇文章是為了記錄本人在系統(tǒng)發(fā)育實(shí)驗(yàn)和數(shù)據(jù)分析過(guò)程中,所使用過(guò)的以及部分未使用但如雷貫耳的軟件的集合。當(dāng)然,相對(duì)于整個(gè)系統(tǒng)發(fā)育軟件之汪洋大海,此處不過(guò)一滴水!
1. 數(shù)據(jù)獲取
1.1 測(cè)序數(shù)據(jù)整理
- DNAstar DNAstar是一個(gè)序列編輯和分析的全功能包,其子包Seqman能比對(duì)和手動(dòng)調(diào)整Sanger測(cè)序文件,操作簡(jiǎn)單,但默認(rèn)不解析少量低質(zhì)量測(cè)序結(jié)果的測(cè)序文件
- Chromas 功能和Seqman類似,不支持測(cè)序結(jié)果反向互補(bǔ)查看,但沒(méi)有Seqman的低質(zhì)量限制,可做為Seqman的補(bǔ)充
1.2 序列比對(duì)
- Mega 全能型系統(tǒng)發(fā)育工具,比對(duì)和編輯序列較方便,比對(duì)算法集成ClustralX和Muscle方法
- Mafft 快速而準(zhǔn)確的序列比對(duì)工具,在序列比對(duì)方面比Mega效果更好
1.3 比對(duì)后處理
- PhyDE 序列編輯軟件,可以多開(kāi),方便多個(gè)比對(duì)結(jié)果之間進(jìn)行比較
- FastGap 可以對(duì)比對(duì)后的序列的gap進(jìn)行重編碼,提高序列信息使用效率
- Gblocks 在線工具,選擇序列保守區(qū),使得后續(xù)系統(tǒng)發(fā)育分析免受變異過(guò)大的比對(duì)區(qū)域的不良影響
- DAMBE 一個(gè)低調(diào)但全能的系統(tǒng)發(fā)育軟件,定位與Mega類似,包括飽和度檢測(cè)功能點(diǎn)
- DNAsp 序列分析軟件,計(jì)算各種序列多樣性數(shù)據(jù),如核苷酸多樣性、序列信息位點(diǎn)含量、單倍型等
2. 格式轉(zhuǎn)換
- DAMBE 少有的集成有格式轉(zhuǎn)換模塊的綜合性軟件
- PGDSpider 專門(mén)的格式轉(zhuǎn)換軟件,可以在眾多系統(tǒng)發(fā)育和群體遺傳學(xué)的文件格式之間轉(zhuǎn)換
- EasyCodeML 進(jìn)化選擇壓力檢測(cè)軟件,但是也集成了部分常見(jiàn)格式的轉(zhuǎn)換功能
3. 模型選擇
- Modeltest/MrModeltest---ModelGUI 最常使用的DNA突變模型選擇軟件組合,依附于PAUP,直接生成適用于MrBayes和Garli的對(duì)應(yīng)模型命令模塊
- jmodeltest 模型選擇軟件,優(yōu)勢(shì)是可選擇的模型很多,缺點(diǎn)是運(yùn)算速度很慢
- Moderfinder 較新的模型選擇軟件,速度很快,集成于iqtree里面
- PartitionFinder 較新的模型和數(shù)據(jù)分區(qū)選擇軟件
4. 發(fā)育樹(shù)構(gòu)建
4.1 最大簡(jiǎn)約法法
- PAUP 老牌著名系統(tǒng)發(fā)育軟件,支持UPGMA,MP,ML等方法。
- Mega 支持最大簡(jiǎn)約法
- DAMBE 支持最大簡(jiǎn)約法
- Phylip 老牌系統(tǒng)發(fā)育軟件,可覆蓋系統(tǒng)發(fā)育構(gòu)建的幾乎全部流程
4.2 最大似然法
PAUP 老牌著名系統(tǒng)發(fā)育軟件,支持UPGMA,MP,ML等方法。
Phylip 老牌系統(tǒng)發(fā)育軟件,可覆蓋系統(tǒng)發(fā)育構(gòu)建的幾乎全部流程
Garli 基于遺傳算法的構(gòu)建ML樹(shù)的軟件
RAxML 現(xiàn)在較為常用的ML樹(shù)構(gòu)建軟件,速度很快,從RAxML開(kāi)始,由于peusolikelihood和快速自展的引入,最大似然法的運(yùn)算速度得到極大改善,一改最大似然法計(jì)算強(qiáng)度大的印象
ExaML RAxML的擴(kuò)展,支持并行運(yùn)算,速度較快,適用于基因組數(shù)據(jù)
-
iqtree 集成了模型選擇和發(fā)育樹(shù)評(píng)估的建樹(shù)軟件,使用起來(lái)方便,在Zhou et al.對(duì)各種快速似然發(fā)育樹(shù)構(gòu)建方法(RAxML,PhyML,Fasttree,iqtree)的評(píng)價(jià)中,iqtree綜合表現(xiàn)最好
Programs Optimality Criterion Starting Tree Topological Moves Supported Models(DNA) Supported Models(AA) Partitioned Analysis RAxML v8.2.0 (ExaML v3.0.17) ML Parsimony/random/custom SPR Common and custom models JC69, K80, HKY85, GTR Y PhyML v20160530 ML Parsimony/random/custom Interleaved NNI and SPR Common and custom models Common and custom models Y IQ-TREE v1.4.2 ML BIONJ and multiple parsimony/random/custom NNI and stochastic perturbation Common and custom models Common and custom models Y FastTree v2.1.9 ML Heuristic NJ NNI and SPR (ME) followed by NNI (ML) JTT, WAG, LG JC69, GTR N 圖表參考自該論文
4.3 貝葉斯法
MrBayes 貝葉斯方法的主要軟件,但相對(duì)于支持快速自展的ML方法,速度較慢
BEAST 一個(gè)基于貝葉斯方法構(gòu)建發(fā)育樹(shù)、物種樹(shù)、時(shí)間樹(shù)以及時(shí)空動(dòng)態(tài)的平臺(tái)型軟件,分為兩個(gè)版本BEAST1和BEAST2。
-
ExaBayes 針對(duì)于大數(shù)據(jù)集的貝葉斯構(gòu)樹(shù)軟件
Program Brief description BEAST Implements a vast number of models. Examples are the simultaneous estimation of the tree topology and divergence times, phylodynamics, phylogeography, and species tree estimation under the MSC model. MrBayes Implements a large number of models for analysis of nucleotide, amino acid and morphological data. Estimates species phylogenies and species divergence times. RevBayes Similar to MrBayes, but with its own programming language to set up complex hierarchical Bayesian models. MCMCTree Estimates divergence times on a fixed phylogenetic tree. Phycas Estimates phylogenetic trees based on nucleotide data. This allows for multifurcating trees, helping to reduce spuriously high posterior probabilities for phylogenies. PhyloBayes Reconstructs phylogenetic trees using infinite mixture models to account for among-site and among-lineage heterogeneity in nucleotide or amino acid compositions, which may be important for inferring deep phylogenies. BPP Implements species tree estimation and species delimitation under the MSC model using multi-loci genomic sequence data. Migrate Estimates population sizes and migration rates under the population-subdivision model based on molecular data. IMa2 Estimates divergence times, population sizes and migration rates under the isolation-with-migration model using multi-loci DNA sequence data and a fixed phylogenetic tree for populations. Structure Estimates population structure from multi-loci genotype data. BAMM Estimates clade diversification rates on phylogenies. Tracer A program for MCMC diagnostics and summaries. AWTY A package for MCMC diagnostics for Bayesian phylogenetic inference. 圖表參考自此論文
5. 系統(tǒng)發(fā)育網(wǎng)絡(luò)構(gòu)建
TCS 簡(jiǎn)單的構(gòu)建系統(tǒng)發(fā)育網(wǎng)絡(luò)的軟件,官方的下載鏈接可能已經(jīng)失效
Network 同上
SplitsTree 支持多種算法(距離法,NJ法等)的發(fā)育網(wǎng)絡(luò)構(gòu)建軟件
Phylonet 利用最大似然算法推斷系統(tǒng)發(fā)育網(wǎng)絡(luò),能夠區(qū)分雜交事件和不完全譜系分選
JML 同樣宣稱可以區(qū)分雜交和不完全譜系分選的發(fā)育網(wǎng)絡(luò)構(gòu)建軟件
-
SplitsTree Network Arlequin TCS T-Rex SpectroNet CombineTrees Input format Nexus Unique Unique, can import Phylip Nexus Phylip Nexus Nexus Data types Characters, distances, trees Characters Characters Characters, distances Characters, distances Characters, trees Trees Network method Character-display networks Median network * * Qusai-median network * * Reduced-median network Greedily reduced-median netword Local buneman graph Quartet window annlysis Pruned median network Pruned quasi-median network * Parsimony splits * Splits decomposition * Neighobor-net * Parsimony networks Netting Union of maximum-parsimony trees * Minimum-spanning netword * * Median-joining * * Statistical-parsimony * Other networks Pyramids Statistical geometry Concordance tree Netview Reticulogram * Tree-display networks Consensus network * * Super-network * Filtered super-network * 該表參考自書(shū)籍 Morrison, D. A. (2011). An introduction to phylogenetic networks. RJR productions.
6. 物種樹(shù)構(gòu)建
BEAST
BPP BEAST和BPP軟件推算物種樹(shù)的時(shí)候同時(shí)計(jì)算每個(gè)基因的基因樹(shù)及其和每個(gè)物種樹(shù)的組合概率,稱為一步法,一般較為準(zhǔn)確,但是運(yùn)算量很大,難以適用于基因組時(shí)代的大量數(shù)據(jù)
ASTRAL 該軟件需先計(jì)算出每個(gè)基因的基因樹(shù),然后再根據(jù)基因樹(shù)利用溯祖原理推斷出物種樹(shù),稱為兩步法,速度很快
-
NJst
Program Analytical framework Input Output GMYC Best-fit tree branching models (coalescent vs Yule) Ultrametric gene tree Transition point from species to populations, and estimate of species number Brownie Maximum likelihood or gene tree parsimony Gene trees Species tree of delimited species and group membership SpedeSTEM Maximum likelihood and/or information theory Sequence alignments and group membership Species tree of delimited species BP&P Bayesian and/or reversible-jump MCMC Sequence alignments, group membership, and guide tree Posterior probability distribution of species delimitation models, coalescent times, and population sizes [圖片上傳失敗...(image-6debf-1639380305072)]
圖表參考自此論文論文
7. 發(fā)育樹(shù)注釋
-
祖先狀態(tài)重建
- Mesquite 功能相當(dāng)全面的系統(tǒng)發(fā)育軟件,包括其他軟件少有的發(fā)育樹(shù)編輯功能。對(duì)于祖先狀態(tài)重建,該軟件支持MP和ML方法
- BayesTraits 支持貝葉斯方法進(jìn)行狀態(tài)重建,這里狀態(tài)包括形態(tài)性狀和地理分布數(shù)據(jù)
-
祖先地理重建
- RASP 常見(jiàn)的祖先地理重建軟件,操作簡(jiǎn)便,支持多種模型和參數(shù)設(shè)置
- BayesTraits 支持貝葉斯方法進(jìn)行狀態(tài)重建,這里狀態(tài)包括形態(tài)性狀和地理分布數(shù)據(jù)
-
時(shí)間樹(shù)
- BEAST 最常用的時(shí)間樹(shù)構(gòu)建軟件,基于貝葉斯算法,速度較慢
- MCMCtree PAML軟件的子包,常用的時(shí)間樹(shù)構(gòu)建軟件,在基因組時(shí)代使用較多
- r8s 基于最大似然法的時(shí)間樹(shù)構(gòu)建軟件,計(jì)算相對(duì)時(shí)間,善于應(yīng)對(duì)沒(méi)有化石校準(zhǔn)點(diǎn)的狀況,貌似官方下載鏈接已失效
Program Method Brief description Beast Bayesian Comprehensive suite of models. Particularly strong for the analysis of serially sampled DNA sequences. Includes models of morphological traits DPPDiv Bayesian Dirichlet relaxed clock model. Fossilized birth–death process prior to calibrate time trees MCMCTree Bayesian Comprehensive suite of models of rate variation. Fast approximate likelihood method that allows the estimation of time trees using genome alignments MrBayes Bayesian Large suite of models for morphological and molecular evolutionary analysis. Comprehensive suite of models of rate variation Multidivtime Bayesian The first Bayesian clock dating program. Introduced the geometric Brownian model and the approximate likelihood method PhyloBayes Bayesian Broad suite of models. Uses data augmentation to speed up likelihood calculation and can be efficiently used in parallel computing environments (MPI enabled) r8s Penalized likelihood Very fast (uses Poisson densities on inferred mutations to approximate the likelihood). Suitable for the analysis of large phylogenies. Suitable for estimating relative ages (by fixing the age of the root to 1). Does not deal with fossil and branch length uncertainty correctly TreePL Penalized likelihood Similar to r8s 圖標(biāo)參考自此論文。
8. 發(fā)育樹(shù)評(píng)估
- Consel 一個(gè)年代相當(dāng)久遠(yuǎn)的發(fā)育樹(shù)評(píng)估軟件,但是功能相對(duì)全面,現(xiàn)在還少見(jiàn)可以替代的軟件
- Bucky 多基因樹(shù)中,可以評(píng)估發(fā)育樹(shù)中某個(gè)拓?fù)浣Y(jié)構(gòu)受到多少個(gè)基因數(shù)據(jù)的支持,而不是僅僅只看總體的支持率
9. 發(fā)育樹(shù)可視化
Figtree 常見(jiàn)又好用的發(fā)育樹(shù)可視化軟件,當(dāng)額外的形態(tài)性狀或者地理分布注釋數(shù)據(jù)不多的時(shí)候,很好用
Tree graph 發(fā)育樹(shù)可視化軟件,特色之一是可以手動(dòng)畫(huà)發(fā)育樹(shù),自定義拓?fù)浣Y(jié)構(gòu)和枝長(zhǎng)
iTol 發(fā)育樹(shù)可視化在線工具,可以注釋多種多樣的數(shù)據(jù)
EvolView 類似iTOL的在線工具,但是是由國(guó)人創(chuàng)造的
ggtree Y叔開(kāi)發(fā)的發(fā)育樹(shù)可視化R包,在最新的版本中g(shù)gtree被拆分成了樹(shù)文件讀取解析的treeio和專注于可視化的ggtree,如果你熟悉R語(yǔ)言,那么也不失于一個(gè)好選擇
10. 在線工具
CIPRES 系統(tǒng)發(fā)育在線工具, 支持大量主要的系統(tǒng)發(fā)育和群體遺傳性軟件( RAxML; MrBayes; BEAST; BEAST2; GARLI; MAFFT; ExaBayes; DPPDIV; FastTree, jModelTest2, ModelTest-NG, PAUP, ParallelStructure, PartitionFinder2, IQ Tree, Bali-Phy, and Migrate-N. ),并且這些軟件都被配置以支持并行計(jì)算
MABL 小眾的系統(tǒng)發(fā)育在線工具
GBlocks 保守區(qū)選擇在線工具
FindModel 堿基突變模型選擇工具
bPTP 物種劃分在線工具,基于泊松樹(shù)過(guò)程處理發(fā)育樹(shù),推斷出一棵樹(shù)上合理的物種數(shù)量
GMYC 和上一個(gè)功能類似,只是基于廣義混合Yule溯祖過(guò)程
RAxML BlackBox 一個(gè)RXaML在線工具
IQ-TREE 軟件官方提供的在線工具
HIV Database Tools 在線工具合集,主要針對(duì)于HIV數(shù)據(jù),但是也有一些通用的工具
iTol 發(fā)育樹(shù)可視化在線工具,可以注釋多種多樣的數(shù)據(jù)
EvolView 類似iTOL的在線工具,但是是由國(guó)人創(chuàng)造的
VizioMetrics 這不是一個(gè)數(shù)據(jù)分析工具,他可以根據(jù)圖表標(biāo)題搜索其SCI文章出處
11. 軟件合集
- PhyloSuite 雖然它只是一個(gè)軟件,而不是一個(gè)合集,但卻是迄今最全面的系統(tǒng)發(fā)育流程整合軟件,從序列下載、比對(duì)、模型選擇到發(fā)育樹(shù)構(gòu)建均可以完成,主要是還可以設(shè)置批處理流程,設(shè)置好后一鍵運(yùn)行,系統(tǒng)發(fā)育分析再也不用在軟件間繁瑣地跳來(lái)跳去了。
- ExPASy Bioinformatics Resource Portal
- Phylogeny Programs pages 系統(tǒng)發(fā)育軟件集合
- 生物軟件網(wǎng)