系統(tǒng)發(fā)育常見(jiàn)軟件與在線工具合集

本篇文章是為了記錄本人在系統(tǒng)發(fā)育實(shí)驗(yàn)和數(shù)據(jù)分析過(guò)程中,所使用過(guò)的以及部分未使用但如雷貫耳的軟件的集合。當(dāng)然,相對(duì)于整個(gè)系統(tǒng)發(fā)育軟件之汪洋大海,此處不過(guò)一滴水!

1. 數(shù)據(jù)獲取

1.1 測(cè)序數(shù)據(jù)整理

  • DNAstar DNAstar是一個(gè)序列編輯和分析的全功能包,其子包Seqman能比對(duì)和手動(dòng)調(diào)整Sanger測(cè)序文件,操作簡(jiǎn)單,但默認(rèn)不解析少量低質(zhì)量測(cè)序結(jié)果的測(cè)序文件
  • Chromas 功能和Seqman類似,不支持測(cè)序結(jié)果反向互補(bǔ)查看,但沒(méi)有Seqman的低質(zhì)量限制,可做為Seqman的補(bǔ)充

1.2 序列比對(duì)

  • Mega 全能型系統(tǒng)發(fā)育工具,比對(duì)和編輯序列較方便,比對(duì)算法集成ClustralX和Muscle方法
  • Mafft 快速而準(zhǔn)確的序列比對(duì)工具,在序列比對(duì)方面比Mega效果更好

1.3 比對(duì)后處理

  • PhyDE 序列編輯軟件,可以多開(kāi),方便多個(gè)比對(duì)結(jié)果之間進(jìn)行比較
  • FastGap 可以對(duì)比對(duì)后的序列的gap進(jìn)行重編碼,提高序列信息使用效率
  • Gblocks 在線工具,選擇序列保守區(qū),使得后續(xù)系統(tǒng)發(fā)育分析免受變異過(guò)大的比對(duì)區(qū)域的不良影響
  • DAMBE 一個(gè)低調(diào)但全能的系統(tǒng)發(fā)育軟件,定位與Mega類似,包括飽和度檢測(cè)功能點(diǎn)
  • DNAsp 序列分析軟件,計(jì)算各種序列多樣性數(shù)據(jù),如核苷酸多樣性、序列信息位點(diǎn)含量、單倍型等

2. 格式轉(zhuǎn)換

  • DAMBE 少有的集成有格式轉(zhuǎn)換模塊的綜合性軟件
  • PGDSpider 專門(mén)的格式轉(zhuǎn)換軟件,可以在眾多系統(tǒng)發(fā)育和群體遺傳學(xué)的文件格式之間轉(zhuǎn)換
  • EasyCodeML 進(jìn)化選擇壓力檢測(cè)軟件,但是也集成了部分常見(jiàn)格式的轉(zhuǎn)換功能

3. 模型選擇

  • Modeltest/MrModeltest---ModelGUI 最常使用的DNA突變模型選擇軟件組合,依附于PAUP,直接生成適用于MrBayes和Garli的對(duì)應(yīng)模型命令模塊
  • jmodeltest 模型選擇軟件,優(yōu)勢(shì)是可選擇的模型很多,缺點(diǎn)是運(yùn)算速度很慢
  • Moderfinder 較新的模型選擇軟件,速度很快,集成于iqtree里面
  • PartitionFinder 較新的模型和數(shù)據(jù)分區(qū)選擇軟件

4. 發(fā)育樹(shù)構(gòu)建

4.1 最大簡(jiǎn)約法法

  • PAUP 老牌著名系統(tǒng)發(fā)育軟件,支持UPGMA,MP,ML等方法。
  • Mega 支持最大簡(jiǎn)約法
  • DAMBE 支持最大簡(jiǎn)約法
  • Phylip 老牌系統(tǒng)發(fā)育軟件,可覆蓋系統(tǒng)發(fā)育構(gòu)建的幾乎全部流程

4.2 最大似然法

  • PAUP 老牌著名系統(tǒng)發(fā)育軟件,支持UPGMA,MP,ML等方法。

  • Phylip 老牌系統(tǒng)發(fā)育軟件,可覆蓋系統(tǒng)發(fā)育構(gòu)建的幾乎全部流程

  • Garli 基于遺傳算法的構(gòu)建ML樹(shù)的軟件

  • RAxML 現(xiàn)在較為常用的ML樹(shù)構(gòu)建軟件,速度很快,從RAxML開(kāi)始,由于peusolikelihood和快速自展的引入,最大似然法的運(yùn)算速度得到極大改善,一改最大似然法計(jì)算強(qiáng)度大的印象

  • ExaML RAxML的擴(kuò)展,支持并行運(yùn)算,速度較快,適用于基因組數(shù)據(jù)

  • iqtree 集成了模型選擇和發(fā)育樹(shù)評(píng)估的建樹(shù)軟件,使用起來(lái)方便,在Zhou et al.對(duì)各種快速似然發(fā)育樹(shù)構(gòu)建方法(RAxML,PhyML,Fasttree,iqtree)的評(píng)價(jià)中,iqtree綜合表現(xiàn)最好

    Programs Optimality Criterion Starting Tree Topological Moves Supported Models(DNA) Supported Models(AA) Partitioned Analysis
    RAxML v8.2.0 (ExaML v3.0.17) ML Parsimony/random/custom SPR Common and custom models JC69, K80, HKY85, GTR Y
    PhyML v20160530 ML Parsimony/random/custom Interleaved NNI and SPR Common and custom models Common and custom models Y
    IQ-TREE v1.4.2 ML BIONJ and multiple parsimony/random/custom NNI and stochastic perturbation Common and custom models Common and custom models Y
    FastTree v2.1.9 ML Heuristic NJ NNI and SPR (ME) followed by NNI (ML) JTT, WAG, LG JC69, GTR N

    圖表參考自該論文

4.3 貝葉斯法

  • MrBayes 貝葉斯方法的主要軟件,但相對(duì)于支持快速自展的ML方法,速度較慢

  • BEAST 一個(gè)基于貝葉斯方法構(gòu)建發(fā)育樹(shù)、物種樹(shù)、時(shí)間樹(shù)以及時(shí)空動(dòng)態(tài)的平臺(tái)型軟件,分為兩個(gè)版本BEAST1BEAST2。

  • ExaBayes 針對(duì)于大數(shù)據(jù)集的貝葉斯構(gòu)樹(shù)軟件

    Program Brief description
    BEAST Implements a vast number of models. Examples are the simultaneous estimation of the tree topology and divergence times, phylodynamics, phylogeography, and species tree estimation under the MSC model.
    MrBayes Implements a large number of models for analysis of nucleotide, amino acid and morphological data. Estimates species phylogenies and species divergence times.
    RevBayes Similar to MrBayes, but with its own programming language to set up complex hierarchical Bayesian models.
    MCMCTree Estimates divergence times on a fixed phylogenetic tree.
    Phycas Estimates phylogenetic trees based on nucleotide data. This allows for multifurcating trees, helping to reduce spuriously high posterior probabilities for phylogenies.
    PhyloBayes Reconstructs phylogenetic trees using infinite mixture models to account for among-site and among-lineage heterogeneity in nucleotide or amino acid compositions, which may be important for inferring deep phylogenies.
    BPP Implements species tree estimation and species delimitation under the MSC model using multi-loci genomic sequence data.
    Migrate Estimates population sizes and migration rates under the population-subdivision model based on molecular data.
    IMa2 Estimates divergence times, population sizes and migration rates under the isolation-with-migration model using multi-loci DNA sequence data and a fixed phylogenetic tree for populations.
    Structure Estimates population structure from multi-loci genotype data.
    BAMM Estimates clade diversification rates on phylogenies.
    Tracer A program for MCMC diagnostics and summaries.
    AWTY A package for MCMC diagnostics for Bayesian phylogenetic inference.

    圖表參考自此論文

5. 系統(tǒng)發(fā)育網(wǎng)絡(luò)構(gòu)建

  • TCS 簡(jiǎn)單的構(gòu)建系統(tǒng)發(fā)育網(wǎng)絡(luò)的軟件,官方的下載鏈接可能已經(jīng)失效

  • Network 同上

  • SplitsTree 支持多種算法(距離法,NJ法等)的發(fā)育網(wǎng)絡(luò)構(gòu)建軟件

  • Phylonet 利用最大似然算法推斷系統(tǒng)發(fā)育網(wǎng)絡(luò),能夠區(qū)分雜交事件和不完全譜系分選

  • JML 同樣宣稱可以區(qū)分雜交和不完全譜系分選的發(fā)育網(wǎng)絡(luò)構(gòu)建軟件

  • Dendroscope

    SplitsTree Network Arlequin TCS T-Rex SpectroNet CombineTrees
    Input format Nexus Unique Unique, can import Phylip Nexus Phylip Nexus Nexus
    Data types Characters, distances, trees Characters Characters Characters, distances Characters, distances Characters, trees Trees
    Network method
    Character-display networks
    Median network * *
    Qusai-median network * *
    Reduced-median network
    Greedily reduced-median netword
    Local buneman graph
    Quartet window annlysis
    Pruned median network
    Pruned quasi-median network *
    Parsimony splits *
    Splits decomposition *
    Neighobor-net *
    Parsimony networks
    Netting
    Union of maximum-parsimony trees *
    Minimum-spanning netword * *
    Median-joining * *
    Statistical-parsimony *
    Other networks
    Pyramids
    Statistical geometry
    Concordance tree
    Netview
    Reticulogram *
    Tree-display networks
    Consensus network * *
    Super-network *
    Filtered super-network *

    該表參考自書(shū)籍 Morrison, D. A. (2011). An introduction to phylogenetic networks. RJR productions.

6. 物種樹(shù)構(gòu)建

  • BEAST

  • BPP BEAST和BPP軟件推算物種樹(shù)的時(shí)候同時(shí)計(jì)算每個(gè)基因的基因樹(shù)及其和每個(gè)物種樹(shù)的組合概率,稱為一步法,一般較為準(zhǔn)確,但是運(yùn)算量很大,難以適用于基因組時(shí)代的大量數(shù)據(jù)

  • ASTRAL 該軟件需先計(jì)算出每個(gè)基因的基因樹(shù),然后再根據(jù)基因樹(shù)利用溯祖原理推斷出物種樹(shù),稱為兩步法,速度很快

  • STEM-hy

  • MP-EST

  • NJst

    Program Analytical framework Input Output
    GMYC Best-fit tree branching models (coalescent vs Yule) Ultrametric gene tree Transition point from species to populations, and estimate of species number
    Brownie Maximum likelihood or gene tree parsimony Gene trees Species tree of delimited species and group membership
    SpedeSTEM Maximum likelihood and/or information theory Sequence alignments and group membership Species tree of delimited species
    BP&P Bayesian and/or reversible-jump MCMC Sequence alignments, group membership, and guide tree Posterior probability distribution of species delimitation models, coalescent times, and population sizes

    [圖片上傳失敗...(image-6debf-1639380305072)]

圖表參考自此論文論文

7. 發(fā)育樹(shù)注釋

  • 祖先狀態(tài)重建

    • Mesquite 功能相當(dāng)全面的系統(tǒng)發(fā)育軟件,包括其他軟件少有的發(fā)育樹(shù)編輯功能。對(duì)于祖先狀態(tài)重建,該軟件支持MP和ML方法
    • BayesTraits 支持貝葉斯方法進(jìn)行狀態(tài)重建,這里狀態(tài)包括形態(tài)性狀和地理分布數(shù)據(jù)
  • 祖先地理重建

    • RASP 常見(jiàn)的祖先地理重建軟件,操作簡(jiǎn)便,支持多種模型和參數(shù)設(shè)置
    • BayesTraits 支持貝葉斯方法進(jìn)行狀態(tài)重建,這里狀態(tài)包括形態(tài)性狀和地理分布數(shù)據(jù)
  • 時(shí)間樹(shù)

    • BEAST 最常用的時(shí)間樹(shù)構(gòu)建軟件,基于貝葉斯算法,速度較慢
    • MCMCtree PAML軟件的子包,常用的時(shí)間樹(shù)構(gòu)建軟件,在基因組時(shí)代使用較多
    • r8s 基于最大似然法的時(shí)間樹(shù)構(gòu)建軟件,計(jì)算相對(duì)時(shí)間,善于應(yīng)對(duì)沒(méi)有化石校準(zhǔn)點(diǎn)的狀況,貌似官方下載鏈接已失效
    Program Method Brief description
    Beast Bayesian Comprehensive suite of models. Particularly strong for the analysis of serially sampled DNA sequences. Includes models of morphological traits
    DPPDiv Bayesian Dirichlet relaxed clock model. Fossilized birth–death process prior to calibrate time trees
    MCMCTree Bayesian Comprehensive suite of models of rate variation. Fast approximate likelihood method that allows the estimation of time trees using genome alignments
    MrBayes Bayesian Large suite of models for morphological and molecular evolutionary analysis. Comprehensive suite of models of rate variation
    Multidivtime Bayesian The first Bayesian clock dating program. Introduced the geometric Brownian model and the approximate likelihood method
    PhyloBayes Bayesian Broad suite of models. Uses data augmentation to speed up likelihood calculation and can be efficiently used in parallel computing environments (MPI enabled)
    r8s Penalized likelihood Very fast (uses Poisson densities on inferred mutations to approximate the likelihood). Suitable for the analysis of large phylogenies. Suitable for estimating relative ages (by fixing the age of the root to 1). Does not deal with fossil and branch length uncertainty correctly
    TreePL Penalized likelihood Similar to r8s

    圖標(biāo)參考自此論文。

8. 發(fā)育樹(shù)評(píng)估

  • Consel 一個(gè)年代相當(dāng)久遠(yuǎn)的發(fā)育樹(shù)評(píng)估軟件,但是功能相對(duì)全面,現(xiàn)在還少見(jiàn)可以替代的軟件
  • Bucky 多基因樹(shù)中,可以評(píng)估發(fā)育樹(shù)中某個(gè)拓?fù)浣Y(jié)構(gòu)受到多少個(gè)基因數(shù)據(jù)的支持,而不是僅僅只看總體的支持率

9. 發(fā)育樹(shù)可視化

  • Figtree 常見(jiàn)又好用的發(fā)育樹(shù)可視化軟件,當(dāng)額外的形態(tài)性狀或者地理分布注釋數(shù)據(jù)不多的時(shí)候,很好用

  • Tree graph 發(fā)育樹(shù)可視化軟件,特色之一是可以手動(dòng)畫(huà)發(fā)育樹(shù),自定義拓?fù)浣Y(jié)構(gòu)和枝長(zhǎng)

  • iTol 發(fā)育樹(shù)可視化在線工具,可以注釋多種多樣的數(shù)據(jù)

  • EvolView 類似iTOL的在線工具,但是是由國(guó)人創(chuàng)造的

  • ggtree Y叔開(kāi)發(fā)的發(fā)育樹(shù)可視化R包,在最新的版本中g(shù)gtree被拆分成了樹(shù)文件讀取解析的treeio和專注于可視化的ggtree,如果你熟悉R語(yǔ)言,那么也不失于一個(gè)好選擇

10. 在線工具

  • CIPRES 系統(tǒng)發(fā)育在線工具, 支持大量主要的系統(tǒng)發(fā)育和群體遺傳性軟件( RAxML; MrBayes; BEAST; BEAST2; GARLI; MAFFT; ExaBayes; DPPDIV; FastTree, jModelTest2, ModelTest-NG, PAUP, ParallelStructure, PartitionFinder2, IQ Tree, Bali-Phy, and Migrate-N. ),并且這些軟件都被配置以支持并行計(jì)算

  • MABL 小眾的系統(tǒng)發(fā)育在線工具

  • GBlocks 保守區(qū)選擇在線工具

  • FindModel 堿基突變模型選擇工具

  • bPTP 物種劃分在線工具,基于泊松樹(shù)過(guò)程處理發(fā)育樹(shù),推斷出一棵樹(shù)上合理的物種數(shù)量

  • GMYC 和上一個(gè)功能類似,只是基于廣義混合Yule溯祖過(guò)程

  • RAxML BlackBox 一個(gè)RXaML在線工具

  • IQ-TREE 軟件官方提供的在線工具

  • HIV Database Tools 在線工具合集,主要針對(duì)于HIV數(shù)據(jù),但是也有一些通用的工具

  • iTol 發(fā)育樹(shù)可視化在線工具,可以注釋多種多樣的數(shù)據(jù)

  • EvolView 類似iTOL的在線工具,但是是由國(guó)人創(chuàng)造的

  • VizioMetrics 這不是一個(gè)數(shù)據(jù)分析工具,他可以根據(jù)圖表標(biāo)題搜索其SCI文章出處

11. 軟件合集

  • PhyloSuite 雖然它只是一個(gè)軟件,而不是一個(gè)合集,但卻是迄今最全面的系統(tǒng)發(fā)育流程整合軟件,從序列下載、比對(duì)、模型選擇到發(fā)育樹(shù)構(gòu)建均可以完成,主要是還可以設(shè)置批處理流程,設(shè)置好后一鍵運(yùn)行,系統(tǒng)發(fā)育分析再也不用在軟件間繁瑣地跳來(lái)跳去了。
  • ExPASy Bioinformatics Resource Portal
  • Phylogeny Programs pages 系統(tǒng)發(fā)育軟件集合
  • 生物軟件網(wǎng)
?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者
【社區(qū)內(nèi)容提示】社區(qū)部分內(nèi)容疑似由AI輔助生成,瀏覽時(shí)請(qǐng)結(jié)合常識(shí)與多方信息審慎甄別。
平臺(tái)聲明:文章內(nèi)容(如有圖片或視頻亦包括在內(nèi))由作者上傳并發(fā)布,文章內(nèi)容僅代表作者本人觀點(diǎn),簡(jiǎn)書(shū)系信息發(fā)布平臺(tái),僅提供信息存儲(chǔ)服務(wù)。

相關(guān)閱讀更多精彩內(nèi)容

友情鏈接更多精彩內(nèi)容