Finding and testing hybridization events in Fragaria species history
Overview
This protocol was developed and employed for studying phylogeny of di- and polyploid Fragaria species using multi-locus sequence data obtained using Hyb-Seq approach (Kamneva et al. In revision) but can be applied to any group of organisms. It was developed on Mac OS but will probably work just as well in any Unix based environment.This tutorial is still work in progress. Please feel free to contact me if you have questions, suggestions, need help or something does not work.
Custom scripts supplied with the protocol are not guarantied to work. Here is the license statement [LICENSE].
Objectives
Sample data
Please download testing_hybridizations_input.tar.gz archive, that contains:Programs
Make sure that you have all of those programs installed on your computer, working and their location is added to the path. Please cite these programs if using in your research:Main steps of the protocol
Find candidate hybridizations using ILS-agnostic consensus network approach
Rscript testing_hybridizations_scripts/R_consensus_summary.r \
testing_hybridizations_input.txt \
testing_hybridizations_scripts/R_functions.R \
5 \
0.30 \
test_hybridizations_consensus_out
Test this hybridization using ILS-aware approach in PhyloNet
Rscript testing_hybridizations_scripts/R_phylonet.r \
testing_hybridizations_input.txt \
chiloensis,vesca,iinumae,Drymocallis \
test_hybridization \
12345 \
commands_phylonet \
10
Parameter files are set up to calculate likelihood of 3 elternative scenarios of species evolution. One with hybridization and two others without:
export PATH_PHYLONET=~/Downloads/
chmod 777 ./commands_phylonet
./commands_phylonet
They can be used to compute AIC of the 3 different evolutionary scenarios and select the one which seens to fit data (set of gene trees) best.
olga@strawbery$ tail -n 3 test_hybridization_*_phylonet.out
==> test_hybridization_1_phylonet.out <==
Species Network:
((((chiloensis:0.08177960134575445)#H1:0.0011774181844964955::0.7889362664736876,
iinumae:3.2322961834617074):0.21342487771271856,(vesca:0.03756955801084463,
#H1:0.9128026176817999::0.21106373352631236):0.8646947680550517):5.9063434149541605,
Drymocallis:1.0);
Total log probability: -4976.799536608153
==> test_hybridization_2_phylonet.out <==
Species Network:
(Drymocallis:1.0,(iinumae:3.299155992176229,(vesca:0.3672750518888784,chiloensis:0.13941635790686432)
:0.0011774181844964955):5.905554510350227);
Total log probability: -5220.087740719886
==> test_hybridization_3_phylonet.out <==
Species Network:
(Drymocallis:1.0,(vesca:0.35931455092773623,(iinumae:3.27864301747052,
chiloensis:0.05952780259580878):0.14214091413189794):5.904547370827603);
Total log probability: -5173.5750126386065
olga@strawbery$ R
> 2*9--4976.799536608153
[1] 4994.8 ## AIC for Scenario 1
> 2*5--5173.5750126386065
[1] 5183.575 ## AIC for Scenario 2
> 2*5--5220.087740719886
[1] 5230.088 ## AIC for Scenario 3