Difference between revisions of "Systematics Seminar"

From EEBedia
Jump to: navigation, search
(Monday, 20 October 2014)
(89 intermediate revisions by 16 users not shown)
Line 1: Line 1:
 
This is the home page of the UConn EEB department's Systematics Seminar (EEB 6486). This is a graduate seminar devoted to issues of interest to graduate students and faculty who make up the systematics program at the University of Connecticut.  
 
This is the home page of the UConn EEB department's Systematics Seminar (EEB 6486). This is a graduate seminar devoted to issues of interest to graduate students and faculty who make up the systematics program at the University of Connecticut.  
 +
<br><br>
 +
'''Seminar Format:''' Registered students be prepared to lead discussions, perhaps more than once depending on the number of participants.
  
[[Systematics Listserv|Click here for information about joining and using the Systematics email list]]
+
The leader(s) will be responsible both for (1) selection of readings, (2) announcing the selection, (3) an introductory presentation, (4) driving discussion and (5) setting up and putting away the projector. 
  
== Meeting time and place ==
+
'''Readings:''' In consultation with the instructors, each leader should assign one primary paper for discussion and up to two other ancillary papers or resources.  The readings should be posted to EEBedia at least 5 days in advance.
For the Fall 2014 semester, we are meeting in the '''Bamford Room (TLS 171B) Mondays 2:30-3:30pm'''
+
  
=== Topics ===
+
'''Announcing the reading:''' The leader should add an entry to the schedule (see below) by editing this page. There are two ways to create a link to the paper:
As the semester progresses, please feel free to add to this running list of sources of systematic error, tests for that error, methods to account for that error, and relevant literature for that error.
+
  
{| border="1" cellpadding="1"
+
1. If the paper is available online through our library, it is sufficient to create a link to the DOI:
!style="background:#C0C0C0;" width="250"|Systematic Error
+
<nowiki>:[http://dx.doi.org/10.1093/sysbio/syv041 Doyle et al. 2015. Syst. Biol. 64:824-837.]</nowiki>
!style="background:#C0C0C0;" width="350"|Tests for Systematic Error
+
In this case, you need not give all the citation details because the DOI should always be sufficient to find the paper. The colon (:) at the beginning of the link causes the link to be indented an placed on a separate line. Note that the DOI is in the form of a URL, starting with <code><nowiki>http://dx.doi.org/</nowiki></code>. Here is how the above link looks embedded in this EEBedia page:
!style="background:#C0C0C0;" width="350"|Programs / Methods Accounting for Systematic Error
+
:[http://dx.doi.org/10.1093/sysbio/syv041 Doyle et al. 2015. Syst. Biol. 64:824-837.]
!style="background:#C0C0C0;" width="200"|Associated Literature
+
  
|-  
+
2. If the paper is not available through the library, upload a PDF of the paper to [http://dropbox.uconn.edu the UConn dropbox], being sure to use the secure version so that it can be password protected. Copy the URL provided by dropbox, and create a link to it as follows (see the [[Dropbox Test]] page for other examples):
 +
<nowiki>:[https://dropbox.uconn.edu/dropbox?n=SystBiol-2015-Doyle-824-37.pdf&p=ELPFIc5NtO3c4V44Ls Doyle et al. 2015.]</nowiki>
 +
In this case, you should provide a full citation to the paper for the benefit of those that visit the site long after the dropbox link has expired; however, the full details need not be part of the link text. Here is what this kind of link looks like embedded in this EEBedia page:
  
|| Nucleotide Composition Bias ||  || Include additional taxa, RY recoding ||
+
:[https://dropbox.uconn.edu/dropbox?n=SystBiol-2015-Doyle-824-37.pdf&p=ELPFIc5NtO3c4V44Ls Doyle et al. 2015.] Full citation: Vinson P. Doyle, Randee E. Young, Gavin J. P. Naylor, and Jeremy M. Brown. 2015. Can We Identify Genes with Increased Phylogenetic Reliability? Systematic Biology 64 (5): 824-837. doi:10.1093/sysbio/syv041
  
|-
+
If you have ancillary papers, upload those to the dropbox individually and create separate links.
  
|| Amino Acid Composition Bias ||  || Dayhoff recoding ||
+
Finally, send a note to the [[Systematics Listserv]] letting everyone know that a paper is available.
  
|-
+
'''Introductory PowerPoint/KeyNote Presentation:''' Introduce your topic with a 10- to 15-minute PowerPoint or KeyNote presentation.  Dedicate at least 2/3 of that time to placing the subject into the broader context of the subject areas/themes and at most 1/3 of it introducing paper, special definitions, taxa, methods, etc. Never exceed 15 minutes.  (For example, for a reading on figs and fig-wasps, broaden the scope to plant-herbivore co-evolution.).  Add images, include short movie clips, visit web resources, etc. to keep the presentation engaging.  Although your presentation should not be a review of the primary reading, showing key figures from the readings may be helpful (and appreciated).  You may also want to provide more detail and background about ancillary readings which likely have not been read by all.
  
|| Incomplete Lineage Sorting || || ||
+
'''Discussion:''' You are responsible for driving the discussion. Assume everyone in attendance has read the main paper. There are excellent suggestions for generating class discussions on Chris Elphick’s Current Topics in Conservation Biology course site.  See section under expectations.  
  
|-
+
Prepare 3-5 questions that you expect will spur discussion.  Ideally, you would distribute questions a day or two before our class meeting.
  
|| Horizontal Gene Transfer / Hybridization / Gene Flow ||  ||  ||
+
'''Projector:'''
 +
The presenter will be responsible for setting up the projector for each class session—you will need to get it from the EEB office, make sure you have appropriate adaptors and have it set up so that class can begin on schedule. Kathy has reserved the pink projector for our class. If you do not have a laptop, let Wagner know and he will bring his. (Nick McIntosh may also be able to provide a loaner.)
  
|-
+
[[Systematics Listserv|Click here for information about joining and using the Systematics email list]]
  
|| Among Site Rate Heterogeneity (ASRV) ||  ||  ||
+
== Meeting time and place ==
 
+
For the Fall 2015 semester, we are meeting in the '''Bamford Room (TLS 171B), Tuesdays 2:30-3:30pm'''
|-
+
 
+
|| Among Lineage Rate Heterogeneity (ALRV) ||  ||  ||
+
 
+
|-
+
 
+
|| Heterotachy ||  ||  ||
+
 
+
|-
+
 
+
|| Paralogy ||  ||  ||
+
 
+
|-
+
 
+
|| Functional Convergence in Proteins / Selection ||  ||  || [[:File:Parker_et_al_2013.pdf‎|Parker et al. 2013]], sequence convergence in echolocating bats and cetaceans
+
 
+
|-
+
 
+
|| Missing Data (?) ||  ||  || [[:File:Wiens_and_Moen_2008.pdf‎|Wiens and Moen 2008]], but see
+
[[:File:Lemmon_et_al_2009.pdf‎|Lemmon et al. 2009]]
+
 
+
|-
+
  
|| Taxon Sampling (?) ||  ||  ||
+
=== Tuesday, 1 September 2015, 3pm, Bamford Room (TLS 171b) ===
 +
At this meeting we discussed possible themes for this semester's seminar, and determined the meeting time. For starters, we will explore how to use RevBayes, and afterwards explore current topics such as new developments in comparative methods.
  
|-
+
=== Tuesday, 8 September 2015, 2:30pm, Bamford Room (TLS 171b) ===
 +
Suman will demonstrate RevBayes using a simulated data file and a RevBayes script that is pretty bare-bones. If you would to see or play with the script and data yourself beforehand, it is available at the link below:
 +
:[https://dropbox.uconn.edu/dropbox?n=RevBayes1.zip&p=WHwP8lS7EY10jpwMs RevBayes1] (simdata.nex plus test.Rev script)
  
|| Non-Independence of Sites ||  ||  ||
+
:[http://revbayes.github.io/about.html RevBayes website]
  
|-
+
A sample qsub script to run RevBayes on the BBC cluster:
  
|| Overly Restrictive Priors || || ||
+
  #$ -S /bin/bash
 +
  #$ -cwd
 +
#$ -m ea
 +
#$ -M suman.neupane@uconn.edu
 +
#$ -N TimeTree
 +
rb GTR_Gamma.nonclock.Rev
  
|-
+
If you don't mind staying logged in, you can type qlogin to get assigned to a compute node that is not currently busy and then just type
 +
rb GTR_Gamma.nonclock.Rev
 +
This is a good method to use if you just want to test RevBayes; you'll want to use Suman's qsub script for long jobs because you will not probably not want to stay logged in overnight. If you don't have an account on the cluster, you can get one by filling out this form: [http://bioinformatics.uconn.edu/contact-us/ http://bioinformatics.uconn.edu/contact-us/]
  
|| Sequencing Hardware Error ||  ||  ||
+
=== Tuesday, 15 September 2015, 2:30pm, Bamford Room (TLS 171b) ===
 +
This week we will explore the graphical model descriptions used by RevBayes. Here is the paper (password is being sent over the systematics email list):
 +
:[https://dropbox.uconn.edu/dropbox?n=Syst%20Biol-2014-H%F6hna-753-71.pdf&p=EWMqcsduO0GeULlr8Y Höhna et al. 2014. Probabilistic Graphical Model Representation in Phylogenetics. Systematic Biology. 63:753–771]
 +
You can also get the paper without requiring a password if you are on campus or connected via VPN using this link:
 +
:[http://dx.doi.org/10.1093/sysbio/syu039 Höhna et al. 2014. Probabilistic Graphical Model Representation in Phylogenetics. Systematic Biology. 63:753–771]
 +
Paul will also demonstrate how to use RevBayes on our bioinformatics cluster (probably the best way to run it, especially for long runs).
  
|}
+
=== Tuesday, 29 September 2015, 2:30pm, Bamford Room (TLS 171b) ===
 +
This week we'll talk about the new "Open Tree of Life", reading the paper, and maybe exploring the website (http://opentreeoflife.org/)
 +
:[http://www.pnas.org/content/early/2015/09/16/1423041112 Hinchliff et al. Synthesis of phylogeny and taxonomy into a comprehensive tree of life. PNAS. ]
  
=== Monday, 25 August 2014 ===
+
=== Tuesday, 6 October 2015, 2:30pm, Bamford Room (TLS 171b) ===
At this meeting we will discuss possible themes for this semester's seminar:
+
This week we'll indulge Elizabeth's interest in salamanders, and also talk about molecular dating
 +
:[http://sysbio.oxfordjournals.org/content/early/2015/09/18/sysbio.syv061.full.pdf+html Shen et al. Enlarged Multilocus Dataset Provides Surprisingly Younger Time of Origin for the Plethodontidae, the Largest Family of Salamanders. Sys. Bio. in press]
  
=== Monday, 1 September 2014 ===
+
=== Tuesday, 13 October 2015, 2:30pm, Bamford Room (TLS 171b) ===
Labor Day, no meeting
+
This week we'll take a look at a paper comparing small and large datasets when constructing trees. Also snakes!
 +
:[https://sararuane.files.wordpress.com/2013/12/authors-accepted-copy-not-typeset-or-proofed.pdf Ruane et al. Comparing species-tree estimation with large anchored phylogenomic and small Sanger-sequenced molecular datasets: An empirical study on Malagasy pseudoxyrhophiine snakes. BMC Evolutionary Biology. in press]
  
=== Monday, 8 September 2014 ===
+
EDIT here is the published version
For this meeting, please come with an example (or examples) of a source of systematic error in datasets, and a paper that attempts to address this source of systematic error. We will use these examples and papers as a basis for discussions in upcoming weeks.
+
:[http://www.biomedcentral.com/content/pdf/s12862-015-0503-1.pdf Ruane et al., 2015]
  
=== Monday, 15 September 2014 ===
+
=== Tuesday, 20 October 2015, 2:30pm, Bamford Room (TLS 171b) ===
Topic: An overview of potential systematic errors found in phylogenomic data sets
+
This week we'll jump back into an anchored phylogenomics dataset, and talk about (yet another) bird phylogeny.
:[[:File:Rodriguez-Ezpeleta et al 2007. SystBiol.pdf|Rodriguez-Ezpeleta et al. 2007]], Detecting and Overcoming Systematic Errors in Genome-Scale Phylogenies
+
:[http://www.nature.com/nature/journal/vaop/ncurrent/pdf/nature15697.pdf Prum et al. A comprehensive phylogeny of birds (Aves) using targeted next-generation DNA sequencing. Nature.]
 +
:[http://www.nature.com/nature/journal/vaop/ncurrent/pdf/nature15638.pdf Accompanying News and Views article.]
 +
:[http://www.allaboutbirds.org/earliest-beginnings-of-bird-evolution-brought-into-focus-with-new-dna-analysis/ Relevant blog post!]
  
=== Monday, 22 September 2014 ===
+
=== Tuesday, 27 October 2015, 2:30pm, Bamford Room (TLS 171b) ===
Topic: Coalescent versus Concatenation Methods and the Placement of Amborella as Sister to Water Lilies
+
Some back and forth commentaries!
:{{pdf|http://hydrodictyon.eeb.uconn.edu/courses/systematicsseminar/restricted/Syst%20Biol-2014-Xi-sysbio_syu055.pdf}} Xi et al. 2014
+
:[http://www.sciencemag.org/content/350/6257/171.1.full.pdf Liu & Edwards. Comment on "Statistical binning enables an accurate coalescent based estimation of the avian tree"]
 +
:[http://www.sciencemag.org/content/350/6257/171.2.full.pdf Mirarab et al. Response to Comment on “Statistical binning enables an accurate coalescent-based estimation of the avian tree”]
  
=== Monday, 29 September 2014 ===
+
=== Tuesday, 3 November 2015, 2:30pm, Bamford Room (TLS 171b) ===
Topic: David Swofford's presentation at the Frontiers in Phylogenetics Symposium, "Filtering and Partitioning Strategies for Phylogenomic Analyses", and SVDQuartets method from Chifman and Kukatko 2014 <br/>
+
Springer and Gatesy. 2016. The gene tree delusion. A critique of species tree methods. The reason that it is called the gene tree delusion rather than the species tree delusion is I think because they are in favor of concatenation.
 +
:[[File:Pdficon small.gif|link=https://dropbox.uconn.edu/dropbox?n=Springer%20and%20Gatesy.%202015.%20%20The%20gene%20tree%20delusion.%201-s2.0-S1055790315002225-main.pdf&p=WpXucXwcz06hxwMDO]] Springer and Gatesy. 2016. MPE 94: 1-33. ([http://dx.doi.org/10.1016/j.ympev.2015.07.018 doi:10.1016/j.ympev.2015.07.018])
  
:{{pdf|http://hydrodictyon.eeb.uconn.edu/courses/systematicsseminar/restricted/Chifman%20and%20Kubatko%20-%202014%20-%20Quartet%20Inference%20from%20SNP%20Data%20Under%20the%20Coalesce.pdf}} Chifman and Kubatko 2014
 
  
Optional (but a nice supplement to the paper above and also reviews most other species tree methods): Laura Kubatko talked about SVDQuartets in her lecture at the Woods Hole Molecular Evolution Workshop this past summer. Click on the link below, then click on "Slides (draft)" to download the PDF: the SVDQuartets explanation begins at slide 63.
+
=== Tuesday, 10 November 2015, 2:30pm, Bamford Room (TLS 171b) ===
  
[https://molevol.mbl.edu/index.php/Laura_Kubatko Kubato lecture]
+
:[[File:Pdficon small.gif|link=https://dropbox.uconn.edu/dropbox?n=Edwardsetal_DelusionResponse_MPE2015.pdf&p=Loeg95jdKojP726iL]]  Edwards et al. 2016.  Response to Gene Tree Delusion.  You can read it online [http://ezproxy.lib.uconn.edu/login?url=http://www.sciencedirect.com/science/article/pii/S1055790315003309 here].
  
Symposium talk recordings:
+
=== Tuesday, 17 November 2015, 2:30pm, Bamford Room (TLS 171b) ===
:Part 1  http://www.ustream.tv/recorded/52713111 <br/>
+
Suman and Paul will discuss the "concaterpillar" paper below (an oldie but goodie) unless someone writes to us before the end of the week with a different idea. The idea here is to find clusters of genes that can tolerate sharing a single tree topology. The password is being sent over the [http://hydrodictyon.eeb.uconn.edu/eebedia/index.php/Systematics_Listserv systematics listserv], but you can also download the paper using the DOI link, which should work if you are on campus or connected to the campus network via VPN.
:Part 2  http://www.ustream.tv/recorded/52716590 (The first half of Swofford's talk starts towards the end of this recording) <br/>
+
:Part 3  http://www.ustream.tv/recorded/52720049 (The second half Swofford's talk picks up at the beginning of this recording) <br/>
+
  
Symposium schedule and abstracts:
+
:[[File:Pdficon small.gif|link=https://dropbox.uconn.edu/dropbox?n=Systematic%20Biology%202008%20Leigh-2.pdf&p=EW4tWZx0xJC76FwvVj]]  Leigh J.W., Susko E., Baumgartner M., Roger A.J. 2008. Testing congruence in phylogenomic analysis. Systematic Biology. 57:104–115. [http://dx.doi.org/10.1080/10635150801910436 doi:10.1080/10635150801910436]
:{{pdf|http://hydrodictyon.eeb.uconn.edu/courses/systematicsseminar/restricted/2014FrontiersSymposiumSchedule.pdf}} 2014 Frontiers in Phylogenetics Symposium Schedule
+
:{{pdf|http://hydrodictyon.eeb.uconn.edu/courses/systematicsseminar/restricted/2014FrontiersSymposiumAbstracts.pdf}} 2014 Frontiers in Phylogenetics Symposium Abstracts
+
  
=== Monday, 6 October 2014 ===
+
==== Running concaterpillar on the bbcsrv3 cluster ====
Topic: Paul Lewis's presentation at Evolution 2014, "Bayesian estimation of phylogenetic information content and implications for site-stripping"
+
Make sure your data file names end in .seq (not .nex), then create the following qsub script (you can name this anything, but I'll assume you named it cpillar.sh):
:https://www.youtube.com/watch?v=kHa57G1imNY
+
  
Here is the paper referenced in Paul's talk:
+
#!/bin/bash
 +
#$ -S /bin/bash
 +
#$ -cwd
 +
#$ -m ea
 +
#$ -M your.name@uconn.edu
 +
#$ -N cpillar
 +
/opt/python/bin/python /common/opt/bioinformatics/concaterpillar/concaterpillar.py -m GTR -t
  
:{{pdf|http://hydrodictyon.eeb.uconn.edu/courses/systematicsseminar/restricted/Genome%20Biology%20and%20Evolution%202011%20Zhong.pdf}} Zhong B., Deusch O., Goremykin V.V., Penny D., Biggs P.J., Atherton R.A., Nikiforova S.V., Lockhart P.J. 2011. Systematic error in seed plant phylogenomics. Genome Biology and Evolution. 3:1340–1348.
+
'''Important:''' be sure your qsub script has unix line endings. This is only an issue if you created it on a Windows machine - you can use Notepad++ to change the line endings.
  
=== Monday, 13 October 2014 ===
+
(Note that you should change your.name@uconn.edu to your own email address. You can also change cpillar to a job name that makes sense for your analysis.)
  
:[[:File:Parker_et_al_2013.pdf‎|Parker et al. 2013]], sequence convergence in echolocating bats and cetaceans
+
Run concaterpillar by navigating to the directory containing your qsub script and your .seq files and typing
  
=== Monday, 20 October 2014 ===
+
qsub cpillar.sh
Topic: Hybridization/reticulate evolution
+
:{{pdf|http://hydrodictyon.eeb.uconn.edu/eebedia/images/d/dc/Cui_2013.pdf}} Cui, R., Schumer, M., Kruesi, K., Walter, R., Andolfatto, P., Rosenthal, G.G. 2013. Phylogenomics reveals extensive reticulate evolution in Xiphophorus fishes. Evolution. 67(8):2166-2179.
+
  
=== Monday, 27 October 2014 ===
+
==== Running the MPI (parallel) version ====
  
=== Monday, 3 November 2014 ===
+
If you have a lot of genes, you can make concaterpillar go faster by using more CPU slots on the cluster. Use the following qsub script for an MPI run that uses 8 slots:
  
=== Monday, 10 November 2014 ===
+
#!/bin/bash
 +
#$ -S /bin/bash
 +
#$ -cwd
 +
#$ -m ea
 +
#$ -M your.name@uconn.edu
 +
#$ -N cpillar
 +
#$ -pe orte 8
 +
mpirun -np 8 /opt/python/bin/python /common/opt/bioinformatics/concaterpillar/concaterpillar.py -c 8 -m GTR -t
  
=== Monday, 17 November 2014 ===
+
Here I've specified 8 slots. Note that the number 8 appears 3 times - whatever number you decide to use, make sure to use that same number in all 3 places!
  
=== Monday, 1 December 2014===
+
=== Tuesday, 30 November 2015, 2:30pm, Bamford Room (TLS 171b) ===
 +
[http://www.biomedcentral.com/content/pdf/1471-2164-16-S10-S2.pdf A comparative study of SVDquartets and other coalescent-based species tree estimation methods. Chou et al 2015]
  
== Past Systematics Seminars ==
+
== Past Seminars ==
 +
* [[Systematics Seminar Fall 2014|Fall 2014]]
 
* [[Systematics Seminar Fall 2013|Fall 2013]]
 
* [[Systematics Seminar Fall 2013|Fall 2013]]
 
* [[Systematics Seminar Spring 2012|Spring 2012]]
 
* [[Systematics Seminar Spring 2012|Spring 2012]]

Revision as of 19:24, 23 November 2015

This is the home page of the UConn EEB department's Systematics Seminar (EEB 6486). This is a graduate seminar devoted to issues of interest to graduate students and faculty who make up the systematics program at the University of Connecticut.

Seminar Format: Registered students be prepared to lead discussions, perhaps more than once depending on the number of participants.

The leader(s) will be responsible both for (1) selection of readings, (2) announcing the selection, (3) an introductory presentation, (4) driving discussion and (5) setting up and putting away the projector.

Readings: In consultation with the instructors, each leader should assign one primary paper for discussion and up to two other ancillary papers or resources. The readings should be posted to EEBedia at least 5 days in advance.

Announcing the reading: The leader should add an entry to the schedule (see below) by editing this page. There are two ways to create a link to the paper:

1. If the paper is available online through our library, it is sufficient to create a link to the DOI:

:[http://dx.doi.org/10.1093/sysbio/syv041 Doyle et al. 2015. Syst. Biol. 64:824-837.]

In this case, you need not give all the citation details because the DOI should always be sufficient to find the paper. The colon (:) at the beginning of the link causes the link to be indented an placed on a separate line. Note that the DOI is in the form of a URL, starting with http://dx.doi.org/. Here is how the above link looks embedded in this EEBedia page:

Doyle et al. 2015. Syst. Biol. 64:824-837.

2. If the paper is not available through the library, upload a PDF of the paper to the UConn dropbox, being sure to use the secure version so that it can be password protected. Copy the URL provided by dropbox, and create a link to it as follows (see the Dropbox Test page for other examples):

:[https://dropbox.uconn.edu/dropbox?n=SystBiol-2015-Doyle-824-37.pdf&p=ELPFIc5NtO3c4V44Ls Doyle et al. 2015.]

In this case, you should provide a full citation to the paper for the benefit of those that visit the site long after the dropbox link has expired; however, the full details need not be part of the link text. Here is what this kind of link looks like embedded in this EEBedia page:

Doyle et al. 2015. Full citation: Vinson P. Doyle, Randee E. Young, Gavin J. P. Naylor, and Jeremy M. Brown. 2015. Can We Identify Genes with Increased Phylogenetic Reliability? Systematic Biology 64 (5): 824-837. doi:10.1093/sysbio/syv041

If you have ancillary papers, upload those to the dropbox individually and create separate links.

Finally, send a note to the Systematics Listserv letting everyone know that a paper is available.

Introductory PowerPoint/KeyNote Presentation: Introduce your topic with a 10- to 15-minute PowerPoint or KeyNote presentation. Dedicate at least 2/3 of that time to placing the subject into the broader context of the subject areas/themes and at most 1/3 of it introducing paper, special definitions, taxa, methods, etc. Never exceed 15 minutes. (For example, for a reading on figs and fig-wasps, broaden the scope to plant-herbivore co-evolution.). Add images, include short movie clips, visit web resources, etc. to keep the presentation engaging. Although your presentation should not be a review of the primary reading, showing key figures from the readings may be helpful (and appreciated). You may also want to provide more detail and background about ancillary readings which likely have not been read by all.

Discussion: You are responsible for driving the discussion. Assume everyone in attendance has read the main paper. There are excellent suggestions for generating class discussions on Chris Elphick’s Current Topics in Conservation Biology course site. See section under expectations.

Prepare 3-5 questions that you expect will spur discussion. Ideally, you would distribute questions a day or two before our class meeting.

Projector: The presenter will be responsible for setting up the projector for each class session—you will need to get it from the EEB office, make sure you have appropriate adaptors and have it set up so that class can begin on schedule. Kathy has reserved the pink projector for our class. If you do not have a laptop, let Wagner know and he will bring his. (Nick McIntosh may also be able to provide a loaner.)

Click here for information about joining and using the Systematics email list

Meeting time and place

For the Fall 2015 semester, we are meeting in the Bamford Room (TLS 171B), Tuesdays 2:30-3:30pm

Tuesday, 1 September 2015, 3pm, Bamford Room (TLS 171b)

At this meeting we discussed possible themes for this semester's seminar, and determined the meeting time. For starters, we will explore how to use RevBayes, and afterwards explore current topics such as new developments in comparative methods.

Tuesday, 8 September 2015, 2:30pm, Bamford Room (TLS 171b)

Suman will demonstrate RevBayes using a simulated data file and a RevBayes script that is pretty bare-bones. If you would to see or play with the script and data yourself beforehand, it is available at the link below:

RevBayes1 (simdata.nex plus test.Rev script)
RevBayes website

A sample qsub script to run RevBayes on the BBC cluster:

#$ -S /bin/bash
#$ -cwd
#$ -m ea
#$ -M suman.neupane@uconn.edu
#$ -N TimeTree
rb GTR_Gamma.nonclock.Rev

If you don't mind staying logged in, you can type qlogin to get assigned to a compute node that is not currently busy and then just type

rb GTR_Gamma.nonclock.Rev

This is a good method to use if you just want to test RevBayes; you'll want to use Suman's qsub script for long jobs because you will not probably not want to stay logged in overnight. If you don't have an account on the cluster, you can get one by filling out this form: http://bioinformatics.uconn.edu/contact-us/

Tuesday, 15 September 2015, 2:30pm, Bamford Room (TLS 171b)

This week we will explore the graphical model descriptions used by RevBayes. Here is the paper (password is being sent over the systematics email list):

Höhna et al. 2014. Probabilistic Graphical Model Representation in Phylogenetics. Systematic Biology. 63:753–771

You can also get the paper without requiring a password if you are on campus or connected via VPN using this link:

Höhna et al. 2014. Probabilistic Graphical Model Representation in Phylogenetics. Systematic Biology. 63:753–771

Paul will also demonstrate how to use RevBayes on our bioinformatics cluster (probably the best way to run it, especially for long runs).

Tuesday, 29 September 2015, 2:30pm, Bamford Room (TLS 171b)

This week we'll talk about the new "Open Tree of Life", reading the paper, and maybe exploring the website (http://opentreeoflife.org/)

Hinchliff et al. Synthesis of phylogeny and taxonomy into a comprehensive tree of life. PNAS.

Tuesday, 6 October 2015, 2:30pm, Bamford Room (TLS 171b)

This week we'll indulge Elizabeth's interest in salamanders, and also talk about molecular dating

Shen et al. Enlarged Multilocus Dataset Provides Surprisingly Younger Time of Origin for the Plethodontidae, the Largest Family of Salamanders. Sys. Bio. in press

Tuesday, 13 October 2015, 2:30pm, Bamford Room (TLS 171b)

This week we'll take a look at a paper comparing small and large datasets when constructing trees. Also snakes!

Ruane et al. Comparing species-tree estimation with large anchored phylogenomic and small Sanger-sequenced molecular datasets: An empirical study on Malagasy pseudoxyrhophiine snakes. BMC Evolutionary Biology. in press

EDIT here is the published version

Ruane et al., 2015

Tuesday, 20 October 2015, 2:30pm, Bamford Room (TLS 171b)

This week we'll jump back into an anchored phylogenomics dataset, and talk about (yet another) bird phylogeny.

Prum et al. A comprehensive phylogeny of birds (Aves) using targeted next-generation DNA sequencing. Nature.
Accompanying News and Views article.
Relevant blog post!

Tuesday, 27 October 2015, 2:30pm, Bamford Room (TLS 171b)

Some back and forth commentaries!

Liu & Edwards. Comment on "Statistical binning enables an accurate coalescent based estimation of the avian tree"
Mirarab et al. Response to Comment on “Statistical binning enables an accurate coalescent-based estimation of the avian tree”

Tuesday, 3 November 2015, 2:30pm, Bamford Room (TLS 171b)

Springer and Gatesy. 2016. The gene tree delusion. A critique of species tree methods. The reason that it is called the gene tree delusion rather than the species tree delusion is I think because they are in favor of concatenation.

Pdficon small.gif Springer and Gatesy. 2016. MPE 94: 1-33. (doi:10.1016/j.ympev.2015.07.018)


Tuesday, 10 November 2015, 2:30pm, Bamford Room (TLS 171b)

Pdficon small.gif Edwards et al. 2016. Response to Gene Tree Delusion. You can read it online here.

Tuesday, 17 November 2015, 2:30pm, Bamford Room (TLS 171b)

Suman and Paul will discuss the "concaterpillar" paper below (an oldie but goodie) unless someone writes to us before the end of the week with a different idea. The idea here is to find clusters of genes that can tolerate sharing a single tree topology. The password is being sent over the systematics listserv, but you can also download the paper using the DOI link, which should work if you are on campus or connected to the campus network via VPN.

Pdficon small.gif Leigh J.W., Susko E., Baumgartner M., Roger A.J. 2008. Testing congruence in phylogenomic analysis. Systematic Biology. 57:104–115. doi:10.1080/10635150801910436

Running concaterpillar on the bbcsrv3 cluster

Make sure your data file names end in .seq (not .nex), then create the following qsub script (you can name this anything, but I'll assume you named it cpillar.sh):

#!/bin/bash
#$ -S /bin/bash
#$ -cwd
#$ -m ea
#$ -M your.name@uconn.edu 
#$ -N cpillar
/opt/python/bin/python /common/opt/bioinformatics/concaterpillar/concaterpillar.py -m GTR -t

Important: be sure your qsub script has unix line endings. This is only an issue if you created it on a Windows machine - you can use Notepad++ to change the line endings.

(Note that you should change your.name@uconn.edu to your own email address. You can also change cpillar to a job name that makes sense for your analysis.)

Run concaterpillar by navigating to the directory containing your qsub script and your .seq files and typing

qsub cpillar.sh

Running the MPI (parallel) version

If you have a lot of genes, you can make concaterpillar go faster by using more CPU slots on the cluster. Use the following qsub script for an MPI run that uses 8 slots:

#!/bin/bash
#$ -S /bin/bash
#$ -cwd
#$ -m ea
#$ -M your.name@uconn.edu
#$ -N cpillar
#$ -pe orte 8
mpirun -np 8 /opt/python/bin/python /common/opt/bioinformatics/concaterpillar/concaterpillar.py -c 8 -m GTR -t

Here I've specified 8 slots. Note that the number 8 appears 3 times - whatever number you decide to use, make sure to use that same number in all 3 places!

Tuesday, 30 November 2015, 2:30pm, Bamford Room (TLS 171b)

A comparative study of SVDquartets and other coalescent-based species tree estimation methods. Chou et al 2015

Past Seminars