Phylogenetics: Bioinformatics Cluster

From EEBedia
Revision as of 01:10, 18 February 2007 by PaulLewis (Talk | contribs) (Obtaining the necessary communications software)

Jump to: navigation, search
Under construction.png This article is still under construction.
Expect it to change frequently until this notice is removed.
Adiantum.png EEB 349: Phylogenetics
The goal of this lab exercise is to show you how to use the Bioinformatics Facility computer cluster to run PAUP* and GARLI.

Part A: Using the UConn Bioinformatics Facility cluster

The Bioinformatics Facility is part of the UConn Biotechnology Center, which is located behind the Up-N-Atom Cafe in the lower level of the Biology/Physics building. Jeff Lary maintains a 17-node Apple Xserve G5 Cluster that can be used by UConn graduate students and faculty to conduct bioinformatics-related research (sequence analysis, biological database searches, phylogenetics, molecular evolution). You have each been given accounts on the cluster, and today you will learn how to start analyses remotely (i.e. from this computer lab), check on their status, and download the results when your analysis is finished.

Obtaining the necessary communications software

You will be using a couple of simple (and free) programs to communicate with the head node of the cluster. Visit the PuTTY web site, scroll down to the section labeled "Binaries" and save putty.exe and pscp.exe on your desktop.

PuTTY

The program PuTTY will allow you to communicate with the cluster using a protocol known as SSH (Secure Shell) that encrypts everything sent over the internet. You will use PuTTY to send commands to the cluster and see the output generated. In the old days, a protocol known as Telnet was used for this purpose, but it is no longer used because it did not encrypt anything, making it easy for someone with access to the network to snatch your password.

PSCP

The other program you will use is called PSCP. It allows you to transfer files back and forth using the SCP (Secure Copy) protocol. It replaces the old FTP protocol that, like Telnet, sends usernames and passwords unencrypted across the network.

Programs vs. protocols

SSH and SCP are protocols, not programs. PuTTY and PSCP are programs that implement the SSH and SCP protocols, respectively. In a little while from now, you may be thinking "I liked FTP much better than SCP!" because you are used to user-friendly, graphical FTP programs. There are much fancier programs for using SSH and SCP than PuTTY and PSCP, but these will serve us well today. The nice thing is that these programs are so small that you can just download them whenever and whereever you happen to need them. If you find yourself wanting a fancier SCP client, check out FileZilla (on Windows) or [1] (for Macs).

Logging in for the first time

On the whiteboard you will find your login id (user name) and password. The first step is to change your password. Be sure to remember your username and password for future labs.

Part B: Starting a PAUP* run on the cluster

Part C: Starting a GARLI run on the cluster