Using the Synteny Viewer with "Embedded link data"

Last updated 6/26/2006

By "embedded link" I mean that the synteny linking data is contained in the curation data set itself. This is true of both GAME and Chado JDBC data. In contrast, Ensembl and GFF load there linking data from a separate source, be it a database or file. Below is described working with GAME and after that Chado JDBC.

GAME

Here are the instructions for trying the GAME synteny viewer. Please note that until the synteny data becomes public, you will only be able to look at the example that is included with the Apollo distribution. Also be aware that the synteny viewer is still in active development, so there are still many bugs. You should see better performance in the next release.

When you start apollo, choose the data adapter called "Synteny". If you have the version of the apollo.cfg that came with the Apollo public release 1.4.2, this should give you the GAME synteny choices of Drosophila data, rather than the GFF synteny choices for Ensembl data.

You can choose "Load two species" and then under file, ask to browse to find the example GAME files showing a syntenic region between Drosophila melanogaster (Dmel) and Drosophila pseudoobscura (Dpse). Please note that this is only an example, and is not official FlyBase pseudoobscura data. For now, you will only be able to view this one example; complete data will be released at a later date.

The Synteny data loader should show the following example files:

Species 1:
data/example.xml

then hit the "Species 2" tab and select

Species 2:
data/dpse-example.xml

The GAME synteny viewer will allow you to see comparative links between the Dmel and Dpse sequence, in salmon pink for the syntenic block data and in tan for the BLASTX data, comparing the r3.1 annotations to the genewise predictions in Dpse which have been promoted to annotations.

It is often easier to see the tan BLASTX results if you hide the Synteny results, either with the third mouse menu or the types panel. You can also hide links with the link popup menu(described below)

You can click on the section with the colorful polygons and with the third mouse button ask to see only +/+, ±, -/-, or -/+ strands. You can also use that menu to "home in on a selected link". The menu also lists all the linked tiers, that can be selected for visibility.

When you choose ±, you usually need to reverse complement the species that is displaying the - strand. So if you click in the - strand species, then under the "view" menu you can select "reverse complement". Clicking in either the top species or bottom species window makes that window active for most menu options at the top. You can see which species window is active under the "Synteny" menu.

Under the Synteny menu, you can lock scrolling or zooming. Using the types panel, under the tiers menu, you can see various types of comparative data.

Please note: the EnsJ/GFF synteny viewer is currently out of commission.

Here is a screenshot of GAME data in the synteny viewer:

Advanced configuration

If you have GAME data from two species that you would like displayed in synteny it's a bit involved. Here is what you need to know:

Chado JDBC Data

The Chado JDBC data adapter can support embedded link data as well (as of June 2006). Like GAME synteny it is configured in the tiers file. Here is a Type from the tiers file configured for jdbc synteny from conf/paramecium.tiers:
[Type]
tiername : Gene Prediction
typename : Gaze
resulttype : Gaze
.....
synteny_link_type: SELF
synteny_link_level: PARENT
synteny_link_match_on: ID

SELF says that it links to features of its same type (just as in Game). synteny_link_level PARENT says to link features at the parent level not at the child/leaf level. For instance for a gene prediction predicting exons and transcripts, PARENT says to link the predicted transcripts, not exons. CHILD (which I believe is the default) would cause the exons to link. synteny_link_match_on: ID says that the linking actually happens via IDs. The Jdbc reader actually reads in the ids of the 2 mathcing features and stores them with each feature in the apollo datamodel. This contrasts from game where the id information is lost and features find each other via RANGE.

The jdbc adapter will query chado for this kind of relationship looking for a feature_relationship between the 2 features. I believe the type of this relationship is one of the new syntenic that were recently discussed on the gmod schema list.