These new databases still provide sequence information on isolated genes,
but now bring in the additional capacity of showing them
in relation to each
other
within the global framework of an entire genome.
The best way to interact with these databases is to zoom in and out until
you reach the level of detail you need for the question you’re interested in.
Genome-centric databases also provide the easiest way to gather every
gene/protein sequence of a given organism at once, with only a few mouse
clicks.
We start our exploration of whole genome databases by taking a peek at the
Viral Genome section available on the NCBI server.
Working with complete viral genomes
Viruses are fascinating objects, on the edge of the living world. They function
as minimal molecular bits of machinery, cleverly designed to ensure the mul-
tiplication of nucleic-acid molecules (the
viral genome) at the expense of cel-
lular hosts (eukaryotic, bacterial, or archaebacterial). While going about
their business, viruses might go unnoticed — or trigger dreadful diseases and
epidemics, such as smallpox, poliomyelitis, or AIDS. Viral genomes come in
all varieties of biochemical forms (RNA/DNA, circular/linear, single/double
stranded) and size (a few kb to a million bp).
Visiting the NCBI Viral Genome resource is a great way to find out more about
them. To demonstrate, we show you what you can find out about the dreadful
AIDS-causing, type-1 human immunodeficiency virus, or HIV-1.
1. Point your browser to www.ncbi.nlm.nih.gov/entrez/.
You’ve probably memorized this address by now. The NCBI PubMed
home page appears.
2. On the black menu bar at the top of the form, click Genome.
This takes you to the Entrez Genome page.
3. Click the Viruses link (on the right side of the form).
The Viral Genomes reference page appears.
4. Scroll down the Viral Reference Genomes page until you reach the
table of available viral-genome sequences grouped by class
(Deltavirus, Retroid viruses, and so on), as shown in Figure 3-13.
5. Type
HIV1 in the Search window, and then click the Find button.
Your browser returns a nice global summary of the HIV-1 genome, as
shown in Figure 3-14. At the bottom, a clickable picture indicates the
identity and respective positions of all the genes.
89
Chapter 3: Using Nucleotide Sequence Databases