NCBI FTP
NCBI Home EBI Databases EBI Downloads

Practical lesson 1

Accessing GenBank and EMBL databases by FTP and the WWW.

By Juan Carlos Sanchez CNB, CSIC.

You will get the genomic sequence of Mycoplasma pneumoniae, from the GenBank and EMBL nucleotide databases, in FASTA, GenBank and EMBL formats, as well as a file with the sequences of all proteins codified in the Mycoplasma pneumoniae genome. You will access the databases both using FTP and through a WWW server.

For comparison, you will also retrieve another version of the Mycoplasma pneumoniae genome that has been curated by NCBI staff, and that, therefore, is not part of GenBank. In doing so, you will visualize examples of different data formats and entry identifiers, and you will surf a little bit through the NCBI WWW server.



A. Accessing the NCBI server by FTP

By command line.

A variation to compare GenBank versus NCBI's curated databases

Using a WWW Browser.



B. Accessing the NCBI server through the WWW.

C. Accessing the EMBL nucleotide database through the WWW.

D. Accessing the EMBL nucleotide database by FTP (using a WWW browser).
  • Open Netscape or a similar browser and make a connection to the EMBL databases server, at the address: http://www.ebi.ac.uk/Databases/index.html
  • Click on the tab labeled as Downloads (on the upper part); then on the link to Databases (in the table).
  • You will be in the main directory of the EBI databases FTP site. You can use the browser to travel through the directories. Take some time to have a look and check what you can find.
  • Follow the path to: genomes/bacteria/mpneumoniae/U00089.embl

  • February 2007
    Manuel J. Gómez
    updated by Juan Carlos Sanchez
    Grupo de Diseño de Proteínas
    Centro Nacional de Biotecnología, CSIC