Browsing precomputed genomes
cPSORTdb is designed to allow convenient retrieval of PSORTb predictions by genomes. Newly sequenced genomes from NCBI are automatically analyzed by PSORTb v.3 and PSORTb v.2, and the results are stored into cPSORTdb. You can access the data by typing in the first few letters of your genome of interest and select from the list of genomes from the drop-down list.
Genome summary page
When you select a genome name from the precomputed genomes page or from general search result, you will come to the genome summary page which lists the following information for the genome:
- A: Predicted localization break down - a pie chart showing the percentage of proteins predicted to reside in each of the possible localizations for the listed genome
- B: Basic information for the organism - taxonomic information, organism type and number of predicted genes in the genome
- C: Prediction results - number of proteins within the genome predicted to be localized to each of the possible localizations for the organism type. Each of the localization label links to a list of all the proteins predicted to reside in that localization. Both PSORTb v.3 and PSORTb v.2 predictions are available for all genomes.
- D: Download Predictions - click this button to download a tab-delimited version fo prediction results for all the proteins in this genome.
- A: Database selection - allows user to choose between ePSORTdb and cPSORTdb.
- B: Drop down menu - selects a field to search against.
- C: Drop down menu of selections or autocomplete search input text field - allows user to start entering keywords and will auto-suggest keywords most likely to match the letters entered.
- D: Add or remove button - allows user to input additional search criteria to narrow down searches.
Database selection: Searches can be carried out against ePSORTdb and cPSORTdb(A). In the latter case, the search is carried out by default against the predictions from PSORTb v.3.0 (the most recent version of PSORTb).
Flexible database browsing options: PSORTdb can be browsed by selecting one or more of the search categories as listed in the table below. When you choose a particular field from the drop down menu list (B), a drop down menu of possible values appropriate to this field, or a text field with autocomplete capabilities (C) appear on the right, letting you know the correct keywords or the correct syntax to be used. You can then either highlight keywords from the drop down menu, or you may type your own keywords into box (D), and then choose the appropriate taxnomic name for the organism or organism group you are looking for.
The following table lists the possible searching criteria used for browsing PSORTdb:
|Field name||Possible values|
Archaea / Gram-positive bacteria: Cytoplasmic, Cytoplasmic membrane, Cell wall, Extracellular
Gram-negative: Cytoplasmic, Cytoplasmic membrane, Periplasmic, Outer membrane, Extracellular
Gram-positive with outer membrane: same as Gram-negative
Gram-negative with no outer membrane: Cytoplasmic, Cytoplasmic membrane, Extracellular
|Secondary localization||Flagellar, Fimbrial, Host-associated, T3SS, Spore, Toga, S-layer|
|Domain||Archaea or Bacteria|
|Protein name||Name of protein|
|Gram stain||Gram-positive or Gram-negative|
|Organism name||Name of organism as listed in NCBI completed microbial genomes|
|Phylum name||Name of phylum from the listed of NCBI completed microbial genomes|
|Class name||Name of phylum from the listed of NCBI completed microbial genomes|
|Order name||Name of order from the listed of NCBI completed microbial genomes|
|Family name||Name of family from the listed of NCBI completed microbial genomes|
|Genus name||Name of genus from the listed of NCBI completed microbial genomes|
Text search results
A brief explanation of columns from left to right:
- GI - Genbank ID of each protein
- Name - protein name with a hyperlink to protein information and details of PSORTb prediction results
- Organism - organism in which the protein belongs to. Clicking on the organism name will lead to PSORTb prediction summary page for that particular genome.
- Localization - localization rediction results made by PSORTb.
- Score - confidence score of PSORTb's subcellular localization prediction result.
Clicking on one the column header will allow the results to be sorted by that particular column. Results of browsing and keyword searches can be viewed page by page. Furthermore, from the results list the user can download the result list by clicking on the "Download predictions" button on the top right corner.
Both ePSORTdb and cPSORTdb datasets can be searched using the BLASTP program. One or more proteins can be submitted at the same time, these must be in FASTA format. A sequence with a FASTA sequence file consists of three parts:
- A title line, which must begin with a '>' symbol, and may be followed by any type of text
- A new line character at the end of the title line
- The sequence itself, which continues until the end of file or the next `>' is reached
An example of FASTA format is shown below:
>sp|O52956|A85A_MYCAV Antigen 85-A precursor (85A) MTLVDRLRGAVAGMPRRLVVGAAGAALLSGLIGAVGGSATAGAFSRPGLPVEYLQVPSAAMG RDIKVQFQSGGANSPALYLLDGMRAQDDFNGWDINTPAFEWYNQSGISVAMPVGGQSSFYSD WYKPACGKAGCTTYKWETFLTSELPQYLSAQKQVKPTGSGVVGLSMAGSSALILAAYHPDQF VYAGSLSALLDPSQGMGPSLIGLAMGDAGGYKAADMWGPKEDPAWARNDPSLQVGKLVANNT RIWVYCGNGKPSDLGGDNLPAKFLEGFVRTSNLKFQDAYNGAGGHNAVWNFDANGTHDWPYW GAQLQAMKPDLQSVLGATPGAGPATAAATNAGNGQGT
For more information, see the description at NCBI.
BLAST search results
Results of a BLAST search are presented through the standard BLASTP layout, displaying the retrieved proteins with their associated parameters.
Summary of the retrieved proteins with their Score and E-value. Clicking on the name of the protein (e.g.anitgen 85-A ) will take you to the complete PSORTdb entry of the protein. Click on "BLAST Alignment" to view the seqeuence alignment result of your query protein to the listed BLAST hit. Selecting the genome name will lead you to the summary genome page. to the detailed results of the BLAST search. You may also click the "BLAST Report" tab on the top right corner to view the plain text version of the BLAST results.
Downloading data from PSORTdb
From the Downloads tab, the user can choose to download any of the following:
- For individural prokaryotic genomes, a FASTA file of all predicted proteins in the genome, as well as PSORTb version 3.0 and version 2.0 prediction results for each genome.
- Complete ePSORTdb and cPSORTdb dataset (for both version 3.0 and 2.0).
We're always looking for new proteins to add to our database!
If you think you've got a good candidate, please submit it to us!
Interested in hearing about our latest updates? Enter your email below to subscribe to our mailing list!