Cross-references
Last modified February 17, 2011
This section is used to point to information related to entries and found in data collections other than UniProtKB.
The full list of all databases to which UniProtKB is cross-referenced can be found in the document Databases cross-referenced in UniProtKB. It also contains a list of references describing these databases and provides a link to the web sites.
List of categories of the databases cross-referenced in UniProtKB
- 2D gel databases
- 3D structure databases
- Enzyme and pathway databases
- Family and domain databases
- Gene expression databases
- Genome annotation databases
- Ontologies
- Organism-specific databases
- Phylogenomic databases
- Polymorphism databases
- Proteomic databases
- Protein-protein interaction databases
- Protein family/group databases
- PTM databases
- Sequence databases
- Other
Cross-references to ‘Sequence databases’
If a sequence submitted to EMBL/GenBank/DDBJ present some discrepancies with the canonical sequence, these latter are documented in the ‘Sequence conflict’ subsection of the ‘Sequence annotation (Features)’ section. However, if these discrepancies are severe, the underlying cross-reference to EMBL/GenBank/DDBJ is tagged.
Four different tags are used to mark problematic sequences submitted to the EMBL/GenBank/DDBJ databases:
- ‘Frameshift’: discrepancies are due to the insertion or deletion of one or more nucleotides in the submitted nucleotide sequence.
Example: O14467
- ‘Different initiation’: discrepancies are due to an erroneous initiation codon choice in the submitted sequence.
Example: O14467
- ‘Different termination’: the termination codon of the submitted sequence differs from that of the sequence displayed
Example: P59263
- ‘Sequence problems’: discrepancies are due to an erroneous gene model prediction, erroneous ORF assignement, miscellaneous discrepancy, etc.
Example: Q7XR80, P76164, Q8TBF5
The discrepancies are described in more details in the ‘Sequence Caution’ section.
If no CoDing Sequence (CDS) has been annotated in a submitted nucleotide sequence (most frequently genomic sequence), the corresponding cross-reference is tagged with the label: ‘No translation available’.
Example: B2RU33
