UniProt release 9.5
Published January 23, 2007
Headlines
Reintroduction of the initiator methionine
In UniProtKB/Swiss-Prot, the sequence data corresponds to the precursor form of a protein, i.e. before post-translational modifications such as cleavage of the signal peptide or other processing. However, for historical reasons, a notable exception was made: when the initiator methionine was post-translationally removed, the sequence stored in UniProtKB/Swiss-Prot did not include the methionine and instead started with the second residue.
As a consequence, our sequence data differed from that shown in other sequence databases where the initiator methionine is usually not removed. This discrepancy was confusing for users and was the subject of one of the most frequently asked questions to UniProtKB/Swiss-Prot.
This is no longer the case. With this release, all initiator methionines have been reintroduced to the UniProtKB/Swiss-Prot entries (over 10'000) from which it is cleaved. This caused a major change, since all amino acid positions described in these entries have now been updated to reflect the new sequence numbering.
The cleavage of the initiator methionine is still indicated by the INIT_MET line in the feature table but the sequence position is 1 instead of 0. We also added the comment Removed in the description field of INIT_MET line to indicate that the initiator methionine is indeed removed post-translationally.
Example P51487:
Previous format:
FT INIT_MET 0 0
FT CHAIN 1 400 Phosrestin-1.
...
SQ SEQUENCE 400 AA; 44781 MW; DA786D7E9FFB4A29 CRC64;
VVSVKVFKK ATPNGKVTFY LGRRHFIDHF DYIDPVDGVI VVDPDYLKNR KVFAQLATIY
New format:
FT INIT_MET 1 1 Removed.
FT CHAIN 2 401 Phosrestin-1.
...
SQ SEQUENCE 401 AA; 44912 MW; 1212C2422CD35A94 CRC64;
MVVSVKVFKK ATPNGKVTFY LGRRHFIDHF DYIDPVDGVI VVDPDYLKNR KVFAQLATIY
