Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein SFI1 homolog

Gene

Sfi1

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at transcript leveli

Functioni

Plays a role in the dynamic structure of centrosome-associated contractile fibers via its interaction with CETN2.By similarity

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Enzyme and pathway databases

ReactomeiR-MMU-2565942. Regulation of PLK1 Activity at G2/M Transition.
R-MMU-380259. Loss of Nlp from mitotic centrosomes.
R-MMU-380270. Recruitment of mitotic centrosome proteins and complexes.
R-MMU-5620912. Anchoring of the basal body to the plasma membrane.
R-MMU-8854518. AURKA Activation by TPX2.

Names & Taxonomyi

Protein namesi
Recommended name:
Protein SFI1 homolog
Gene namesi
Name:Sfi1
Synonyms:Kiaa0542
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 11

Organism-specific databases

MGIiMGI:1926137. Sfi1.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Cytoskeleton

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00003346221 – 1216Protein SFI1 homologAdd BLAST1216

Proteomic databases

PaxDbiQ3UZY0.
PRIDEiQ3UZY0.

PTM databases

iPTMnetiQ3UZY0.
PhosphoSitePlusiQ3UZY0.

Expressioni

Gene expression databases

BgeeiENSMUSG00000023764.
ExpressionAtlasiQ3UZY0. baseline and differential.
GenevisibleiQ3UZY0. MM.

Interactioni

Subunit structurei

Interacts with CETN2 (via C-terminus).By similarity

GO - Molecular functioni

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000080066.

Structurei

3D structure databases

ProteinModelPortaliQ3UZY0.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati114 – 146HAT 1Add BLAST33
Repeati148 – 177HAT 2Add BLAST30
Repeati246 – 278HAT 3Add BLAST33
Repeati375 – 407HAT 5Add BLAST33
Repeati1122 – 1154HAT 6Add BLAST33

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni87 – 106Interaction with CETN2By similarityAdd BLAST20
Regioni451 – 470Interaction with CETN2By similarityAdd BLAST20
Regioni617 – 636Interaction with CETN2By similarityAdd BLAST20

Domaini

CETN2-binding regions contains a conserved Trp residue in their C-terminal ends, which seems critical for interaction with CETN2.

Sequence similaritiesi

Belongs to the SFI1 family.Curated
Contains 6 HAT repeats.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiKOG4775. Eukaryota.
ENOG4111JHQ. LUCA.
GeneTreeiENSGT00390000018445.
HOVERGENiHBG108479.
InParanoidiQ3UZY0.
KOiK16489.
OMAiARWKVHH.
OrthoDBiEOG091G0EXO.
PhylomeDBiQ3UZY0.
TreeFamiTF328940.

Family and domain databases

InterProiIPR030516. SFI1.
[Graphical view]
PANTHERiPTHR22028:SF4. PTHR22028:SF4. 1 hit.

Sequences (5)i

Sequence statusi: Complete.

This entry describes 5 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q3UZY0-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MEKKIGSRSF RDGVVKKPCS PKTLPLKKSS AFSGIQREPS RSCHSIYYHA
60 70 80 90 100
SQNWTRYRLQ ELRIRCVARK FLYLWIRVTF GRVTPSRARI FHEQKILQKV
110 120 130 140 150
FGEWREEWWV SQREWKLCVR ADCHYRYYLY NLIFQNWKTF VHQQREMRKR
160 170 180 190 200
FRIAEHHDTK QKMCQAWKSW LIYMVSRRTK LHMKTTALEF RRQSVLCFWW
210 220 230 240 250
SKWRWRLGQA HAEHALHAVA VKHRALSLQL QGWLRWQEQL LISQRDRRKE
260 270 280 290 300
ATAVQHYQHW QKQRSLKAWL KYLQICRVKR WQNEMAVQFH RATVLQIHFC
310 320 330 340 350
DWQWAWEWRQ SLSAHQALVV KLAGRMVLRR AFTHWKHYML LQAEEAAQRE
360 370 380 390 400
AAAEHRQHYL LYSCFRAFKD NVTQARLQQT RKKLAQQLRD TTLLHRFWNL
410 420 430 440 450
WQSRIEQREE RVQTPSLHAA LSHYRVTVLH KCVRVWLRYV HKRQWQQLLR
460 470 480 490 500
ARADGHFQQR ALPAAFYTWY RGWLWHQQRR ILHTKAVRFH RGTLEKQVFA
510 520 530 540 550
LWRQKMSQHR ENCLAERMAI LQAEQQLLRR FWFVWHQQAA VCQLERQQQA
560 570 580 590 600
MAIAHHHSGL LRRAFCIWKE STQGFRIERM GRAQAAHFHS AQLLSRAWSM
610 620 630 640 650
WRECLALRLE EQQKLKCAAL HSQCILLRRA LQKWLVYQNR VRSVLREVAA
660 670 680 690 700
RERQHNRQLL WWALHLWREN TMARLDGAKK TSQARVHYSR TLCSKVLVQW
710 720 730 740 750
REVTSVQIYY RQKEAAALRE ARKALDRGRL QNWFQHWRFC SQRAAQQRFQ
760 770 780 790 800
LGQAAQHHHW QLLMEAMARW KAHHLGCIRK KFLQRQAAQL LAQRLSRACF
810 820 830 840 850
CQWRKQLAVR KQEQWGTARA LWLWAFSLQA KVWTAWLGFV LERRRKKARL
860 870 880 890 900
ERAMQAYQQQ LLQEGATRLL RFTAGTKAFR QQLQAQQQVQ AAHSLHCAVR
910 920 930 940 950
HCAELWKKKV LGPGKTSQPP APTTFSKRVT FKDSFLSGHA AEAGDATQET
960 970 980 990 1000
KKLRAPPSQG VLGSLAGAAG EPCHLDLNAA RSSRKQPRRP SFLLERLGSQ
1010 1020 1030 1040 1050
RSPEWYSLGE QQLEKPPEEE STALLGGSSL TRPFLPGVLP NVPGPKLPPT
1060 1070 1080 1090 1100
ASPGLELLPP SSIMPHAAGG TARVSAKPSI PGPQPWGCPS LPRDLDPQLL
1110 1120 1130 1140 1150
PGDSISTRTE PVYGSEATGH TELEAELEGI QQQLQHYQTT KQNLWSCQRQ
1160 1170 1180 1190 1200
ANSLRRWLEL SQEEPKSEDL HLEEQVKTEL EEVELQVQQL AKELEAQRQP
1210
VGTCIARVRA LRRALC
Length:1,216
Mass (Da):144,033
Last modified:October 11, 2005 - v1
Checksum:i304079976848CD01
GO
Isoform 2 (identifier: Q3UZY0-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     604-635: Missing.

Show »
Length:1,184
Mass (Da):140,274
Checksum:iC201AFF8B3936AE0
GO
Isoform 3 (identifier: Q3UZY0-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     197-425: CFWWSKWRWR...SLHAALSHYR → W
     604-635: Missing.
     978-1216: Missing.

Show »
Length:717
Mass (Da):85,938
Checksum:i5899B47FF129A3DD
GO
Isoform 4 (identifier: Q3UZY0-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-7: MEKKIGS → MTAEVNGSTSGNH
     115-1216: Missing.

Note: No experimental confirmation available.
Show »
Length:120
Mass (Da):14,287
Checksum:i6DB7304C35633F41
GO
Isoform 5 (identifier: Q3UZY0-5) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     362-366: YSCFR → VRAWS
     367-1125: Missing.
     1182-1182: E → EVRPGQPRASPWLSFLSACLVPPSRPCPQ

Show »
Length:485
Mass (Da):58,870
Checksum:iC4139B193B0955DB
GO

Sequence cautioni

The sequence AAH48950 differs from that shown. Reason: Erroneous initiation.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti23T → A in AAI06125 (PubMed:15489334).Curated1
Sequence conflicti133I → S in BAC97969 (PubMed:14621295).Curated1
Sequence conflicti134F → L in BAC97969 (PubMed:14621295).Curated1
Sequence conflicti759H → R in AAH26390 (PubMed:15489334).Curated1
Sequence conflicti779R → I in BAB31543 (PubMed:16141072).Curated1
Sequence conflicti1098Q → H in AAH26390 (PubMed:15489334).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0337091 – 7MEKKIGS → MTAEVNGSTSGNH in isoform 4. 1 Publication7
Alternative sequenceiVSP_033710115 – 1216Missing in isoform 4. 1 PublicationAdd BLAST1102
Alternative sequenceiVSP_033711197 – 425CFWWS…LSHYR → W in isoform 3. 1 PublicationAdd BLAST229
Alternative sequenceiVSP_033712362 – 366YSCFR → VRAWS in isoform 5. 1 Publication5
Alternative sequenceiVSP_033713367 – 1125Missing in isoform 5. 1 PublicationAdd BLAST759
Alternative sequenceiVSP_033714604 – 635Missing in isoform 2 and isoform 3. 1 PublicationAdd BLAST32
Alternative sequenceiVSP_033715978 – 1216Missing in isoform 3. 1 PublicationAdd BLAST239
Alternative sequenceiVSP_0337161182E → EVRPGQPRASPWLSFLSACL VPPSRPCPQ in isoform 5. 1 Publication1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK129159 mRNA. Translation: BAC97969.1.
AK019095 mRNA. Translation: BAB31543.1.
AK133558 mRNA. Translation: BAE21725.1.
BX572640, AL671968 Genomic DNA. Translation: CAI26156.1.
AL671968, BX572640 Genomic DNA. Translation: CAI35196.1.
BC026390 mRNA. Translation: AAH26390.1.
BC046305 mRNA. Translation: AAH46305.1.
BC048950 mRNA. Translation: AAH48950.1. Different initiation.
BC106124 mRNA. Translation: AAI06125.1.
CCDSiCCDS36094.1. [Q3UZY0-1]
RefSeqiNP_084483.2. NM_030207.2. [Q3UZY0-1]
XP_006514959.1. XM_006514896.3. [Q3UZY0-1]
XP_006514961.1. XM_006514898.3. [Q3UZY0-2]
UniGeneiMm.394071.
Mm.485866.

Genome annotation databases

EnsembliENSMUST00000066391; ENSMUSP00000067261; ENSMUSG00000023764. [Q3UZY0-2]
ENSMUST00000081318; ENSMUSP00000080066; ENSMUSG00000023764. [Q3UZY0-1]
GeneIDi78887.
KEGGimmu:78887.
UCSCiuc007hrs.1. mouse. [Q3UZY0-2]
uc007hrt.1. mouse. [Q3UZY0-1]
uc007hrx.1. mouse. [Q3UZY0-3]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK129159 mRNA. Translation: BAC97969.1.
AK019095 mRNA. Translation: BAB31543.1.
AK133558 mRNA. Translation: BAE21725.1.
BX572640, AL671968 Genomic DNA. Translation: CAI26156.1.
AL671968, BX572640 Genomic DNA. Translation: CAI35196.1.
BC026390 mRNA. Translation: AAH26390.1.
BC046305 mRNA. Translation: AAH46305.1.
BC048950 mRNA. Translation: AAH48950.1. Different initiation.
BC106124 mRNA. Translation: AAI06125.1.
CCDSiCCDS36094.1. [Q3UZY0-1]
RefSeqiNP_084483.2. NM_030207.2. [Q3UZY0-1]
XP_006514959.1. XM_006514896.3. [Q3UZY0-1]
XP_006514961.1. XM_006514898.3. [Q3UZY0-2]
UniGeneiMm.394071.
Mm.485866.

3D structure databases

ProteinModelPortaliQ3UZY0.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000080066.

PTM databases

iPTMnetiQ3UZY0.
PhosphoSitePlusiQ3UZY0.

Proteomic databases

PaxDbiQ3UZY0.
PRIDEiQ3UZY0.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000066391; ENSMUSP00000067261; ENSMUSG00000023764. [Q3UZY0-2]
ENSMUST00000081318; ENSMUSP00000080066; ENSMUSG00000023764. [Q3UZY0-1]
GeneIDi78887.
KEGGimmu:78887.
UCSCiuc007hrs.1. mouse. [Q3UZY0-2]
uc007hrt.1. mouse. [Q3UZY0-1]
uc007hrx.1. mouse. [Q3UZY0-3]

Organism-specific databases

CTDi9814.
MGIiMGI:1926137. Sfi1.
RougeiSearch...

Phylogenomic databases

eggNOGiKOG4775. Eukaryota.
ENOG4111JHQ. LUCA.
GeneTreeiENSGT00390000018445.
HOVERGENiHBG108479.
InParanoidiQ3UZY0.
KOiK16489.
OMAiARWKVHH.
OrthoDBiEOG091G0EXO.
PhylomeDBiQ3UZY0.
TreeFamiTF328940.

Enzyme and pathway databases

ReactomeiR-MMU-2565942. Regulation of PLK1 Activity at G2/M Transition.
R-MMU-380259. Loss of Nlp from mitotic centrosomes.
R-MMU-380270. Recruitment of mitotic centrosome proteins and complexes.
R-MMU-5620912. Anchoring of the basal body to the plasma membrane.
R-MMU-8854518. AURKA Activation by TPX2.

Miscellaneous databases

PROiQ3UZY0.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000023764.
ExpressionAtlasiQ3UZY0. baseline and differential.
GenevisibleiQ3UZY0. MM.

Family and domain databases

InterProiIPR030516. SFI1.
[Graphical view]
PANTHERiPTHR22028:SF4. PTHR22028:SF4. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiSFI1_MOUSE
AccessioniPrimary (citable) accession number: Q3UZY0
Secondary accession number(s): Q3KQN2
, Q5NC00, Q6ZQ99, Q80ZJ4, Q80ZY0, Q8R0V8, Q9CTY8
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 20, 2008
Last sequence update: October 11, 2005
Last modified: November 2, 2016
This is version 77 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.