Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q12766 (SMF_HUMAN)

Last modified November 4, 2008. Version 52. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Protein SMF
Gene names
Name: SMF
Synonyms: KIAA0194
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1538 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Subcellular location

NucleusPotential.

Sequence similarities

Contains 1 HMG box DNA-binding domain.

Ontologies

Keywords

   Cellular componentNucleus
   LigandDNA-binding

Gene Ontology (GO)

   Cellular componentnucleolus

Inferred from direct assay. Source: HPA

   Molecular functionkinase activity Ref.4

Non-traceable author statement. Source: UniProtKB

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 15381538Protein SMF
PRO_0000048806

Regions

DNA binding288 – 35669HMG box
Compositional bias62 – 239178Arg-rich
Compositional bias276 – 2827Poly-Lys
Compositional bias518 – 5247Poly-Ser
Compositional bias1523 – 153210Poly-Glu

Experimental info

Sequence conflict7341A → V in AAH51025. Ref.2
Sequence conflict7341A → V in BAA12107. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Q12766-1 [UniParc].

Last modified November 13, 2007. Version 2.
Checksum: 966C2EBE269BA118

FASTA1,538168,334
        10         20         30         40         50         60 
MQRTQPRPCY LNAPQQCPGA ERPGRPTAGS HSFLLRPGPL AGSSPFALLD PLQAFEQFVW 

        70         80         90        100        110        120 
VRSQARAGLL RLRQGSHAVT RCRPLPVRRE GRRDGSPWRS VVCRYCRCSR QTGASVTTVS 

       130        140        150        160        170        180 
LPSSSSSPGL DPRGPRQASV RSLRSEPVLL FLPFRTPYRD SEEGKREGLS RLRAVCRRAG 

       190        200        210        220        230        240 
PRGRGSFSPR DARASPRLHF LVAAVTTGAA SRRQRGARVR QPSPSSSRRA KRLRECERRS 

       250        260        270        280        290        300 
LHAPPAMDAS YDGTEVTVVM EEIEEAYCYT SPGPPKKKKK YKIHGEKTKK PRSAYLLYYY 

       310        320        330        340        350        360 
DIYLKVQQEL PHLPQSEINK KISESWRLLS VAERSYYLEK AKLEKEGLDP NSKLSALTAV 

       370        380        390        400        410        420 
VPDIPGFRKI LPRSDYIIIP KSSLQEDRSC PQLELCVAQN QMSPKGPPLV SNTAPETVPS 

       430        440        450        460        470        480 
HAGMAEQCLA VEALAEEVGA LTQSGAVQEI ATSEILSQDV LLEDASLEVG ESHQPYQTSL 

       490        500        510        520        530        540 
VIEETLVNGS PDLPTGSLAV PHPQVGESVS VVTVMRDSSE SSSSAPATQF IMLPLPAYSV 

       550        560        570        580        590        600 
VENPTSIKLT TTYTRRGHGT CTSPGCSFTY VTRHKPPKCP TCGNFLGGKW IPKEKPAKVK 

       610        620        630        640        650        660 
VELASGVSSK GSVVKRNQQP VTTEQNSSKE NASKLTLENS EAVSQLLNVA PPREVGEESE 

       670        680        690        700        710        720 
WEEVIISDAH VLVKEAPGNC GTAVTKTPVV KSGVQPEVTL GTTDNDSPGA DVPTPSEGTS 

       730        740        750        760        770        780 
TSSPLPAPKK PTGADLLTPG SRAPELKGRA RGKPSLLAAA RPMRAILPAP VNVGRGSSMG 

       790        800        810        820        830        840 
LPRARQAFSL SDKTPSVRTC GLKPSTLKQL GQPIQQPSGP GEVKLPSGPS NRTSQVKVVE 

       850        860        870        880        890        900 
VKPDMFPPYK YSCTVTLDLG LATSRGRGKC KNPSCSYVYT NRHKPRICPS CGVNLAKDRT 

       910        920        930        940        950        960 
EKTTKAIEVS SPLPDVLNAT EPLSTAQREI QRQSTLQLLR KVLQIPENES ELAEVFALIH 

       970        980        990       1000       1010       1020 
ELNSSRLILS NVSEETVTIE QTSWSNYYES PSTQCLLCSS PLFKGGQNSL AGPQECWLLT 

      1030       1040       1050       1060       1070       1080 
ASRLQTVTAQ VKMCLNPHCL ALHSFIDIYT GLFNVGNKLL VSLDLLFAIR NQIKLGEDPR 

      1090       1100       1110       1120       1130       1140 
VSINVVLKSV QEQTEKTLTS EELSQLQELL CNGYWAFECL TVRDYNDMIC GICGVAPKVE 

      1150       1160       1170       1180       1190       1200 
MAQRSEENVL ALKSVEFTWP EFLGSNEVNV EDFWATMETE VIEQVAFPAS IPITKFDASV 

      1210       1220       1230       1240       1250       1260 
IAPFFPPLMR GAVVVNTEKD KNLDVQPVPG SGSALVRLLQ EGTCKLDEIG SYSEEKLQHL 

      1270       1280       1290       1300       1310       1320 
LRQCGIPFGA EDSKDQLCFS LLALYESVQN GARAIRPPRH FTGGKIYKVC PHQVVCGSKY 

      1330       1340       1350       1360       1370       1380 
LVRGESARDH VDLLASSRHW PPVYVVDMAT SVALCADLCY PELTNQMWGR NQGCFSSPTE 

      1390       1400       1410       1420       1430       1440 
PPVSVSCPEL LDQHYTVDMT ETEHSIQHPV TKTATRRIVH AGLQPNPGDP SAGHHSLALC 

      1450       1460       1470       1480       1490       1500 
PELAPYATIL ASIVDSKPNG VRQRPIAFDN ATHYYLYNRL MDFLTSREIV NRQIHDIVQS 

      1510       1520       1530 
CQPGEVVIRD TLYRLGVAQI KTETEEEGEE EEVAAVAE 

« Hide

References

« Hide 'large scale' references
[1]"The DNA sequence and comparative analysis of human chromosome 5."
Schmutz J., Martin J., Terry A., Couronne O., Grimwood J., Lowry S., Gordon L.A., Scott D., Xie G., Huang W., Hellsten U., Tran-Gyamfi M., She X., Prabhakar S., Aerts A., Altherr M., Bajorek E., Black S. expand/collapse author list , Branscomb E., Caoile C., Challacombe J.F., Chan Y.M., Denys M., Detter J.C., Escobar J., Flowers D., Fotopulos D., Glavina T., Gomez M., Gonzales E., Goodstein D., Grigoriev I., Groza M., Hammon N., Hawkins T., Haydu L., Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Lopez F., Lou Y., Martinez D., Medina C., Morgan J., Nandkeshwar R., Noonan J.P., Pitluck S., Pollard M., Predki P., Priest J., Ramirez L., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A., Thayer N., Tice H., Tsai M., Ustaszewska A., Vo N., Wheeler J., Wu K., Yang J., Dickson M., Cheng J.-F., Eichler E.E., Olsen A., Pennacchio L.A., Rokhsar D.S., Richardson P., Lucas S.M., Myers R.M., Rubin E.M.
Nature 431:268-274(2004) [PubMed: 15372022] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 100-1538.
Tissue: Brain.
[3]"Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1."
Nagase T., Seki N., Ishikawa K., Tanaka A., Nomura N.
DNA Res. 3:17-24(1996) [PubMed: 8724849] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 104-1538.
Tissue: Bone marrow.
[4]"Sequence analysis of two genomic regions containing the KIT and the FMS receptor tyrosine kinase genes."
Andre C., Hampe A., Lachaume P., Martin E., Wang X.P., Manus V., Hu W.X., Galibert F.
Genomics 39:216-226(1997) [PubMed: 9027509] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1161-1538.
Tissue: Placenta.

Cross-references

Sequence databases

AC011406 Genomic DNA. No translation available.
BC051025 mRNA. Translation: AAH51025.1.
D83778 mRNA. Translation: BAA12107.1.
U63963 Genomic DNA. Translation: AAB51697.1.
UniGeneHs.586219

3D structure databases

ModBaseSearch...

PTM databases

PhosphoSiteQ12766.

Genome annotation databases

EnsemblENSG00000113716. Homo sapiens. [Contig view]

Organism-specific databases

HPAHPA002354.
HPA005967.
HUGESearch...
GeneCardsSearch...

Phylogenomic databases

HOGENOMQ12766.
HOVERGENQ12766.

Gene expression databases

GermOnlineENSG00000113716. Homo sapiens.

Family and domain databases

InterProIPR000910. HMG_1/2_box.
[Graphical view]
Gene3DG3DSA:1.10.30.10. HMG-box. 1 hit.
PfamPF00505. HMG_box. 1 hit.
[Graphical view]
SMARTSM00398. HMG. 1 hit.
[Graphical view]
PROSITEPS50118. HMG_BOX_2. 1 hit.
[Graphical view]
BLOCKSSearch...
ProtoNetSearch...

Entry information

Entry nameSMF_HUMAN
AccessionPrimary (citable) accession number: Q12766
Secondary accession number(s): Q86UG3, Q9UMF4
Entry history
Integrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: November 13, 2007
Last modified: November 4, 2008
This is version 52 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

Human chromosome 5

Human chromosome 5: entries, gene names and cross-references to MIM

UniProtKB secondary accession numbers

Index of UniProtKB secondary accession numbers

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents