Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P27790 (CENPB_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 114. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Major centromere autoantigen B
Alternative name(s):
Centromere protein B
Short name=CENP-B
Gene names
Name:Cenpb
Synonyms:Cenp-b
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length599 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Interacts with centromeric heterochromatin in chromosomes and binds to a specific subset of alphoid satellite DNA, called the CENP-B box. May organize arrays of centromere satellite DNA into a higher-order structure which then directs centromere formation and kinetochore assembly in mammalian chromosomes By similarity.

Subunit structure

Antiparallel homodimer. Interacts with CENPT By similarity.

Subcellular location

Nucleus. Chromosomecentromere.

Post-translational modification

Poly-ADP-ribosylated by PARP1. Ref.4

N-terminally methylated by METTL11A/NTM1. Alpha-N-methylation is stimulated in response to extracellular stimuli, including increased cell density and heat shock, and seems to facilitate binding to CENP-B boxes. Chromatin-bound CENP-B is primarily trimethylated By similarity.

Sequence similarities

Contains 1 HTH CENPB-type DNA-binding domain.

Contains 1 HTH psq-type DNA-binding domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Initiator methionine11Removed By similarity
Chain2 – 599598Major centromere autoantigen B
PRO_0000126126

Regions

Domain2 – 5251HTH psq-type
Domain65 – 13672HTH CENPB-type
DNA binding28 – 4821H-T-H motif By similarity
DNA binding97 – 12933H-T-H motif By similarity
Region536 – 59964Homodimerization By similarity
Compositional bias404 – 46562Glu-rich (acidic)
Compositional bias508 – 53831Asp/Glu-rich (acidic)

Amino acid modifications

Modified residue21N,N,N-trimethylglycine By similarity
Modified residue1651Phosphoserine By similarity
Modified residue3981Phosphothreonine By similarity

Experimental info

Sequence conflict1451S → T in CAA38878. Ref.1
Sequence conflict150 – 1523APA → PQP in CAA38878. Ref.1

Sequences

Sequence LengthMass (Da)Tools
P27790 [UniParc].

Last modified July 27, 2011. Version 2.
Checksum: EBDB7C76BA87DC73

FASTA59965,381
        10         20         30         40         50         60 
MGPKRRQLTF REKSRIIQEV EENPDLRKGE IARRFNIPPS TLSTILKNKR AILASERKYG 

        70         80         90        100        110        120 
VASTCRKTNK LSPYDKLEGL LIAWFQQIRA AGLPVKGIIL KEKALRIAEE LGMDDFTASN 

       130        140        150        160        170        180 
GWLDRFRRRH GVVACSGVTR SRARSSAPRA PAAPAGPATV PSEGSGGSTP GWHTREEQPP 

       190        200        210        220        230        240 
SVAEGYASQD VFSATETSLW YDFLSDQASG LWGGDGPARQ ATQRLSVLLC ANADGSEKLP 

       250        260        270        280        290        300 
PLVAGKSAKP RAGQGGLPCD YTANSKGGVT TQALAKYLKA LDTRMAAESR RVLLLAGRLA 

       310        320        330        340        350        360 
AQSLDTSGLR HVQLAFFPPG TVHPLERGVV QQVKGHYRQA MLLKAMAALE GQDPSGLQLG 

       370        380        390        400        410        420 
LVEALHFVAA AWQAVEPSDI ATCFREAGFG GGLNATITTS FKSEGEEEEE EEEEEEEEEE 

       430        440        450        460        470        480 
EEGEGEEEEE EEEEGEEEGG EGEEEGEEEV EEEGEVDDSD EEEEESSSEG LEAEDWAQGV 

       490        500        510        520        530        540 
VEASGGFGGY SVQEEAQFPT LHFLEGGEDS DSDSDEEEDD EEEDEEDEDE EDDEDGDEVP 

       550        560        570        580        590 
VPSFGEAMAY FAMVKRYLTS FPIDDRVQSH ILHLEHDLVH VTRKNHARQA GVRGLGHQS 

« Hide

References

« Hide 'large scale' references
[1]"CENP-B is a highly conserved mammalian centromere protein with homology to the helix-loop-helix family of proteins."
Sullivan K.F., Glass C.A.
Chromosoma 100:360-370(1991) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: C57BL/6.
Tissue: Liver.
[2]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Strain: FVB/N.
Tissue: Colon, Embryo, Jaw and Limb.
[4]"Centromere proteins Cenpa, Cenpb, and Bub3 interact with poly(ADP-ribose) polymerase-1 protein and are poly(ADP-ribosyl)ated."
Saxena A., Saffery R., Wong L.H., Kalitsis P., Choo K.H.
J. Biol. Chem. 277:26921-26926(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: POLY-ADP-RIBOSYLATION BY PARP1.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X55038 Genomic DNA. Translation: CAA38878.1.
AL831736 Genomic DNA. No translation available.
BC053333 mRNA. Translation: AAH53333.1.
BC071269 mRNA. Translation: AAH71269.1.
BC075733 mRNA. Translation: AAH75733.1.
CCDSCCDS16757.1.
RefSeqNP_031708.2. NM_007682.2.
UniGeneMm.440169.

3D structure databases

ProteinModelPortalP27790.
SMRP27790. Positions 1-129, 540-585.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid198675. 1 interaction.
IntActP27790. 1 interaction.
MINTMINT-237475.

PTM databases

PhosphoSiteP27790.

Proteomic databases

PRIDEP27790.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000089510; ENSMUSP00000086938; ENSMUSG00000068267.
GeneID12616.
KEGGmmu:12616.
UCSCuc008mkx.1. mouse.

Organism-specific databases

CTD1059.
MGIMGI:88376. Cenpb.

Phylogenomic databases

eggNOGNOG241149.
GeneTreeENSGT00740000115260.
HOGENOMHOG000111537.
HOVERGENHBG050890.
KOK11496.
OMAKRRQLTF.
OrthoDBEOG7HTHGS.
TreeFamTF101131.

Gene expression databases

BgeeP27790.
CleanExMM_CENPB.
GenevestigatorP27790.

Family and domain databases

Gene3D1.10.10.60. 2 hits.
InterProIPR015115. Centromere_CenpB_dimerisation.
IPR004875. DDE_SF_endonuclease_CENPB-like.
IPR009057. Homeodomain-like.
IPR006600. HTH_CenpB_DNA-bd_dom.
IPR007889. HTH_Psq.
[Graphical view]
PfamPF09026. CENP-B_dimeris. 1 hit.
PF04218. CENP-B_N. 1 hit.
PF03184. DDE_1. 1 hit.
PF03221. HTH_Tnp_Tc5. 1 hit.
[Graphical view]
SMARTSM00674. CENPB. 1 hit.
[Graphical view]
SUPFAMSSF46689. SSF46689. 2 hits.
PROSITEPS51253. HTH_CENPB. 1 hit.
PS50960. HTH_PSQ. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio281782.
PROP27790.
SOURCESearch...

Entry information

Entry nameCENPB_MOUSE
AccessionPrimary (citable) accession number: P27790
Secondary accession number(s): Q7TSG8
Entry history
Integrated into UniProtKB/Swiss-Prot: August 1, 1992
Last sequence update: July 27, 2011
Last modified: July 9, 2014
This is version 114 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot