Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Integrator complex subunit 4

Gene

Ints4

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

Component of the Integrator complex, a complex involved in the small nuclear RNAs (snRNA) U1 and U2 transcription and in their 3'-box-dependent processing. The Integrator complex is associated with the C-terminal domain (CTD) of RNA polymerase II largest subunit (POLR2A) and is recruited to the U1 and U2 snRNAs genes (By similarity).By similarity

GO - Biological processi

Complete GO annotation...

Enzyme and pathway databases

ReactomeiR-MMU-6807505. RNA polymerase II transcribes snRNA genes.

Names & Taxonomyi

Protein namesi
Recommended name:
Integrator complex subunit 4
Short name:
Int4
Gene namesi
Name:Ints4
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 7

Organism-specific databases

MGIiMGI:1917164. Ints4.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 964964Integrator complex subunit 4PRO_0000259539Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei27 – 271N6-acetyllysineCombined sources
Cross-linki792 – 792Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity

Keywords - PTMi

Acetylation, Isopeptide bond, Ubl conjugation

Proteomic databases

EPDiQ8CIM8.
MaxQBiQ8CIM8.
PaxDbiQ8CIM8.
PRIDEiQ8CIM8.

PTM databases

iPTMnetiQ8CIM8.
PhosphoSiteiQ8CIM8.

Expressioni

Gene expression databases

BgeeiQ8CIM8.
CleanExiMM_INTS4.
GenevisibleiQ8CIM8. MM.

Interactioni

Subunit structurei

Belongs to the multiprotein complex Integrator, at least composed of INTS1, INTS2, INTS3, INTS4, INTS5, INTS6, INTS7, INTS8, INTS9/RC74, INTS10, CPSF3L/INTS11 and INTS12.By similarity

Protein-protein interaction databases

BioGridi221739. 9 interactions.
IntActiQ8CIM8. 10 interactions.
MINTiMINT-4116412.
STRINGi10090.ENSMUSP00000026126.

Structurei

3D structure databases

ProteinModelPortaliQ8CIM8.
SMRiQ8CIM8. Positions 71-312, 378-472.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati67 – 10640HEAT 1Add
BLAST
Repeati146 – 18439HEAT 2Add
BLAST
Repeati191 – 22939HEAT 3Add
BLAST
Repeati230 – 26435HEAT 4Add
BLAST
Repeati278 – 31437HEAT 5Add
BLAST
Repeati370 – 40637HEAT 6Add
BLAST
Repeati407 – 44539HEAT 7Add
BLAST
Repeati447 – 48539HEAT 8Add
BLAST

Sequence similaritiesi

Belongs to the Integrator subunit 4 family.Curated
Contains 8 HEAT repeats.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiKOG2259. Eukaryota.
ENOG410XPX8. LUCA.
GeneTreeiENSGT00390000010128.
HOGENOMiHOG000007958.
HOVERGENiHBG081801.
InParanoidiQ8CIM8.
KOiK13141.
OMAiFNAAKSC.
OrthoDBiEOG70CR6Q.
PhylomeDBiQ8CIM8.
TreeFamiTF315047.

Family and domain databases

Gene3Di1.25.10.10. 4 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR026003. Cohesin_HEAT.
[Graphical view]
PfamiPF12765. Cohesin_HEAT. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 2 hits.

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q8CIM8-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MAAHLKKRVY EEFTKVVQQQ QEEIATKKLR LTKPSKSAAL HIDLCKATSP
60 70 80 90 100
ADALQYLLQF ARKPVEAESV EGVVRILLEH YYKENDPSVR LKIASLLGLL
110 120 130 140 150
SKTAGFSPDC IMDDAINILQ NEKSHQVLAQ LLDTLLAIGS KLPENQATQV
160 170 180 190 200
RLVDVACKHL TDTSHGVRNK CLQLLGNLGS LEKSVTKDTE GSAARDVQKI
210 220 230 240 250
IGDHFSDQDP RVRTAAIKAM LQLHERGLKL HQTIYNQACK LLSDDYEQVR
260 270 280 290 300
SAAVQLIWVV SQLYPESIVP IPSSNEEIRL VDDAFGKICH MVSDGSWVVR
310 320 330 340 350
VQAAKLLGSM EQVSSHFLEQ TLDKKLMSDL RRKRTAHERA KELYSSGEFS
360 370 380 390 400
SGRKWGDDAP KEEIDTGAVN LIESGACGAF VHGLEDEMYE VRIAAVEALC
410 420 430 440 450
MLAQSSPSFA EKCLDFLVDM FNDEIEEVRL QSIHTMRKIS NNITLREDQL
460 470 480 490 500
DTVLAVLEDS SRDIREALHE LLCCTNVSTK EGIHLALVEL LKNLTKYPTD
510 520 530 540 550
RDSIWKCLKF LGSRHPTLVL PLVPELLSTH PFFDTAEPDM DDPAYIAVLV
560 570 580 590 600
LIFNAAKTCP TMPALFSDHT LRHYAYLRDS LSHLVPALRL PGRKLVSSTV
610 620 630 640 650
PSNITPHEDP SQQFLQQSLE RVYSVQHLDP QGAQELLEFT IRDLQRLGEL
660 670 680 690 700
QSELAGVADF SATYLQCQLL LIKALQEKLW NVAAPLYLKQ SDLASAAAKQ
710 720 730 740 750
IMEETYKMEF MYSGVENKQV VIIQHMRLQA KALQLIVTAR TTRGVDPLFG
760 770 780 790 800
MCEKFLQEVD FFQRCFIADL PHLQDSFVDK LLDLMPRLMA SKPVEVIKIL
810 820 830 840 850
QTMLRQSTFL HLPLPEQIHK ASATIIEPAG ESDNPLRFTS GLVVALDVDA
860 870 880 890 900
TLEHVQDPQN TVKVQVLYPD GQAQMIHPKP ADFRNPGPGR HRLLTQVYLS
910 920 930 940 950
HTAWTEPCQV EVRLLLAYNS GARIPKSPWL EGSEMSPQVE TSIEGTIPFS
960
KPVKVYIMPK PARR
Length:964
Mass (Da):108,193
Last modified:March 1, 2003 - v1
Checksum:iBF00ADA2FF1076FF
GO
Isoform 2 (identifier: Q8CIM8-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     507-964: Missing.

Show »
Length:506
Mass (Da):56,643
Checksum:iF9E9574E3DC335BE
GO
Isoform 3 (identifier: Q8CIM8-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     221-223: LQL → VTG
     224-964: Missing.

Note: No experimental confirmation available.
Show »
Length:223
Mass (Da):24,565
Checksum:i6D48C272553B4C8C
GO

Sequence cautioni

The sequence AAH13813.1 differs from that shown. Reason: Erroneous initiation. Curated
The sequence BAE37328.1 differs from that shown. Reason: Frameshift at positions 165 and 191. Curated

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei221 – 2233LQL → VTG in isoform 3. 1 PublicationVSP_021454
Alternative sequencei224 – 964741Missing in isoform 3. 1 PublicationVSP_021455Add
BLAST
Alternative sequencei507 – 964458Missing in isoform 2. 1 PublicationVSP_021456Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK011676 mRNA. Translation: BAB27773.1.
AK088635 mRNA. Translation: BAC40468.1.
AK163383 mRNA. Translation: BAE37328.1. Frameshift.
BC013710 mRNA. Translation: AAH13710.1.
BC013813 mRNA. Translation: AAH13813.1. Different initiation.
CCDSiCCDS21460.1. [Q8CIM8-1]
RefSeqiNP_081532.1. NM_027256.2. [Q8CIM8-1]
UniGeneiMm.27383.

Genome annotation databases

EnsembliENSMUST00000026126; ENSMUSP00000026126; ENSMUSG00000025133. [Q8CIM8-1]
GeneIDi101861.
KEGGimmu:101861.
UCSCiuc009ijg.2. mouse. [Q8CIM8-3]
uc009ijh.2. mouse. [Q8CIM8-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK011676 mRNA. Translation: BAB27773.1.
AK088635 mRNA. Translation: BAC40468.1.
AK163383 mRNA. Translation: BAE37328.1. Frameshift.
BC013710 mRNA. Translation: AAH13710.1.
BC013813 mRNA. Translation: AAH13813.1. Different initiation.
CCDSiCCDS21460.1. [Q8CIM8-1]
RefSeqiNP_081532.1. NM_027256.2. [Q8CIM8-1]
UniGeneiMm.27383.

3D structure databases

ProteinModelPortaliQ8CIM8.
SMRiQ8CIM8. Positions 71-312, 378-472.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi221739. 9 interactions.
IntActiQ8CIM8. 10 interactions.
MINTiMINT-4116412.
STRINGi10090.ENSMUSP00000026126.

PTM databases

iPTMnetiQ8CIM8.
PhosphoSiteiQ8CIM8.

Proteomic databases

EPDiQ8CIM8.
MaxQBiQ8CIM8.
PaxDbiQ8CIM8.
PRIDEiQ8CIM8.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000026126; ENSMUSP00000026126; ENSMUSG00000025133. [Q8CIM8-1]
GeneIDi101861.
KEGGimmu:101861.
UCSCiuc009ijg.2. mouse. [Q8CIM8-3]
uc009ijh.2. mouse. [Q8CIM8-1]

Organism-specific databases

CTDi92105.
MGIiMGI:1917164. Ints4.

Phylogenomic databases

eggNOGiKOG2259. Eukaryota.
ENOG410XPX8. LUCA.
GeneTreeiENSGT00390000010128.
HOGENOMiHOG000007958.
HOVERGENiHBG081801.
InParanoidiQ8CIM8.
KOiK13141.
OMAiFNAAKSC.
OrthoDBiEOG70CR6Q.
PhylomeDBiQ8CIM8.
TreeFamiTF315047.

Enzyme and pathway databases

ReactomeiR-MMU-6807505. RNA polymerase II transcribes snRNA genes.

Miscellaneous databases

PROiQ8CIM8.
SOURCEiSearch...

Gene expression databases

BgeeiQ8CIM8.
CleanExiMM_INTS4.
GenevisibleiQ8CIM8. MM.

Family and domain databases

Gene3Di1.25.10.10. 4 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR026003. Cohesin_HEAT.
[Graphical view]
PfamiPF12765. Cohesin_HEAT. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 2 hits.
ProtoNetiSearch...

Publicationsi

  1. "The transcriptional landscape of the mammalian genome."
    Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.
    , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
    Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3), NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 622-964 (ISOFORM 1).
    Strain: C57BL/6J and NOD.
    Tissue: Thymus.
  2. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
    Strain: FVB/N.
    Tissue: Mammary tumor.
  3. Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Tissue: Liver, Pancreas, Spleen and Testis.
  4. "SIRT5-mediated lysine desuccinylation impacts diverse metabolic pathways."
    Park J., Chen Y., Tishkoff D.X., Peng C., Tan M., Dai L., Xie Z., Zhang Y., Zwaans B.M., Skinner M.E., Lombard D.B., Zhao Y.
    Mol. Cell 50:919-930(2013) [PubMed] [Europe PMC] [Abstract]
    Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT LYS-27, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Tissue: Embryonic fibroblast.

Entry informationi

Entry nameiINT4_MOUSE
AccessioniPrimary (citable) accession number: Q8CIM8
Secondary accession number(s): Q3TQQ4
, Q8C2H1, Q91YV5, Q9CSY4
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 31, 2006
Last sequence update: March 1, 2003
Last modified: June 8, 2016
This is version 105 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.