Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q9DB00 (GON4L_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified November 16, 2011. Version 75. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
GON-4-like protein
Alternative name(s):
GON-4 homolog
Gene names
Name:Gon4l
Synonyms:Gon4, Kiaa1606
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length2260 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Subcellular location

Nucleus Potential.

Sequence similarities

Contains 1 Myb-like domain.

Contains 2 PAH (paired amphipathic helix) domains.

Sequence caution

The sequence AAH16616.1 differs from that shown. Reason: Erroneous initiation.

The sequence BAB23992.1 differs from that shown. Reason: Chimeric cDNA.

The sequence BAC65813.1 differs from that shown. Reason: Erroneous initiation.

Ontologies

Keywords
   Cellular componentNucleus
   DomainRepeat
   PTMIsopeptide bond
Phosphoprotein
Ubl conjugation
   Technical term3D-structure
Complete proteome
Reference proteome
Gene Ontology (GO)
   Biological processregulation of transcription, DNA-dependent

Inferred from electronic annotation. Source: InterPro

   Cellular componentnucleus

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionDNA binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 22602260GON-4-like protein
PRO_0000197110

Regions

Domain1648 – 172073PAH 1
Domain1730 – 180172PAH 2
Domain2167 – 222054Myb-like
Compositional bias242 – 2487Poly-Lys
Compositional bias380 – 3856Poly-Glu
Compositional bias450 – 4567Poly-Pro
Compositional bias519 – 57860Asp-rich
Compositional bias622 – 6254Poly-Glu
Compositional bias1476 – 157095Glu-rich

Amino acid modifications

Modified residue1881Phosphoserine By similarity
Modified residue19981Phosphoserine By similarity
Modified residue20741Phosphoserine By similarity
Cross-link536Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity

Experimental info

Sequence conflict2591P → L in BAC65813. Ref.3
Sequence conflict7341I → V in BAC65813. Ref.3
Sequence conflict10801A → S in BAC65813. Ref.3
Sequence conflict13411S → P in BAC65813. Ref.3
Sequence conflict13541Q → E in BAC65813. Ref.3
Sequence conflict13641F → L in BAC65813. Ref.3
Sequence conflict13921S → R in BAC65813. Ref.3
Sequence conflict14071M → T in BAC65813. Ref.3
Sequence conflict14381L → P in BAC65813. Ref.3
Sequence conflict14581A → V in BAC65813. Ref.3
Sequence conflict20691S → P in BAC65813. Ref.3
Sequence conflict20691S → P in AAH16616. Ref.4

Secondary structure

.......... 2260
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Q9DB00 [UniParc].

Last modified July 27, 2011. Version 3.
Checksum: BB0F605666D642C6

FASTA2,260248,742
        10         20         30         40         50         60 
MLPCKKRRLS VTESSQQQDD QEGDDLDLEA AVKPDTDQLP DSASESLSWG QSQDSAVCPE 

        70         80         90        100        110        120 
GLSMQDGDDQ LRAEGLSLNS KMLAQHVNLA VLEAVDVAVS QEIPLPSLES SHSLPVHVDK 

       130        140        150        160        170        180 
GRLQVSASKK GKRVVFTPGQ VTREDRGDHP VPEEPPSGEP AEEAKTEGGE LELRSDGEVP 

       190        200        210        220        230        240 
LLSSSSQSAK PGAQPRKSVQ PDGSAFPQDK PLGPLVRQAE EEMEDGGLFI PTEEQDSEES 

       250        260        270        280        290        300 
DKKKKTKKGT KRKRDGKGPE QGTMVYDPKL DDMLDRTLED GAKQHNLTAV NVRNILHEVI 

       310        320        330        340        350        360 
TNEHVVAMMK AAISETEDMP LFEPKMTRSK LKEVVEKGVV IPTWNISPIK KASEIKQPPQ 

       370        380        390        400        410        420 
FVDIHLEEDD SSDEEYSPDE EEEDETAEES LLESDVESTA SSPRGVKRSR LRLSSEVAET 

       430        440        450        460        470        480 
DEESGMLSEV EKAATPALRH ISAEVVPMGP PPPPKPKQSR DSVFMEKLDA VDEELASSPV 

       490        500        510        520        530        540 
CMDSFQPMED SLIAFRTRSK MPLKDVPLGQ LEAELQAPDI TPDMYDPNTA DDEDWKQWLG 

       550        560        570        580        590        600 
GLINDDVENE DEADDDDDPE YNFLEDLDEP DTEDFRTDRA VRITKKEVNG LMEELFETVQ 

       610        620        630        640        650        660 
SVVPSKFQDE MGFSNMEDDG PEEEERATES RPSFNTPQAL RFEEPLANLL NERHRTVKEL 

       670        680        690        700        710        720 
LEQLKMKKPS VRQQPEVEKL KPQEEAAHQT LVLDPAQRSR LQQQMQQHVQ LLTQIYLLTT 

       730        740        750        760        770        780 
SNPNLSSEAS TTRIFLKELG TFAENSIALH QQFNPRFQTL FQPCNWMGAM RLIEDFTQVS 

       790        800        810        820        830        840 
IDCSPHKTAK KTASEFPCLP KQVAWILATN KVFMYPELLP ICSLKANNPR DKTIFTKAED 

       850        860        870        880        890        900 
NLLALGLKHF EGTEFPKPLI SKYLVTCKTA HQLTVRIKNL NLNRAPNNVI KFYKKTKQLP 

       910        920        930        940        950        960 
VLVRCCEEIQ PHQWKPPFEK EEHRLPFWLK ASLQSIQDEL RNISEGATEG GSVTTATESS 

       970        980        990       1000       1010       1020 
TDQHLQKASP ALGDEPQYPL LLPKGVVLKL KPGSKRFSRK AWRQKRPLVQ KPLLIQPSPS 

      1030       1040       1050       1060       1070       1080 
VQPVFNPGKM ATWPTQSEVP PSNTVVQIPH LIQPAAVLQT LPGFPSVGVR GEDGFESPTA 

      1090       1100       1110       1120       1130       1140 
LPAMPCGSEA RTTFPLSETQ SAPPSCSAPK LMLPSLAPSK FRKPYVRRKP TRRKGAKVSP 

      1150       1160       1170       1180       1190       1200 
CVKPAPIIHP TPVIFTVPAT TVKVVSIGGG CNMIQPVSAA VAPSPQTIPI TTLLVNPTTF 

      1210       1220       1230       1240       1250       1260 
PCSLNQPLVA SSISPLIVSS NPLTLPVTSI PEDKAQVKLD VAEGKNAPQN PESKLKPQEL 

      1270       1280       1290       1300       1310       1320 
TPLCTTVFSK EEPKSWHSSA DTGSQEAFSE SSACSWAVVK TESQEGSSEK SACGWTVVKT 

      1330       1340       1350       1360       1370       1380 
EDGGHAVEPL PQNLQDSLSS SSKDLLNMVK MEAQDCMVEI SSNFPKQDIG EEVKEECSME 

      1390       1400       1410       1420       1430       1440 
LDSESPQEKP SSASEMSKQT VLQREDMQAA KSPSVPQDAA AEGRTSSHAS RGLPKSTLSS 

      1450       1460       1470       1480       1490       1500 
MGQGGGLSGP PGKLEDSANA DGQSVGTPAG PDTGGEKDGP EEEEEEDFDD LTQDEEDELS 

      1510       1520       1530       1540       1550       1560 
SASEESVLSV PELQETMEKL TWLASERRMS QEGESEEENS QEENSEPEEE EEEEAEGMET 

      1570       1580       1590       1600       1610       1620 
LQKEDEVNDE AVGDAAKKPP STLASPQTAP EIETSIAPAG ESIKAAGKGR SSHRARNKRG 

      1630       1640       1650       1660       1670       1680 
SRARASKDTS KLLLLYDEDI LDRDPLREQK DLAFAQAYLT RVREALQHTP GKYEDFLQII 

      1690       1700       1710       1720       1730       1740 
YEFESSTQMH SAVDLFKSLQ TLLQDWPQLL KDFAAFLLPE QALSCGLFEE QQAFEKSRKF 

      1750       1760       1770       1780       1790       1800 
LRQLEICFAE NPSHHQKIIK VLQGCADCLP QDIAELKTQM WQLLRGHDHL QDEFSIFFDH 

      1810       1820       1830       1840       1850       1860 
LRPAASRMGD FEEINWTEEK EYEFDGFEEV ILPDVEEDEE PAKVSTASKS KRRKEIGVQH 

      1870       1880       1890       1900       1910       1920 
QDKDTEWPEA AKDCSCSCHE GGPESKLKKS KRRNCHCSSK VCDSKPYKSK EPPELVGSGP 

      1930       1940       1950       1960       1970       1980 
LHEASTVPGS KEAGQGKDML EEEILEEQEN MEVTQSKTAR TTRKGEAPAP GSTIGSTLLC 

      1990       2000       2010       2020       2030       2040 
PAEVTPMELL LEGPALCSAE TPRLPPQTGA VVCSVRRNQA GPEVVSCLST SSLPPEEGED 

      2050       2060       2070       2080       2090       2100 
QRAAANSETI APHREASETE RLPETVEHSA PLPSPVSTRT RDTGRRHICG KAGSQSWLIE 

      2110       2120       2130       2140       2150       2160 
SRAEAEAAHV AAPICEKSSG ARASEAAPKT AREVLAEDSG TQGMGPEGAL PKASEATVCA 

      2170       2180       2190       2200       2210       2220 
NNSKVSSTGE KVVLWTREAD RVILTMCQEQ GAQPHTFSVI SQQLGNKTPV EVSHRFRELM 

      2230       2240       2250       2260 
QLFHTACEAS SEDEDDATST SNADQLSDHG DLLSEEELDE 

« Hide

References

« Hide 'large scale' references
[1]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed: 19468303] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[2]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed: 16141072] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-168.
Strain: C57BL/6J.
Tissue: Cerebellum.
[3]"Prediction of the coding sequences of mouse homologues of KIAA gene: II. The complete nucleotide sequences of 400 mouse KIAA-homologous cDNAs identified by screening of terminal sequences of cDNA clones randomly sampled from size-fractionated libraries."
Okazaki N., Kikuno R., Ohara R., Inamoto S., Aizawa H., Yuasa S., Nakajima D., Nagase T., Ohara O., Koga H.
DNA Res. 10:35-48(2003) [PubMed: 12693553] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 208-2260.
Tissue: Brain.
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1531-2260.
Strain: FVB/N.
Tissue: Mammary gland.
[5]"Solution structure of mouse hypothetical gene (2610100B20Rik) product homologous to Myb DNA-binding domain."
RIKEN structural genomics initiative (RSGI)
Submitted (JUN-2004) to the PDB data bank
Cited for: STRUCTURE BY NMR OF 2148-2230.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AC127377 Genomic DNA. No translation available.
AK005385 mRNA. Translation: BAB23992.1. Different termination.
AK122531 mRNA. Translation: BAC65813.1. Different initiation.
BC016616 mRNA. Translation: AAH16616.1. Different initiation.
IPIIPI00854904.
UniGeneMm.126870.
Mm.482358.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1UG2NMR-A2148-2228[»]
ProteinModelPortalQ9DB00.
SMRQ9DB00. Positions 2140-2229.
ModBaseSearch...

Protein-protein interaction databases

STRINGQ9DB00.

PTM databases

PhosphoSiteQ9DB00.

Proteomic databases

PRIDEQ9DB00.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000090942; ENSMUSP00000088461; ENSMUSG00000054199.
UCSCuc008pwv.2. mouse.

Organism-specific databases

MGIMGI:1917579. Gon4l.
RougeSearch...

Phylogenomic databases

GeneTreeENSGT00390000016256.
HOVERGENHBG081563.

Gene expression databases

ArrayExpressQ9DB00.
BgeeQ9DB00.
GenevestigatorQ9DB00.
GermOnlineENSMUSG00000054199. Mus musculus.

Family and domain databases

InterProIPR009057. Homeodomain-like.
IPR012287. Homeodomain-rel.
IPR017877. MYB-like.
IPR003822. PAH.
IPR001005. SANT_DNA-bd.
[Graphical view]
Gene3DG3DSA:1.10.10.60. Homeodomain-rel. 1 hit.
G3DSA:1.20.1160.11. PAH. 1 hit.
PfamPF02671. PAH. 1 hit.
[Graphical view]
SMARTSM00717. SANT. 1 hit.
[Graphical view]
SUPFAMSSF46689. Homeodomain_like. 1 hit.
SSF47762. PAH. 2 hits.
PROSITEPS50090. MYB_LIKE. 1 hit.
PS51477. PAH. 2 hits.
[Graphical view]
ProtoNetSearch...

Other

SOURCESearch...

Entry information

Entry nameGON4L_MOUSE
AccessionPrimary (citable) accession number: Q9DB00
Secondary accession number(s): E9Q5N3, Q80TB4, Q91YI9
Entry history
Integrated into UniProtKB/Swiss-Prot: December 20, 2005
Last sequence update: July 27, 2011
Last modified: November 16, 2011
This is version 75 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families