Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q0VF58 (COJA1_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 67. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Collagen alpha-1(XIX) chain
Alternative name(s):
Collagen alpha-1(Y) chain
Gene names
Name:Col19a1
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length1136 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

May act as a cross-bridge between fibrils and other extracellular matrix molecules. Involved in skeletal myogenesis in the developing esophagus. May play a role in organization of the pericellular matrix or the sphinteric smooth muscle. Ref.5

Subunit structure

Oligomer; disulfide-linked By similarity. UniProtKB Q14993

Subcellular location

Secretedextracellular spaceextracellular matrix By similarity.

Developmental stage

Expressed in the myotome of somites from E9.5. In muscular tissues, expression is transient and is confined to a few sites of the developing embryo, such as limbs, tongue, and smooth muscle layers of stomach and esophagus. Also detected in skin at E16.5 and in cerebral cortex and hippocampus of the newborn brain. In adult, expression is only observed in cerebrum, cerebellum, eyes, and testis. In CNS, expression gradually increases following birth. Also expressed in embryonic fibroblasts and to a lesser extent in adult fibroblasts. Ref.1 Ref.4

Domain

The numerous interruptions in the triple helix may make this molecule either elastic or flexible By similarity. UniProtKB Q14993

Post-translational modification

Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains.

Disruption phenotype

Mice show severe signs of malnourishment and the majority die within the first three weeks of postnatal life. Newborn homozygotes do not show gross anatomical abnormalities, except for smaller size of the internal organs. However, necroscopy of the mice that survive past the weaning stage reveals a dilated esophagus (megaesophagus) with retention of ingesta immediately above the diaphragm level. Mutant mice also exhibit an additional defect, namely impaired smooth-to-skeletal muscle cell transdifferentiation in the abdominal segment of the esophagus. Heterozygotes by comparison are morphologically normal, viable and fertile. Ref.5

Sequence similarities

Belongs to the fibril-associated collagens with interrupted helices (FACIT) family.

Contains 9 collagen-like domains.

Contains 1 laminin G-like domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2323 Potential
Chain24 – 11361113Collagen alpha-1(XIX) chain
PRO_0000284731

Regions

Domain47 – 231185Laminin G-like
Domain292 – 34655Collagen-like 1
Domain347 – 38842Collagen-like 2
Domain389 – 43042Collagen-like 3
Domain519 – 57759Collagen-like 4
Domain578 – 61841Collagen-like 5
Domain620 – 67354Collagen-like 6
Domain722 – 77756Collagen-like 7
Domain778 – 81033Collagen-like 8
Domain833 – 89159Collagen-like 9
Region289 – 34860Triple-helical region 1 (COL1)
Region367 – 42660Triple-helical region 2 (COL2)
Region442 – 682241Triple-helical region 3 (COL3)
Region694 – 812119Triple-helical region 4 (COL4)
Region827 – 1006180Triple-helical region 5 (COL5)
Region1048 – 110558Triple-helical region 6 (COL6)
Motif946 – 9483Cell attachment site Potential

Amino acid modifications

Glycosylation471N-linked (GlcNAc...) Potential

Experimental info

Sequence conflict1101K → Q in BAA23578. Ref.1
Sequence conflict3151D → N in BAA23578. Ref.1
Sequence conflict3151D → N in AAI18971. Ref.3
Sequence conflict5281G → K in BAA23578. Ref.1
Sequence conflict5971G → E in BAA23578. Ref.1
Sequence conflict7171D → E in BAA23578. Ref.1
Sequence conflict7171D → E in AAI18971. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Q0VF58 [UniParc].

Last modified April 17, 2007. Version 2.
Checksum: C480216027D70B43

FASTA1,136114,197
        10         20         30         40         50         60 
MRHTGSWKLW TWVTTFLLPA CTCLTVRDKP ETTCPTLRTE RYQDDRNKSE LSGFDLGESF 

        70         80         90        100        110        120 
ALRHAFCEGD KTCFKLGSVL LIRDTVKIFP KGLPEEYAIA VMFRVRRSTK KERWFLWKIL 

       130        140        150        160        170        180 
NQQNMAQISV VIDGTKKVVE FMFRGAEGDL LNYVFKNREL RPLFDRQWHK LGIGVQSRVL 

       190        200        210        220        230        240 
SLYMDCNLIA SRHTEEKNSV DFQGRTIIAA RASDGKPVDI ELHQLRIYCN ANFLAEESCC 

       250        260        270        280        290        300 
NLSPTKCPEQ DDFGSTTSSW GTSNTGKMSS YLPGKQELKD TCQCIPNKEE AGLPGTLRSI 

       310        320        330        340        350        360 
GHKGDKGEPG EHGLDGTPGL PGQKGEQGLE GIKGEIGEKG EPGAKGDSGL DGLNGQDGLK 

       370        380        390        400        410        420 
GDSGPQGPPG PKGDKGDMGP PGPPALTGSI GIQGPQGPPG KEGQRGRRGK TGPPGNPGPP 

       430        440        450        460        470        480 
GPPGPPGLQG LQQPFGGYFN KGTGEHGASG PKGEKGDTGL PGFPGSVGPK GHKGEPGEPL 

       490        500        510        520        530        540 
TKGEKGDRGE PGLLGPQGIK GEPGDPGPPG LLGSPGLKGQ QGPAGSMGPR GPPGDVGLPG 

       550        560        570        580        590        600 
EHGIPGKQGV KGEKGDPGGR LGPPGLPGLK GDAGPPGISL PGKPGLDGNP GSPGPRGPKG 

       610        620        630        640        650        660 
ERGLPGLHGS PGDTGPPGVG IPGRTGSQGP AGEPGIQGPR GLPGLPGTPG MPGNDGAPGK 

       670        680        690        700        710        720 
DGKPGLPGPP GDPIALPLLG DIGALLKNFC GNCQANVPGL KSIKGDDGST GEPGKYDPAA 

       730        740        750        760        770        780 
RKGDVGPRGP PGFPGREGPK GSKGERGYPG IHGEKGDEGL QGIPGLSGAP GPTGPPGLTG 

       790        800        810        820        830        840 
RTGHPGPTGA KGDKGSEGPP GKPGPPGPPG VPLNEGNGMS SLYKIQGGVN VPGYPGPPGP 

       850        860        870        880        890        900 
PGPKGDPGPV GEPGAMGLPG LEGFPGVKGD RGPAGPPGIA GISGKPGAPG PPGVPGEQGE 

       910        920        930        940        950        960 
RGPIGDTGFP GPEGPSGKPG INGKDGLPGA QGIMGKPGDR GPKGERGDQG IPGDRGPQGE 

       970        980        990       1000       1010       1020 
RGKPGLTGMK GAIGPVGPAG SKGSTGPPGH QGPPGNPGIP GTPADAVSFE EIKHYINQEV 

      1030       1040       1050       1060       1070       1080 
LRIFEERMAV FLSQLKLPAA MLSAQAHGRP GPPGKDGLPG PPGDPGPQGY RGQKGERGEP 

      1090       1100       1110       1120       1130 
GIGLPGSPGL PGSSAVGLPG SPGAPGPQGP PGPSGRCNPE DCLYPAPPPH QQAGGK 

« Hide

References

« Hide 'large scale' references
[1]"Ubiquitous expression of the alpha1(XIX) collagen gene (Col19a1) during mouse embryogenesis becomes restricted to a few tissues in the adult organism."
Sumiyoshi H., Inoguchi K., Khaleduzzaman M., Ninomiya Y., Yoshioka H.
J. Biol. Chem. 272:17104-17111(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA], DEVELOPMENTAL STAGE.
Strain: BALB/c.
[2]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
[4]"Embryonic expression of type XIX collagen is transient and confined to muscle cells."
Sumiyoshi H., Laub F., Yoshioka H., Ramirez F.
Dev. Dyn. 220:155-162(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: DEVELOPMENTAL STAGE.
[5]"Esophageal muscle physiology and morphogenesis require assembly of a collagen XIX-rich basement membrane zone."
Sumiyoshi H., Mor N., Lee S.Y., Doty S., Henderson S., Tanaka S., Yoshioka H., Rattan S., Ramirez F.
J. Cell Biol. 166:591-600(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION, DISRUPTION PHENOTYPE.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB000636 mRNA. Translation: BAA23578.1.
AC116998 Genomic DNA. No translation available.
AC130201 Genomic DNA. No translation available.
AC161879 Genomic DNA. No translation available.
BC118970 mRNA. Translation: AAI18971.1.
RefSeqNP_031759.2. NM_007733.2.
XP_006495708.1. XM_006495645.1.
UniGeneMm.329196.

3D structure databases

ProteinModelPortalQ0VF58.
SMRQ0VF58. Positions 48-242.
ModBaseSearch...
MobiDBSearch...

PTM databases

PhosphoSiteQ0VF58.

Proteomic databases

PaxDbQ0VF58.
PRIDEQ0VF58.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000115244; ENSMUSP00000110899; ENSMUSG00000026141.
GeneID12823.
KEGGmmu:12823.
UCSCuc007amq.1. mouse.

Organism-specific databases

CTD1310.
MGIMGI:1095415. Col19a1.

Phylogenomic databases

eggNOGNOG275976.
GeneTreeENSGT00750000117628.
HOGENOMHOG000085653.
HOVERGENHBG060240.
InParanoidQ0VF58.
OMAERWFLWQ.
OrthoDBEOG7353W7.
PhylomeDBQ0VF58.
TreeFamTF351778.

Gene expression databases

ArrayExpressQ0VF58.
BgeeQ0VF58.
CleanExMM_COL19A1.
GenevestigatorQ0VF58.

Family and domain databases

InterProIPR008160. Collagen.
IPR008985. ConA-like_lec_gl_sf.
IPR001791. Laminin_G.
[Graphical view]
PfamPF01391. Collagen. 9 hits.
[Graphical view]
SMARTSM00210. TSPN. 1 hit.
[Graphical view]
SUPFAMSSF49899. SSF49899. 1 hit.
ProtoNetSearch...

Other

NextBio282302.
PROQ0VF58.
SOURCESearch...

Entry information

Entry nameCOJA1_MOUSE
AccessionPrimary (citable) accession number: Q0VF58
Secondary accession number(s): O35053
Entry history
Integrated into UniProtKB/Swiss-Prot: April 17, 2007
Last sequence update: April 17, 2007
Last modified: April 16, 2014
This is version 67 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot