Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Collagen alpha-3(IV) chain

Gene

Col4a3

Organism
Mus musculus (Mouse)
Status
Unreviewed-Annotation score: Annotation score: 2 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

Names & Taxonomyi

Protein namesi
Submitted name:
Collagen alpha-3(IV) chainImported
Gene namesi
Name:Col4a3Imported
OrganismiMus musculus (Mouse)Imported
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaMyomorphaMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 1

Organism-specific databases

MGIiMGI:104688. Col4a3.

Subcellular locationi

GO - Cellular componenti

Keywords - Cellular componenti

Basement membranePROSITE-ProRule annotation, Extracellular matrix, Secreted

PTM / Processingi

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Disulfide bondi68 ↔ 74PROSITE-ProRule annotation

Post-translational modificationi

Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains.PROSITE-ProRule annotation
Type IV collagens contain numerous cysteine residues which are involved in inter- and intramolecular disulfide bonding. 12 of these, located in the NC1 domain, are conserved in all known type IV collagens.PROSITE-ProRule annotation

Keywords - PTMi

Disulfide bondPROSITE-ProRule annotation

Proteomic databases

MaxQBiF6RIS8.
PaxDbiF6RIS8.
PeptideAtlasiF6RIS8.

Expressioni

Gene expression databases

BgeeiENSMUSG00000079465.
ExpressionAtlasiF6RIS8. baseline and differential.

Structurei

3D structure databases

SMRiF6RIS8.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domaini

Alpha chains of type IV collagen have a non-collagenous domain (NC1) at their C-terminus, frequent interruptions of the G-X-Y repeats in the long central triple-helical domain (which may cause flexibility in the triple helix), and a short N-terminal triple-helical 7S domain.PROSITE-ProRule annotation

Sequence similaritiesi

Belongs to the type IV collagen family.PROSITE-ProRule annotation

Keywords - Domaini

CollagenPROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG3544. Eukaryota.
ENOG410XNMM. LUCA.
GeneTreeiENSGT00840000129673.

Family and domain databases

Gene3Di2.170.240.10. 1 hit.
InterProiView protein in InterPro
IPR001442. Collagen_VI_NC.
IPR016187. CTDL_fold.
PfamiView protein in Pfam
PF01413. C4. 2 hits.
SMARTiView protein in SMART
SM00111. C4. 1 hit.
SUPFAMiSSF56436. SSF56436. 2 hits.
PROSITEiView protein in PROSITE
PS51403. NC1_IV. 1 hit.

Sequencei

Sequence statusi: Fragment.

F6RIS8-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
ATGTRMRGFI FTRHSQTTAI PSCPEGTQPL YSGFSLLFVQ GNKRAHGQDL
60 70 80 90 100
GTLGSCLQRF TTMPFLFCNI NNVCNFASRN DYSYWLSTPA LMPMDMAPIS
110 120 130 140 150
GRALEPYISR CTVCEGPAMA IAVHSQTTAI PPCPQDWVSL WKGFSFIMKT
160 170 180 190
YSINCESWRL RENHKPLSGV HEEKTLTKSK KPEPFFFFFF FFLFLLK
Length:197
Mass (Da):22,381
Last modified:July 27, 2011 - v1
Checksum:i0EEA01E6BA6E5AAC
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Non-terminal residuei1Imported1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC123746 Genomic DNA. No translation available.
AC138214 Genomic DNA. No translation available.

Genome annotation databases

EnsembliENSMUST00000152664; ENSMUSP00000119094; ENSMUSG00000079465.

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.

Entry informationi

Entry nameiF6RIS8_MOUSE
AccessioniPrimary (citable) accession number: F6RIS8
Entry historyiIntegrated into UniProtKB/TrEMBL: July 27, 2011
Last sequence update: July 27, 2011
Last modified: July 5, 2017
This is version 47 of the entry and version 1 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Caution

Lacks conserved residue(s) required for the propagation of feature annotation.PROSITE-ProRule annotation
The sequence shown here is derived from an Ensembl automatic analysis pipeline and should be considered as preliminary data.Imported

Keywords - Technical termi

Complete proteome, Proteomics identificationCombined sources, Reference proteomeImported