Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Pulmonary surfactant-associated protein A2

Gene

SFTPA2

Organism
Homo sapiens (Human)
Status
Unreviewed-Annotation score: Annotation score: 1 out of 5-Protein predictedi

Names & Taxonomyi

Protein namesi
Submitted name:
Pulmonary surfactant-associated protein A2Imported
Gene namesi
Name:SFTPA2Imported
OrganismiHomo sapiens (Human)Imported
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 10

Organism-specific databases

HGNCiHGNC:10799. SFTPA2.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 1919Sequence analysisAdd
BLAST
Chaini20 – 9879Sequence analysisPRO_5006757086Add
BLAST

Proteomic databases

PaxDbiR4GMN3.
PeptideAtlasiR4GMN3.

Expressioni

Gene expression databases

ExpressionAtlasiR4GMN3. baseline and differential.

Interactioni

Protein-protein interaction databases

STRINGi9606.ENSP00000361400.

Family & Domainsi

Keywords - Domaini

SignalSequence analysis

Phylogenomic databases

eggNOGiKOG4297. Eukaryota.
ENOG410XPJ1. LUCA.
GeneTreeiENSGT00700000104102.

Family and domain databases

InterProiIPR008160. Collagen.
[Graphical view]
PfamiPF01391. Collagen. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Fragment.

R4GMN3-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MWLCPLALTL ILMAASGAAC EVKDVCVGSP GIPGTPGSHG LPGRDGRDGV
60 70 80 90
KGDPGPPGPM GPPGETPCPP GNNGLPGAPG VPGERGEKGE AGERGPPG
Length:98
Mass (Da):9,411
Last modified:February 17, 2016 - v7
Checksum:i207A703FAADE3845
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Non-terminal residuei98 – 981Imported

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BX248123 Genomic DNA. No translation available.

Genome annotation databases

EnsembliENST00000492049; ENSP00000473275; ENSG00000185303.
UCSCiuc057uli.1. human.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BX248123 Genomic DNA. No translation available.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi9606.ENSP00000361400.

Proteomic databases

PaxDbiR4GMN3.
PeptideAtlasiR4GMN3.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000492049; ENSP00000473275; ENSG00000185303.
UCSCiuc057uli.1. human.

Organism-specific databases

HGNCiHGNC:10799. SFTPA2.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG4297. Eukaryota.
ENOG410XPJ1. LUCA.
GeneTreeiENSGT00700000104102.

Miscellaneous databases

ChiTaRSiSFTPA2. human.

Gene expression databases

ExpressionAtlasiR4GMN3. baseline and differential.

Family and domain databases

InterProiIPR008160. Collagen.
[Graphical view]
PfamiPF01391. Collagen. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "The DNA sequence and comparative analysis of human chromosome 10."
    Deloukas P., Earthrowl M.E., Grafham D.V., Rubenfield M., French L., Steward C.A., Sims S.K., Jones M.C., Searle S., Scott C., Howe K., Hunt S.E., Andrews T.D., Gilbert J.G.R., Swarbreck D., Ashurst J.L., Taylor A., Battles J.
    , Bird C.P., Ainscough R., Almeida J.P., Ashwell R.I.S., Ambrose K.D., Babbage A.K., Bagguley C.L., Bailey J., Banerjee R., Bates K., Beasley H., Bray-Allen S., Brown A.J., Brown J.Y., Burford D.C., Burrill W., Burton J., Cahill P., Camire D., Carter N.P., Chapman J.C., Clark S.Y., Clarke G., Clee C.M., Clegg S., Corby N., Coulson A., Dhami P., Dutta I., Dunn M., Faulkner L., Frankish A., Frankland J.A., Garner P., Garnett J., Gribble S., Griffiths C., Grocock R., Gustafson E., Hammond S., Harley J.L., Hart E., Heath P.D., Ho T.P., Hopkins B., Horne J., Howden P.J., Huckle E., Hynds C., Johnson C., Johnson D., Kana A., Kay M., Kimberley A.M., Kershaw J.K., Kokkinaki M., Laird G.K., Lawlor S., Lee H.M., Leongamornlert D.A., Laird G., Lloyd C., Lloyd D.M., Loveland J., Lovell J., McLaren S., McLay K.E., McMurray A., Mashreghi-Mohammadi M., Matthews L., Milne S., Nickerson T., Nguyen M., Overton-Larty E., Palmer S.A., Pearce A.V., Peck A.I., Pelan S., Phillimore B., Porter K., Rice C.M., Rogosin A., Ross M.T., Sarafidou T., Sehra H.K., Shownkeen R., Skuce C.D., Smith M., Standring L., Sycamore N., Tester J., Thorpe A., Torcasso W., Tracey A., Tromans A., Tsolas J., Wall M., Walsh J., Wang H., Weinstock K., West A.P., Willey D.L., Whitehead S.L., Wilming L., Wray P.W., Young L., Chen Y., Lovering R.C., Moschonas N.K., Siebert R., Fechtel K., Bentley D., Durbin R., Hubbard T., Doucette-Stamm L., Beck S., Smith D.R., Rogers J.
    Nature 429:375-381(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  2. Ensembl
    Submitted (MAY-2013) to UniProtKB
    Cited for: IDENTIFICATION.

Entry informationi

Entry nameiR4GMN3_HUMAN
AccessioniPrimary (citable) accession number: R4GMN3
Entry historyi
Integrated into UniProtKB/TrEMBL: June 26, 2013
Last sequence update: February 17, 2016
Last modified: July 6, 2016
This is version 17 of the entry and version 7 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Caution

The sequence shown here is derived from an Ensembl automatic analysis pipeline and should be considered as preliminary data.Imported

Keywords - Technical termi

Complete proteome, Reference proteomeImported

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.