Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Pulmonary surfactant-associated protein A

Gene

SFTPA1

Organism
Equus caballus (Horse)
Status
Unreviewed-Annotation score: Annotation score: 1 out of 5-Protein predictedi

Names & Taxonomyi

Protein namesi
Submitted name:
Pulmonary surfactant-associated protein AImported
Gene namesi
Name:SFTPA1Imported
OrganismiEquus caballus (Horse)Imported
Taxonomic identifieri9796 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaLaurasiatheriaPerissodactylaEquidaeEquus
ProteomesiUP000002281 Componenti: Chromosome 1

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2020Sequence analysisAdd
BLAST
Chaini21 – 248228Sequence analysisPRO_5003353486Add
BLAST

Keywords - PTMi

Disulfide bondSAAS annotation

Proteomic databases

PaxDbiF7AUX9.

Interactioni

Protein-protein interaction databases

STRINGi9796.ENSECAP00000016397.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini152 – 24796C-type lectinInterPro annotationAdd
BLAST

Sequence similaritiesi

Contains C-type lectin domain.SAAS annotation

Keywords - Domaini

SignalSequence analysis

Phylogenomic databases

eggNOGiENOG410XPJ1. LUCA.
KOG4297. Eukaryota.
GeneTreeiENSGT00700000104102.
OMAiECELKEV.
OrthoDBiEOG7HXCVB.
TreeFamiTF330481.

Family and domain databases

Gene3Di3.10.100.10. 1 hit.
InterProiIPR001304. C-type_lectin.
IPR016186. C-type_lectin-like.
IPR018378. C-type_lectin_CS.
IPR016187. C-type_lectin_fold.
[Graphical view]
PfamiPF00059. Lectin_C. 1 hit.
[Graphical view]
SMARTiSM00034. CLECT. 1 hit.
[Graphical view]
SUPFAMiSSF56436. SSF56436. 1 hit.
PROSITEiPS00615. C_TYPE_LECTIN_1. 1 hit.
PS50041. C_TYPE_LECTIN_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

F7AUX9-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MLLCSLTLTL ILLAVSGTKC DVKEFCAACS GVPGIPGSPG LPGRDGRDGV
60 70 80 90 100
KGDPGPPGPI GPPGGMPGSP GHDGLIGPPG PPGERGDKGE PGERGPPGPP
110 120 130 140 150
AYPDEELQTT LHDIRHQILQ LMGALSLQGS VLAVGEKVFS TNGQVVDFDA
160 170 180 190 200
IRESCARAGG RIAVPKSLEE NAAIASLVTK HNTYAYLGLE EGPTAGDFYY
210 220 230 240
LDGAPVNYTN WYPGEPRGRG KEKCVEMYTD GQWNDRSCLQ YRLAICEF
Length:248
Mass (Da):26,015
Last modified:July 27, 2011 - v1
Checksum:iB71133FB05C2A5D1
GO

Genome annotation databases

EnsembliENSECAT00000019996; ENSECAP00000016397; ENSECAG00000018767.

Cross-referencesi

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi9796.ENSECAP00000016397.

Proteomic databases

PaxDbiF7AUX9.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSECAT00000019996; ENSECAP00000016397; ENSECAG00000018767.

Phylogenomic databases

eggNOGiENOG410XPJ1. LUCA.
KOG4297. Eukaryota.
GeneTreeiENSGT00700000104102.
OMAiECELKEV.
OrthoDBiEOG7HXCVB.
TreeFamiTF330481.

Family and domain databases

Gene3Di3.10.100.10. 1 hit.
InterProiIPR001304. C-type_lectin.
IPR016186. C-type_lectin-like.
IPR018378. C-type_lectin_CS.
IPR016187. C-type_lectin_fold.
[Graphical view]
PfamiPF00059. Lectin_C. 1 hit.
[Graphical view]
SMARTiSM00034. CLECT. 1 hit.
[Graphical view]
SUPFAMiSSF56436. SSF56436. 1 hit.
PROSITEiPS00615. C_TYPE_LECTIN_1. 1 hit.
PS50041. C_TYPE_LECTIN_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Genome sequence, comparative analysis, and population genetics of the domestic horse."
    Broad Institute Genome Sequencing Platform, Broad Institute Whole Genome Assembly Team
    Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.
    , Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.
    Science 326:865-867(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ThoroughbredImported.
  2. Ensembl
    Submitted (JUL-2011) to UniProtKB
    Cited for: IDENTIFICATION.
    Strain: ThoroughbredImported.

Entry informationi

Entry nameiF7AUX9_HORSE
AccessioniPrimary (citable) accession number: F7AUX9
Entry historyi
Integrated into UniProtKB/TrEMBL: July 27, 2011
Last sequence update: July 27, 2011
Last modified: December 9, 2015
This is version 24 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Caution

The sequence shown here is derived from an Ensembl automatic analysis pipeline and should be considered as preliminary data.Imported

Keywords - Technical termi

Complete proteome, Reference proteomeImported

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.