Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Spermatogenesis-associated protein 31E1

Gene

SPATA31E1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at transcript leveli

Functioni

May play a role in spermatogenesis.By similarity

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Differentiation, Spermatogenesis

Names & Taxonomyi

Protein namesi
Recommended name:
Spermatogenesis-associated protein 31E1
Alternative name(s):
Protein FAM75E1
Gene namesi
Name:SPATA31E1
Synonyms:C9orf79, FAM75E1
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 9

Organism-specific databases

HGNCiHGNC:26672. SPATA31E1.

Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Transmembranei64 – 84HelicalSequence analysisAdd BLAST21

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Membrane

Pathology & Biotechi

Organism-specific databases

OpenTargetsiENSG00000177992.
PharmGKBiPA134886884.

Polymorphism and mutation databases

BioMutaiSPATA31E1.
DMDMi71152415.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000897161 – 1445Spermatogenesis-associated protein 31E1Add BLAST1445

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi408N-linked (GlcNAc...)Sequence analysis1
Glycosylationi819N-linked (GlcNAc...)Sequence analysis1
Glycosylationi906N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1160N-linked (GlcNAc...)Sequence analysis1

Keywords - PTMi

Glycoprotein

Proteomic databases

EPDiQ6ZUB1.
PaxDbiQ6ZUB1.
PeptideAtlasiQ6ZUB1.
PRIDEiQ6ZUB1.

PTM databases

iPTMnetiQ6ZUB1.
PhosphoSitePlusiQ6ZUB1.

Expressioni

Gene expression databases

BgeeiENSG00000177992.
CleanExiHS_C9orf79.
GenevisibleiQ6ZUB1. HS.

Organism-specific databases

HPAiHPA024253.

Interactioni

Protein-protein interaction databases

BioGridi130337. 1 interactor.
IntActiQ6ZUB1. 2 interactors.
MINTiMINT-7969886.

Structurei

3D structure databases

ProteinModelPortaliQ6ZUB1.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi168 – 289Pro-richAdd BLAST122

Sequence similaritiesi

Belongs to the SPATA31 family.Curated

Keywords - Domaini

Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiENOG410IF6X. Eukaryota.
ENOG4111EH8. LUCA.
GeneTreeiENSGT00530000063191.
HOGENOMiHOG000203946.
InParanoidiQ6ZUB1.
OMAiEIKEGWI.
OrthoDBiEOG091G03NN.
PhylomeDBiQ6ZUB1.
TreeFamiTF338531.

Family and domain databases

InterProiIPR027970. DUF4599.
[Graphical view]
PfamiPF15371. DUF4599. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q6ZUB1-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MGNLVIPLGK GRAGRVESGQ RIPPPAPRPS VECTGDDIAL QMEKMLFPLK
60 70 80 90 100
SPSATWLSPS STPWMMDFIL TSVCGLVLLF LLLLYVHSDP PSPPPGRKRS
110 120 130 140 150
SREPQRERSG RSRSRKISAL KACRILLREL EETRDLNYLL ESHLRKLAGE
160 170 180 190 200
GSSHLPLGGD PLGDVCKPVP AKAHQPHGKC MQDPSPASLS PPAPPAPLAS
210 220 230 240 250
TLSPGPMTFS EPFGPHSTLS ASGPPEPLLP LKCPATQPHV VFPPSPQPHG
260 270 280 290 300
PLASSPPPPD SSLAGLQCGS TTCPVPQSSP LHNQVLPPPT RVISGLGCSS
310 320 330 340 350
DPIWDLYCWR EAATTWGLST YSHGKSQPRH LPDHTSEASF WGDPTPKHME
360 370 380 390 400
VGGCTFIHPD VQKLLETLIA KRALMKMWQE KERKRADHPH MTSLGKEWDI
410 420 430 440 450
TTLNPFWNVS TQPQQLPRPQ QVSDATTVGN HLQQKRSQLF WDLPSLNSES
460 470 480 490 500
LATTVWVSRN PSSQNAHSVP LDKASTSLPG EPEVEASSQL SQAPPQPHHM
510 520 530 540 550
AQPQHFTPAW PQSQPPPLAE IQTQAHLSPP VPSLGCSSPP QIRGCGASYP
560 570 580 590 600
TSQERTQSVI PTGKEYLEWP LKKRPKWKRV LPSLLKKSQA VLSQPTAHLP
610 620 630 640 650
QERPASWSPK SAPILPGVVT SPELPEHWWQ GRNAIHQEQS CGPPSRLQAS
660 670 680 690 700
GDLLQPDGEF PGRPQSQAED TQQALLPSQP SDFAGKGRKD VQKTGFRSSG
710 720 730 740 750
RFSDKGCLGS KLGPDPSRDQ GSGRTSVKAL DEDKEAEGDL RRSWKYQSVS
760 770 780 790 800
STPRDPDKEH LENKLQIHLA RKVGEIKEGW IPMPVRRSWL MAKCAVPKSD
810 820 830 840 850
THRKPGKLAS WRGGKAHVNT SQELSFLHPC TQQILEVHLV RFCVRHSWGT
860 870 880 890 900
DLQSLEPINV WSGEAQAPPF PQSTFTPWAS WVSRVESVPK VPIFLGKRPQ
910 920 930 940 950
NGPGDNRTTS KSVPTVSGPL AAPPPEQEGV QRPPRGSQSA DTHGRSEAFP
960 970 980 990 1000
TGHKGRGCSQ PPTCSLVGRT WQSRTVLESG KPKPRLEGSM GSEMAGNEAW
1010 1020 1030 1040 1050
LESESMSPGD PCSSRALQVL SIGSQWARAE DALQALKVGE KPPTWEVTLG
1060 1070 1080 1090 1100
ASVRASSGSV QEDLRSTGAL GTTGNPSASS VCVAQDPEQL HLKAQVVSEI
1110 1120 1130 1140 1150
ALIVQVDSEE QLPGRAPGIL LQDGATGLCL PGRHMDMLTA ADRLPTQAPL
1160 1170 1180 1190 1200
STSQSVSGKN MTASQGPCAL LWKGGDSPGQ QEPGSPKAKA PQKSQKTLGC
1210 1220 1230 1240 1250
ADKGEAHRRP RTGEQGHRSK GPRTSEASGR SHPAQAREIG DKQERKYNQL
1260 1270 1280 1290 1300
QLEKGQTPPE SHFQRKISHH PQGLHPRKGG TRWEDVLQKG KPGADAFQSW
1310 1320 1330 1340 1350
GSGPPRQFMD CMADKAWTIS RVVGQILVDK LGLQWGRGPS EVNRHKGDFR
1360 1370 1380 1390 1400
AQENVPSCCH RGHCHQERSR EMRALACSPK ATPKGHHCPV KNRGIRDRDS
1410 1420 1430 1440
SWAPPPREPV SPAGPHHHRP RMASTSGGPH PQLQELMSAQ RCLAS
Length:1,445
Mass (Da):157,136
Last modified:July 19, 2005 - v2
Checksum:i7461FE3603FBED82
GO

Sequence cautioni

The sequence BAC04087 differs from that shown. Reason: Erroneous initiation.Curated
The sequence CAD39098 differs from that shown. Reason: Erroneous initiation.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti806G → E in BAC86315 (PubMed:14702039).Curated1
Sequence conflicti901N → S in BAC86315 (PubMed:14702039).Curated1
Sequence conflicti1170L → F in BAC04087 (PubMed:14702039).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_062203208T → S.Corresponds to variant rs28510722dbSNPEnsembl.1
Natural variantiVAR_022858335T → P.1 PublicationCorresponds to variant rs7850542dbSNPEnsembl.1
Natural variantiVAR_053943409V → M.Corresponds to variant rs34946554dbSNPEnsembl.1
Natural variantiVAR_053944586K → E.Corresponds to variant rs35232271dbSNPEnsembl.1
Natural variantiVAR_053945671T → M.Corresponds to variant rs36079890dbSNPEnsembl.1
Natural variantiVAR_022859682D → E.1 PublicationCorresponds to variant rs4076795dbSNPEnsembl.1
Natural variantiVAR_053946700G → R.Corresponds to variant rs34017995dbSNPEnsembl.1
Natural variantiVAR_022860704D → E.1 PublicationCorresponds to variant rs4076794dbSNPEnsembl.1
Natural variantiVAR_053947736A → V.Corresponds to variant rs34791830dbSNPEnsembl.1
Natural variantiVAR_053948924P → L.Corresponds to variant rs34051334dbSNPEnsembl.1
Natural variantiVAR_0539491019V → E.1 PublicationCorresponds to variant rs10868670dbSNPEnsembl.1
Natural variantiVAR_0228611202D → G.1 PublicationCorresponds to variant rs11789780dbSNPEnsembl.1
Natural variantiVAR_0228621350R → H.1 PublicationCorresponds to variant rs11142017dbSNPEnsembl.1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK093185 mRNA. Translation: BAC04087.1. Different initiation.
AK125845 mRNA. Translation: BAC86315.1.
AL772337 Genomic DNA. Translation: CAI23630.2.
BC137349 mRNA. Translation: AAI37350.1.
AL834438 mRNA. Translation: CAD39098.1. Different initiation.
CCDSiCCDS6676.1.
RefSeqiNP_849150.3. NM_178828.4.
UniGeneiHs.130672.

Genome annotation databases

EnsembliENST00000325643; ENSP00000322640; ENSG00000177992.
GeneIDi286234.
KEGGihsa:286234.
UCSCiuc004app.5. human.

Keywords - Coding sequence diversityi

Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK093185 mRNA. Translation: BAC04087.1. Different initiation.
AK125845 mRNA. Translation: BAC86315.1.
AL772337 Genomic DNA. Translation: CAI23630.2.
BC137349 mRNA. Translation: AAI37350.1.
AL834438 mRNA. Translation: CAD39098.1. Different initiation.
CCDSiCCDS6676.1.
RefSeqiNP_849150.3. NM_178828.4.
UniGeneiHs.130672.

3D structure databases

ProteinModelPortaliQ6ZUB1.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi130337. 1 interactor.
IntActiQ6ZUB1. 2 interactors.
MINTiMINT-7969886.

PTM databases

iPTMnetiQ6ZUB1.
PhosphoSitePlusiQ6ZUB1.

Polymorphism and mutation databases

BioMutaiSPATA31E1.
DMDMi71152415.

Proteomic databases

EPDiQ6ZUB1.
PaxDbiQ6ZUB1.
PeptideAtlasiQ6ZUB1.
PRIDEiQ6ZUB1.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000325643; ENSP00000322640; ENSG00000177992.
GeneIDi286234.
KEGGihsa:286234.
UCSCiuc004app.5. human.

Organism-specific databases

CTDi286234.
GeneCardsiSPATA31E1.
H-InvDBHIX0021361.
HGNCiHGNC:26672. SPATA31E1.
HPAiHPA024253.
neXtProtiNX_Q6ZUB1.
OpenTargetsiENSG00000177992.
PharmGKBiPA134886884.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiENOG410IF6X. Eukaryota.
ENOG4111EH8. LUCA.
GeneTreeiENSGT00530000063191.
HOGENOMiHOG000203946.
InParanoidiQ6ZUB1.
OMAiEIKEGWI.
OrthoDBiEOG091G03NN.
PhylomeDBiQ6ZUB1.
TreeFamiTF338531.

Miscellaneous databases

GenomeRNAii286234.
PROiQ6ZUB1.

Gene expression databases

BgeeiENSG00000177992.
CleanExiHS_C9orf79.
GenevisibleiQ6ZUB1. HS.

Family and domain databases

InterProiIPR027970. DUF4599.
[Graphical view]
PfamiPF15371. DUF4599. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiS31E1_HUMAN
AccessioniPrimary (citable) accession number: Q6ZUB1
Secondary accession number(s): B2RPB1
, Q5SQC9, Q8NA41, Q8ND27
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 19, 2005
Last sequence update: July 19, 2005
Last modified: November 2, 2016
This is version 95 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 9
    Human chromosome 9: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.