Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

B8C7F4 (B8C7F4_THAPS) Unreviewed, UniProtKB/TrEMBL

Last modified May 14, 2014. Version 24. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
ORF Names:THAPSDRAFT_263451 EMBL EED90971.1
OrganismThalassiosira pseudonana (Marine diatom) (Cyclotella nana) [Reference proteome]
Taxonomic identifier35128 [NCBI]
Taxonomic lineageEukaryotaStramenopilesBacillariophytaCoscinodiscophyceaeThalassiosirophycidaeThalassiosiralesThalassiosiraceaeThalassiosira

Protein attributes

Sequence length219 AA.
Sequence statusFragment.
Protein existenceInferred from homology

General annotation (Comments)

Sequence similarities

Belongs to the peptidase S1 family. RuleBase RU000360

Ontologies

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Experimental info

Non-terminal residue11 EMBL EED90971.1
Non-terminal residue2191 EMBL EED90971.1

Sequences

Sequence LengthMass (Da)Tools
B8C7F4 [UniParc].

Last modified March 3, 2009. Version 1.
Checksum: 5E633CFB9F5EC887

FASTA21923,117
        10         20         30         40         50         60 
SRIIGGTVSS IGRYSYAVSL QDSQYNHFCG GSLIAPDVVL SAAHCGGVVA TVAVQRHNLN 

        70         80         90        100        110        120 
DRTVGDDVTV KYEVLHPQHD PRSTDNDFSL IFLSRSTTAD VDLVQLNKDK SVPMSGDDVT 

       130        140        150        160        170        180 
VMGWGDTVAF DSIQQLSDTL KEVEVTAISN AECESYQGQI TDNMLCAEDN GEDSCQGDSG 

       190        200        210 
GPLVLASSDE SGDVQVGVVS WGIGCANSSF PGVYSRVSA 

« Hide

References

[1]"The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism."
Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D., Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., Brzezinski M.A., Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., Detter J.C., Glavina T., Goodstein D. expand/collapse author list , Hadi M.Z., Hellsten U., Hildebrand M., Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., Lau W.W., Lane T.W., Larimer F.W., Lippmeier J.C., Lucas S., Medina M., Montsant A., Obornik M., Parker M.S., Palenik B., Pazour G.J., Richardson P.M., Rynearson T.A., Saito M.A., Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A., Wilkerson F.P., Rokhsar D.S.
Science 306:79-86(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: CCMP1335 EMBL EED90971.1.
[2]"The Phaeodactylum genome reveals the evolutionary history of diatom genomes."
Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A., Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T. expand/collapse author list , Valentin K., Verret F., Berges J.A., Brownlee C., Cadoret J.P., Chiovitti A., Choi C.J., Coesel S., De Martino A., Detter J.C., Durkin C., Falciatore A., Fournet J., Haruta M., Huysman M.J., Jenkins B.D., Jiroutova K., Jorgensen R.E., Joubert Y., Kaplan A., Kroger N., Kroth P.G., La Roche J., Lindquist E., Lommer M., Martin-Jezequel V., Lopez P.J., Lucas S., Mangogna M., McGinnis K., Medlin L.K., Montsant A., Oudot-Le Secq M.P., Napoli C., Obornik M., Parker M.S., Petit J.L., Porcel B.M., Poulsen N., Robison M., Rychlewski L., Rynearson T.A., Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M.R., Taylor A.R., Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L.S., Rokhsar D.S., Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y., Grigoriev I.V.
Nature 456:239-244(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: CCMP1335 EMBL EED90971.1.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CM000644 Genomic DNA. Translation: EED90971.1.
RefSeqXP_002292120.1. XM_002292084.1.

3D structure databases

ProteinModelPortalB8C7F4.
ModBaseSearch...
MobiDBSearch...

Protein family/group databases

MEROPSS01.A49.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblProtistsThaps263451; Thaps263451; Thaps263451.
GeneID7449873.
KEGGtps:THAPSDRAFT_263451.

Phylogenomic databases

eggNOGCOG5640.
HOGENOMHOG000251820.
KOK01312.

Family and domain databases

InterProIPR001254. Peptidase_S1.
IPR018114. Peptidase_S1_AS.
IPR001314. Peptidase_S1A.
IPR009003. Trypsin-like_Pept_dom.
[Graphical view]
PfamPF00089. Trypsin. 1 hit.
[Graphical view]
PRINTSPR00722. CHYMOTRYPSIN.
SMARTSM00020. Tryp_SPc. 1 hit.
[Graphical view]
SUPFAMSSF50494. SSF50494. 1 hit.
PROSITEPS50240. TRYPSIN_DOM. 1 hit.
PS00134. TRYPSIN_HIS. 1 hit.
PS00135. TRYPSIN_SER. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameB8C7F4_THAPS
AccessionPrimary (citable) accession number: B8C7F4
Entry history
Integrated into UniProtKB/TrEMBL: March 3, 2009
Last sequence update: March 3, 2009
Last modified: May 14, 2014
This is version 24 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)