Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

H2R1P1 (H2R1P1_PANTR) Unreviewed, UniProtKB/TrEMBL

Last modified April 16, 2014. Version 17. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein attributes

Sequence length719 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Catalytic activity

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone]. SAAS SAAS007728 RuleBase RU004542

Subcellular location

Nucleus By similarity RuleBase RU004542.

Sequence similarities

Contains 1 SET domain. RuleBase RU004538

Contains 1 pre-SET domain. RuleBase RU004542

Contains SET domain. SAAS SAAS001214

Contains pre-SET domain. SAAS SAAS001214

Sequences

Sequence LengthMass (Da)Tools
H2R1P1 [UniParc].

Last modified March 21, 2012. Version 1.
Checksum: 05DD253EA77F6EC5

FASTA71981,940
        10         20         30         40         50         60 
MGEKNGDAKT FWMELEDDGK VDFIFEQVQN VLQSLKQKIK DGSATNKEYI QAMILVNEAT 

        70         80         90        100        110        120 
IINSSTSIKG ASQKEVNAQS SDHMPVTQKE QENKSNAFPS TSCENSFPED CTFLTTENKE 

       130        140        150        160        170        180 
ILSLEDKVVD FREKDSSSNL SYQSHDCSGA CLMKMPLNLK GENPLQLPIK CHFQRRHAKT 

       190        200        210        220        230        240 
NSHSSALHVS YKTPCGRSLR NVEEVFRYLL ETECNFLFTD NFSFNTYVQL ARNYPKQKEV 

       250        260        270        280        290        300 
VSDVDISNGV ESVPISFCNE IDSRKLPQFK YRKTVWPRAY NLTNFSSMFT DSCDCSEGCI 

       310        320        330        340        350        360 
DITKCACLQL TARNAKTSPL SSDKITTGYK YKRLQRQIPT GIYECSLLCK CNRQLCQNRV 

       370        380        390        400        410        420 
VQHGPQVRLQ VFKTEQKGWG VRCLDDIDRG TFVCIYSGRL LSRANTEKSY GIDENGRDEN 

       430        440        450        460        470        480 
TMKNIFSKKR KLEVACSDCE VEVLPLGLET HPRTAKTEKC PPKFSNNPKE LTMETKYDNI 

       490        500        510        520        530        540 
SRIQYHSVIR DPESKTAIFQ HNGKKMEFVS SESVTPEDND GFKPPREHLN SKTKGAQKDS 

       550        560        570        580        590        600 
SSNHVDEFED NLLIESDVID ITKYREETPP RSRCNQATTL DNQNIKKAIE VQIQKPQEGR 

       610        620        630        640        650        660 
STACQRQQVF CDEELLSETK NTSSDSLTKF NKGNVFLLDA TKEGNVGRFL NHSCCPNLSV 

       670        680        690        700        710 
QNVFVETHNR NFPLVAFFTN RYVKARTELT WDYGYEAGTV PEKEIFCQCG VNKCRKKIL 

« Hide

References

« Hide 'large scale' references
[1]"Initial sequence of the chimpanzee genome and comparison with the human genome."
Chimpanzee sequencing and analysis consortium
Nature 437:69-87(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[2]Ensembl
Submitted (FEB-2012) to UniProtKB
Cited for: IDENTIFICATION.
[3]"De novo assembly of the reference chimpanzee transcriptome from NextGen mRNA sequences."
Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.
Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
Tissue: Adipose stromal EMBL JAA05943.1, Skeletal muscle EMBL JAA38457.1, Skin EMBL JAA23377.1 and Smooth vascular EMBL JAA21689.1.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AACZ03092694 Genomic DNA. No translation available.
AACZ03092695 Genomic DNA. No translation available.
GABC01005395 mRNA. Translation: JAA05943.1.
GABF01000456 mRNA. Translation: JAA21689.1.
GABD01009723 mRNA. Translation: JAA23377.1.
GABE01006282 mRNA. Translation: JAA38457.1.
RefSeqXP_001153947.1. XM_001153947.3.

3D structure databases

ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSPTRT00000048180; ENSPTRP00000042655; ENSPTRG00000005873.
GeneID452719.
KEGGptr:452719.

Organism-specific databases

CTD83852.

Phylogenomic databases

GeneTreeENSGT00750000117355.
KOK11421.
OMAKCHFQRR.
OrthoDBEOG7ZD1TG.
TreeFamTF106411.

Family and domain databases

Gene3D3.30.890.10. 1 hit.
InterProIPR016177. DNA-bd_dom.
IPR001739. Methyl_CpG_DNA-bd.
IPR007728. Pre-SET_dom.
IPR001214. SET_dom.
[Graphical view]
PfamPF01429. MBD. 1 hit.
PF05033. Pre-SET. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
SMARTSM00317. SET. 1 hit.
[Graphical view]
SUPFAMSSF54171. SSF54171. 1 hit.
PROSITEPS50982. MBD. 1 hit.
PS50867. PRE_SET. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameH2R1P1_PANTR
AccessionPrimary (citable) accession number: H2R1P1
Entry history
Integrated into UniProtKB/TrEMBL: March 21, 2012
Last sequence update: March 21, 2012
Last modified: April 16, 2014
This is version 17 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)