Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

A5PF05 (A5PF05_PIG) Unreviewed, UniProtKB/TrEMBL

Last modified April 16, 2014. Version 47. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
Name:BAT8 EMBL CAN87702.1
ORF Names:SBAB-707F1.1-002 EMBL CAN87702.1
OrganismSus scrofa (Pig) EMBL CAN87702.1
Taxonomic identifier9823 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaLaurasiatheriaCetartiodactylaSuinaSuidaeSus

Protein attributes

Sequence length1178 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone]. RuleBase RU004542 SAAS SAAS001214

Subcellular location

Nucleus By similarity RuleBase RU004542.

Sequence similarities

Contains 1 SET domain. RuleBase RU004538

Contains 1 post-SET domain. RuleBase RU004541

Contains 1 pre-SET domain. RuleBase RU004542

Contains 6 ANK repeats. RuleBase RU003321

Contains SET domain. SAAS SAAS001214

Contains post-SET domain. SAAS SAAS001214

Contains pre-SET domain. SAAS SAAS001214

Sequences

Sequence LengthMass (Da)Tools
A5PF05 [UniParc].

Last modified July 10, 2007. Version 1.
Checksum: DE066366BC4B8DE2

FASTA1,178129,051
        10         20         30         40         50         60 
MAAAAGAAAA AAAEGEAPAE MGALVLEKEP RGATERVHGS LGDTPRSEEA LPKANPDSLE 

        70         80         90        100        110        120 
PAGPSSPASV TVTVGDEGAD TPIGATPLIG DEPENLEGDG DLHGGRILLG HATKSFPSSP 

       130        140        150        160        170        180 
SKGGACPSRA KMSMTGAGKS PPSVQSLAMR LLSMPGAQGA ATAAIPEPPP ATASPEGQPK 

       190        200        210        220        230        240 
VHRARKTMSK PGNGQPPVPE KRPPEVQHFR MSDDVHSLGK VTSDVAKRRK LNSGGGLSEE 

       250        260        270        280        290        300 
LGSARGSGDV SLEKGDPGSL EEWETVVGDD FSLYYDSYSV DERVDSDSKS EVEALAEQLS 

       310        320        330        340        350        360 
EEEEEEEEEE EEEEEEEEEE EEEEEDEESG NQSDRSGSSG RRKAKKKWRK DSPWVKPSRK 

       370        380        390        400        410        420 
RRKREPPRAK EPRGVSNDTS SLETERGFEE LPLCSCRMEA PKIDRISERA GHKCMATESV 

       430        440        450        460        470        480 
DGELSGCNAA ILKRETMRPS SRVALMVLCE THRARMVKHH CCPGCGYFCT AGTFLECHPD 

       490        500        510        520        530        540 
FRVAHRFHKA CVSQLNGMVF CPHCGEDASE AQEVTIPRGD GVTPPAGTAA PAPPPLAQDA 

       550        560        570        580        590        600 
PGRADTSQPS ARMRGHGEPR RPPCDPLADT IDSSGPSLTL PSGGCLSAVG LPPGPGREAL 

       610        620        630        640        650        660 
EKALVIQESE RRKKLRFHPR QLYLSVKQGE LQKVILMLLD NLDPNFQSDQ QSKRTPLHAA 

       670        680        690        700        710        720 
AQKGSVEICH VLLQAGANIN AVDKQQRTPL MEAVVNNHLE VARYMVQRGG CVYSKEEDGS 

       730        740        750        760        770        780 
TCLHHAAKIG NLEMVSLLLS TGQVDVNAQD SGGWTPIIWA AEHKHIEVIR MLLTRGADVT 

       790        800        810        820        830        840 
LTDNEENICL HWASFTGSAA IAEVLLNARC DLHAVNYHGD TPLHIAARES YHDCVLLFLS 

       850        860        870        880        890        900 
RGANPELRNK EGDTAWDLTP ERSDVWFALQ LNRKLRLGVG NRAIRTEKII CRDVARGYEN 

       910        920        930        940        950        960 
VPIPCVNGVD SEPCPEDYKY ISENCETSTM NIDRNITHLQ HCTCVDDCSS SNCLCGQLSI 

       970        980        990       1000       1010       1020 
RCWYDKDGRL LQEFNKIEPP LIFECNQACS CWRNCKNRVV QSGIKVRLQL YRTAKMGWGV 

      1030       1040       1050       1060       1070       1080 
RALQTIPQGT FICEYVGELI SDAEADVRED DSYLFDLDNK DGEVYCIDAR YYGNISRFIN 

      1090       1100       1110       1120       1130       1140 
HLCDPNIIPV RVFMLHQDLR FPRIAFFSSR DIRTGEELGF DYGDRFWDIK SKYFTCQCGS 

      1150       1160       1170 
EKCKHSAEAI ALEQSRLARL DPHPELLPEL SSLPPVNP 

« Hide

References

[1]Sehra H.
Submitted (JUN-2007) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AL773527 Genomic DNA. Translation: CAN87702.1.
RefSeqNP_001095293.1. NM_001101823.1.
UniGeneSsc.12499.

3D structure databases

ProteinModelPortalA5PF05.
SMRA5PF05. Positions 650-876, 890-1160.
ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID100124382.
KEGGssc:100124382.

Organism-specific databases

CTD10919.

Phylogenomic databases

eggNOGCOG0666.
HOGENOMHOG000231216.
HOVERGENHBG028394.
KOK11420.

Family and domain databases

Gene3D1.25.40.20. 1 hit.
InterProIPR002110. Ankyrin_rpt.
IPR020683. Ankyrin_rpt-contain_dom.
IPR003616. Post-SET_dom.
IPR007728. Pre-SET_dom.
IPR003606. Pre-SET_Zn-bd_sub.
IPR001214. SET_dom.
[Graphical view]
PfamPF12796. Ank_2. 3 hits.
PF05033. Pre-SET. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
PRINTSPR01415. ANKYRIN.
SMARTSM00248. ANK. 6 hits.
SM00508. PostSET. 1 hit.
SM00468. PreSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
SUPFAMSSF48403. SSF48403. 1 hit.
PROSITEPS50297. ANK_REP_REGION. 1 hit.
PS50088. ANK_REPEAT. 5 hits.
PS50867. PRE_SET. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameA5PF05_PIG
AccessionPrimary (citable) accession number: A5PF05
Entry history
Integrated into UniProtKB/TrEMBL: July 10, 2007
Last sequence update: July 10, 2007
Last modified: April 16, 2014
This is version 47 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)