Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q9H5L6 (THAP9_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 73. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
DNA transposase THAP9

EC=2.7.7.-
Alternative name(s):
THAP domain-containing protein 9
Short name=hTh9
Gene names
Name:THAP9
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length903 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Active transposase that specifically recognizes the bipartite 5'-TXXGGGX(A/T)-3' consensus motif and mediates transposition. Ref.3 Ref.4

Miscellaneous

Able to mediate mobilization of P-elements when transfected in Drosophila (Ref.4).

Sequence similarities

Contains 1 THAP-type zinc finger.

Ontologies

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 903903DNA transposase THAP9
PRO_0000317246

Regions

Zinc finger1 – 8989THAP-type
Motif123 – 1264HCFC1-binding motif (HBM) By similarity

Natural variations

Natural variant2841M → I. Ref.1
Corresponds to variant rs1031639 [ dbSNP | Ensembl ].
VAR_038486
Natural variant2991L → F. Ref.1
Corresponds to variant rs897945 [ dbSNP | Ensembl ].
VAR_038487
Natural variant8121N → D. Ref.1
Corresponds to variant rs6535411 [ dbSNP | Ensembl ].
VAR_038488
Natural variant8331V → I.
Corresponds to variant rs35532215 [ dbSNP | Ensembl ].
VAR_061842

Experimental info

Sequence conflict1741L → I in BAB15609. Ref.1
Sequence conflict4911E → G in BAB15609. Ref.1
Sequence conflict8751S → P in BAG52354. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Q9H5L6 [UniParc].

Last modified February 5, 2008. Version 2.
Checksum: 64DA9DADA3D80353

FASTA903103,411
        10         20         30         40         50         60 
MTRSCSAVGC STRDTVLSRE RGLSFHQFPT DTIQRSKWIR AVNRVDPRSK KIWIPGPGAI 

        70         80         90        100        110        120 
LCSKHFQESD FESYGIRRKL KKGAVPSVSL YKIPQGVHLK GKARQKILKQ PLPDNSQEVA 

       130        140        150        160        170        180 
TEDHNYSLKT PLTIGAEKLA EVQQMLQVSK KRLISVKNYR MIKKRKGLRL IDALVEEKLL 

       190        200        210        220        230        240 
SEETECLLRA QFSDFKWELY NWRETDEYSA EMKQFACTLY LCSSKVYDYV RKILKLPHSS 

       250        260        270        280        290        300 
ILRTWLSKCQ PSPGFNSNIF SFLQRRVENG DQLYQYCSLL IKSMPLKQQL QWDPSSHSLQ 

       310        320        330        340        350        360 
GFMDFGLGKL DADETPLASE TVLLMAVGIF GHWRTPLGYF FVNRASGYLQ AQLLRLTIGK 

       370        380        390        400        410        420 
LSDIGITVLA VTSDATAHSV QMAKALGIHI DGDDMKCTFQ HPSSSSQQIA YFFDSCHLLR 

       430        440        450        460        470        480 
LIRNAFQNFQ SIQFINGIAH WQHLVELVAL EEQELSNMER IPSTLANLKN HVLKVNSATQ 

       490        500        510        520        530        540 
LFSESVASAL EYLLSLDLPP FQNCIGTIHF LRLINNLFDI FNSRNCYGKG LKGPLLPETY 

       550        560        570        580        590        600 
SKINHVLIEA KTIFVTLSDT SNNQIIKGKQ KLGFLGFLLN AESLKWLYQN YVFPKVMPFP 

       610        620        630        640        650        660 
YLLTYKFSHD HLELFLKMLR QVLVTSSSPT CMAFQKAYYN LETRYKFQDE VFLSKVSIFD 

       670        680        690        700        710        720 
ISIARRKDLA LWTVQRQYGV SVTKTVFHEE GICQDWSHCS LSEALLDLSD HRRNLICYAG 

       730        740        750        760        770        780 
YVANKLSALL TCEDCITALY ASDLKASKIG SLLFVKKKNG LHFPSESLCR VINICERVVR 

       790        800        810        820        830        840 
THSRMAIFEL VSKQRELYLQ QKILCELSGH INLFVDVNKH LFDGEVCAIN HFVKLLKDII 

       850        860        870        880        890        900 
ICFLNIRAKN VAQNPLKHHS ERTDMKTLSR KHWSSVQDYK CSSFANTSSK FRHLLSNDGY 


PFK 

« Hide

References

« Hide 'large scale' references
[1]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA], VARIANTS ILE-284; PHE-299 AND ASP-812.
Tissue: Brain.
[2]"Homologs of Drosophila P transposons were mobile in zebrafish but have been domesticated in a common ancestor of chicken and human."
Hammer S.E., Strehl S., Hagemann S.
Mol. Biol. Evol. 22:833-844(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 351-583.
[3]"THAP proteins target specific DNA sites through bipartite recognition of adjacent major and minor grooves."
Sabogal A., Lyubimov A.Y., Corn J.E., Berger J.M., Rio D.C.
Nat. Struct. Mol. Biol. 17:117-123(2010) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION, DNA-BINDING.
[4]"The human THAP9 gene encodes an active P-element DNA transposase."
Majumdar S., Singh A., Rio D.C.
Science 339:446-448(2013) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AK026973 mRNA. Translation: BAB15609.1.
AK091412 mRNA. Translation: BAG52354.1.
AJ717666 Genomic DNA. Translation: CAG30691.1.
CCDSCCDS3598.1.
RefSeqNP_078948.3. NM_024672.4.
UniGeneHs.582050.

3D structure databases

ProteinModelPortalQ9H5L6.
SMRQ9H5L6. Positions 1-88.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING9606.ENSP00000305533.

PTM databases

PhosphoSiteQ9H5L6.

Polymorphism databases

DMDM166987614.

Proteomic databases

MaxQBQ9H5L6.
PaxDbQ9H5L6.
PRIDEQ9H5L6.

Protocols and materials databases

DNASU79725.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000302236; ENSP00000305533; ENSG00000168152.
GeneID79725.
KEGGhsa:79725.
UCSCuc003hns.1. human.

Organism-specific databases

CTD79725.
GeneCardsGC04P083821.
H-InvDBHIX0004338.
HGNCHGNC:23192. THAP9.
HPAHPA037421.
MIM612537. gene.
neXtProtNX_Q9H5L6.
PharmGKBPA134981371.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG72787.
HOGENOMHOG000154567.
HOVERGENHBG101681.
InParanoidQ9H5L6.
OMASLKWLYQ.
OrthoDBEOG7MH11X.
PhylomeDBQ9H5L6.
TreeFamTF328542.

Gene expression databases

ArrayExpressQ9H5L6.
BgeeQ9H5L6.
CleanExHS_THAP9.
GenevestigatorQ9H5L6.

Family and domain databases

InterProIPR006612. Znf_C2CH.
[Graphical view]
PfamPF05485. THAP. 1 hit.
[Graphical view]
SMARTSM00692. DM3. 1 hit.
SM00980. THAP. 1 hit.
[Graphical view]
PROSITEPS50950. ZF_THAP. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

GenomeRNAi79725.
NextBio35469517.
PROQ9H5L6.
SOURCESearch...

Entry information

Entry nameTHAP9_HUMAN
AccessionPrimary (citable) accession number: Q9H5L6
Secondary accession number(s): B3KRE2, Q59AC9
Entry history
Integrated into UniProtKB/Swiss-Prot: February 5, 2008
Last sequence update: February 5, 2008
Last modified: July 9, 2014
This is version 73 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human chromosome 4

Human chromosome 4: entries, gene names and cross-references to MIM