Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Adenomatous polyposis protein

Gene

EGK_16751

Organism
Macaca mulatta (Rhesus macaque)
Status
Unreviewed-Annotation score: Annotation score: 2 out of 5-Protein predictedi

Functioni

GO - Molecular functioni

GO - Biological processi

Names & Taxonomyi

Protein namesi
Submitted name:
Adenomatous polyposis proteinImported
Gene namesi
ORF Names:EGK_16751Imported
OrganismiMacaca mulatta (Rhesus macaque)Imported
Taxonomic identifieri9544 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniCercopithecidaeCercopithecinaeMacaca

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Interactioni

GO - Molecular functioni

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini4 – 55APC_N_CCInterPro annotationAdd BLAST52
Repeati754 – 796ARMPROSITE-ProRule annotationAdd BLAST43
Domaini2319 – 2670APC_basicInterPro annotationAdd BLAST352
Domaini2765 – 2937EB1_bindingInterPro annotationAdd BLAST173

Coiled coil

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Coiled coili6 – 54Sequence analysisAdd BLAST49
Coiled coili134 – 168Sequence analysisAdd BLAST35
Coiled coili1965 – 1988Sequence analysisAdd BLAST24

Keywords - Domaini

Coiled coilSequence analysis

Phylogenomic databases

eggNOGiKOG2122. Eukaryota.
ENOG410XR2V. LUCA.

Family and domain databases

Gene3Di1.25.10.10. 2 hits.
InterProiView protein in InterPro
IPR026836. APC.
IPR009240. APC_15aa_rpt.
IPR009234. APC_basic_dom.
IPR026831. APC_dom.
IPR026818. Apc_fam.
IPR032038. APC_N.
IPR036149. APC_N_sf.
IPR009223. APC_rpt.
IPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR000225. Armadillo.
IPR009232. EB1-bd.
IPR009224. SAMP.
PANTHERiPTHR12607. PTHR12607. 2 hits.
PTHR12607:SF11. PTHR12607:SF11. 2 hits.
PfamiView protein in Pfam
PF05972. APC_15aa. 3 hits.
PF05956. APC_basic. 1 hit.
PF16689. APC_N_CC. 1 hit.
PF05923. APC_r. 7 hits.
PF00514. Arm. 2 hits.
PF05937. EB1_binding. 1 hit.
PF05924. SAMP. 3 hits.
SMARTiView protein in SMART
SM00185. ARM. 7 hits.
SUPFAMiSSF48371. SSF48371. 1 hit.
SSF58050. SSF58050. 1 hit.
SSF82931. SSF82931. 1 hit.
PROSITEiView protein in PROSITE
PS50176. ARM_REPEAT. 1 hit.

Sequencei

Sequence statusi: Complete.

G7MVJ3-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAAASYDQLL KQVEALKMEN SNLRQELEDN SNHLTKLETE ASNMKEVLKQ
60 70 80 90 100
LQGSIEDEAM ASSGQIDLLE RLKELNLDSS NFPGVKLRSK MSLRSYGSRE
110 120 130 140 150
GSVSSRSGEC SPVPMGSFPR RGFVNGSRES TGYLEELEKE RSLLLADLDK
160 170 180 190 200
EEKEKDWYYA QLQNLTKRID SLPLTENFSL QTDMTRRQLE YEARQIRVAM
210 220 230 240 250
EEQLGTCQDM EKRAQVADKY IPETGQFIKE RGLRDLTVPR GWGILSIMVE
260 270 280 290 300
GKEEQVVSYM DGGRHVSKGW KSAHQFVNRT YYWDVGIGGV KRRIARIQQI
310 320 330 340 350
EKDILRIRQL LQSQATEAER SSQNKHETGS HDAERQNEGQ GVAEINMATS
360 370 380 390 400
GNGQGSTTRM DHETASVLSS SSTHSAPRRL TSHLGTKVEM VYSLLSMLGT
410 420 430 440 450
HDKDDMSRTL LAMSSSQDSC ISMRQSGCLP LLIQLLHGND KDSVLLGNSR
460 470 480 490 500
GSKEARARAS AALHNIIHSQ PDDKRGRREI RVLHLLEQIR AYCETCWEWQ
510 520 530 540 550
EAHEQGMDQD KNPMPAPVEH QICPAVCVLM KLSFDEEHRH AMNELGRKAT
560 570 580 590 600
RGISSQELGQ GLSGGLQAIA ELLQVDCEMY GLTNDHYSIT LRRYAGMALT
610 620 630 640 650
NLTFGDVANK ATLCSMKGCM RALVAQLKSE SEDLQQVIAS VLRNLSWRAD
660 670 680 690 700
VNSKKTLREV GSVKALMECA LEVKKESTLK SVLSALWNLS AHCTENKADI
710 720 730 740 750
CAVDGALAFL VGTLTYRSQT NTLAIIESGG GILRNVSSLI ATNEDHRQIL
760 770 780 790 800
RENNCLQTLL QHLKSHSLTI VSNACGTLWN LSARNPKDQE ALWDMGAVSM
810 820 830 840 850
LKNLIHSKHK MIAMGSAAAL RNLMANRPAK YKDANIMSPG SSLPSLHVRK
860 870 880 890 900
QKALEAELDA QHLSETFDNI DNLSPKASHR SKQRHKQSLY GDYVFDTNRH
910 920 930 940 950
EDNRSDNFNA GNMTVLSPYL NTTVLPSSSS SRGSLDSSRS EKDRSLERER
960 970 980 990 1000
GIGLGNYHPA TENPGTSSKR GLQISTTAAQ IAKVMEEVSA IHTSQEDRSS
1010 1020 1030 1040 1050
GSTTELHCVT DERNALRRSS AAHTHSNTYN FTKSENSNRT CSMPYAKLEY
1060 1070 1080 1090 1100
KRSSNDSLNS VSSSDGYGKR GQMKPSIESY SEDDESKFCS YGQYPADLAH
1110 1120 1130 1140 1150
KIHSANHMDD NDGELDTPIN YSLKYSDEQL NSGRQSPSQN ERWARPKHII
1160 1170 1180 1190 1200
EDEIKQSEQR QSRSQSTTYP VYTESTDDKH LKFQPHFGQQ ECVSPYRSRG
1210 1220 1230 1240 1250
ANGSETNRVG SNHGINQNVS QSLCQEDDYE DDKPTNYSER YSEEEQHEEE
1260 1270 1280 1290 1300
ERPTNYSIKY NEEKHHVDQP IDYSLKYATD IPSSQKQSFS FSKSSSGQST
1310 1320 1330 1340 1350
KTEHISSSSE NTSTPSSNAK RQNQLHPSSA QSRSGQTQKA ATCKVSSINQ
1360 1370 1380 1390 1400
ETIQTYCVED TPICFSRCSS LSSLSSAEDE IGCDQTTQEA DSANTLQIAE
1410 1420 1430 1440 1450
IKDKIGTRST EDPVSEVPAV SQHTRTKSSR LQGSSLSSES TRHKAVEFSS
1460 1470 1480 1490 1500
GAKSPSKSGA QTPKSPPEHY VQETPLMFSR CTSVSSLDSF ESRSIASSVQ
1510 1520 1530 1540 1550
SEPCSGMVSG IISPSDLPDS PGQTMPPSRS KTPPPPPQTA QTKREVPKNK
1560 1570 1580 1590 1600
TPTAEKRESG PKQAAVNAAV QRVQVLPDAD TLLHFATEST PDGFSCSSSL
1610 1620 1630 1640 1650
SALSLDEPFI QKDVELRIMP PVQENDNGNE TESEQPKESN ENQEKEAEKT
1660 1670 1680 1690 1700
IDSEKDLLDD SDDDDIEILE ECIISAMPTK SSRKAKKPAQ TASKLPPPVA
1710 1720 1730 1740 1750
RKPSQLPVYK LLPSQNRLQP QKHVSFTPGD DMPRVYCVEG TPINFSTATS
1760 1770 1780 1790 1800
LSDLTIESPP NELAAGEGVR AGAQSGEFEK RDTIPTEGRS TDEAQGGKTS
1810 1820 1830 1840 1850
SVTIPELDDN KAEEGDILAE CINSAMPKGK SHKPFRVKKI MDQVQQASAS
1860 1870 1880 1890 1900
SSATNKNQLD GKKKKPTSPV KPIPQNTEYR TRIRKNADSK NNLNAERVFS
1910 1920 1930 1940 1950
DNKDSKKQNL KNNSKDFNDK LPNNEDRVRG SFAFDSPHHY TPIEGTPYCF
1960 1970 1980 1990 2000
SRNDSLSSLD FDDDDVDLSR EKAELRKAKE NKESEAKVTS HTELTSNQQS
2010 2020 2030 2040 2050
ASKTQAIAKH PINRGQLKPI LQKQSTFPQS SKDIPDRGAA TDEKLQNFAI
2060 2070 2080 2090 2100
ENTPVCFSHN SSLSSLSDID QENNNNKENE PIKETEPPDS QGEPSKPQAS
2110 2120 2130 2140 2150
GYAPKSFHVE DTPVCFSRNS SLSSLSIDSE DDLLQECISS AMPKKKKPSR
2160 2170 2180 2190 2200
LKGDNEKHSP RNMGGMLAED LTLDLKDIQR PDSEHGLSPD SENFDWKAIQ
2210 2220 2230 2240 2250
EGANSIVSSL HQAAAAACLS RQASSDSDSI LSLKSGISLG SPFHLTPDQE
2260 2270 2280 2290 2300
EKPFTSNKGP RILKPGEKST LETKKIESES KGIKGGKKVY KSLITGKVRS
2310 2320 2330 2340 2350
NSEISGQMKQ PLQANMPSIS RGRTMIHIPG VRNSSSSTSP VSKKGPPLKT
2360 2370 2380 2390 2400
PASKSPSEGQ TATTSPRGAK PSVKSELSPV ARQTSQIGGS SKAPSRSGSR
2410 2420 2430 2440 2450
DSTPSRPAQQ PLSRPIQSPG RNSISPGRNG ISPPNKLSQL PRTSSPSTAS
2460 2470 2480 2490 2500
TKSSGSGKMS YTSPGRQMSQ QNLTKQTGLS KNASSIPRSE SASKGLNQVN
2510 2520 2530 2540 2550
NGNGANKKVE LSRMSSTKSS GSESDRSERP VLVRQSTFIK EAPSPTLRRK
2560 2570 2580 2590 2600
LEESASFESL SPSSRPASPT RSQAQTPVLS PSLPDMSLST HSSVQAGGWR
2610 2620 2630 2640 2650
KLPPNLSPTI EYNDGRPAKR HDIARSHSES PSRLPINRSG TWKREHSKHS
2660 2670 2680 2690 2700
SSLPRVSTWR RTGSSSSILS ASSESSEKAK SEDEKHVNSI SGTKQSKENQ
2710 2720 2730 2740 2750
VSAKGTWRKI KENEISPTNS TSQTVSSGAT NGAESKTLIY QMAPAVSKTE
2760 2770 2780 2790 2800
DVWVRIEDCP INNPRSGRSP TGNTPPVIDS VSEKGNPNKD SKDNQAKQNV
2810 2820 2830 2840 2850
GNGSVPMRTV GLENRLNSFI QVDAPDQKGT ETKPGQNNPV PVSETNESSI
2860 2870 2880 2890 2900
VERTPFSSSS SSKHSSPSGT VAARVTPFNY NPSPRKSSAD STSARPSQIP
2910 2920 2930
TPVNNNTKKR DSKTDSTESS GTQSPKRHSG SYLVTSV
Length:2,937
Mass (Da):322,134
Last modified:January 25, 2012 - v1
Checksum:i6CE1DBD750F798A9
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CM001258 Genomic DNA. Translation: EHH26703.1.

Similar proteinsi

Entry informationi

Entry nameiG7MVJ3_MACMU
AccessioniPrimary (citable) accession number: G7MVJ3
Entry historyiIntegrated into UniProtKB/TrEMBL: January 25, 2012
Last sequence update: January 25, 2012
Last modified: October 25, 2017
This is version 32 of the entry and version 1 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)