Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: APCL_HUMAN (O95996)

Summary

This is the summary of UniProt entry APCL_HUMAN (O95996).

Description: Adenomatous polyposis coli protein 2
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 2303 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
Pfam APC_N_CC 6 57
coiled_coil n/a 8 28
disorder n/a 16 17
disorder n/a 19 45
disorder n/a 47 52
disorder n/a 94 119
Pfam Suppressor_APC 124 205
coiled_coil n/a 131 151
low_complexity n/a 143 151
disorder n/a 200 202
disorder n/a 213 214
disorder n/a 227 230
disorder n/a 238 270
disorder n/a 291 299
low_complexity n/a 325 339
disorder n/a 328 343
Pfam APC_rep 357 431
low_complexity n/a 392 408
disorder n/a 398 402
Pfam Arm 614 654
low_complexity n/a 626 639
Pfam Arm_APC_u3 697 959
disorder n/a 706 707
coiled_coil n/a 725 745
disorder n/a 736 765
low_complexity n/a 778 800
disorder n/a 782 793
disorder n/a 813 839
low_complexity n/a 836 851
coiled_coil n/a 840 860
low_complexity n/a 866 877
disorder n/a 868 935
disorder n/a 947 993
disorder n/a 1001 1038
disorder n/a 1041 1045
disorder n/a 1050 1051
disorder n/a 1053 1057
Pfam APC_r 1054 1076
low_complexity n/a 1064 1077
disorder n/a 1070 1154
low_complexity n/a 1081 1101
Pfam APC_r 1146 1168
low_complexity n/a 1156 1169
low_complexity n/a 1169 1185
disorder n/a 1173 1228
low_complexity n/a 1213 1225
Pfam APC_r 1258 1281
disorder n/a 1258 1262
low_complexity n/a 1274 1282
disorder n/a 1298 1301
low_complexity n/a 1303 1320
disorder n/a 1307 1339
Pfam SAMP 1335 1356
disorder n/a 1361 1362
disorder n/a 1371 1498
Pfam APC_r 1386 1409
disorder n/a 1504 1688
low_complexity n/a 1517 1525
low_complexity n/a 1547 1561
low_complexity n/a 1577 1595
low_complexity n/a 1608 1622
Pfam SAMP 1621 1642
low_complexity n/a 1637 1649
low_complexity n/a 1658 1674
disorder n/a 1695 2036
low_complexity n/a 1702 1715
Pfam APC_basic 1786 2118
low_complexity n/a 1818 1829
low_complexity n/a 1867 1886
low_complexity n/a 1895 1908
low_complexity n/a 1934 1946
low_complexity n/a 1960 1975
low_complexity n/a 1976 1988
low_complexity n/a 1999 2027
disorder n/a 2043 2232
low_complexity n/a 2046 2065
low_complexity n/a 2110 2130
disorder n/a 2235 2241
disorder n/a 2251 2303
low_complexity n/a 2287 2299

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession O95996. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MASSVAPYEQ LVRQVEALKA ENSHLRQELR DNSSHLSKLE TETSGMKEVL
50
51
KHLQGKLEQE ARVLVSSGQT EVLEQLKALQ MDITSLYNLK FQPPTLGPEP
100
101
AARTPEGSPV HGSGPSKDSF GELSRATIRL LEELDRERCF LLNEIEKEEK
150
151
EKLWYYSQLQ GLSKRLDELP HVETQFSMQM DLIRQQLEFE AQHIRSLMEE
200
201
RFGTSDEMVQ RAQIRASRLE QIDKELLEAQ DRVQQTEPQA LLAVKSVPVD
250
251
EDPETEVPTH PEDGTPQPGN SKVEVVFWLL SMLATRDQED TARTLLAMSS
300
301
SPESCVAMRR SGCLPLLLQI LHGTEAAAGG RAGAPGAPGA KDARMRANAA
350
351
LHNIVFSQPD QGLARKEMRV LHVLEQIRAY CETCWDWLQA RDGGPEGGGA
400
401
GSAPIPIEPQ ICQATCAVMK LSFDEEYRRA MNELGGLQAV AELLQVDYEM
450
451
HKMTRDPLNL ALRRYAGMTL TNLTFGDVAN KATLCARRGC MEAIVAQLAS
500
501
DSEELHQVVS SILRNLSWRA DINSKKVLRE AGSVTALVQC VLRATKESTL
550
551
KSVLSALWNL SAHSTENKAA ICQVDGALGF LVSTLTYKCQ SNSLAIIESG
600
601
GGILRNVSSL VATREDYRQV LRDHNCLQTL LQHLTSHSLT IVSNACGTLW
650
651
NLSARSARDQ ELLWDLGAVG MLRNLVHSKH KMIAMGSAAA LRNLLAHRPA
700
701
KHQAAATAVS PGSCVPSLYV RKQRALEAEL DARHLAQALE HLEKQGPPAA
750
751
EAATKKPLPP LRHLDGLAQD YASDSGCFDD DDAPSSLAAA AATGEPASPA
800
801
ALSLFLGSPF LQGQALARTP PTRRGGKEAE KDTSGEAAVA AKAKAKLALA
850
851
VARIDQLVED ISALHTSSDD SFSLSSGDPG QEAPREGRAQ SCSPCRGPEG
900
901
GRREAGSRAH PLLRLKAAHA SLSNDSLNSG SASDGYCPRE HMLPCPLAAL
950
951
ASRREDPRCG QPRPSRLDLD LPGCQAEPPA REATSADARV RTIKLSPTYQ
1000
1001
HVPLLEGASR AGAEPLAGPG ISPGARKQAW LPADHLSKVP EKLAAAPLSV
1050
1051
ASKALQKLAA QEGPLSLSRC SSLSSLSSAG RPGPSEGGDL DDSDSSLEGL
1100
1101
EEAGPSEAEL DSTWRAPGAT SLPVAIPAPR RNRGRGLGVE DATPSSSSEN
1150
1151
YVQETPLVLS RCSSVSSLGS FESPSIASSI PSEPCSGQGS GTISPSELPD
1200
1201
SPGQTMPPSR SKTPPLAPAP QGPPEATQFS LQWESYVKRF LDIADCRERC
1250
1251
RLPSELDAGS VRFTVEKPDE NFSCASSLSA LALHEHYVQQ DVELRLLPSA
1300
1301
CPERGGGAGG AGLHFAGHRR REEGPAPTGS RPRGAADQEL ELLRECLGAA
1350
1351
VPARLRKVAS ALVPGRRALP VPVYMLVPAP APAQEDDSCT DSAEGTPVNF
1400
1401
SSAASLSDET LQGPPRDQPG GPAGRQRPTG RPTSARQAMG HRHKAGGAGR
1450
1451
SAEQSRGAGK NRAGLELPLG RPPSAPADKD GSKPGRTRGD GALQSLCLTT
1500
1501
PTEEAVYCFY GNDSDEEPPA AAPTPTHRRT SAIPRAFTRE RPQGRKEAPA
1550
1551
PSKAAPAAPP PARTQPSLIA DETPPCYSLS SSASSLSEPE PSEPPAVHPR
1600
1601
GREPAVTKDP GPGGGRDSSP SPRAAEELLQ RCISSALPRR RPPVSGLRRR
1650
1651
KPRATRLDER PAEGSRERGE EAAGSDRASD LDSVEWRAIQ EGANSIVTWL
1700
1701
HQAAAATREA SSESDSILSF VSGLSVGSTL QPPKHRKGRQ AEGEMGSARR
1750
1751
PEKRGAASVK TSGSPRSPAG PEKPRGTQKT TPGVPAVLRG RTVIYVPSPA
1800
1801
PRAQPKGTPG PRATPRKVAP PCLAQPAAPA KVPSPGQQRS RSLHRPAKTS
1850
1851
ELATLSQPPR SATPPARLAK TPSSSSSQTS PASQPLPRKR PPVTQAAGAL
1900
1901
PGPGASPVPK TPARTLLAKQ HKTQRSPVRI PFMQRPARRG PPPLARAVPE
1950
1951
PGPRGRAGTE AGPGARGGRL GLVRVASALS SGSESSDRSG FRRQLTFIKE
2000
2001
SPGLRRRRSE LSSAESAASA PQGASPRRGR PALPAVFLCS SRCEELRAAP
2050
2051
RQGPAPARQR PPAARPSPGE RPARRTTSES PSRLPVRAPA ARPETVKRYA
2100
2101
SLPHISVARR PDGAVPAAPA SADAARRSSD GEPRPLPRVA APGTTWRRIR
2150
2151
DEDVPHILRS TLPATALPLR GSTPEDAPAG PPPRKTSDAV VQTEEVAAPK
2200
2201
TNSSTSPSLE TREPPGAPAG GQLSLLGSDV DGPSLAKAPI SAPFVHEGLG
2250
2251
VAVGGFPASR HGSPSRSARV PPFNYVPSPM VVAATTDSAA EKAPATASAT
2300
2301
LLE                                                   
2303
 

Show the unformatted sequence.

Checksums:
CRC64:7BF940183ACD643D
MD5:17805df8f88f5227b2a52f48a1fb45ac

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.