Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: ERCC6_HUMAN (Q03468)

Summary

This is the summary of UniProt entry ERCC6_HUMAN (Q03468).

Description: DNA excision repair protein ERCC-6
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 1493 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 38
disorder n/a 41 46
disorder n/a 58 61
disorder n/a 73 78
disorder n/a 165 174
coiled_coil n/a 171 191
disorder n/a 182 183
disorder n/a 209 229
disorder n/a 235 236
disorder n/a 239 240
disorder n/a 242 269
disorder n/a 285 328
low_complexity n/a 292 308
coiled_coil n/a 318 338
low_complexity n/a 323 335
disorder n/a 332 336
disorder n/a 339 453
low_complexity n/a 362 373
low_complexity n/a 368 392
low_complexity n/a 429 446
low_complexity n/a 450 461
disorder n/a 483 485
low_complexity n/a 483 495
Pfam SNF2-rel_dom 510 812
Pfam Helicase_C 840 952
disorder n/a 943 945
disorder n/a 1016 1027
disorder n/a 1032 1219
low_complexity n/a 1068 1079
disorder n/a 1223 1248
disorder n/a 1266 1269
disorder n/a 1276 1278
disorder n/a 1281 1284
disorder n/a 1291 1293
disorder n/a 1315 1379
low_complexity n/a 1377 1391
disorder n/a 1381 1388
disorder n/a 1400 1406
disorder n/a 1410 1419
disorder n/a 1450 1452

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q03468. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MPNEGIPHSS QTQEQDCLQS QPVSNNEEMA IKQESGGDGE VEEYLSFRSV
50
51
GDGLSTSAVG CASAAPRRGP ALLHIDRHQI QAVEPSAQAL ELQGLGVDVY
100
101
DQDVLEQGVL QQVDNAIHEA SRASQLVDVE KEYRSVLDDL TSCTTSLRQI
150
151
NKIIEQLSPQ AATSRDINRK LDSVKRQKYN KEQQLKKITA KQKHLQAILG
200
201
GAEVKIELDH ASLEEDAEPG PSSLGSMLMP VQETAWEELI RTGQMTPFGT
250
251
QIPQKQEKKP RKIMLNEASG FEKYLADQAK LSFERKKQGC NKRAARKAPA
300
301
PVTPPAPVQN KNKPNKKARV LSKKEERLKK HIKKLQKRAL QFQGKVGLPK
350
351
ARRPWESDMR PEAEGDSEGE ESEYFPTEEE EEEEDDEVEG AEADLSGDGT
400
401
DYELKPLPKG GKRQKKVPVQ EIDDDFFPSS GEEAEAASVG EGGGGGRKVG
450
451
RYRDDGDEDY YKQRLRRWNK LRLQDKEKRL KLEDDSEESD AEFDEGFKVP
500
501
GFLFKKLFKY QQTGVRWLWE LHCQQAGGIL GDEMGLGKTI QIIAFLAGLS
550
551
YSKIRTRGSN YRFEGLGPTV IVCPTTVMHQ WVKEFHTWWP PFRVAILHET
600
601
GSYTHKKEKL IRDVAHCHGI LITSYSYIRL MQDDISRYDW HYVILDEGHK
650
651
IRNPNAAVTL ACKQFRTPHR IILSGSPMQN NLRELWSLFD FIFPGKLGTL
700
701
PVFMEQFSVP ITMGGYSNAS PVQVKTAYKC ACVLRDTINP YLLRRMKSDV
750
751
KMSLSLPDKN EQVLFCRLTD EQHKVYQNFV DSKEVYRILN GEMQIFSGLI
800
801
ALRKICNHPD LFSGGPKNLK GLPDDELEED QFGYWKRSGK MIVVESLLKI
850
851
WHKQGQRVLL FSQSRQMLDI LEVFLRAQKY TYLKMDGTTT IASRQPLITR
900
901
YNEDTSIFVF LLTTRVGGLG VNLTGANRVV IYDPDWNPST DTQARERAWR
950
951
IGQKKQVTVY RLLTAGTIEE KIYHRQIFKQ FLTNRVLKDP KQRRFFKSND
1000
1001
LYELFTLTSP DASQSTETSA IFAGTGSDVQ TPKCHLKRRI QPAFGADHDV
1050
1051
PKRKKFPASN ISVNDATSSE EKSEAKGAEV NAVTSNRSDP LKDDPHMSSN
1100
1101
VTSNDRLGEE TNAVSGPEEL SVISGNGECS NSSGTGKTSM PSGDESIDEK
1150
1151
LGLSYKRERP SQAQTEAFWE NKQMENNFYK HKSKTKHHSV AEEETLEKHL
1200
1201
RPKQKPKNSK HCRDAKFEGT RIPHLVKKRR YQKQDSENKS EAKEQSNDDY
1250
1251
VLEKLFKKSV GVHSVMKHDA IMDGASPDYV LVEAEANRVA QDALKALRLS
1300
1301
RQRCLGAVSG VPTWTGHRGI SGAPAGKKSR FGKKRNSNFS VQHPSSTSPT
1350
1351
EKCQDGIMKK EGKDNVPEHF SGRAEDADSS SGPLASSSLL AKMRARNHLI
1400
1401
LPERLESESG HLQEASALLP TTEHDDLLVE MRNFIAFQAH TDGQASTREI
1450
1451
LQEFESKLSA SQSCVFRELL RNLCTFHRTS GGEGIWKLKP EYC       
1493
 

Show the unformatted sequence.

Checksums:
CRC64:285257E2AEC071AC
MD5:51d8852326bb529ecf898e9a10667209

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;