Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: Q9C5J2_ARATH (Q9C5J2)

Summary

This is the summary of UniProt entry Q9C5J2_ARATH (Q9C5J2).

Description: Putative 3-methyladenine DNA glycosylase {ECO:0000313|EMBL:AAK25923.1}
Source organism: Arabidopsis thaliana (Mouse-ear cress) (NCBI taxonomy ID 3702)
View Pfam proteome data.
Length: 391 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 119
low_complexity n/a 36 54
low_complexity n/a 156 167
Pfam HhH-GPD 172 317
disorder n/a 295 296
disorder n/a 298 300
low_complexity n/a 333 346
disorder n/a 340 344
disorder n/a 351 376
low_complexity n/a 356 371

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q9C5J2. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MGEHSPSQPS SHTLPPNQPE SPNHETPNPI PPETNDDDSA SSAGVSGSIV
50
51
SSTTIEAPQV TELGNVSSPP TKIPLRPRKI RKLSPDDDAS DGFNPEHNLS
100
101
QMTTTKPATK SKLSQSRTVT VPRIQARSLT CEGELEAALH HLRSVDPLLA
150
151
SLIDIHPPPT FETFQTPFLA LIRSILYQQL AAKAGNSIYT RFVALCGGEN
200
201
GVVPENVLPL TPQQLRQIGV SGRKASYLHD LARKYQNGIL SDSGIVNMDE
250
251
KSLFTMLTMV NGIGSWSVHM FMINSLHRPD VLPVNDLGVR KGVQMLNGME
300
301
DLPRPSKMEQ LCEKWRPYRS VASWYLWRLI ESKNTPPNAA AATAGAALSF
350
351
PQLEDIQQQE QEQQHQQHQQ QQPQLMDPLN NVFSIGAWGQ T         
391
 

Show the unformatted sequence.

Checksums:
CRC64:1947E2C994C23496
MD5:3429fe65afa8d15fbcdc017861b58940