Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
2  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: DNAA_ECOLI (P03004)

Summary

This is the summary of UniProt entry DNAA_ECOLI (P03004).

Description: Chromosomal replication initiator protein DnaA
Source organism: Escherichia coli (strain K12) (NCBI taxonomy ID 83333)
View Pfam proteome data.
Length: 467 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
Pfam DnaA_N 3 65
disorder n/a 77 83
disorder n/a 87 126
low_complexity n/a 95 110
disorder n/a 128 129
Pfam Bac_DnaA 132 350
disorder n/a 152 156
low_complexity n/a 169 182
coiled_coil n/a 326 346
Pfam Bac_DnaA_C 376 444
coiled_coil n/a 444 464

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P03004. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MSLSLWQQCL ARLQDELPAT EFSMWIRPLQ AELSDNTLAL YAPNRFVLDW
50
51
VRDKYLNNIN GLLTSFCGAD APQLRFEVGT KPVTQTPQAA VTSNVAAPAQ
100
101
VAQTQPQRAA PSTRSGWDNV PAPAEPTYRS NVNVKHTFDN FVEGKSNQLA
150
151
RAAARQVADN PGGAYNPLFL YGGTGLGKTH LLHAVGNGIM ARKPNAKVVY
200
201
MHSERFVQDM VKALQNNAIE EFKRYYRSVD ALLIDDIQFF ANKERSQEEF
250
251
FHTFNALLEG NQQIILTSDR YPKEINGVED RLKSRFGWGL TVAIEPPELE
300
301
TRVAILMKKA DENDIRLPGE VAFFIAKRLR SNVRELEGAL NRVIANANFT
350
351
GRAITIDFVR EALRDLLALQ EKLVTIDNIQ KTVAEYYKIK VADLLSKRRS
400
401
RSVARPRQMA MALAKELTNH SLPEIGDAFG GRDHTTVLHA CRKIEQLREE
450
451
SHDIKEDFSN LIRTLSS                                    
467
 

Show the unformatted sequence.

Checksums:
CRC64:607C8366A8CDCCED
MD5:cf87188c893421a9a13c9300b1c3cd68

Structures

For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe SIFTS project, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between Pfam domains, this UniProt entry and a corresponding three dimensional structure.

Pfam family UniProt residues PDB ID PDB chain ID PDB residues View
Bac_DnaA_C 376 - 444 1J1V A 376 - 444 Jmol OpenAstexViewer
DnaA_N 3 - 65 2E0G A 2 - 64 Jmol OpenAstexViewer