Summary: PE-PPE domain
The Pfam group coordinates the annotation of Pfam families in Wikipedia, but we have not yet assigned a Wikipedia article to this family. If you think that a particular Wikipedia article provides good annotation, please let us know.
This tab holds the annotation information that is stored in the Pfam database. As we move to using Wikipedia as our main source of annotation, the contents of this tab will be gradually replaced by the Wikipedia tab.
PE-PPE domain Provide feedback
External database links
This tab holds annotation information from the InterPro database.
InterPro entry IPR013228
The human pathogen Mycobacterium tuberculosis harbours a large number of genes that encode proteins whose N-termini contain the characteristic motifs Pro-Glu (PE) or Pro-Pro-Glu (PPE). A subgroup of the PE proteins contains polymorphic GC-rich sequences (PGRS), while a subgroup of the PPE proteins contains major polymorphic tandem repeats (MPTR). The function of most of these proteins remains unknown [PUBMED:19602151]. However, the PE_PGRS proteins from Mycobacterium marinum are secreted by components of the ESX-5 system that belongs to the recently defined type VII secretion systems [PUBMED:18981138]. It has also been reported that the PE_PGRS family of proteins contains multiple calcium-binding and glycine-rich sequence motifs GGXGXD/NXUX. This sequence repeat constitutes a calcium-binding parallel beta-roll or parallel beta-helix structure and is found in RTX toxins secreted by many Gram-negative bacteria [PUBMED:18267304].
This domain is found C-terminal to the PE (INTERPRO) and PPE (INTERPRO) domains. The secondary structure of this domain is predicted to be a mixture of alpha helices and beta strands [PUBMED:12711809].
- the number of sequences which exhibit this architecture
a textual description of the architecture, e.g. Gla, EGF x 2, Trypsin.
This example describes an architecture with one
Gladomain, followed by two consecutive
EGFdomains, and finally a single
- the UniProt description of the protein sequence
- the number of residues in the sequence
- the Pfam graphic itself.
Loading domain graphics...
This catalytic domain is found in a very wide range of enzymes.
The clan contains the following 67 members:Abhydro_lipase Abhydrolase_1 Abhydrolase_2 Abhydrolase_3 Abhydrolase_4 Abhydrolase_5 Abhydrolase_6 Abhydrolase_7 Abhydrolase_8 Acyl_transf_2 Arb2 AXE1 BAAT_C Chlorophyllase Chlorophyllase2 COesterase Cutinase DLH DUF1057 DUF1100 DUF1350 DUF1400 DUF1749 DUF2048 DUF2235 DUF2305 DUF2424 DUF2920 DUF2974 DUF3089 DUF3141 DUF3530 DUF452 DUF676 DUF726 DUF818 DUF829 DUF900 DUF915 EHN Esterase Esterase_phd FSH1 Hydrolase_4 LCAT LIP Lipase Lipase_2 Lipase_3 Ndr PAF-AH_p_II Palm_thioest PE-PPE Peptidase_S10 Peptidase_S15 Peptidase_S28 Peptidase_S37 Peptidase_S9 PGAP1 PhaC_N PHB_depo_C PhoPQ_related Ser_hydrolase Tannase Thioesterase UPF0227 VirJ
We make a range of alignments for each Pfam-A family:
- the curated alignment from which the HMM for the family is built
- the alignment generated by searching the sequence database using the HMM
- Representative Proteomes (RPs) at 15%, 35%, 55% and 75% co-membership thresholds
- alignment generated by searching the NCBI sequence database using the family HMM
- alignment generated by searching the metagenomics sequence database using the family HMM
You can see the alignments as HTML or in three different sequence viewers:
- Pfam viewer
- an HTML-based viewer that uses DAS to retrieve alignment fragments on request
1Cannot generate PP/Heatmap alignments for seeds; no PP data available
Key: available, not generated, — not available.
Format an alignment
If you find these logos useful in your own work, please consider citing the following article:
Note: You can also download the data file for the tree.
Curation and family details
|Author:||Mistry J, Adindla S|
|Number in seed:||32|
|Number in full:||715|
|Average length of the domain:||213.70 aa|
|Average identity of full alignment:||31 %|
|Average coverage of the sequence by the domain:||45.55 %|
|HMM build commands:||
build method: hmmbuild -o /dev/null HMM SEED
search method: hmmsearch -Z 23193494 -E 1000 --cpu 4 HMM pfamseq
|Family (HMM) version:||6|
|Download:||download the raw HMM for this family|
Weight segments by...
Change the size of the sunburst
selected sequences to HMM
a FASTA-format file
- 0 sequences
- 0 species
How the sunburst is generated
Colouring and labels
Anomalies in the taxonomy tree
Missing taxonomic levels
Unmapped species names
Too many species/sequences
The tree shows the occurrence of this domain across different species. More...
You can use the tree controls to manipulate how the interactive tree is displayed:
- show/hide the summary boxes
- highlight species that are represented in the seed alignment
- expand/collapse the tree or expand it to a given depth
- select a sub-tree or a set of species within the tree and view them graphically or as an alignment
- save a plain text representation of the tree