Summary: Tandem-repeating region of mucin, epiglycanin-like
Pfam includes annotations and additional family information from a range of different sources. These sources can be accessed via the tabs below.
The Pfam group coordinates the annotation of Pfam families in Wikipedia, but we have not yet assigned a Wikipedia article to this family. If you think that a particular Wikipedia article provides good annotation, please let us know.
This tab holds the annotation information that is stored in the Pfam database. As we move to using Wikipedia as our main source of annotation, the contents of this tab will be gradually replaced by the Wikipedia tab.
Tandem-repeating region of mucin, epiglycanin-like Provide feedback
The unusual mucin, epiglycanin, is membrane-bound at the C-terminus but has a long region of this tandem-repeat at the N-terminus [1]. It was the first mucin identified to be associated with the malignant behaviour of carcinoma cells [2]. Mouse Muc21/epiglycanin is thought to be a highly glycosylated molecule, which makes it likely that its function is dependent on its glycoforms. Cells expressing Muc21 are significantly less adherent to each other and to extracellular matrix components than control cells, and this loss of adhesion is mediated by the TR portion of Muc21 [3]. This family also now contains the repeat that was the C. elegans protein of unknown function (DUF801).
Literature references
-
Itoh Y, Kamata-Sakurai M, Denda-Nagai K, Nagai S, Tsuiji M, Ishii-Schrade K, Okada K, Goto A, Fukayama M, Irimura T;, Glycobiology. 2008;18:74-83.: Identification and expression of human epiglycanin/MUC21: a novel transmembrane mucin. PUBMED:17977904 EPMC:17977904
-
Toda M, Hisano R, Yurugi H, Akita K, Maruyama K, Inoue M, Adachi T, Tsubata T, Nakada H;, Biochem J. 2009;417:673-683.: Ligation of tumour-produced mucins to CD22 dramatically impairs splenic marginal zone B-cells. PUBMED:18925876 EPMC:18925876
-
Yi Y, Kamata-Sakurai M, Denda-Nagai K, Itoh T, Okada K, Ishii-Schrade K, Iguchi A, Sugiura D, Irimura T;, J Biol Chem. 2010;285:21233-21240.: Mucin 21/epiglycanin modulates cell adhesion. PUBMED:20388707 EPMC:20388707
This tab holds annotation information from the InterPro database.
InterPro entry IPR008519
The unusual mucin, epiglycanin, is membrane-bound at the C terminus but has a long region of this tandem-repeat at the N terminus [ PUBMED:17977904 ]. It was the first mucin identified to be associated with the malignant behaviour of carcinoma cells [ PUBMED:18925876 ]. Mouse Muc21/epiglycanin is thought to be a highly glycosylated molecule, which makes it likely that its function is dependent on its glycoforms. Cells expressing Muc21 are significantly less adherent to each other and to extracellular matrix components than control cells, and this loss of adhesion is mediated by the tandem-repeat portion of Muc21 [ PUBMED:20388707 ].
Domain organisation
Below is a listing of the unique domain organisations or architectures in which this domain is found. More...
Loading domain graphics...
Alignments
We store a range of different sequence alignments for families. As well as the seed alignment from which the family is built, we provide the full alignment, generated by searching the sequence database (reference proteomes) using the family HMM. We also generate alignments using four representative proteomes (RP) sets and the UniProtKB sequence database. More...
View options
We make a range of alignments for each Pfam-A family. You can see a description of each above. You can view these alignments in various ways but please note that some types of alignment are never generated while others may not be available for all families, most commonly because the alignments are too large to handle.
Seed (8) |
Full (453) |
Representative proteomes | UniProt (643) |
||||
---|---|---|---|---|---|---|---|
RP15 (52) |
RP35 (88) |
RP55 (206) |
RP75 (369) |
||||
Jalview | |||||||
HTML | |||||||
PP/heatmap | 1 |
1Cannot generate PP/Heatmap alignments for seeds; no PP data available
Key:
available,
not generated,
— not available.
Format an alignment
Download options
We make all of our alignments available in Stockholm format. You can download them here as raw, plain text files or as gzip-compressed files.
Seed (8) |
Full (453) |
Representative proteomes | UniProt (643) |
||||
---|---|---|---|---|---|---|---|
RP15 (52) |
RP35 (88) |
RP55 (206) |
RP75 (369) |
||||
Raw Stockholm | |||||||
Gzipped |
You can also download a FASTA format file containing the full-length sequences for all sequences in the full alignment.
HMM logo
HMM logos is one way of visualising profile HMMs. Logos provide a quick overview of the properties of an HMM in a graphical form. You can see a more detailed description of HMM logos and find out how you can interpret them here. More...
Trees
This page displays the phylogenetic tree for this family's seed alignment. We use FastTree to calculate neighbour join trees with a local bootstrap based on 100 resamples (shown next to the tree nodes). FastTree calculates approximately-maximum-likelihood phylogenetic trees from our seed alignment.
Note: You can also download the data file for the tree.
Curation and family details
This section shows the detailed information about the Pfam family. You can see the definitions of many of the terms in this section in the glossary and a fuller explanation of the scoring system that we use in the scores section of the help pages.
Curation
Seed source: | Pfam-B_1480 (release 8.0) Pfam-B_13922 (release 26.0) |
Previous IDs: | DUF801; |
Type: | Repeat |
Sequence Ontology: | SO:0001068 |
Author: |
Moxon SJ |
Number in seed: | 8 |
Number in full: | 453 |
Average length of the domain: | 60.5 aa |
Average identity of full alignment: | 48 % |
Average coverage of the sequence by the domain: | 79.62 % |
HMM information
HMM build commands: |
build method: hmmbuild -o /dev/null HMM SEED
search method: hmmsearch -Z 61295632 -E 1000 --cpu 4 HMM pfamseq
|
||||||||||||
Model details: |
|
||||||||||||
Model length: | 68 | ||||||||||||
Family (HMM) version: | 14 | ||||||||||||
Download: | download the raw HMM for this family |
Species distribution
Sunburst controls
HideWeight segments by...
Change the size of the sunburst
Colour assignments
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Selections
Align selected sequences to HMM
Generate a FASTA-format file
Clear selection
This visualisation provides a simple graphical representation of the distribution of this family across species. You can find the original interactive tree in the adjacent tab. More...
Tree controls
HideThe tree shows the occurrence of this domain across different species. More...
Loading...
Please note: for large trees this can take some time. While the tree is loading, you can safely switch away from this tab but if you browse away from the family page entirely, the tree will not be loaded.
AlphaFold Structure Predictions
The list of proteins below match this family and have AlphaFold predicted structures. Click on the protein accession to view the predicted structure.
Protein | Predicted structure | External Information |
---|---|---|
A0A0G2JKD1 | View 3D Structure | Click here |
A0A140T8X8 | View 3D Structure | Click here |
O17084 | View 3D Structure | Click here |
Q5SSG8 | View 3D Structure | Click here |
trRosetta Structure
The structural model below was generated by the Baker group with the trRosetta software using the Pfam UniProt multiple sequence alignment.
The InterPro website shows the contact map for the Pfam SEED alignment. Hovering or clicking on a contact position will highlight its connection to other residues in the alignment, as well as on the 3D structure.
- View the contact map and structural model in InterPro
- Download the model in PDB format
- Download all the data from the Pfam FTP site