Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: CATC_MOUSE (P97821)

Summary

This is the summary of UniProt entry CATC_MOUSE (P97821).

Description: Dipeptidyl peptidase 1 EC=3.4.14.1
Source organism: Mus musculus (Mouse) (NCBI taxonomy ID 10090)
Length: 462 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 24
low_complexity n/a 11 17
Pfam CathepsinC_exc 25 141
disorder n/a 204 205
disorder n/a 207 222
Pfam Peptidase_C1 230 457

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P97821. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MGPWTHSLRA VLLLVLLGVC TVRSDTPANC TYPDLLGTWV FQVGPRSSRS
50
51
DINCSVMEAT EEKVVVHLKK LDTAYDELGN SGHFTLIYNQ GFEIVLNDYK
100
101
WFAFFKYEVR GHTAISYCHE TMTGWVHDVL GRNWACFVGK KVESHIEKVN
150
151
MNAAHLGGLQ ERYSERLYTH NHNFVKAINT VQKSWTATAY KEYEKMSLRD
200
201
LIRRSGHSQR IPRPKPAPMT DEIQQQILNL PESWDWRNVQ GVNYVSPVRN
250
251
QESCGSCYSF ASMGMLEARI RILTNNSQTP ILSPQEVVSC SPYAQGCDGG
300
301
FPYLIAGKYA QDFGVVEESC FPYTAKDSPC KPRENCLRYY SSDYYYVGGF
350
351
YGGCNEALMK LELVKHGPMA VAFEVHDDFL HYHSGIYHHT GLSDPFNPFE
400
401
LTNHAVLLVG YGRDPVTGIE YWIIKNSWGS NWGESGYFRI RRGTDECAIE
450
451
SIAVAAIPIP KL                                         
462
 

Show the unformatted sequence.

Checksums:
CRC64:56574B38D7DF4710
MD5:bee2d5f4ee832b96f784aa7c0e9c4ecd

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;