Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: MUC5B_CHICK (Q98UI9)

Summary

This is the summary of UniProt entry MUC5B_CHICK (Q98UI9).

Description: Mucin-5B
Source organism: Gallus gallus (Chicken) (NCBI taxonomy ID 9031)
View Pfam proteome data.
Length: 2108 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 21
Pfam VWD 38 181
disorder n/a 203 210
Pfam C8 228 300
Pfam TIL 304 360
Pfam VWD 400 551
Pfam C8 593 660
Pfam TIL 666 723
Pfam TIL 764 825
Pfam VWD 865 1014
Pfam C8 1054 1121
Pfam VWD 1431 1590
disorder n/a 1587 1588
Pfam C8 1633 1718
low_complexity n/a 1930 1942

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q98UI9. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MEIKKERSFW IFCLIWSFCK GKEPVQIVQV STVGRSECTT WGNFHFHTFD
50
51
HVKFTFPGTC TYVFASHCND SYQDFNIKIR RSDKNSHLIY FTVTTDGVIL
100
101
EVKETGITVN GNQIPLPFSL KSILIEDTCA YFQVTSKLGL TLKWNWADTL
150
151
LLDLEETYKE KICGLCGNYD GNKKNDLILD GYKMHPRQFG NFHKVEDPSE
200
201
KCPDVRPDDH TGRHPTEDDN RCSKYKKMCK KLLSRFGNCP KVVAFDDYVA
250
251
TCTEDMCNCV VNSSQSDLVS SCICSTLNQY SRDCVLSKGD PGEWRTKELC
300
301
YQECPSNMEY MECGNSCADT CADPERSKIC KAPCTDGCFC PPGTILDDLG
350
351
GKKCVPRDSC PCMFQGKVYS SGGTYSTPCQ NCTCKGGHWS CISLPCSGSC
400
401
SIDGGFHIKT FDNKKFNFHG NCHYVLAKNT DDTFVVIGEI IQCGTSKTMT
450
451
CLKNVLVTLG RTTIKICSCG SIYMNNFIVK LPVSKDGITI FRPSTFFIKI
500
501
LSSAGVQIRV QMKPVMQLSI TVDHSYQNRT SGLCGNFNNI QTDDFRTATG
550
551
AVEDSAAAFG NSWKTRASCF DVEDSFEDPC SNSVDKEKFA QHWCALLSNT
600
601
SSTFAACHSV VDPSVYIKRC MYDTCNAEKS EVALCSVLST YSRDCAAAGM
650
651
TLKGWRQGIC DPSEECPETM VYNYSVKYCN QSCRSLDEPD PLCKVQIAPM
700
701
EGCGCPEGTY LNDEEECVTP DDCPCYYKGK IVQPGNSFQE DKLLCKCIQG
750
751
RLDCIGETVL VKDCPAPMYY FNCSSAGPGA IGSECQKSCK TQDMHCYVTE
800
801
CVSGCMCPDG LVLDGSGGCI PKDQCPCVHG GHFYKPGETI RVDCNTCTCN
850
851
KRQWNCTDNP CKGTCTVYGN GHYMSFDGEK FDFLGDCDYI LAQDFCPNNM
900
901
DAGTFRIVIQ NNACGKSLSI CSLKITLIFE SSEIRLLEGR IQEIATDPGA
950
951
EKNYKVDLRG GYIVIETTQG MSFMWDQKTT VVVHVTPSFQ GKVCGLCGDF
1000
1001
DGRSRNDFTT RGQSVEMSIQ EFGNSWKITS TCSNINMTDL CADQPFKSAL
1050
1051
GQKHCSIIKS SVFEACHSKV NPIPYYESCV SDFCGCDSVG DCECFCTSVA
1100
1101
AYARSCSTAG VCINWRTPAI CPVFCDYYNP PDKHEWFYKP CGAPCLKTCR
1150
1151
NPQGKCGNIL YSLEGCYPEC SPDKPYFDEE RRECVSLPDC TSCNPEEKLC
1200
1201
TEDSKDCLCC YNGKTYPLNE TIYSQTEGTK CGNAFCGPNG MIIETFIPCS
1250
1251
TLSVPAQEQL MQPVTSAPLL STEATPCFCT DNGQLIQMGE NVSLPMNISG
1300
1301
HCAYSICNAS CQIELIWAEC KVVQTEALET CEPNSEACPP TAAPNATSLV
1350
1351
PATALAPMSD CLGLIPPRKF NESWDFGNCQ IATCLGEENN IKLSSITCPP
1400
1401
QQLKLCVNGF PFMKHHDETG CCEVFECQCI CSGWGNEHYV TFDGTYYHFK
1450
1451
ENCTYVLVEL IQPSSEKFWI HIDNYYCGAA DGAICSMSLL IFHSNSLVIL
1500
1501
TQAKEHGKGT NLVLFNDKKV VPDISKNGIR ITSSGLYIIV EIPELEVYVS
1550
1551
YSRLAFYIKL PFGKYYNNTM GLCGTCTNQK SDDARKRNGE VTDSFKEMAL
1600
1601
DWKAPVSTNR YCNPGISEPV KIENYQHCEP SELCKIIWNL TECHRVVPPQ
1650
1651
PYYEACVASR CSQQHPSTEC QSMQTYAALC GLHGICVDWR GQTNGQCEAT
1700
1701
CARDQVYKPC GEAKRNTCFS REVIVDTLLS RNNTPVFVEG CYCPDGNILL
1750
1751
NEHDGICVSV CGCTAQDGSV KKPREAWEHD CQYCTCDEET LNISCFPRPC
1800
1801
AKSPPINCTK EGFVRKIKPR LDDPCCTETV CECDIKTCII NKTACDLGFQ
1850
1851
PVVAISEDGC CPIFSCIPKG VCVSEGVEFK PGAVVPKSSC EDCVCTDEQD
1900
1901
AVTGTNRIQC VPVKCQTTCQ QGFRYVEKEG QCCSQCQQVA CVANFPFGSV
1950
1951
TIEVGKSYKA PYDNCTQYTC TESGGQFSLT STVKVCLPFE ESNCVPGTVD
2000
2001
VTSDGCCKTC IDLPHKCKRS MKEQYIVHKH CKSAAPVPVP FCEGTCSTYS
2050
2051
VYSFENNEME HKCICCHEKK SHVEKVELVC SEHKTLKFSY VHVDECGCVE
2100
2101
TKCPMRRT                                              
2108
 

Show the unformatted sequence.

Checksums:
CRC64:68B887CB781E6539
MD5:13fd022f7de3abfa3c6554bded0fe19b