Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: FREM2_HUMAN (Q5SZK8)

Summary

This is the summary of UniProt entry FREM2_HUMAN (Q5SZK8).

Description: FRAS1-related extracellular matrix protein 2
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 3169 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 23
sig_p n/a 1 46
low_complexity n/a 24 45
low_complexity n/a 54 71
Pfam Frem_N 72 284
low_complexity n/a 159 175
Pfam Cadherin_3 302 416
disorder n/a 394 395
low_complexity n/a 405 418
disorder n/a 406 407
Pfam Cadherin_3 421 539
disorder n/a 452 456
disorder n/a 460 469
disorder n/a 530 532
Pfam Cadherin_3 543 677
disorder n/a 564 568
disorder n/a 582 583
disorder n/a 611 613
disorder n/a 622 623
disorder n/a 652 660
disorder n/a 665 678
Pfam Cadherin_3 683 809
disorder n/a 683 687
disorder n/a 704 706
disorder n/a 709 710
disorder n/a 736 759
disorder n/a 781 783
disorder n/a 785 787
Pfam Cadherin_3 812 921
disorder n/a 818 820
disorder n/a 832 836
disorder n/a 848 850
Pfam Cadherin_3 923 1039
disorder n/a 965 969
Pfam Cadherin_3 1050 1170
disorder n/a 1118 1120
Pfam Cadherin_3 1173 1284
Pfam Cadherin_3 1286 1401
Pfam Cadherin_3 1405 1514
low_complexity n/a 1438 1449
Pfam Cadherin_3 1516 1623
disorder n/a 1554 1555
disorder n/a 1603 1604
disorder n/a 1615 1616
Pfam Cadherin_3 1638 1754
Pfam Calx-beta 1762 1858
disorder n/a 1839 1840
Pfam Calx-beta 1871 1982
disorder n/a 1887 1888
Pfam Calx-beta 1996 2103
disorder n/a 1996 1997
low_complexity n/a 2038 2050
disorder n/a 2041 2052
disorder n/a 2054 2058
Pfam Calx-beta 2116 2220
disorder n/a 2173 2178
disorder n/a 2184 2189
low_complexity n/a 2201 2219
disorder n/a 2226 2234
Pfam Calx-beta 2238 2342
disorder n/a 2255 2258
disorder n/a 2271 2273
disorder n/a 2288 2291
disorder n/a 2296 2311
disorder n/a 3038 3055
disorder n/a 3058 3063
disorder n/a 3074 3099
transmembrane n/a 3108 3134
low_complexity n/a 3111 3124
disorder n/a 3141 3169

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q5SZK8. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MHSAGTPGLS SRRTGNSTSF QPGPPPPPRL LLLLLLLLSL VSRVPAQPAA
50
51
FGRALLSPGL AGAAGVPAEE AIVLANRGLR VPFGREVWLD PLHDLVLQVQ
100
101
PGDRCAVSVL DNDALAQRPG RLSPKRFPCD FGPGEVRYSH LGARSPSRDR
150
151
VRLQLRYDAP GGAVVLPLVL EVEVVFTQLE VVTRNLPLVV EELLGTSNAL
200
201
DARSLEFAFQ PETEECRVGI LSGLGALPRY GELLHYPQVP GGAREGGAPE
250
251
TLLMDCKAFQ ELGVRYRHTA ASRSPNRDWI PMVVELRSRG APVGSPALKR
300
301
EHFQVLVRIR GGAENTAPKP SFVAMMMMEV DQFVLTALTP DMLAAEDAES
350
351
PSDLLIFNLT SPFQPGQGYL VSTDDRSLPL SSFTQRDLRL LKIAYQPPSE
400
401
DSDQERLFEL ELEVVDLEGA ASDPFAFMVV VKPMNTMAPV VTRNTGLILY
450
451
EGQSRPLTGP AGSGPQNLVI SDEDDLEAVR LEVVAGLRHG HLVILGASSG
500
501
SSAPKSFTVA ELAAGQVVYQ HDDRDGSLSD NLVLRMVDGG GRHQVQFLFP
550
551
ITLVPVDDQP PVLNANTGLT LAEGETVPIL PLSLSATDMD SDDSLLLFVL
600
601
ESPFLTTGHL LLRQTHPPHE KQELLRGLWR KEGAFYERTV TEWQQQDITE
650
651
GRLFYRHSGP HSPGPVTDQF TFRVQDNHDP PNQSGLQRFV IRIHPVDRLP
700
701
PELGSGCPLR MVVQESQLTP LRKKWLRYTD LDTDDRELRY TVTQSPTDTD
750
751
ENHLPAPLGT LVLTDNPSVV VTHFTQAQIN HHKIAYRPPG QELGVATRVA
800
801
QFQFQVEDRA GNVAPGTFTL YLHPVDNQPP EILNTGFTIQ EKGHHILSET
850
851
ELHVNDVDTD VAHISFTLTQ APKHGHMRVS GQILHVGGLF HLEDIKQGRV
900
901
SYAHNGDKSL TDSCSLEVSD RHHVVPITLR VNVRPVDDEV PILSHPTGTL
950
951
ESYLDVLENG ATEITANVIK GTNEETDDLM LTFLLEDPPL YGEILVNGIP
1000
1001
AEQFTQRDIL EGSVVYTHTS GEIGLLPKAD SFNLSLSDMS QEWRIGGNTI
1050
1051
QGVTIWVTIL PVDSQAPEIF VGEQLIVMEG DKSVITSVHI SAEDVDSLND
1100
1101
DILCTIVIQP TSGYVENISP APGSEKSRAG IAISAFNLKD LRQGHINYVQ
1150
1151
SVHKGVEPVE DRFVFRCSDG INFSERQFFP IVIIPTNDEQ PEMFMREFMV
1200
1201
MEGMSLVIDT PILNAADADV PLDDLTFTIT QFPTHGHIMN QLINGTVLVE
1250
1251
SFTLDQIIES SSIIYEHDDS ETQEDSFVIK LTDGKHSVEK TVLIIVIPVD
1300
1301
DETPRMTINN GLEIEIGDTK IINNKILMAT DLDSEDKSLV YIIRYGPGHG
1350
1351
LLQRRKPTGA FENITLGMNF TQDEVDRNLI QYVHLGQEGI RDLIKFDVTD
1400
1401
GINPLIDRYF YVSIGSIDIV FPDVISKGVS LKEGGKVTLT TDLLSTSDLN
1450
1451
SPDENLVFTI TRAPMRGHLE CTDQPGVSIT SFTQLQLAGN KIYYIHTADD
1500
1501
EVKMDSFEFQ VTDGRNPVFR TFRISISDVD NKKPVVTIHK LVVSESENKL
1550
1551
ITPFELTVED RDTPDKLLKF TITQVPIHGH LLFNNTRPVM VFTKQDLNEN
1600
1601
LISYKHDGTE SSEDSFSFTV TDGTHTDFYV FPDTVFETRR PQVMKIQVLA
1650
1651
VDNSVPQIAV NKGASTLRTL ATGHLGFMIT SKILKVEDRD SLHISLRFIV
1700
1701
TEAPQHGYLL NLDKGNHSIT QFTQADIDDM KICYVLREGA NATSDMFYFA
1750
1751
VEDGGGNKLT YQNFRLNWAW ISFEKEYYLV NEDSKFLDVV LKRRGYLGET
1800
1801
SFISIGTRDR TAEKDKDFKG KAQKQVQFNP GQTRATWRVR ILSDGEHEQS
1850
1851
ETFQVVLSEP VLAALEFPTV ATVEIVDPGD EPTVFIPQSK YSVEEDVGEL
1900
1901
FIPIRRSGDV SQELMVVCYT QQGTATGTVP TSVLSYSDYI SRPEDHTSVV
1950
1951
RFDKDEREKL CRIVIIDDSL YEEEETFHVL LSMPMGGRIG SEFPGAQVTI
2000
2001
VPDKDDEPIF YFGDVEYSVD ESAGYVEVQV WRTGTDLSKS SSVTVRSRKT
2050
2051
DPPSADAGTD YVGISRNLDF APGVNMQPVR VVILDDLGQP ALEGIEKFEL
2100
2101
VLRMPMNAAL GEPSKATVSI NDSVSDLPKM QFKERIYTGS ESDGQIVTMI
2150
2151
HRTGDVQYRS SVRCYTRQGS AQVMMDFEER PNTDTSIITF LPGETEKPCI
2200
2201
LELMDDVLYE EVEELRLVLG TPQSNSPFGA AVGEQNETLI RIRDDADKTV
2250
2251
IKFGETKFSV TEPKEPGESV VIRIPVIRQG DTSKVSIVRV HTKDGSATSG
2300
2301
EDYHPVSEEI EFKEGETQHV VEIEVTFDGV REMREAFTVH LKPDENMIAE
2350
2351
MQLTKAIVYI EEMSSMADVT FPSVPQIVSL LMYDDTSKAK ESAEPMSGYP
2400
2401
VICITACNPK YSDYDKTGSI CASENINDTL TRYRWLISAP AGPDGVTSPM
2450
2451
REVDFDTFFT SSKMVTLDSI YFQPGSRVQC AARAVNTNGD EGLELMSPIV
2500
2501
TISREEGLCQ PRVPGVVGAE PFSAKLRYTG PEDADYTNLI KLTVTMPHID
2550
2551
GMLPVISTRE LSNFELTLSP DGTRVGNHKC SNLLDYTEVK THYGFLTDAT
2600
2601
KNPEIIGETY PYQYSLSIRG STTLRFYRNL NLEACLWEFV SYYDMSELLA
2650
2651
DCGGTIGTDG QVLNLVQSYV TLRVPLYVSY VFHSPVGVGG WQHFDLKSEL
2700
2701
RLTFVYDTAI LWNDGIGSPP EAELQGSLYP TSMRIGDEGR LAVHFKTEAQ
2750
2751
FHGLFVLSHP ASFTSSVIMS ADHPGLTFSL RLIRSEPTYN QPVQQWSFVS
2800
2801
DFAVRDYSGT YTVKLVPCTA PSHQEYRLPV TCNPREPVTF DLDIRFQQVS
2850
2851
DPVAAEFSLN TQMYLLSKKS LWLSDGSMGF GQESDVAFAE GDIIYGRVMV
2900
2901
DPVQNLGDSF YCSIEKVFLC TGADGYVPKY SPMNAEYGCL ADSPSLLYRF
2950
2951
KIVDKAQPET QATSFGNVLF NAKLAVDDPE AILLVNQPGS DGFKVDSTPL
3000
3001
FQVALGREWY IHTIYTVRSK DNANRGIGKR SVEYHSLVSQ GKPQSTTKSR
3050
3051
KKREIRSTPS LAWEIGAENS RGTNIQHIAL DRTKRQIPHG RAPPDGILPW
3100
3101
ELNSPSSAVS LVTVVGGTTV GLLTICLTVI AVLMCRGKES FRGKDAPKGS
3150
3151
SSSEPMVPPQ SHHNDSSEV                                  
3169
 

Show the unformatted sequence.

Checksums:
CRC64:4000FC02963417F7
MD5:33fa06926baf93163662f4b134c1e5a9

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.