Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: O77380_PLAF7 (O77380)

Summary

This is the summary of UniProt entry O77380_PLAF7 (O77380).

Description: CPSF (Cleavage and polyadenylation specific factor), subunit A, putative {ECO:0000313|EMBL:CAB11136.2}
Source organism: Plasmodium falciparum (isolate 3D7) (NCBI taxonomy ID 36329)
View Pfam proteome data.
Length: 2870 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
low_complexity n/a 150 165
low_complexity n/a 196 281
disorder n/a 197 279
disorder n/a 319 321
low_complexity n/a 510 527
disorder n/a 524 563
low_complexity n/a 531 572
low_complexity n/a 571 593
disorder n/a 708 710
low_complexity n/a 862 875
low_complexity n/a 880 891
low_complexity n/a 955 972
low_complexity n/a 1024 1036
low_complexity n/a 1042 1046
low_complexity n/a 1042 1060
disorder n/a 1051 1061
coiled_coil n/a 1064 1084
disorder n/a 1070 1074
disorder n/a 1084 1085
low_complexity n/a 1085 1100
disorder n/a 1108 1128
low_complexity n/a 1117 1127
low_complexity n/a 1143 1154
low_complexity n/a 1199 1214
disorder n/a 1214 1217
disorder n/a 1220 1221
disorder n/a 1223 1225
disorder n/a 1244 1245
low_complexity n/a 1246 1254
disorder n/a 1251 1262
low_complexity n/a 1269 1284
disorder n/a 1271 1280
low_complexity n/a 1358 1372
low_complexity n/a 1368 1399
coiled_coil n/a 1429 1449
disorder n/a 1525 1527
low_complexity n/a 1525 1546
disorder n/a 1531 1560
low_complexity n/a 1551 1582
disorder n/a 1562 1563
low_complexity n/a 1657 1671
disorder n/a 1773 1774
disorder n/a 1777 1778
disorder n/a 1781 1782
disorder n/a 1785 1786
disorder n/a 1789 1790
disorder n/a 1793 1795
disorder n/a 1797 1799
disorder n/a 1801 1818
disorder n/a 1821 1822
disorder n/a 1825 1826
disorder n/a 1829 1831
disorder n/a 1833 1835
disorder n/a 1837 1913
low_complexity n/a 1892 1903
low_complexity n/a 1895 1914
disorder n/a 1916 1920
disorder n/a 1925 1976
low_complexity n/a 1934 1948
low_complexity n/a 1944 1955
low_complexity n/a 1949 1972
low_complexity n/a 1998 2007
disorder n/a 2025 2030
disorder n/a 2035 2036
low_complexity n/a 2081 2096
disorder n/a 2089 2090
disorder n/a 2094 2111
low_complexity n/a 2108 2138
disorder n/a 2113 2115
disorder n/a 2134 2158
low_complexity n/a 2149 2160
disorder n/a 2172 2194
low_complexity n/a 2172 2181
disorder n/a 2198 2204
disorder n/a 2207 2247
coiled_coil n/a 2207 2229
low_complexity n/a 2207 2233
low_complexity n/a 2228 2245
low_complexity n/a 2335 2348
Pfam CPSF_A 2503 2829
low_complexity n/a 2638 2650

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession O77380. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MSPYHFYNNV IDSKSVRSSV CCNIKGNEKK YLIYACNNHL NVCCIDKNGN
50
51
TDDYSEHVLF AEVLELREYV PEKLVHSTYN KEKVKSYLFV LTRKYVLLLL
100
101
EYDVKENDFI TLSKINLCEL NGLHLEEDVI FLLDERHKTI LFYGYKNILK
150
151
YIYLDYDNFL NLNNVYTMRI DESLIIDIAF LGTHTMGCNK QLRNKQDDDD
200
201
KINYGDNKNN YGDNKNNYGD NNNNYGDYNN NYDDNRSNYD DDKSNYDDDK
250
251
SNYDDDKSNY DDDKSNYDDD KSNYDDDNKY YDRVHKSGNF LCDDIYKNEM
300
301
LFNKEDLFLE HQNIYKKIKK EMSTDDQDDV PLINKTLYSI GNKSCLEIDK
350
351
LLKKKSYDFV GTMNYDNNDC RPNSINKLKV EERGKRNYDV RNCNNFDHIS
400
401
GGLLSQHPNS FESYYPKYNE YNKNIRRMLD GYYNIKGRQQ DDIKLAFHDS
450
451
CNSTYNKNND EYMYSTICIL YDYKKSDTEF YERYIRIIPL CRMNDASIKF
500
501
LGDDNSEFDF YDYSDNDKYY AYKYNKRNRD EDNGKDNHKD NHKDNHKDNH
550
551
KDNHKDNDKD NDRDNLKYKY DNYEYNIYSN NKNINKKNSF VNNILDKGCL
600
601
LPKYYKPLHV DPSINKILCL SKYKILLIGF QFINYINIVK EKRRSFFLSS
650
651
EFRTITYIES INMNKYILSD DYGDLFILSF LPKKKNKDIF DEEDIDVHNN
700
701
MEKKNDKYED DHMKDIQDKH DIEKKNEPYK GRYYDFYCKY GGNDTFMGDI
750
751
SSMRVQFLGT CSRANVITRI YPDIIFLGSQ ISDSYLLRMH YFPIYEREDF
800
801
EPVEYSPYCQ MLKEDETVSK EMNMYDVNKY VEDIWNKNKN VMVPIMNNNS
850
851
RNIPHMENMK SMNNIYNVNN INNIFNFPAD LNNLYNNHYK NKEMHGLGKY
900
901
CMNYNKTYFC TYPYNDNNIG RNKYHRDKND YYTSIINTSQ GDVNYSPNTN
950
951
YIYDYNEGNK KNKIEKKKFY IEILSVIQNM GPILDMCVVK NKNNEYEIIT
1000
1001
CNSYGRTGCV SIIQSGLKTN ITCDLNFNKL NNFFVVKYVI YLKKKKKNTH
1050
1051
THIHTHATIT NDAMKKENDH KDCIEDNKDK VNKESLNINE NRNNILINDD
1100
1101
EGMMNDVCND DGTNLPRKKQ KTKNKKNKSV SEEPLHFEVF VSLNNKKNIK
1150
1151
IKNINLCFLN KYYFESEKEI NVLKYSNFIY FHIFMICVTY ANQTKIVGVS
1200
1201
RDIFKRRKIK KDSSSETLIN NKDNKFDNGT NHYPDNIMNC FTSDSHNNNN
1250
1251
NNNNNHMFGM QNGRDIQDDK KNNLNNPNFL NEEKKGDMKN KIPSESFLKD
1300
1301
VCNEIFLCEY ENTDIDMYSN TLYFNIIKNH PYLIQICNNH IRLLCCLSLK
1350
1351
LIYNLQVAYV YNYSIYNDYI YIYCKEGIKI YGIIENYIIH IYTYIIKENI
1400
1401
SSWSLYKNLL ACVFNNNEVV IYNINMNTLK EIKEENHKRE REAIDLDINV
1450
1451
EQGKGKKLHH IFNIVNYYKP EMSFFVYISD VELIMMNDNM YLFLGYSHGN
1500
1501
IEYFIMCAYN KKGKKKNMGA ECMCRNDDNK NGDKKEKKMK KKEHKKKSST
1550
1551
FSNNSDYSDN SNNSNNSDYS DYSDYSYGIN NSYEYVPENS NLKNFLIHTD
1600
1601
YKGLVRKRKE CNTLFKLHKE YLKRKELILK YIYKVCKDNV CNEDTGSRCK
1650
1651
YTYKLNMRKD LKKEKKIKKR KRNVLKYLSQ YNMNVFDFFD FDKISFNNMN
1700
1701
ELYKICNRSK DDVIYIDNEY YYNVNIESDN FVSLQKFLES DNEINKMDTG
1750
1751
SAVVDVASDD MIGSSCNNNI KENVGDNIKE NVGDNIKENV GDNIKENVGD
1800
1801
NIKENVGDNI KENVGDNIKE NVGDNIKEYV GDNIKENVGD NIKENVGDNI
1850
1851
KENVGDNIKE NVGDNIKENV GDNIKENVGD NIKENVGDNI KENVGDNDNV
1900
1901
SNNSYTNYSN SNTYDISHAY NNNCSDKNTL HFSQNNKGKK KGMIKNSNSV
1950
1951
KCNVKKEVDN NNNNNNNKKI LVKGEKNVKI HHKRARYFFN FFQVYGILSE
2000
2001
ESSDYDSSVD IMTFKRSIKP KEKYNDDTYI HRKKNHRKRI VYNTEGTIFD
2050
2051
ELHYGTLSPP SLYIHKYNKS IDKFSHYINK YSISDDDMLS SYDDNDDEVN
2100
2101
PLDVQDTRKN KKNKIKKKNI KYIKNIKYIK NIKNIKNNKT HRKAGRQKRD
2150
2151
ESITSTDSSS IYNSKMDECY QYDDDDDDDD DDNENFEEES QGDDHIFEQN
2200
2201
NILRTFIKNE NKNKNIKKKN NNNKKKKASN DNMNVFNNTS NVNHNDIKKR
2250
2251
KRDIKNLLYK KLKRQCKDIY LQENCKSDCS KYCNSNNMSS EIPSSFDNLN
2300
2301
NILFDEKIFL KNNILKSEKI FLINKRKINV CNDTIKFKKF VKVFSEKKKI
2350
2351
DINQSNIIKK YNFLFVCCES PIIIYSDLKK KINVSKLSLK NIYIVDIFND
2400
2401
FNYLNPFHNF LSFKKKNQNN FYFIFYDGSN IHISPLNQIK KTFLKKIPFH
2450
2451
RTVEKIAYHS DTGLLIAACP SEEKHKTNEM MKQIICFFDP YHDSIKYTYI
2500
2501
IPSKYTVSTI IIYDNEKLMK SNFDVTSFIF VGTCNSNEKY TEPTSGHIHI
2550
2551
FIAKKKANIF EIKHIYTHNI NYGGVTNLVP YDDKIVATIN NMVVILDINN
2600
2601
LIIKYEAFMD PQNLQPKIEG NNAIVELVSF TPSSWIMTVD VYGDYIVVGD
2650
2651
IMTSVTILQY DYENSQLFEV CRDYSNIWCT SLCALSKSHI VVSDMDANFI
2700
2701
ILQKSKFKYN DEDSYKLSSV SLFNHGSIIN KMLPLSNTNL IEEDYDKRNI
2750
2751
LTKNDGILCA SSEGSISVLI PFSSFANFKK ALCIEIAITD NISSIGNLSH
2800
2801
NAYREYKVNF RSKHCKGIVD GELLKMFFHM SFEKQYKTFI YAKWIAKKIN
2850
2851
CKFGSFNNFI LDLENMCSFL                                 
2870
 

Show the unformatted sequence.

Checksums:
CRC64:F808CCDA52E31335
MD5:3f99163041b21c9738b6fb5ad0fcaacf