Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
2  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: ARI1B_HUMAN (Q8NFD5)

Summary

This is the summary of UniProt entry ARI1B_HUMAN (Q8NFD5).

Description: AT-rich interactive domain-containing protein 1B
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 2236 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 330
low_complexity n/a 2 21
low_complexity n/a 18 58
low_complexity n/a 81 132
coiled_coil n/a 103 131
low_complexity n/a 141 152
low_complexity n/a 166 177
low_complexity n/a 213 228
low_complexity n/a 235 250
low_complexity n/a 260 280
low_complexity n/a 305 367
disorder n/a 360 387
low_complexity n/a 375 401
disorder n/a 389 393
disorder n/a 399 428
disorder n/a 430 478
low_complexity n/a 441 472
low_complexity n/a 481 495
disorder n/a 484 992
low_complexity n/a 537 559
low_complexity n/a 568 588
low_complexity n/a 594 618
low_complexity n/a 615 629
low_complexity n/a 684 700
low_complexity n/a 712 733
low_complexity n/a 736 749
low_complexity n/a 747 766
low_complexity n/a 798 809
low_complexity n/a 897 911
low_complexity n/a 905 923
low_complexity n/a 929 945
disorder n/a 995 1047
low_complexity n/a 1029 1040
Pfam ARID 1055 1140
disorder n/a 1075 1078
disorder n/a 1108 1121
disorder n/a 1147 1598
low_complexity n/a 1222 1236
low_complexity n/a 1328 1357
low_complexity n/a 1572 1588
disorder n/a 1600 1601
disorder n/a 1604 1607
disorder n/a 1610 1612
disorder n/a 1695 1772
low_complexity n/a 1717 1749
coiled_coil n/a 1725 1745
low_complexity n/a 1754 1768
disorder n/a 1790 1792
disorder n/a 1798 1863
low_complexity n/a 1828 1836
disorder n/a 1871 1894
disorder n/a 1901 1910
disorder n/a 1915 1919
Pfam BAF250_C 1927 2182
low_complexity n/a 1957 1968
disorder n/a 1976 1988
disorder n/a 2161 2162
disorder n/a 2165 2173

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q8NFD5. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MAHNAGAAAA AGTHSAKSGG SEAALKEGGS AAALSSSSSS SAAAAAASSS
50
51
SSSGPGSAME TGLLPNHKLK TVGEAPAAPP HQQHHHHHHA HHHHHHAHHL
100
101
HHHHALQQQL NQFQQQQQQQ QQQQQQQQQQ QHPISNNNSL GGAGGGAPQP
150
151
GPDMEQPQHG GAKDSAAGGQ ADPPGPPLLS KPGDEDDAPP KMGEPAGGRY
200
201
EHPGLGALGT QQPPVAVPGG GGGPAAVPEF NNYYGSAAPA SGGPGGRAGP
250
251
CFDQHGGQQS PGMGMMHSAS AAAAGAPGSM DPLQNSHEGY PNSQCNHYPG
300
301
YSRPGAGGGG GGGGGGGGGS GGGGGGGGAG AGGAGAGAVA AAAAAAAAAA
350
351
GGGGGGGYGG SSAGYGVLSS PRQQGGGMMM GPGGGGAASL SKAAAGSAAG
400
401
GFQRFAGQNQ HPSGATPTLN QLLTSPSPMM RSYGGSYPEY SSPSAPPPPP
450
451
SQPQSQAAAA GAAAGGQQAA AGMGLGKDMG AQYAAASPAW AAAQQRSHPA
500
501
MSPGTPGPTM GRSQGSPMDP MVMKRPQLYG MGSNPHSQPQ QSSPYPGGSY
550
551
GPPGPQRYPI GIQGRTPGAM AGMQYPQQQM PPQYGQQGVS GYCQQGQQPY
600
601
YSQQPQPPHL PPQAQYLPSQ SQQRYQPQQD MSQEGYGTRS QPPLAPGKPN
650
651
HEDLNLIQQE RPSSLPDLSG SIDDLPTGTE ATLSSAVSAS GSTSSQGDQS
700
701
NPAQSPFSPH ASPHLSSIPG GPSPSPVGSP VGSNQSRSGP ISPASIPGSQ
750
751
MPPQPPGSQS ESSSHPALSQ SPMPQERGFM AGTQRNPQMA QYGPQQTGPS
800
801
MSPHPSPGGQ MHAGISSFQQ SNSSGTYGPQ MSQYGPQGNY SRPPAYSGVP
850
851
SASYSGPGPG MGISANNQMH GQGPSQPCGA VPLGRMPSAG MQNRPFPGNM
900
901
SSMTPSSPGM SQQGGPGMGP PMPTVNRKAQ EAAAAVMQAA ANSAQSRQGS
950
951
FPGMNQSGLM ASSSPYSQPM NNSSSLMNTQ APPYSMAPAM VNSSAASVGL
1000
1001
ADMMSPGESK LPLPLKADGK EEGTPQPESK SKKSSSSTTT GEKITKVYEL
1050
1051
GNEPERKLWV DRYLTFMEER GSPVSSLPAV GKKPLDLFRL YVCVKEIGGL
1100
1101
AQVNKNKKWR ELATNLNVGT SSSAASSLKK QYIQYLFAFE CKIERGEEPP
1150
1151
PEVFSTGDTK KQPKLQPPSP ANSGSLQGPQ TPQSTGSNSM AEVPGDLKPP
1200
1201
TPASTPHGQM TPMQGGRSST ISVHDPFSDV SDSSFPKRNS MTPNAPYQQG
1250
1251
MSMPDVMGRM PYEPNKDPFG GMRKVPGSSE PFMTQGQMPN SSMQDMYNQS
1300
1301
PSGAMSNLGM GQRQQFPYGA SYDRRHEPYG QQYPGQGPPS GQPPYGGHQP
1350
1351
GLYPQQPNYK RHMDGMYGPP AKRHEGDMYN MQYSSQQQEM YNQYGGSYSG
1400
1401
PDRRPIQGQY PYPYSRERMQ GPGQIQTHGI PPQMMGGPLQ SSSSEGPQQN
1450
1451
MWAARNDMPY PYQNRQGPGG PTQAPPYPGM NRTDDMMVPD QRINHESQWP
1500
1501
SHVSQRQPYM SSSASMQPIT RPPQPSYQTP PSLPNHISRA PSPASFQRSL
1550
1551
ENRMSPSKSP FLPSMKMQKV MPTVPTSQVT GPPPQPPPIR REITFPPGSV
1600
1601
EASQPVLKQR RKITSKDIVT PEAWRVMMSL KSGLLAESTW ALDTINILLY
1650
1651
DDSTVATFNL SQLSGFLELL VEYFRKCLID IFGILMEYEV GDPSQKALDH
1700
1701
NAARKDDSQS LADDSGKEEE DAECIDDDEE DEEDEEEDSE KTESDEKSSI
1750
1751
ALTAPDAAAD PKEKPKQASK FDKLPIKIVK KNNLFVVDRS DKLGRVQEFN
1800
1801
SGLLHWQLGG GDTTEHIQTH FESKMEIPPR RRPPPPLSSA GRKKEQEGKG
1850
1851
DSEEQQEKSI IATIDDVLSA RPGALPEDAN PGPQTESSKF PFGIQQAKSH
1900
1901
RNIKLLEDEP RSRDETPLCT IAHWQDSLAK RCICVSNIVR SLSFVPGNDA
1950
1951
EMSKHPGLVL ILGKLILLHH EHPERKRAPQ TYEKEEDEDK GVACSKDEWW
2000
2001
WDCLEVLRDN TLVTLANISG QLDLSAYTES ICLPILDGLL HWMVCPSAEA
2050
2051
QDPFPTVGPN SVLSPQRLVL ETLCKLSIQD NNVDLILATP PFSRQEKFYA
2100
2101
TLVRYVGDRK NPVCREMSMA LLSNLAQGDA LAARAIAVQK GSIGNLISFL
2150
2151
EDGVTMAQYQ QSQHNLMHMQ PPPLEPPSVD MMCRAAKALL AMARVDENRS
2200
2201
EFLLHEGRLL DISISAVLNS LVASVICDVL FQIGQL               
2236
 

Show the unformatted sequence.

Checksums:
CRC64:4538B4747606C918
MD5:07910d61cbdb27146d3049cd3bf669a2

Structures

For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe SIFTS project, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between Pfam domains, this UniProt entry and a corresponding three dimensional structure.

Pfam family UniProt residues PDB ID PDB chain ID PDB residues View
ARID 1055 - 1140 2CXY A 15 - 100 Show 3D Structure View in InterPro
2EH9 A 15 - 100 Show 3D Structure View in InterPro
×

The parts of the structure corresponding to the Pfam family are highlighted in blue.

Loading Structure Data

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;