Query gi|254780621|ref|YP_003065034.1| small heat shock protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 157 No_of_seqs 114 out of 2566 Neff 6.6 Searched_HMMs 39220 Date Sun May 29 21:15:49 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780621.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK11597 heat shock chaperone 100.0 1.3E-41 0 277.7 15.2 138 1-142 1-139 (142) 2 PRK10743 heat shock protein Ib 100.0 3.3E-41 1.4E-45 275.2 15.4 136 1-138 1-137 (137) 3 COG0071 IbpA Molecular chapero 100.0 2.9E-30 7.5E-35 207.5 14.0 134 3-138 10-146 (146) 4 cd06470 ACD_IbpA-B_like Alpha- 100.0 1.1E-27 2.7E-32 191.7 12.3 90 35-124 1-90 (90) 5 pfam00011 HSP20 Hsp20/alpha cr 99.9 5.6E-23 1.4E-27 162.4 12.3 99 38-138 1-102 (102) 6 cd06472 ACD_ScHsp26_like Alpha 99.9 6.4E-22 1.6E-26 155.9 11.8 88 36-124 1-92 (92) 7 cd06471 ACD_LpsHSP_like Group 99.9 2.6E-21 6.6E-26 152.1 12.3 87 36-124 2-93 (93) 8 cd06526 metazoan_ACD Alpha-cry 99.7 2.7E-17 7E-22 127.2 9.4 78 41-124 3-83 (83) 9 cd06464 ACD_sHsps-like Alpha-c 99.7 2.4E-16 6.2E-21 121.4 10.8 85 39-124 2-88 (88) 10 KOG0710 consensus 99.7 2.7E-16 6.8E-21 121.1 7.2 107 31-138 81-195 (196) 11 KOG3591 consensus 99.6 1.6E-14 4.1E-19 110.1 10.2 103 38-147 66-171 (173) 12 cd06478 ACD_HspB4-5-6 Alpha-cr 99.6 2.1E-14 5.3E-19 109.4 9.0 78 41-124 3-83 (83) 13 cd06481 ACD_HspB9_like Alpha c 99.5 2.6E-14 6.7E-19 108.8 8.8 81 41-124 3-87 (87) 14 cd06498 ACD_alphaB-crystallin_ 99.5 5.1E-14 1.3E-18 107.0 9.0 79 41-125 3-84 (84) 15 cd06475 ACD_HspB1_like Alpha c 99.5 5.6E-14 1.4E-18 106.8 8.7 79 38-123 4-85 (86) 16 cd06497 ACD_alphaA-crystallin_ 99.5 2.2E-13 5.7E-18 103.0 9.6 80 38-124 4-86 (86) 17 cd06476 ACD_HspB2_like Alpha c 99.4 7.9E-13 2E-17 99.6 9.3 78 41-124 3-83 (83) 18 cd06480 ACD_HspB8_like Alpha-c 99.4 1.4E-12 3.5E-17 98.1 8.7 77 42-124 12-91 (91) 19 cd06479 ACD_HspB7_like Alpha c 99.3 1.1E-11 2.8E-16 92.6 8.9 77 38-124 2-81 (81) 20 cd06477 ACD_HspB3_Like Alpha c 99.3 1.1E-11 2.8E-16 92.6 8.6 76 41-122 3-81 (83) 21 cd00298 ACD_sHsps_p23-like Thi 99.0 3.2E-09 8.2E-14 77.3 9.5 74 44-124 5-80 (80) 22 cd06482 ACD_HspB10 Alpha cryst 98.9 8.7E-09 2.2E-13 74.6 8.3 77 44-123 7-86 (87) 23 pfam08190 PIH1 pre-RNA process 96.7 0.0076 1.9E-07 37.9 7.0 62 45-123 258-324 (325) 24 pfam04969 CS CS domain. The CS 96.3 0.077 2E-06 31.7 9.8 74 35-124 1-77 (77) 25 cd06463 p23_like Proteins cont 96.1 0.089 2.3E-06 31.3 9.3 72 41-127 3-76 (84) 26 cd06469 p23_DYX1C1_like p23_li 96.1 0.042 1.1E-06 33.3 7.6 68 40-127 2-71 (78) 27 cd06466 p23_CS_SGT1_like p23_l 95.2 0.24 6.1E-06 28.6 8.9 73 40-127 3-77 (84) 28 pfam05455 GvpH GvpH. This fami 95.2 0.11 2.7E-06 30.8 6.9 74 34-125 91-168 (177) 29 cd06489 p23_CS_hSgt1_like p23_ 91.5 1.2 3E-05 24.3 8.2 72 40-126 3-76 (84) 30 cd06467 p23_NUDC_like p23_like 90.6 1.4 3.7E-05 23.8 7.7 69 41-127 5-77 (85) 31 cd06488 p23_melusin_like p23_l 89.6 1.7 4.4E-05 23.3 8.5 77 36-127 2-80 (87) 32 cd06493 p23_NUDCD1_like p23_NU 87.8 2.3 5.9E-05 22.5 7.9 68 41-126 5-76 (85) 33 KOG1309 consensus 86.2 2.5 6.4E-05 22.3 5.8 42 35-77 4-45 (196) 34 cd06492 p23_mNUDC_like p23-lik 77.4 6 0.00015 20.0 6.9 63 46-125 9-77 (87) 35 cd06468 p23_CacyBP p23_like do 71.5 8.4 0.00021 19.1 8.5 69 41-126 8-84 (92) 36 cd06494 p23_NUDCD2_like p23-li 67.1 10 0.00026 18.5 8.5 69 40-125 11-81 (93) 37 cd06495 p23_NUDCD3_like p23-li 54.4 17 0.00044 17.1 7.3 73 40-125 10-85 (102) 38 cd02180 GH16_laminarinase Lami 49.2 21 0.00054 16.6 4.8 44 61-104 39-92 (238) 39 PRK05090 hypothetical protein; 46.1 19 0.00049 16.8 2.7 83 39-129 4-90 (95) 40 TIGR02350 prok_dnaK chaperone 42.7 26 0.00068 16.0 3.7 109 2-143 384-500 (598) 41 TIGR02856 spore_yqfC sporulati 41.8 27 0.0007 15.9 4.3 44 33-76 18-63 (87) 42 KOG3413 consensus 38.7 16 0.00041 17.3 1.4 13 111-123 76-88 (156) 43 TIGR02892 spore_yabP sporulati 38.2 26 0.00067 16.0 2.4 40 51-90 17-59 (88) 44 CHL00140 rpl6 ribosomal protei 36.5 33 0.00085 15.4 5.7 67 48-122 32-107 (178) 45 pfam01491 Frataxin_Cyay Fratax 36.3 33 0.00085 15.3 3.1 14 60-73 31-44 (108) 46 PTZ00179 60S ribosomal protein 32.4 33 0.00084 15.4 2.1 18 58-75 12-29 (189) 47 PRK05518 rpl6p 50S ribosomal p 31.6 36 0.00092 15.1 2.2 16 59-74 41-56 (181) 48 cd00237 p23 p23 binds heat sho 31.2 40 0.001 14.8 8.9 68 44-127 10-79 (106) 49 TIGR03653 arch_L6P archaeal ri 30.6 39 0.001 14.9 2.3 17 58-74 34-50 (170) 50 PTZ00027 60S ribosomal protein 30.2 40 0.001 14.8 2.3 18 58-75 13-30 (190) 51 TIGR03654 L6_bact ribosomal pr 30.2 42 0.0011 14.7 5.8 25 48-76 31-55 (175) 52 pfam07873 YabP YabP family. Th 30.0 36 0.00091 15.2 2.0 24 53-76 20-43 (66) 53 PRK05498 rplF 50S ribosomal pr 27.2 46 0.0012 14.5 2.2 25 48-76 32-56 (178) 54 cd06465 p23_hB-ind1_like p23_l 26.6 49 0.0012 14.3 10.1 72 40-127 6-79 (108) 55 COG2880 Uncharacterized protei 25.7 41 0.001 14.8 1.6 30 107-137 5-34 (67) 56 PRK00446 cyaY frataxin-like pr 24.6 53 0.0014 14.1 3.0 14 60-73 27-40 (104) 57 PRK05183 hscA chaperone protei 22.6 58 0.0015 13.9 5.1 38 103-142 468-508 (621) 58 pfam08308 PEGA PEGA domain. Th 22.5 58 0.0015 13.9 4.3 40 36-75 26-66 (71) 59 cd00503 Frataxin Frataxin is a 21.5 61 0.0016 13.7 2.9 14 60-73 29-42 (105) 60 pfam04972 BON Putative phospho 21.5 61 0.0016 13.7 3.2 26 54-79 12-37 (64) 61 TIGR03421 FeS_CyaY iron donor 20.3 65 0.0016 13.6 3.0 13 61-73 27-39 (102) 62 cd02175 GH16_lichenase Lichena 20.1 65 0.0017 13.5 5.5 55 56-112 29-87 (209) 63 PRK12700 flgH flagellar basal 20.1 65 0.0017 13.5 2.1 11 112-122 153-163 (230) No 1 >PRK11597 heat shock chaperone IbpB; Provisional Probab=100.00 E-value=1.3e-41 Score=277.71 Aligned_cols=138 Identities=36% Similarity=0.753 Sum_probs=126.2 Q ss_pred CC-CCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCC Q ss_conf 91-03411023567867889764541066777888843689948993999997011770214999962179999730332 Q gi|254780621|r 1 MR-LDVSRIYNSAVGYDTVLSMLDGLGAPSQTSAYPPYDIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSE 79 (157) Q Consensus 1 m~-~~~~~~~~~~~gfD~~f~~~~~~~~~~~~~~~P~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~ 79 (157) |+ |||+||+|+++|||++|+.+++.. ...+||||||.+.++++|.|++|||||+++||+|+++++.|+|+|+++.. T Consensus 1 m~~~dlsp~~~~~vGFD~lf~~l~~~~---~~~~yPpYNI~k~~e~~y~IeiAvaGf~k~di~I~~~~~~LtI~g~~~~~ 77 (142) T PRK11597 1 MRNYDLSPLLRQWIGFDKLANALQNAG---ESQSFPPYNIEKSDDNHYRITLALAGFRQEDLDIQLEGTRLTVKGTPEQP 77 (142) T ss_pred CCCCCCCHHHHCCCCHHHHHHHHHHCC---CCCCCCCCEEEEECCCEEEEEEEECCCCHHHEEEEEECCEEEEECCCCCC T ss_conf 975537756745438779999986426---67899996699917972999999688775778999978999997157887 Q ss_pred CCCCCCEEEEEEECCEEEEEEECCCCCEEEEEEEECCEEEEEEECCCCHHHCCEEEEECCCCC Q ss_conf 355662134553023046889738661581038528859999740686011872887337887 Q gi|254780621|r 80 EKETVEYLHRGIAKRAFERRFQLADFVEVMSASLENGLLYLELLRSVPERMKPRRIEISQSPQ 142 (157) Q Consensus 80 ~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~A~~~nGiLtI~lpK~~pe~~~~r~I~I~~~~~ 142 (157) + +..+|+||||++|+|+|+|+|+++|++.+|+|+||+|+|+|+|.+||+.|||+|+|++++. T Consensus 78 ~-~~~~ylhrGIa~r~F~r~F~L~e~veV~~A~l~nGlL~I~L~~~iPE~~kPk~I~I~~~~~ 139 (142) T PRK11597 78 E-KEKKWLHQGLVNQPFSLSFTLAENMEVSGATFVNGLLHIDLIRNEPEAIAPQRIAISERPA 139 (142) T ss_pred C-CCCEEEEECCCCCCEEEEEECCCCEEECCEEEECCEEEEEEEEECCCCCCCEEEEECCCCC T ss_conf 6-4112677412037269999988873996549717889999974588645872877688756 No 2 >PRK10743 heat shock protein IbpA; Provisional Probab=100.00 E-value=3.3e-41 Score=275.20 Aligned_cols=136 Identities=47% Similarity=0.854 Sum_probs=125.5 Q ss_pred CC-CCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCC Q ss_conf 91-03411023567867889764541066777888843689948993999997011770214999962179999730332 Q gi|254780621|r 1 MR-LDVSRIYNSAVGYDTVLSMLDGLGAPSQTSAYPPYDIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSE 79 (157) Q Consensus 1 m~-~~~~~~~~~~~gfD~~f~~~~~~~~~~~~~~~P~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~ 79 (157) |+ |||+||+|.++|||++|+.+++... ...++||||||.+.++++|.|++|||||+++||+|+++++.|+|+|+++.+ T Consensus 1 m~~~d~~pl~~~~vGFD~lfd~l~~~~~-~~~~~YPpyNI~k~~e~~y~Ie~AvAGf~k~di~I~~~~~~LtI~g~~~~~ 79 (137) T PRK10743 1 MRNFDLSPLYRSAIGFDRLFNLLENNQS-QSNGGYPPYNVELVDENHYRIAIAVAGFAESELEITAQDNLLVVKGAHADE 79 (137) T ss_pred CCCCCCCHHCCCCCCHHHHHHHHHHHHH-CCCCCCCCCEEEEECCCEEEEEEEECCCCHHHEEEEEECCEEEEEEECCCC T ss_conf 9644375100766387789999875500-467899995089906983999999778786893899988989999742775 Q ss_pred CCCCCCEEEEEEECCEEEEEEECCCCCEEEEEEEECCEEEEEEECCCCHHHCCEEEEEC Q ss_conf 35566213455302304688973866158103852885999974068601187288733 Q gi|254780621|r 80 EKETVEYLHRGIAKRAFERRFQLADFVEVMSASLENGLLYLELLRSVPERMKPRRIEIS 138 (157) Q Consensus 80 ~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~A~~~nGiLtI~lpK~~pe~~~~r~I~I~ 138 (157) . +..+|+||||++|+|+|+|+|+++|++.+|+|+||+|+|.|+|.+||++|||+|+|+ T Consensus 80 ~-~~~~ylhrGIa~R~F~r~F~La~~v~V~~A~l~nGlL~I~L~r~iPEe~KPk~I~IN 137 (137) T PRK10743 80 Q-KERTYLYQGIAERNFERKFQLAENIHVRGANLVNGLLYIDLERVIPEAKKPRRIEIN 137 (137) T ss_pred C-CCCCEEEECEECCCEEEEEECCCCEEECCEEEECCEEEEEEECCCCCCCCCEEEECC T ss_conf 7-666358806133537999996897299304960787999997618844498784039 No 3 >COG0071 IbpA Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones] Probab=99.97 E-value=2.9e-30 Score=207.47 Aligned_cols=134 Identities=37% Similarity=0.643 Sum_probs=116.8 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECC-CCC Q ss_conf 0341102356786788976454106677788884368994899399999701177021499996217999973033-235 Q gi|254780621|r 3 LDVSRIYNSAVGYDTVLSMLDGLGAPSQTSAYPPYDIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKS-EEK 81 (157) Q Consensus 3 ~~~~~~~~~~~gfD~~f~~~~~~~~~~~~~~~P~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~-~~~ 81 (157) +++.+.++...+||++|+.+.+...... .+||+|||+++++ +|+|.++||||+++||+|.++++.|+|+|+++. .+. T Consensus 10 ~~~~~~~~~~~~fd~~~~~~~~~~~~~~-~~~P~vdi~e~~~-~~~I~~elPG~~kedI~I~~~~~~l~I~g~~~~~~~~ 87 (146) T COG0071 10 FDFFPLLRDSPGFDRLFREFGNLPESRP-TGTPPVDIEETDD-EYRITAELPGVDKEDIEITVEGNTLTIRGEREEEEEE 87 (146) T ss_pred HHHCHHHCCCCCCCHHHHHHHCCCCCCC-CCCCCEEEEECCC-EEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCC T ss_conf 2101233134332077776421244566-7788256898598-8999998788987993999979999999997564555 Q ss_pred CCCCEEEEEEECCEEEEEEECCCCCE--EEEEEEECCEEEEEEECCCCHHHCCEEEEEC Q ss_conf 56621345530230468897386615--8103852885999974068601187288733 Q gi|254780621|r 82 ETVEYLHRGIAKRAFERRFQLADFVE--VMSASLENGLLYLELLRSVPERMKPRRIEIS 138 (157) Q Consensus 82 ~~~~~~~~e~~~~~F~R~f~Lp~~ve--~~~A~~~nGiLtI~lpK~~pe~~~~r~I~I~ 138 (157) +...|+|+|+++|+|+|+|.||++++ .++|+|+||||+|+|||..|+..++++|+|+ T Consensus 88 ~~~~~~~~e~~~~~f~r~~~Lp~~v~~~~~~A~~~nGvL~I~lpk~~~~~~~~~~i~I~ 146 (146) T COG0071 88 EEEGYLRRERAYGEFERTFRLPEKVDPEVIKAKYKNGLLTVTLPKAEPEEKKPKRIEIE 146 (146) T ss_pred CCCCEEEEEEECCEEEEEEECCCCCCCCCCEEEEECCEEEEEEECCCCCCCCCCEEECC T ss_conf 55415873022223999998976356553067950878999986555554566014259 No 4 >cd06470 ACD_IbpA-B_like Alpha-crystallin domain (ACD) found in Escherichia coli inclusion body-associated proteins IbpA and IbpB, and similar proteins. IbpA and IbpB are 16 kDa small heat shock proteins (sHsps). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. IbpA and IbpB are produced during high-level production of various heterologous proteins, specifically human prorenin, renin and bovine insulin-like growth factor 2 (bIGF-2), and are strongly associated with inclusion bodies containing these heterologous proteins. IbpA and IbpB work as an integrated system to stabilize thermally aggregated proteins in a disaggregation competent state. The chaperone activity of IbpB is also significantly elevated as the temperature increases from normal to heat shock. The high temperature results in the disassociation of 2-3-MDa IbpB oligomers into smaller approximately 6 Probab=99.95 E-value=1.1e-27 Score=191.66 Aligned_cols=90 Identities=62% Similarity=0.993 Sum_probs=86.4 Q ss_pred CCEEEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEEEEE Q ss_conf 84368994899399999701177021499996217999973033235566213455302304688973866158103852 Q gi|254780621|r 35 PPYDIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMSASLE 114 (157) Q Consensus 35 P~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~A~~~ 114 (157) |||||+++++++|+|+++|||++|+||+|+++++.|+|+|+++.+..++.+|+|+||++|+|+|+|+||+++++.+|+|+ T Consensus 1 Pp~nI~~~~e~~y~i~~~lPG~~kedi~V~~~~~~LtI~g~~~~~~~~~~~~~~re~~~~~F~Rsf~Lp~~v~~~~A~~~ 80 (90) T cd06470 1 PPYNIEKTGENNYRITLAVAGFSEDDLEIEVENNQLTVTGKKADEENEEREYLHRGIAKRAFERSFNLADHVKVKGAELE 80 (90) T ss_pred CCCCEEEECCCEEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEEEEEEEEEEEECCCCEECCEEEE T ss_conf 99528992797299999949998799599999999999998656355676179995576799999994876599966986 Q ss_pred CCEEEEEEEC Q ss_conf 8859999740 Q gi|254780621|r 115 NGLLYLELLR 124 (157) Q Consensus 115 nGiLtI~lpK 124 (157) ||||+|+|+| T Consensus 81 nGvL~I~L~k 90 (90) T cd06470 81 NGLLTIDLER 90 (90) T ss_pred CCEEEEEEEC T ss_conf 8889999989 No 5 >pfam00011 HSP20 Hsp20/alpha crystallin family. Probab=99.90 E-value=5.6e-23 Score=162.41 Aligned_cols=99 Identities=31% Similarity=0.568 Sum_probs=85.6 Q ss_pred EEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEE--EEEEEC Q ss_conf 689948993999997011770214999962179999730332355662134553023046889738661581--038528 Q gi|254780621|r 38 DIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVM--SASLEN 115 (157) Q Consensus 38 dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~~n 115 (157) ||+++ +++|.|.++|||++++||+|+++++.|+|+|+++... .+..++++|+.+|+|.|+|.||++++.. +|+|+| T Consensus 1 Di~e~-~~~~~i~~~lPG~~~edi~V~v~~~~L~I~g~~~~~~-~~~~~~~~e~~~~~f~r~~~LP~~vd~~~i~A~~~n 78 (102) T pfam00011 1 DIKED-KDAFVVKLDVPGFKPEELKVKVEDNRVLVKGKHEEEE-EDDHGLRSERSYGSFSRKFTLPENADPDKVKASLKD 78 (102) T ss_pred CCEEC-CCEEEEEEECCCCCHHHCEEEEECCEEEEEECCCCCC-CCCCEEEEEEEEEEEEEEEECCCCEEEEEEEEEEEC T ss_conf 94897-9889999998999969919999599799996034334-566259999975599999988867130013999989 Q ss_pred CEEEEEEECCCCH-HHCCEEEEEC Q ss_conf 8599997406860-1187288733 Q gi|254780621|r 116 GLLYLELLRSVPE-RMKPRRIEIS 138 (157) Q Consensus 116 GiLtI~lpK~~pe-~~~~r~I~I~ 138 (157) |||+|++||..++ .++||+|+|. T Consensus 79 GvL~I~lPK~~~~~~~~~r~I~Ik 102 (102) T pfam00011 79 GVLTVTVPKLEPEEPKKERRIQIQ 102 (102) T ss_pred CEEEEEEECCCCCCCCCCCEEEEC T ss_conf 999999976777657998596569 No 6 >cd06472 ACD_ScHsp26_like Alpha crystallin domain (ACD) found in Saccharomyces cerevisiae (Sc) small heat shock protein (Hsp)26 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. ScHsp26 is temperature-regulated, it switches from an inactive to a chaperone-active form upon elevation in temperature. It associates into large 24-mers storage forms which upon heat shock disassociate into dimers. These dimers initiate the interaction with non-native substrate proteins and re-assemble into large globular assemblies having one monomer of substrate bound per dimer. This group also contains Arabidopsis thaliana (Ath) Hsp15.7, a peroxisomal matrix protein which can complement the morphological phenotype of S. cerevisiae mutants deficient in Hsps26. AthHsp15.7 is minimally expressed under normal conditions and is strongly induced by heat and oxidative st Probab=99.88 E-value=6.4e-22 Score=155.89 Aligned_cols=88 Identities=28% Similarity=0.454 Sum_probs=76.9 Q ss_pred CEEEEEECCCCEEEEEECCCCCHHHEEEEEECC-EEEEEEEECCC-CCCCCCEEEEEEECCEEEEEEECCCCCE--EEEE Q ss_conf 436899489939999970117702149999621-79999730332-3556621345530230468897386615--8103 Q gi|254780621|r 36 PYDIERTGENAYRITIAVSGFHPSEITIEVDSS-ILMVRGEKKSE-EKETVEYLHRGIAKRAFERRFQLADFVE--VMSA 111 (157) Q Consensus 36 ~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~-~LtI~g~~~~~-~~~~~~~~~~e~~~~~F~R~f~Lp~~ve--~~~A 111 (157) .+||++++ ++|+|.++|||++++||+|++++| .|+|+|+++.+ ..++..|+++|+.+|+|+|+|.||++++ .++| T Consensus 1 rvDi~E~~-~~y~v~~dlPGv~kedi~V~v~~~~~L~I~g~r~~~~~~~~~~~~~~Er~~g~f~R~~~LP~~vd~~~i~A 79 (92) T cd06472 1 RVDWKETP-EAHVFKADVPGVKKEDVKVEVEDGRVLRISGERKKEEEKKGDDWHRVERSSGRFVRRFRLPENADADEVKA 79 (92) T ss_pred CCCEEECC-CEEEEEEECCCCCHHHEEEEEECCCEEEEEEEECCCCCCCCCEEEEEEEECCCEEEEEECCCCCCCCEEEE T ss_conf 96569979-97999999899987859999974996999999834576779859999982672999999986726043599 Q ss_pred EEECCEEEEEEEC Q ss_conf 8528859999740 Q gi|254780621|r 112 SLENGLLYLELLR 124 (157) Q Consensus 112 ~~~nGiLtI~lpK 124 (157) +|+||||+|+||| T Consensus 80 ~~~nGvL~I~lPK 92 (92) T cd06472 80 FLENGVLTVTVPK 92 (92) T ss_pred EEECCEEEEEECC T ss_conf 9989999999559 No 7 >cd06471 ACD_LpsHSP_like Group of bacterial proteins containing an alpha crystallin domain (ACD) similar to Lactobacillus plantarum (Lp) small heat shock proteins (sHsp) HSP 18.5, HSP 18.55 and HSP 19.3. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Transcription of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is regulated by a variety of stresses including heat, cold and ethanol. Early growing L. plantarum cells contain elevated levels of these mRNAs which rapidly fall of as the cells enter stationary phase. Also belonging to this group is Bifidobacterium breve (Bb) HSP20 and Oenococcus oenis (syn. Leuconostoc oenos) (Oo) HSP18. Transcription of the gene encoding BbHSP20 is strongly induced following heat or osmotic shock, and that of the gene encoding OoHSP18 following heat, ethanol or acid shock. OoHSP18 is peripherally associated with the cytoplasmic me Probab=99.87 E-value=2.6e-21 Score=152.14 Aligned_cols=87 Identities=25% Similarity=0.477 Sum_probs=76.7 Q ss_pred CEEEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCC---CCCCCCEEEEEEECCEEEEEEECCCCC--EEEE Q ss_conf 43689948993999997011770214999962179999730332---355662134553023046889738661--5810 Q gi|254780621|r 36 PYDIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSE---EKETVEYLHRGIAKRAFERRFQLADFV--EVMS 110 (157) Q Consensus 36 ~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~---~~~~~~~~~~e~~~~~F~R~f~Lp~~v--e~~~ 110 (157) ++||+++ +++|+|.++|||++++||+|+++++.|+|+|+++.. .++..+|+++|+.+|+|+|+|.|| ++ +.++ T Consensus 2 rvDV~E~-~~~~~i~~~lPG~~k~dI~V~~~~~~L~I~g~~~~~~~~~~~~~~~~~~E~~~g~f~R~~~lp-~vd~~~i~ 79 (93) T cd06471 2 KTDIKET-DDEYIVEADLPGFKKEDIKLDYKDGYLTISAKRDESKDEKDKKGNYIRRERYYGSFSRSFYLP-NVDEEEIK 79 (93) T ss_pred CEEEEEC-CCEEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCEECCCCCCEEEEEECCCEEEEEEECC-CCCCEEEE T ss_conf 6489988-996999999899987885999989899999996432210266772899997343699999999-98722469 Q ss_pred EEEECCEEEEEEEC Q ss_conf 38528859999740 Q gi|254780621|r 111 ASLENGLLYLELLR 124 (157) Q Consensus 111 A~~~nGiLtI~lpK 124 (157) |+|+||||+|+||| T Consensus 80 A~~~nGvL~I~lPK 93 (93) T cd06471 80 AKYENGVLKITLPK 93 (93) T ss_pred EEEECCEEEEEECC T ss_conf 99989999999759 No 8 >cd06526 metazoan_ACD Alpha-crystallin domain (ACD) of metazoan alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps. Probab=99.72 E-value=2.7e-17 Score=127.24 Aligned_cols=78 Identities=32% Similarity=0.661 Sum_probs=65.6 Q ss_pred EECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEE--EEEEE-ECCE Q ss_conf 94899399999701177021499996217999973033235566213455302304688973866158--10385-2885 Q gi|254780621|r 41 RTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEV--MSASL-ENGL 117 (157) Q Consensus 41 ~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~--~~A~~-~nGi 117 (157) .+++++|.|.++||||+++||+|.++++.|+|+|+++...+.. ++.+|+|.|+|.||++|+. .+|.| +||+ T Consensus 3 ~e~~~~f~v~~dv~Gf~~edi~V~v~~~~L~I~g~~~~~~~~~------~~~~r~F~R~~~LP~~vd~~~v~a~l~~~Gv 76 (83) T cd06526 3 NDDDEKFQVTLDVKGFKPEELKVKVSDNKLVVEGKHEEREDEH------GYVSREFTRRYQLPEGVDPDSVTSSLSSDGV 76 (83) T ss_pred EECCCEEEEEEECCCCCHHHEEEEEECCEEEEEEEEEEECCCC------EEEEEEEEEEEECCCCCCHHHEEEEECCCCE T ss_conf 8879669999998999979929999999999999983474888------3999999999889877286791999878999 Q ss_pred EEEEEEC Q ss_conf 9999740 Q gi|254780621|r 118 LYLELLR 124 (157) Q Consensus 118 LtI~lpK 124 (157) |+|++|| T Consensus 77 L~I~~Pk 83 (83) T cd06526 77 LTIEAPK 83 (83) T ss_pred EEEEECC T ss_conf 9999679 No 9 >cd06464 ACD_sHsps-like Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps. Probab=99.69 E-value=2.4e-16 Score=121.36 Aligned_cols=85 Identities=35% Similarity=0.650 Sum_probs=75.5 Q ss_pred EEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEE--EEEEEECC Q ss_conf 8994899399999701177021499996217999973033235566213455302304688973866158--10385288 Q gi|254780621|r 39 IERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEV--MSASLENG 116 (157) Q Consensus 39 I~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~--~~A~~~nG 116 (157) +.+. +++|.|.++|||++++||+|+++++.|.|+|++.........+.++++.+|.|.|+|.||.+++. .+|.|.|| T Consensus 2 i~e~-~~~~~i~~~lpG~~~e~i~v~v~~~~l~I~g~~~~~~~~~~~~~~~~~~~~~f~r~~~LP~~vd~~~i~a~~~~G 80 (88) T cd06464 2 VYET-DDAYVVEADLPGFKKEDIKVEVEDGVLTISGEREEEEEEEENYLRRERSYGSFSRSFRLPEDVDPDKIKASLENG 80 (88) T ss_pred CEEC-CCEEEEEEECCCCCEEEEEEEEECCEEEEEEEECCCCCCCCEEEEEEECCCEEEEEEECCCCEEECCEEEEEECC T ss_conf 3895-987999999799840028999837899999998125655551899610030358989888888732079999899 Q ss_pred EEEEEEEC Q ss_conf 59999740 Q gi|254780621|r 117 LLYLELLR 124 (157) Q Consensus 117 iLtI~lpK 124 (157) +|+|++|| T Consensus 81 iL~I~~pk 88 (88) T cd06464 81 VLTITLPK 88 (88) T ss_pred EEEEEECC T ss_conf 99999709 No 10 >KOG0710 consensus Probab=99.65 E-value=2.7e-16 Score=121.13 Aligned_cols=107 Identities=30% Similarity=0.524 Sum_probs=87.1 Q ss_pred CCCCCCEEEEEECCCCEEEEEECCCCCHHHEEEEEECC-EEEEEEEECCCCC---CCCCEEEEEEECCEEEEEEECCCCC Q ss_conf 78888436899489939999970117702149999621-7999973033235---5662134553023046889738661 Q gi|254780621|r 31 TSAYPPYDIERTGENAYRITIAVSGFHPSEITIEVDSS-ILMVRGEKKSEEK---ETVEYLHRGIAKRAFERRFQLADFV 106 (157) Q Consensus 31 ~~~~P~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~-~LtI~g~~~~~~~---~~~~~~~~e~~~~~F~R~f~Lp~~v 106 (157) ..+.++.+|.+. ++.|.+.+++||+.++|++|+++++ +|+|+|++..+.+ ....+...|+.+|.|.|+|.||+++ T Consensus 81 ~~~~~~~~v~e~-~~~~~~~~~~Pgl~ke~iKv~~~~~~~l~isGe~~~e~e~~~~~~~~~~~E~~~g~F~r~~~lPenv 159 (196) T KOG0710 81 SEARVPWDVKES-PDAHEFKVDLPGLKKEDIKVEVEDEKVLTISGERKKEEEESGSGKKWKRVERKLGKFKRRFELPENV 159 (196) T ss_pred CCCCCCCCCCCC-CCCEEEEEECCCCCCHHCEEECCCCCEEEECCCCCCCCCCCCCCCCCEEEEECCCCEEEEEECCCCH T ss_conf 544566554357-7752788628999712311562578779762752444444567743156320036378876557402 Q ss_pred EEE--EEEEECCEEEEEEECCCCH--HHCCEEEEEC Q ss_conf 581--0385288599997406860--1187288733 Q gi|254780621|r 107 EVM--SASLENGLLYLELLRSVPE--RMKPRRIEIS 138 (157) Q Consensus 107 e~~--~A~~~nGiLtI~lpK~~pe--~~~~r~I~I~ 138 (157) +.+ +|.|+||||+|++||..+- ..+.+.|+|. T Consensus 160 ~~d~ikA~~~nGVL~VvvpK~~~~~~~~~v~~i~i~ 195 (196) T KOG0710 160 DVDEIKAEMENGVLTVVVPKLEPLLKKPKVRQIAIS 195 (196) T ss_pred HHHHHHHHHHCCCEEEEEECCCCCCCCCCCEEEECC T ss_conf 279898785478179999755655567862244424 No 11 >KOG3591 consensus Probab=99.58 E-value=1.6e-14 Score=110.12 Aligned_cols=103 Identities=26% Similarity=0.509 Sum_probs=84.2 Q ss_pred EEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEE--EEE-E Q ss_conf 6899489939999970117702149999621799997303323556621345530230468897386615810--385-2 Q gi|254780621|r 38 DIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMS--ASL-E 114 (157) Q Consensus 38 dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~--A~~-~ 114 (157) ++. .++++|.|.++|..|.+++|+|.+.++.|.|.|+|+..++++ |...|+|.|+|.||+++++.+ +++ . T Consensus 66 ~~~-~~~~~F~V~lDV~~F~PeEl~Vk~~~~~l~V~gkHeer~d~~------G~v~R~F~R~y~LP~~vdp~~V~S~LS~ 138 (173) T KOG3591 66 EIV-NDKDKFEVNLDVHQFKPEELKVKTDDNTLEVEGKHEEKEDEH------GYVSRSFVRKYLLPEDVDPTSVTSTLSS 138 (173) T ss_pred CCC-CCCCCEEEEEECCCCCCCCCEEEECCCEEEEEEEECCCCCCC------CEEEEEEEEEECCCCCCCHHHEEEEECC T ss_conf 421-189867999880247655118995798899986606461799------7698898887669999895062786379 Q ss_pred CCEEEEEEECCCCHHHCCEEEEECCCCCCCCCC Q ss_conf 885999974068601187288733788754311 Q gi|254780621|r 115 NGLLYLELLRSVPERMKPRRIEISQSPQESTKM 147 (157) Q Consensus 115 nGiLtI~lpK~~pe~~~~r~I~I~~~~~~~~~~ 147 (157) ||+|||+.||..+.....|.|+|...+..+.+. T Consensus 139 dGvLtI~ap~~~~~~~~er~ipI~~~~~~~~~~ 171 (173) T KOG3591 139 DGVLTIEAPKPPPKQDNERSIPIEQVGPSALSQ 171 (173) T ss_pred CCEEEEECCCCCCCCCCCEEEEEEECCCCCCCC T ss_conf 940899735888767654588673057765555 No 12 >cd06478 ACD_HspB4-5-6 Alpha-crystallin domain found in alphaA-crystallin (HspB4), alphaB-crystallin (HspB5), and the small heat shock protein (sHsp) HspB6, also known as Hsp20. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 on the other hand is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer. HspB5's functions include effects on the apoptotic pathway and on metastasis. Phosphorylation of HspB5 reduces its ol Probab=99.56 E-value=2.1e-14 Score=109.41 Aligned_cols=78 Identities=26% Similarity=0.540 Sum_probs=66.2 Q ss_pred EECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEE--EEEE-ECCE Q ss_conf 948993999997011770214999962179999730332355662134553023046889738661581--0385-2885 Q gi|254780621|r 41 RTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVM--SASL-ENGL 117 (157) Q Consensus 41 ~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~-~nGi 117 (157) +.++++|.|.++|.||+|+||+|.+.++.|+|.|+|+...++. |+..|+|.|+|.||++|+.. ++.| .||+ T Consensus 3 ~~d~d~F~v~lDv~~F~Peei~Vkv~~~~l~V~gkhee~~d~~------g~~~r~F~R~~~LP~~vd~~~v~s~Ls~dG~ 76 (83) T cd06478 3 RLDKDRFSVNLDVKHFSPEELSVKVLGDFVEIHGKHEERQDEH------GFISREFHRRYRLPPGVDPAAITSSLSADGV 76 (83) T ss_pred EECCCCEEEEEECCCCCCHHEEEEEECCEEEEEEEEEEECCCC------CEEEEEEEEEECCCCCCCHHHEEEEECCCCE T ss_conf 7868609999998898623369999899999999871074899------8899999997749999797896998589977 Q ss_pred EEEEEEC Q ss_conf 9999740 Q gi|254780621|r 118 LYLELLR 124 (157) Q Consensus 118 LtI~lpK 124 (157) |+|+.|| T Consensus 77 L~I~aPk 83 (83) T cd06478 77 LTISGPR 83 (83) T ss_pred EEEEECC T ss_conf 9999669 No 13 >cd06481 ACD_HspB9_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB9 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB9 is expressed exclusively in the normal testis and in various tumor samples and is a cancer/testis antigen. hHspB9 interacts with TCTEL1 (T-complex testis expressed protein -1), a subunit of dynein. hHspB9 and TCTEL1 are co-expressed in similar cells within the testis and in tumor cells. Included in this group is Xenopus Hsp30, a developmentally-regulated heat-inducible molecular chaperone. Probab=99.55 E-value=2.6e-14 Score=108.79 Aligned_cols=81 Identities=25% Similarity=0.537 Sum_probs=66.5 Q ss_pred EECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCC-CCCEEEEEEECCEEEEEEECCCCCEEE--EEEE-ECC Q ss_conf 948993999997011770214999962179999730332355-662134553023046889738661581--0385-288 Q gi|254780621|r 41 RTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKE-TVEYLHRGIAKRAFERRFQLADFVEVM--SASL-ENG 116 (157) Q Consensus 41 ~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~-~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~-~nG 116 (157) ++++++|.|.++|.||+|+||+|.+.++.|+|.|+++.+.++ +..+ .+.+|+|.|+|.||++++.. .++| .|| T Consensus 3 ~d~kd~F~v~lDV~~F~Peei~Vkv~g~~l~V~gk~e~~~~d~~g~~---s~~~reF~R~~~LP~~vd~~~v~s~Ls~dG 79 (87) T cd06481 3 KDGKEGFSLKLDVRGFSPEDLSVRVDGRKLVVTGKREKKNEDEKGSF---SYEYQEFVREAQLPEHVDPEAVTCSLSPSG 79 (87) T ss_pred CCCCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEEEEECCCCCCCE---EEEEEEEEEEEECCCCCCHHHCEEEECCCC T ss_conf 77652589999988988056599997999999987872305789854---887789999867889969768099637997 Q ss_pred EEEEEEEC Q ss_conf 59999740 Q gi|254780621|r 117 LLYLELLR 124 (157) Q Consensus 117 iLtI~lpK 124 (157) +|+|+.|| T Consensus 80 vL~I~aPk 87 (87) T cd06481 80 HLHIRAPR 87 (87) T ss_pred EEEEEECC T ss_conf 69998169 No 14 >cd06498 ACD_alphaB-crystallin_HspB5 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaB-crystallin (HspB5, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. HspB4 does not belong to this group. HspB5 shows increased synthesis in response to stress. HspB5 is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer. Its functions include effects on the apoptotic pathway and on metastasis. Phosphorylation of HspB5 reduces its oligomerization and anti-apoptotic activ Probab=99.53 E-value=5.1e-14 Score=106.98 Aligned_cols=79 Identities=23% Similarity=0.524 Sum_probs=66.7 Q ss_pred EECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEE--EEEEE-CCE Q ss_conf 948993999997011770214999962179999730332355662134553023046889738661581--03852-885 Q gi|254780621|r 41 RTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVM--SASLE-NGL 117 (157) Q Consensus 41 ~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~~-nGi 117 (157) +.++++|.|.++|.||+|+||+|.+.++.|+|.|+|+...+++ |+..|+|.|+|.||++++.. ++.|. ||+ T Consensus 3 ~~d~~~F~v~lDv~~F~Peei~Vkv~~~~l~V~gkheek~d~~------G~v~r~F~R~~~LP~~vd~~~v~S~Ls~dG~ 76 (84) T cd06498 3 RLEKDKFSVNLDVKHFSPEELKVKVLGDFIEIHGKHEERQDEH------GFISREFQRKYRIPADVDPLTITSSLSPDGV 76 (84) T ss_pred EECCCCEEEEEECCCCCCHHEEEEEECCEEEEEEEECCCCCCC------CEEEEEEEEEECCCCCCCHHHEEEEECCCCE T ss_conf 0857658999998888520229999799999998663273899------7799999997549999697793998589977 Q ss_pred EEEEEECC Q ss_conf 99997406 Q gi|254780621|r 118 LYLELLRS 125 (157) Q Consensus 118 LtI~lpK~ 125 (157) |+|+.||+ T Consensus 77 L~I~ApkK 84 (84) T cd06498 77 LTVCGPRK 84 (84) T ss_pred EEEEEECC T ss_conf 99998089 No 15 >cd06475 ACD_HspB1_like Alpha crystallin domain (ACD) found in mammalian small (s)heat shock protein (Hsp)-27 (also denoted HspB1 in human) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Hsp27 shows enhanced synthesis in response to stress. It is a molecular chaperone which interacts with a large number of different proteins. It is found in many types of human cells including breast, uterus, cervix, platelets and cancer cells. Hsp27 has diverse cellular functions including, chaperoning, regulation of actin polymerization, keratinocyte differentiation, regulation of inflammatory pathways in keratinocytes, and protection from oxidative stress through modulating glutathione levels. It is also a subunit of AUF1-containing protein complexes. It has been linked to several transduction pathways regulating cellular functions including differentiat Probab=99.52 E-value=5.6e-14 Score=106.77 Aligned_cols=79 Identities=23% Similarity=0.561 Sum_probs=65.7 Q ss_pred EEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEE--EEEE-E Q ss_conf 689948993999997011770214999962179999730332355662134553023046889738661581--0385-2 Q gi|254780621|r 38 DIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVM--SASL-E 114 (157) Q Consensus 38 dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~-~ 114 (157) +|.. ++++|.|.++|+||+|+||+|.+.++.|+|.|+|+...+++ |+..|+|.|+|.||++++.. ++.| . T Consensus 4 ei~~-~~~~F~v~lDV~~F~PeEi~Vk~~d~~l~V~gkheek~de~------G~v~r~F~R~~~LP~~vd~~~v~s~Ls~ 76 (86) T cd06475 4 EIRQ-TADRWKVSLDVNHFAPEELVVKTKDGVVEITGKHEEKQDEH------GFVSRCFTRKYTLPPGVDPTAVTSSLSP 76 (86) T ss_pred EEEE-CCCEEEEEEECCCCCCCCCEEEECCCEEEEEEEEEEECCCC------CEEEEEEEEEECCCCCCCHHHEEEEECC T ss_conf 2998-89889999998897652019999099999998770063899------7799999995339899897786997489 Q ss_pred CCEEEEEEE Q ss_conf 885999974 Q gi|254780621|r 115 NGLLYLELL 123 (157) Q Consensus 115 nGiLtI~lp 123 (157) ||+|+|+.| T Consensus 77 dG~L~I~AP 85 (86) T cd06475 77 DGILTVEAP 85 (86) T ss_pred CCEEEEEEC T ss_conf 988999854 No 16 >cd06497 ACD_alphaA-crystallin_HspB4 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaA-crystallin (HspB4, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 does not belong to this group. Mutations inHspB4 have been associated with Autosomal Dominant Congenital Cataract (ADCC). The chaperone-like functions of HspB4 are considered important for maintaining lens transparency and preventing cataract. Probab=99.49 E-value=2.2e-13 Score=103.04 Aligned_cols=80 Identities=23% Similarity=0.516 Sum_probs=66.6 Q ss_pred EEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEE--EEEE-E Q ss_conf 689948993999997011770214999962179999730332355662134553023046889738661581--0385-2 Q gi|254780621|r 38 DIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVM--SASL-E 114 (157) Q Consensus 38 dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~-~ 114 (157) +|. .++++|.|.++|.+|+|+||+|.+.++.|+|.|+|+...+++ |+..|+|.|+|.||++++.. ++.+ . T Consensus 4 ei~-~~~~~F~v~lDv~~F~PEEi~Vkv~~~~l~V~gkh~e~~d~~------G~~~r~F~R~~~LP~~vd~~~v~S~Ls~ 76 (86) T cd06497 4 EVR-SDRDKFTIYLDVKHFSPEDLTVKVLDDYVEIHGKHSERQDDH------GYISREFHRRYRLPSNVDQSAITCSLSA 76 (86) T ss_pred EEE-ECCCEEEEEEECCCCCCCCEEEEEECCEEEEEEEECCCCCCC------CEEEEEEEEEECCCCCCCHHHEEEEECC T ss_conf 599-889879999998787620059999899999997661382899------8899999997659999897797998389 Q ss_pred CCEEEEEEEC Q ss_conf 8859999740 Q gi|254780621|r 115 NGLLYLELLR 124 (157) Q Consensus 115 nGiLtI~lpK 124 (157) ||+|+|+.|| T Consensus 77 dG~L~I~APK 86 (86) T cd06497 77 DGMLTFSGPK 86 (86) T ss_pred CCEEEEEECC T ss_conf 9669998469 No 17 >cd06476 ACD_HspB2_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB2/heat shock 27kDa protein 2 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB2 is preferentially and constitutively expressed in skeletal muscle and heart. HspB2 shows homooligomeric activity and forms aggregates in muscle cytosol. Although its expression is not induced by heat shock, it redistributes to the insoluble fraction in response to heat shock. In the mouse heart, HspB2 plays a role in maintaining energetic balance, by protecting cardiac energetics during ischemia/reperfusion, and allowing for increased work during acute inotropic challenge. hHspB2 [previously also known as myotonic dystrophy protein kinase (DMPK) binding protein (MKBP)] is selectively up-regulated in skeletal muscles from myotonic dystrophy patients. Probab=99.44 E-value=7.9e-13 Score=99.63 Aligned_cols=78 Identities=27% Similarity=0.488 Sum_probs=66.4 Q ss_pred EECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEE--EEE-ECCE Q ss_conf 9489939999970117702149999621799997303323556621345530230468897386615810--385-2885 Q gi|254780621|r 41 RTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMS--ASL-ENGL 117 (157) Q Consensus 41 ~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~--A~~-~nGi 117 (157) .+++++|.|.++|-.|.|++|+|.+.++.|+|.|+++...+++ |+..|+|.|+|.||++++..+ +++ .||+ T Consensus 3 ~~~~d~F~v~lDV~~F~PeEi~Vk~~d~~l~V~gkHeek~d~~------G~v~R~F~R~y~LP~~vd~~~v~S~Ls~DGv 76 (83) T cd06476 3 ESEDDKYQVFLDVCHFTPDEITVRTVDNLLEVSARHPQRMDRH------GFVSREFTRTYILPMDVDPLLVRASLSHDGI 76 (83) T ss_pred EECCCCEEEEEECCCCCCCCEEEEEECCEEEEEEEECCCCCCC------CEEEEEEEEEEECCCCCCHHHEEEECCCCCE T ss_conf 7769708999998777532129999699999997746592799------8399999997638899597790996389974 Q ss_pred EEEEEEC Q ss_conf 9999740 Q gi|254780621|r 118 LYLELLR 124 (157) Q Consensus 118 LtI~lpK 124 (157) |+|+.|| T Consensus 77 L~I~APk 83 (83) T cd06476 77 LCIQAPR 83 (83) T ss_pred EEEEECC T ss_conf 9998369 No 18 >cd06480 ACD_HspB8_like Alpha-crystallin domain (ACD) found in mammalian 21.6 KDa small heat shock protein (sHsp) HspB8, also denoted as Hsp22 in humans, and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. A chaperone complex formed of HspB8 and Bag3 stimulates degradation of protein complexes by macroautophagy. HspB8 also forms complexes with Hsp27 (HspB1), MKBP (HspB2), HspB3, alphaB-crystallin (HspB5), Hsp20 (HspB6), and cvHsp (HspB7). These latter interactions may depend on phosphorylation of the respective partner sHsp. HspB8 may participate in the regulation of cell proliferation, cardiac hypertrophy, apoptosis, and carcinogenesis. Point mutations in HspB8 have been correlated with the development of several congenital neurological diseases, including Charcot Marie tooth disease and distal motor neuropathy type II. Probab=99.40 E-value=1.4e-12 Score=98.12 Aligned_cols=77 Identities=29% Similarity=0.583 Sum_probs=65.7 Q ss_pred ECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEE--EEE-ECCEE Q ss_conf 489939999970117702149999621799997303323556621345530230468897386615810--385-28859 Q gi|254780621|r 42 TGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMS--ASL-ENGLL 118 (157) Q Consensus 42 ~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~--A~~-~nGiL 118 (157) .++++|.|.++|..|+|++|+|.+.++.|+|.|+|+..++++ |+..|+|.|+|.||++++... +++ .||+| T Consensus 12 n~~d~F~V~LDV~~F~PEEi~Vk~~d~~l~V~gkHeek~de~------g~vsR~F~Rky~LP~~vd~~~v~S~LS~dGvL 85 (91) T cd06480 12 NSSEPWKVCVNVHSFKPEELTVKTKDGFVEVSGKHEEQQKEG------GIVSKNFTKKIQLPPEVDPVTVFASLSPEGLL 85 (91) T ss_pred CCCCCEEEEEECCCCCCCEEEEEEECCEEEEEEEECCCCCCC------CEEEEEEEEEECCCCCCCHHHEEEECCCCCEE T ss_conf 589788999987677763069999799999998865481799------97999999967089997975829963899639 Q ss_pred EEEEEC Q ss_conf 999740 Q gi|254780621|r 119 YLELLR 124 (157) Q Consensus 119 tI~lpK 124 (157) +|+.|| T Consensus 86 tI~APk 91 (91) T cd06480 86 IIEAPQ 91 (91) T ss_pred EEEECC T ss_conf 997179 No 19 >cd06479 ACD_HspB7_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB7, also known as cardiovascular small heat shock protein (cvHsp), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB7 is a 25-kDa protein, preferentially expressed in heart and skeletal muscle. It binds the cytoskeleton protein alpha-filamin (also known as actin-binding protein 280). The expression of HspB7 is increased during rat muscle aging. Its expression is also modulated in obesity implicating this protein in this and related metabolic disorders. As the human gene encoding HspB7 is mapped to chromosome 1p36.23-p34.3 it is a positional candidate for several dystrophies and myopathies. Probab=99.32 E-value=1.1e-11 Score=92.61 Aligned_cols=77 Identities=27% Similarity=0.480 Sum_probs=62.4 Q ss_pred EEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEE--EEE-E Q ss_conf 6899489939999970117702149999621799997303323556621345530230468897386615810--385-2 Q gi|254780621|r 38 DIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMS--ASL-E 114 (157) Q Consensus 38 dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~--A~~-~ 114 (157) ||.. .+++|.+.+++-+|+|++|+|.+.+|.|.|-|+++. ++ |...|+|.|++.||++++..+ .+| . T Consensus 2 ~i~~-~~d~f~v~lDV~~F~PeEi~Vk~~d~~v~VH~Ek~d---e~------G~vsR~F~Rry~LP~~vdp~~v~SsLs~ 71 (81) T cd06479 2 NVKT-LGDTYQFAVDVSDFSPEDIIVTTSNNQIEVHAEKLA---SD------GTVMNTFTHKCQLPEDVDPTSVSSSLGE 71 (81) T ss_pred CEEE-CCCCEEEEEECCCCCCCEEEEEEECCEEEEEEEEEC---CC------CCEEEEEEEEECCCCCCCCCCEEEEECC T ss_conf 6548-797389999988888861699998999999862897---99------8599999996529999394065998479 Q ss_pred CCEEEEEEEC Q ss_conf 8859999740 Q gi|254780621|r 115 NGLLYLELLR 124 (157) Q Consensus 115 nGiLtI~lpK 124 (157) ||+|+|+.|| T Consensus 72 DGvLtI~Apr 81 (81) T cd06479 72 DGTLTIKARR 81 (81) T ss_pred CCEEEEEECC T ss_conf 9489998359 No 20 >cd06477 ACD_HspB3_Like Alpha crystallin domain (ACD) found in mammalian HspB3, also known as heat-shock protein 27-like protein (HSPL27, 17-kDa) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB3 is expressed in adult skeletal muscle, smooth muscle, and heart, and in several other fetal tissues. In muscle cells HspB3 forms an oligomeric 150 kDa complex with myotonic dystrophy protein kinase-binding protein (MKBP/ HspB2), this complex may comprise one of two independent muscle-cell specific chaperone systems. The expression of HspB3 is induced during muscle differentiation controlled by the myogenic factor MyoD. HspB3 may also interact with Hsp22 (HspB8). Probab=99.31 E-value=1.1e-11 Score=92.58 Aligned_cols=76 Identities=34% Similarity=0.586 Sum_probs=64.8 Q ss_pred EECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEE--EEEE-ECCE Q ss_conf 948993999997011770214999962179999730332355662134553023046889738661581--0385-2885 Q gi|254780621|r 41 RTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVM--SASL-ENGL 117 (157) Q Consensus 41 ~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~-~nGi 117 (157) ++++++|.|.++|-+|.|++|+|.+.++.|+|.|+|+...+++ |+..|+|.|+|.||++++.. .++| .||+ T Consensus 3 ~dgk~~F~V~lDV~~F~PeEi~Vkv~d~~l~V~gkHeek~de~------G~vsR~F~R~y~LP~~vd~~~v~S~Ls~DGv 76 (83) T cd06477 3 EEGKPMFQILLDVVQFRPEDIIIQVFEGWLLIKGQHGVRMDEH------GFISRSFTRQYQLPDGVEHKDLSAMLCHDGI 76 (83) T ss_pred CCCCCCEEEEEECCCCCCCCEEEEEECCEEEEEEEECCCCCCC------CEEEEEEEEEECCCCCCCHHHEEEEECCCCE T ss_conf 4576788999998675555149999699899997567882899------8798999980159899696691898679933 Q ss_pred EEEEE Q ss_conf 99997 Q gi|254780621|r 118 LYLEL 122 (157) Q Consensus 118 LtI~l 122 (157) |+|.. T Consensus 77 LtIEa 81 (83) T cd06477 77 LVVET 81 (83) T ss_pred EEEEE T ss_conf 99985 No 21 >cd00298 ACD_sHsps_p23-like This domain family includes the alpha-crystallin domain (ACD) of alpha-crystallin-type small heat shock proteins (sHsps) and a similar domain found in p23-like proteins. sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is this ACD. sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps. p23 is a cochaperone of the Hsp90 chaperoning pathway. It binds Hsp90 and participates in the folding of a number of Hsp90 clients including the progesterone receptor. p23 also has a passive chaperoning activity. p23 in addition may act as the cytosolic prostaglandin E2 synthase. Included in this family is the p23-like C-terminal CHORD-SGT1 (CS) domain of suppressor of G2 allele of Skp1 (Sgt1) and the p23-like domains of human butyrate-induced transcript 1 (hB-ind Probab=99.01 E-value=3.2e-09 Score=77.32 Aligned_cols=74 Identities=32% Similarity=0.591 Sum_probs=64.8 Q ss_pred CCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEE--EEEEECCEEEEE Q ss_conf 993999997011770214999962179999730332355662134553023046889738661581--038528859999 Q gi|254780621|r 44 ENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVM--SASLENGLLYLE 121 (157) Q Consensus 44 e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~~nGiLtI~ 121 (157) .+.|.|.+.+||+.+++++|.+.++.|+|+|++..... ++...+.|.+.+.||..++.. .|.+.+|+|+|+ T Consensus 5 ~~~~~i~i~~~~~~~~~v~V~~~~~~l~v~~~~~~~~~-------~~~~~~~f~~~~~L~~~i~~~~~~~~~~~g~l~i~ 77 (80) T cd00298 5 DDEVVVTVDLPGVKKEDIKVEVEDNVLTISGKREEEEE-------RERSYGEFERSFELPEDVDPEKSKASLENGVLEIT 77 (80) T ss_pred CCEEEEEEECCCCCCCEEEEEEECCEEEEEEEECCCCC-------CCEECCCEEEEEECCCCEEHHHEEEEEECCEEEEE T ss_conf 88899999989995123799999999999999822456-------62861317999998557920142999989999999 Q ss_pred EEC Q ss_conf 740 Q gi|254780621|r 122 LLR 124 (157) Q Consensus 122 lpK 124 (157) +|| T Consensus 78 l~K 80 (80) T cd00298 78 LPK 80 (80) T ss_pred ECC T ss_conf 829 No 22 >cd06482 ACD_HspB10 Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB10, also known as sperm outer dense fiber protein (ODFP), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB10 occurs exclusively in the axoneme of sperm cells and may have a cytoskeletal role. Probab=98.90 E-value=8.7e-09 Score=74.65 Aligned_cols=77 Identities=16% Similarity=0.212 Sum_probs=63.0 Q ss_pred CCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEE--EEEE-ECCEEEE Q ss_conf 993999997011770214999962179999730332355662134553023046889738661581--0385-2885999 Q gi|254780621|r 44 ENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVM--SASL-ENGLLYL 120 (157) Q Consensus 44 e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~-~nGiLtI 120 (157) -......+++-||+||+|+|.+.++.+.|.|+|++..++++-+ ++.+++|.|.|+||.+|+.. ..+| -+|+|+| T Consensus 7 ~~~~l~slDV~~F~PEEl~Vktkdg~V~V~akhEer~d~~G~~---~~~~~~f~Rky~LPpgVd~~~VtsSLS~dGvL~I 83 (87) T cd06482 7 SSNVLASVDVCGFEPDQVKVKVKDGKVQVSAERENRYDCLGSK---KYSYMNICKEFSLPPGVDEKDVTYSYGLGSVVKI 83 (87) T ss_pred CCCEEEEEECCCCCCCEEEEEEECCEEEEEEEEECCCCCCCCE---EEEEEEEEEECCCCCCCCHHHEEEECCCCCEEEE T ss_conf 5365899873777755489998189899985610011467845---7776566641159999897784886088965998 Q ss_pred EEE Q ss_conf 974 Q gi|254780621|r 121 ELL 123 (157) Q Consensus 121 ~lp 123 (157) ..| T Consensus 84 eAP 86 (87) T cd06482 84 ETP 86 (87) T ss_pred EEC T ss_conf 607 No 23 >pfam08190 PIH1 pre-RNA processing PIH1/Nop17. This domain is involved in pre-rRNA processing. It has has been shown to be required either for nucleolar retention or correct assembly of the box C/D snoRNP in Saccharomyces cerevisiae. The C-terminal region of this family has similarity to the CS domain pfam04969. Probab=96.72 E-value=0.0076 Score=37.88 Aligned_cols=62 Identities=19% Similarity=0.289 Sum_probs=48.9 Q ss_pred CCEEEEEECCCCC-HHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEE--EEEEE--ECCEEE Q ss_conf 9399999701177-021499996217999973033235566213455302304688973866158--10385--288599 Q gi|254780621|r 45 NAYRITIAVSGFH-PSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEV--MSASL--ENGLLY 119 (157) Q Consensus 45 ~~y~i~~~lPG~~-kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~--~~A~~--~nGiLt 119 (157) +.++++++|||+. ..+++++|.+..|.+...+. .|...+.||-.|+. .+|.| +.+.|+ T Consensus 258 ~~Lvv~I~LP~l~S~~~~~LdV~e~~l~L~~~~~-----------------~Y~Ldl~LPy~Vd~~~~~AkFdkk~~~L~ 320 (325) T pfam08190 258 KELVLEIELPGLKSVAECDLDVSEDRLLLESRKP-----------------KYRLDLPLPYPVDEERGSAKFNKKKKTLT 320 (325) T ss_pred CEEEEEEECCCCCCCCCEEEEECCCEEEEEECCC-----------------CCEEEECCCCEECCCCCEEEEECCCCEEE T ss_conf 3899999888977512307997278899995698-----------------62688137864757875489832687899 Q ss_pred EEEE Q ss_conf 9974 Q gi|254780621|r 120 LELL 123 (157) Q Consensus 120 I~lp 123 (157) |||| T Consensus 321 VtlP 324 (325) T pfam08190 321 VTLP 324 (325) T ss_pred EEEC T ss_conf 9803 No 24 >pfam04969 CS CS domain. The CS and CHORD (pfam04968) are fused into a single polypeptide chain in metazoans but are found in separate proteins in plants; this is thought to be indicative of an interaction between CS and CHORD. It has been suggested that the CS domain is a binding module for HSP90, implying that CS domain-containing proteins are involved in recruiting heat shock proteins to multiprotein assemblies. Probab=96.26 E-value=0.077 Score=31.65 Aligned_cols=74 Identities=20% Similarity=0.249 Sum_probs=53.4 Q ss_pred CCEEEEEECCCCEEEEEECCCC-CHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEE--E Q ss_conf 8436899489939999970117-702149999621799997303323556621345530230468897386615810--3 Q gi|254780621|r 35 PPYDIERTGENAYRITIAVSGF-HPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMS--A 111 (157) Q Consensus 35 P~~dI~~~~e~~y~i~~~lPG~-~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~--A 111 (157) |.|+=+++++ ...|.+.+|+. .++|++|++..+.|.|+..-. +..| .....|...|+++. . T Consensus 1 ~~y~W~Qt~~-~V~i~i~~~~~~~~~dv~v~~~~~~l~v~~~~~-----~~~~----------~~~~~L~~~I~~~~s~w 64 (77) T pfam04969 1 PRYDWYQTLD-EVTITIPLKGVTKKKDVKVEIKPRSLKVSIKGP-----GKPY----------LLDGELFGPIDPEESSW 64 (77) T ss_pred CCCEEEECCC-EEEEEEECCCCCCCCEEEEEEECCEEEEEEECC-----CCCE----------EEEEEECCCCCCCCEEE T ss_conf 9954998989-999999822785556099999889999999569-----9867----------99768655544673099 Q ss_pred EEECCEEEEEEEC Q ss_conf 8528859999740 Q gi|254780621|r 112 SLENGLLYLELLR 124 (157) Q Consensus 112 ~~~nGiLtI~lpK 124 (157) ++.+|-|.|+|.| T Consensus 65 ~i~~~klei~L~K 77 (77) T pfam04969 65 KIEDTKVEITLKK 77 (77) T ss_pred EEECCEEEEEEEC T ss_conf 9989999999989 No 25 >cd06463 p23_like Proteins containing this p23_like domain include p23 and its Saccharomyces cerevisiae (Sc) homolog Sba1. Both are co-chaperones for the heat shock protein (Hsp) 90. p23 binds Hsp90 and participates in the folding of a number of Hsp90 clients, including the progesterone receptor. p23 also has a passive chaperoning activity and in addition may participate in prostaglandin synthesis. Both p23 and Sba1p can regulate telomerase activity. This group includes domains similar to the C-terminal CHORD-SGT1 (CS) domain of suppressor of G2 allele of Skp1 (Sgt1). Sgt1 interacts with multiple protein complexes and has the features of a co-chaperone. Human (h) Sgt1 interacts with both Hsp70 and Hsp90, and has been shown to bind Hsp90 through its CS domain. Saccharomyces cerevisiae (Sc) Sgt1 is a subunit of both core kinetochore and SCF (Skp1-Cul1-F-box) ubiquitin ligase complexes. Sgt1 is required for pathogen resistance in plants. This group also includes the p23_like domains of Probab=96.07 E-value=0.089 Score=31.28 Aligned_cols=72 Identities=15% Similarity=0.173 Sum_probs=53.7 Q ss_pred EECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEE--EEEEECCEE Q ss_conf 948993999997011770214999962179999730332355662134553023046889738661581--038528859 Q gi|254780621|r 41 RTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVM--SASLENGLL 118 (157) Q Consensus 41 ~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~--~A~~~nGiL 118 (157) ++ ++...|.+.+|+..++++.|.+.++.|+|+.... .+. .|.....|...|++. ..++.+|.| T Consensus 3 Qt-~~~V~v~i~~~~~~~~~~~v~~~~~~l~v~~~~~----~~~----------~~~~~~~L~~~I~~~~s~~~~~~~~i 67 (84) T cd06463 3 QT-LDEVTITIPLKDVTKKDVKVEFTPKSLTVSVKGG----GGK----------EYLLEGELFGPIDPEESKWTVEDRKI 67 (84) T ss_pred EC-CCEEEEEEEECCCCCHHCEEEEECCEEEEEEECC----CCC----------EEEEEEEECCCCCCCCCEEEEECCEE T ss_conf 36-8999999991897502359999899999998648----996----------07997154200622335999979999 Q ss_pred EEEEECCCC Q ss_conf 999740686 Q gi|254780621|r 119 YLELLRSVP 127 (157) Q Consensus 119 tI~lpK~~p 127 (157) .|.|.|..+ T Consensus 68 ~i~L~K~~~ 76 (84) T cd06463 68 EITLKKKEP 76 (84) T ss_pred EEEEEECCC T ss_conf 999998988 No 26 >cd06469 p23_DYX1C1_like p23_like domain found in proteins similar to dyslexia susceptibility 1 (DYX1) candidate 1 (C1) protein, DYX1C1. The human gene encoding this protein is a positional candidate gene for developmental dyslexia (DD), it is located on 15q21.3 by the DYX1 DD susceptibility locus (15q15-21). Independent association studies have reported conflicting results. However, association of short-term memory, which plays a role in DD, with a variant within the DYX1C1 gene has been reported. Most proteins belonging to this group contain a C-terminal tetratricopeptide repeat (TPR) protein binding region. Probab=96.06 E-value=0.042 Score=33.32 Aligned_cols=68 Identities=24% Similarity=0.380 Sum_probs=52.1 Q ss_pred EEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEE--EEEEEECCE Q ss_conf 994899399999701177021499996217999973033235566213455302304688973866158--103852885 Q gi|254780621|r 40 ERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEV--MSASLENGL 117 (157) Q Consensus 40 ~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~--~~A~~~nGi 117 (157) +++ ++...|.+-++|+.+++++|.+.+..|.|... .|.....|-+.|+. .++++-+|. T Consensus 2 ~Qt-~~~V~Itv~lkgv~~~~~dv~~se~~lkvn~p-------------------py~feldL~~~Id~~~S~~ki~~t~ 61 (78) T cd06469 2 SQT-DEDVKISVPLKGVKTSKVDIFCSDLYLKVNFP-------------------PYLFELDLAAPIDDEKSSAKIGNGV 61 (78) T ss_pred EEC-CCEEEEEEEECCCCCCCCCEEEEEEEEEECCC-------------------CEEEEEECCCCCCCHHCEEEEECCE T ss_conf 372-77999999957863676768976038997489-------------------9567777207466102579994888 Q ss_pred EEEEEECCCC Q ss_conf 9999740686 Q gi|254780621|r 118 LYLELLRSVP 127 (157) Q Consensus 118 LtI~lpK~~p 127 (157) +.|+|.|..| T Consensus 62 v~i~L~K~e~ 71 (78) T cd06469 62 LVFTLVKKEP 71 (78) T ss_pred EEEEEECCCC T ss_conf 9999983787 No 27 >cd06466 p23_CS_SGT1_like p23_like domain similar to the C-terminal CHORD-SGT1 (CS) domain of Sgt1 (suppressor of G2 allele of Skp1). Sgt1 interacts with multiple protein complexes and has the features of a cochaperone. Human (h) Sgt1 interacts with both Hsp70 and Hsp90, and has been shown to bind Hsp90 through its CS domain. Saccharomyces cerevisiae (Sc) Sgt1 is a subunit of both core kinetochore and SCF (Skp1-Cul1-F-box) ubiquitin ligase complexes. Sgt1 is required for pathogen resistance in plants. ScSgt1 is needed for the G1/S and G2/M cell-cycle transitions, and for assembly of the core kinetochore complex (CBF3) via activation of Ctf13, the F-box protein. Binding of Hsp82 (a yeast Hsp90 homologue) to ScSgt1, promotes the binding of Sgt1 to Skp1 and of Skp1 to Ctf13. Some proteins in this group have an SGT1-specific (SGS) domain at the extreme C-terminus. The ScSgt1-SGS domain binds adenylate cyclase. The hSgt1-SGS domain interacts with some S100 family proteins, and studies sug Probab=95.22 E-value=0.24 Score=28.61 Aligned_cols=73 Identities=14% Similarity=0.165 Sum_probs=55.8 Q ss_pred EEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEE--EEEECCE Q ss_conf 99489939999970117702149999621799997303323556621345530230468897386615810--3852885 Q gi|254780621|r 40 ERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMS--ASLENGL 117 (157) Q Consensus 40 ~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~--A~~~nGi 117 (157) ++++ +...|.+-+.|+.++++.|.++.+.|+|..... .+ ..|...+.|-..|.+.. .....+- T Consensus 3 yQt~-~~V~i~i~~k~~~~~~~~v~~~~~~l~v~~~~~----~~----------~~y~~~~~L~~~I~p~~s~~~v~~~K 67 (84) T cd06466 3 YQTD-TSVTVTIYAKNVDKEDVKVEFNEQSLSVSIILP----GG----------SEYQLELDLFGPIDPEQSKVSVLPTK 67 (84) T ss_pred CCCC-CEEEEEEEECCCCHHHCEEEEECCEEEEEEECC----CC----------CEEEEEEECCCCCCCHHEEEEEECCE T ss_conf 2479-899999992789968989999799999999869----99----------62899998234526037399996998 Q ss_pred EEEEEECCCC Q ss_conf 9999740686 Q gi|254780621|r 118 LYLELLRSVP 127 (157) Q Consensus 118 LtI~lpK~~p 127 (157) +.|+|.|..| T Consensus 68 IEI~L~K~~~ 77 (84) T cd06466 68 VEITLKKAEP 77 (84) T ss_pred EEEEEEECCC T ss_conf 9999991799 No 28 >pfam05455 GvpH GvpH. This family consists of archaeal GvpH proteins which are thought to be involved in gas vesicle synthesis. Probab=95.16 E-value=0.11 Score=30.79 Aligned_cols=74 Identities=19% Similarity=0.331 Sum_probs=50.0 Q ss_pred CCCEEEEEECCCCEEEEEECCCC-CHHHEEEEE--ECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCC-CCEEE Q ss_conf 88436899489939999970117-702149999--621799997303323556621345530230468897386-61581 Q gi|254780621|r 34 YPPYDIERTGENAYRITIAVSGF-HPSEITIEV--DSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLAD-FVEVM 109 (157) Q Consensus 34 ~P~~dI~~~~e~~y~i~~~lPG~-~kedi~V~v--~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~-~ve~~ 109 (157) .|.+.....++.+.+|.++|||+ +.+-++|.+ ..+.|+|+..... --++.|-+ ++++. T Consensus 91 s~HvEtR~~~dgelvViADLPgVtD~es~Dv~ldtd~~aL~i~~~d~v------------------v~R~~ld~p~~~It 152 (177) T pfam05455 91 SIHVETRDRDDGELVVVADLPGVTDEKSLDVALETDEPALTIRVDDDV------------------VERLELDEPDVEIT 152 (177) T ss_pred CEEEEEECCCCCCEEEEECCCCCCCCCCCCHHHHCCCCCEEEEECCCE------------------EEEEECCCCCCEEE T ss_conf 404785047888469983189986410000122058851699956865------------------87775389985788 Q ss_pred EEEEECCEEEEEEECC Q ss_conf 0385288599997406 Q gi|254780621|r 110 SASLENGLLYLELLRS 125 (157) Q Consensus 110 ~A~~~nGiLtI~lpK~ 125 (157) ..+|.|-||.|.|.+. T Consensus 153 d~~l~NqvLEiRl~~s 168 (177) T pfam05455 153 DMTLRNQVLEIRLTRS 168 (177) T ss_pred EEEEECEEEEEEEECC T ss_conf 8886130899996135 No 29 >cd06489 p23_CS_hSgt1_like p23_like domain similar to the C-terminal CS (CHORD-SGT1) domain of human (h) Sgt1 and related proteins. hSgt1 is a co-chaperone which has been shown to be elevated in HEp-2 cells as a result of stress conditions such as heat shock. It interacts with the heat shock proteins (HSPs) Hsp70 and Hsp90, and it expression pattern is synchronized with these two Hsps. The interaction with HSP90 has been shown to involve the hSgt1_CS domain, and appears to be required for correct kinetochore assembly and efficient cell division. Some proteins in this subgroup contain a tetratricopeptide repeat (TPR) HSP-binding domain N-terminal to this CS domain, and most proteins in this subgroup contain a Sgt1-specific (SGS) domain C-terminal to the CS domain. The SGS domain interacts with some S100 family proteins. Studies suggest that S100A6 modulates in a Ca2+ dependent manner the interactions of hSgt1 with Hsp90 and Hsp70. The yeast Sgt1 CS domain is not found in this subgroup. Probab=91.48 E-value=1.2 Score=24.33 Aligned_cols=72 Identities=15% Similarity=0.222 Sum_probs=52.0 Q ss_pred EEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEEE--EECCE Q ss_conf 9948993999997011770214999962179999730332355662134553023046889738661581038--52885 Q gi|254780621|r 40 ERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMSAS--LENGL 117 (157) Q Consensus 40 ~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~A~--~~nGi 117 (157) ++++ +..+|.+-+-|++++++.|+++.+.|.|...-.. + ..|...+.|...|++...+ +-.=- T Consensus 3 yQT~-~~V~isi~~K~~~~~~~~V~~~~~~l~v~~~~~~----g----------~~y~~~l~L~~~I~pe~S~~~v~~tK 67 (84) T cd06489 3 YQTE-SQVVITILIKNVKPEDVSVEFEKRELSATVKLPS----G----------NDYSLKLHLLHPIVPEQSSYKILSTK 67 (84) T ss_pred CCCC-CEEEEEEEECCCCHHHCEEEEECCEEEEEEECCC----C----------CEEEEEEECCCCCCCHHEEEEEECCE T ss_conf 5579-9999999975899699799996999999998799----9----------74999867343117137099993718 Q ss_pred EEEEEECCC Q ss_conf 999974068 Q gi|254780621|r 118 LYLELLRSV 126 (157) Q Consensus 118 LtI~lpK~~ 126 (157) +.|+|.|.. T Consensus 68 vEI~LkK~~ 76 (84) T cd06489 68 IEIKLKKTE 76 (84) T ss_pred EEEEEECCC T ss_conf 999998379 No 30 >cd06467 p23_NUDC_like p23_like domain of NUD (nuclear distribution) C and similar proteins. Aspergillus nidulas (An) NUDC is needed for nuclear movement. AnNUDC is localized at the hyphal cortex, and binds NUDF at spindle pole bodies (SPBs) and in the cytoplasm at different stages in the cell cycle. At the SPBs it is part of the dynein molecular motor/NUDF complex that regulates microtubule dynamics. Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I). mNUDC is important for cell proliferation both in normal and tumor tissues. Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors. For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its ext Probab=90.60 E-value=1.4 Score=23.81 Aligned_cols=69 Identities=23% Similarity=0.249 Sum_probs=46.3 Q ss_pred EECCCCEEEEEECC-CCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEE--EEEEC-C Q ss_conf 94899399999701-17702149999621799997303323556621345530230468897386615810--38528-8 Q gi|254780621|r 41 RTGENAYRITIAVS-GFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMS--ASLEN-G 116 (157) Q Consensus 41 ~~~e~~y~i~~~lP-G~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~--A~~~n-G 116 (157) ++. +...|.+.|| |+.++||+|++....|+|+-+. ... +..|. |...|+++. =++++ + T Consensus 5 Qt~-~~V~v~i~lp~~~~~~di~v~~~~~~i~v~~~~-------~~~----il~g~------L~~~I~~des~Wtie~~~ 66 (85) T cd06467 5 QTL-DEVTVTIPLPEGTKSKDVKVEITPKHLKVGVKG-------GEP----LLDGE------LYAKVKVDESTWTLEDGK 66 (85) T ss_pred EEC-CEEEEEEECCCCCCCCEEEEEECCCEEEEEECC-------CCC----EECCC------CCCCEECCCCEEEEECCC T ss_conf 517-989999988898210249999746999999669-------970----25484------237574267699997899 Q ss_pred EEEEEEECCCC Q ss_conf 59999740686 Q gi|254780621|r 117 LLYLELLRSVP 127 (157) Q Consensus 117 iLtI~lpK~~p 127 (157) .|.|+|.|..+ T Consensus 67 ~l~i~L~K~~~ 77 (85) T cd06467 67 LLEITLEKRNE 77 (85) T ss_pred EEEEEEEECCC T ss_conf 99999998878 No 31 >cd06488 p23_melusin_like p23_like domain similar to the C-terminal (tail) domain of vertebrate Melusin and related proteins. Melusin's tail domain interacts with the cytoplasmic domain of beta1-A and beta1-D isoforms of beta1 integrin, it does not bind other integrin beta subunits. Melusin is a muscle-specific protein expressed in skeletal and cardiac muscles but not in smooth muscle or other tissues. It is needed for heart hypertrophy following mechanical overload. The integrin-binding portion of this domain appears to be sequestered in the full length melusin protein, Ca2+ may modulate the protein's conformation exposing this binding site. This group includes Chordc1, also known as Chp-1, which is conserved from vertebrates to humans. Mammalian Chordc1 interacts with the heat shock protein (HSP) Hsp90 and is implicated in circadian and/or homeostatic mechanisms in the brain. The N-terminal portions of proteins belonging to this group contain two cysteine and histidine rich domain (C Probab=89.63 E-value=1.7 Score=23.32 Aligned_cols=77 Identities=17% Similarity=0.287 Sum_probs=56.0 Q ss_pred CEEEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEEEEE- Q ss_conf 4368994899399999701177021499996217999973033235566213455302304688973866158103852- Q gi|254780621|r 36 PYDIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMSASLE- 114 (157) Q Consensus 36 ~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~A~~~- 114 (157) .+|=|+++. ..+|.+-.-|+.+++..|++....|+|.-.... + ..|...+.|-..|++...++. T Consensus 2 R~DWyQT~~-~V~vsiyaK~~~~~~~~V~~~~~~l~v~~~~~~----~----------~~y~~~~~L~~~I~p~~s~~~v 66 (87) T cd06488 2 RHDWHQTGS-HVVVSVYAKNSNPELSVVEANSTVLTIHIVFEG----N----------KEFQLDIELWGVIDVEKSSVNM 66 (87) T ss_pred CCCEEECCC-EEEEEEEECCCCHHHCEEEECCCEEEEEEECCC----C----------CEEEEEEECCCEECHHHEEEEE T ss_conf 855268599-999999982699578799980889999998799----9----------6799761434335706739999 Q ss_pred -CCEEEEEEECCCC Q ss_conf -8859999740686 Q gi|254780621|r 115 -NGLLYLELLRSVP 127 (157) Q Consensus 115 -nGiLtI~lpK~~p 127 (157) ---+.|+|.|..| T Consensus 67 ~~tKvEI~L~K~e~ 80 (87) T cd06488 67 LPTKVEIKLRKAEP 80 (87) T ss_pred ECEEEEEEEECCCC T ss_conf 47779999985899 No 32 >cd06493 p23_NUDCD1_like p23_NUDCD1: p23-like NUD (nuclear distribution) C-like domain found in human NUD (nuclear distribution) C domain-containing protein 1, NUDCD1 (also known as CML66), and similar proteins. NUDCD1/CML66 is a broadly immunogenic tumor associated antigen, which is highly expressed in a variety of solid tumors and in leukemias. In normal tissues high expression of NUDCD1/CML66 is limited to testis and heart. Probab=87.78 E-value=2.3 Score=22.54 Aligned_cols=68 Identities=15% Similarity=0.178 Sum_probs=45.4 Q ss_pred EECCCCEEEEEECC-CCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEE--EEEECC- Q ss_conf 94899399999701-17702149999621799997303323556621345530230468897386615810--385288- Q gi|254780621|r 41 RTGENAYRITIAVS-GFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMS--ASLENG- 116 (157) Q Consensus 41 ~~~e~~y~i~~~lP-G~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~--A~~~nG- 116 (157) ++.+ ...|.+.|| |..++||.|.+..+.|+|.=+-+. -+ -.| .|...|+++. =++++| T Consensus 5 QT~~-eV~V~i~vP~~~~~kdi~v~i~~~~l~v~vk~~~------~~-----l~G------~L~~~V~~d~stWtied~k 66 (85) T cd06493 5 QTEE-DLTLTIRLPEDTTKEDIRIKFLPDHISIALKDQA------PL-----LEG------KLYSSIDHESSTWIIKENK 66 (85) T ss_pred EECC-EEEEEEECCCCCCCHHEEEEEECCEEEEEECCCC------EE-----ECC------CCCCCCCCCCCEEEEECCC T ss_conf 6478-6999999999985010999997299999989997------08-----827------4667516656679994799 Q ss_pred EEEEEEECCC Q ss_conf 5999974068 Q gi|254780621|r 117 LLYLELLRSV 126 (157) Q Consensus 117 iLtI~lpK~~ 126 (157) .|.|+|-|.. T Consensus 67 ~l~i~L~K~~ 76 (85) T cd06493 67 SLEVSLIKKD 76 (85) T ss_pred EEEEEEEECC T ss_conf 9999999976 No 33 >KOG1309 consensus Probab=86.19 E-value=2.5 Score=22.29 Aligned_cols=42 Identities=17% Similarity=0.152 Sum_probs=34.9 Q ss_pred CCEEEEEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEEC Q ss_conf 8436899489939999970117702149999621799997303 Q gi|254780621|r 35 PPYDIERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKK 77 (157) Q Consensus 35 P~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~ 77 (157) +.+|-+++. ...+|.+-.+|+.++|++|++.+++|.|.-+-+ T Consensus 4 ~r~DwyQt~-~~vvIti~~k~v~~~~v~v~~s~~~l~~~~~~~ 45 (196) T KOG1309 4 IRHDWYQTE-TSVVITIFAKNVPKEDVNVEISENTLSIVIQLP 45 (196) T ss_pred CCCEEECCC-CEEEEEEEECCCCCCCEEEEEECCEEEEEEECC T ss_conf 340000377-539999981489865424795045578999648 No 34 >cd06492 p23_mNUDC_like p23-like NUD (nuclear distribution) C-like domain of mammalian(m) NUDC and similar proteins. Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I). mNUDC is important for cell proliferation both in normal and tumor tissues. Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors. For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its extracellular domain, and promoting cell proliferation and differentiation. Probab=77.37 E-value=6 Score=19.97 Aligned_cols=63 Identities=16% Similarity=0.250 Sum_probs=40.2 Q ss_pred CEEEEEE--CC-CCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEE--EEEECC-EEE Q ss_conf 3999997--01-17702149999621799997303323556621345530230468897386615810--385288-599 Q gi|254780621|r 46 AYRITIA--VS-GFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMS--ASLENG-LLY 119 (157) Q Consensus 46 ~y~i~~~--lP-G~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~--A~~~nG-iLt 119 (157) ...|.+- +| |.+++||.|++..+.|.|.-+-+. -+ ..|. |...|+++. =+++|| .|. T Consensus 9 ev~v~vpl~~~~~~k~kdv~v~i~~~~l~v~vkg~~------~~-----l~G~------L~~~V~~des~WtlEdg~~l~ 71 (87) T cd06492 9 EVELKVPFKVSFRLKGKDVVVDIQRKHLKVGLKGQP------PI-----IDGE------LYNEVKVEESSWLIEDGKVVT 71 (87) T ss_pred EEEEEEECCCCCCCEEEEEEEEEECCEEEEEECCCC------CE-----EECC------CCCCCCCCCCEEEEECCCEEE T ss_conf 899999658999950016999984699999988998------28-----8350------668605656489997999999 Q ss_pred EEEECC Q ss_conf 997406 Q gi|254780621|r 120 LELLRS 125 (157) Q Consensus 120 I~lpK~ 125 (157) |+|.|. T Consensus 72 i~L~K~ 77 (87) T cd06492 72 VNLEKI 77 (87) T ss_pred EEEEEC T ss_conf 999988 No 35 >cd06468 p23_CacyBP p23_like domain found in proteins similar to Calcyclin-Binding Protein(CacyBP)/Siah-1-interacting protein (SIP). CacyBP/SIP interacts with S100A6 (calcyclin), with some other members of the S100 family, with tubulin, and with Siah-1 and Skp-1. The latter two are components of the ubiquitin ligase that regulates beta-catenin degradation. The beta-catenin gene is an oncogene participating in tumorigenesis in many different cancers. Overexpression of CacyBP/SIP, in part through its effect on the expression of beta-catenin, inhibits the proliferation, tumorigenicity, and invasion of gastric cancer cells. CacyBP/SIP is abundant in neurons and neuroblastoma NB2a cells. An extensive re-organization of microtubules accompanies the differentiation of NB2a cells. CacyBP/SIP may contribute to NB2a cell differentiation through binding to and increasing the oligomerization of tubulin. CacyBP/SIP is also implicated in differentiation of erythroid cells, rat neonatal cardiomyocytes Probab=71.46 E-value=8.4 Score=19.05 Aligned_cols=69 Identities=14% Similarity=0.280 Sum_probs=47.9 Q ss_pred EECCCCEEEEEECCCC---CHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEE---CCCCCEEEE--EE Q ss_conf 9489939999970117---702149999621799997303323556621345530230468897---386615810--38 Q gi|254780621|r 41 RTGENAYRITIAVSGF---HPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQ---LADFVEVMS--AS 112 (157) Q Consensus 41 ~~~e~~y~i~~~lPG~---~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~---Lp~~ve~~~--A~ 112 (157) ++ ++...|.+.++|+ .+++|.|++.++.|.|+-. . ..+.+| +|. |-..|++.. -+ T Consensus 8 Q~-~~~V~Iyi~lk~v~~~~ke~v~v~F~~~s~~v~v~-~---~~gkny------------~~~i~~L~~~I~p~~s~~k 70 (92) T cd06468 8 QS-DKFVKIYITLKGVHQLPKENIQVEFTERSFELKVH-D---LNGKNY------------RFTINRLLKKIDPEKSSFK 70 (92) T ss_pred CC-CCEEEEEEECCCCCCCCHHHEEEEECCCEEEEEEE-C---CCCCEE------------EEECHHHCCCCCCCCCEEE T ss_conf 47-99799999966546588101699942899999998-8---999707------------9863254286274414599 Q ss_pred EECCEEEEEEECCC Q ss_conf 52885999974068 Q gi|254780621|r 113 LENGLLYLELLRSV 126 (157) Q Consensus 113 ~~nGiLtI~lpK~~ 126 (157) .+.+-+.|+|-|.. T Consensus 71 Vk~~ki~I~L~K~~ 84 (92) T cd06468 71 VKTDRIVITLAKKK 84 (92) T ss_pred EECCEEEEEEEECC T ss_conf 96987999999689 No 36 >cd06494 p23_NUDCD2_like p23-like NUD (nuclear distribution) C-like found in human NUDC domain-containing protein 2 (NUDCD2) and similar proteins. Little is known about the function of the proteins in this subgroup. Probab=67.15 E-value=10 Score=18.49 Aligned_cols=69 Identities=17% Similarity=0.203 Sum_probs=45.6 Q ss_pred EEECCCCEEEEEECC-CCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEEEEEC-CE Q ss_conf 994899399999701-1770214999962179999730332355662134553023046889738661581038528-85 Q gi|254780621|r 40 ERTGENAYRITIAVS-GFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMSASLEN-GL 117 (157) Q Consensus 40 ~~~~e~~y~i~~~lP-G~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~A~~~n-Gi 117 (157) .++.+ +..|.+.|| |...+||.|.+..+.|+|.-+ - .. +..|.|-+.+.-.+- .=+++| ++ T Consensus 11 tQTl~-EV~V~i~vp~gt~~kdv~v~i~~~~l~v~lk-g------~~-----il~GeL~~~V~~des----~WtlED~~~ 73 (93) T cd06494 11 YQTMD-EVFIEVNVPPGTRAKDVKCKLGSRDISLAVK-G------QE-----VLKGKLFDSVVADEC----TWTLEDRKL 73 (93) T ss_pred EEECC-EEEEEEECCCCCCEEEEEEEEECCEEEEEEC-C------CE-----EECCCCCCCCCCCCC----EEEEECCCE T ss_conf 75967-8999998999983044999996399999989-9------66-----893604676165456----799977999 Q ss_pred EEEEEECC Q ss_conf 99997406 Q gi|254780621|r 118 LYLELLRS 125 (157) Q Consensus 118 LtI~lpK~ 125 (157) |.|+|-|. T Consensus 74 l~I~L~K~ 81 (93) T cd06494 74 IRIVLTKS 81 (93) T ss_pred EEEEEEEC T ss_conf 99999985 No 37 >cd06495 p23_NUDCD3_like p23-like NUD (nuclear distribution) C-like domain found in human NUDC domain-containing protein 3 (NUDCD3) and similar proteins. Little is known about the function of the proteins in this subgroup. Probab=54.40 E-value=17 Score=17.10 Aligned_cols=73 Identities=15% Similarity=0.251 Sum_probs=45.8 Q ss_pred EEECCCCEEEEEECC-CC-CHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEEEEECC- Q ss_conf 994899399999701-17-702149999621799997303323556621345530230468897386615810385288- Q gi|254780621|r 40 ERTGENAYRITIAVS-GF-HPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMSASLENG- 116 (157) Q Consensus 40 ~~~~e~~y~i~~~lP-G~-~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~A~~~nG- 116 (157) .++ -+...|.+.|| |+ +..|+.|++..+.|.|.-+-.. +... +..|.|...+...+- .=++++| T Consensus 10 tQT-l~ev~v~ipvp~~~~k~k~v~v~i~~~~lkv~lk~~~----~~~~----iidGel~~~Ik~des----~WtlEdgk 76 (102) T cd06495 10 SQD-YTDVEVRVPVPKDVVKGRQVSVDLQSSSIRVSVRDGG----GEKV----LMEGEFTHKINTENS----LWSLEPGK 76 (102) T ss_pred EEE-CCEEEEEEECCCCCCCCCEEEEEEEECEEEEEEECCC----CCCE----EEECCCCCCCCCCCE----EEEECCCC T ss_conf 738-7669999988999875423799998596999983588----9711----892505687542430----67963999 Q ss_pred EEEEEEECC Q ss_conf 599997406 Q gi|254780621|r 117 LLYLELLRS 125 (157) Q Consensus 117 iLtI~lpK~ 125 (157) .|.|+|.|. T Consensus 77 ~l~i~LeK~ 85 (102) T cd06495 77 CVLLSLSKC 85 (102) T ss_pred EEEEEEEEC T ss_conf 899999700 No 38 >cd02180 GH16_laminarinase Laminarinase, also known as glucan endo-1,3-beta-D-glucosidase, is a glycosyl hydrolase family 16 member that hydrolyzes 1,3-beta-D-glucosidic linkages in 1,3-beta-D-glucans such as laminarins, curdlans, paramylons, and pachymans, with very limited action on mixed-link (1,3-1,4-)-beta-D-glucans. Probab=49.19 E-value=21 Score=16.59 Aligned_cols=44 Identities=23% Similarity=0.314 Sum_probs=26.7 Q ss_pred EEEEEECCEEEEEEEECCCCCC--CCCE----EE----EEEECCEEEEEEECCC Q ss_conf 4999962179999730332355--6621----34----5530230468897386 Q gi|254780621|r 61 ITIEVDSSILMVRGEKKSEEKE--TVEY----LH----RGIAKRAFERRFQLAD 104 (157) Q Consensus 61 i~V~v~~~~LtI~g~~~~~~~~--~~~~----~~----~e~~~~~F~R~f~Lp~ 104 (157) -.|.++++.|+|++.++..... ...| +. ..+.||.|+=+..||. T Consensus 39 ~nv~v~~G~L~I~a~~~~~~~~~~~~~ytSg~i~T~~k~~f~yG~~E~Rak~P~ 92 (238) T cd02180 39 ANAYVENGNLVIEARKESYSDAYGGKEYTSARLKTQGKKSFKYGRIEARAKLPS 92 (238) T ss_pred CCEEECCCCEEEEEECCCCCCCCCCCCEEEEEEECCCCEEEECCEEEEEEECCC T ss_conf 657973995999995046556889810664788717954202778999999269 No 39 >PRK05090 hypothetical protein; Validated Probab=46.06 E-value=19 Score=16.82 Aligned_cols=83 Identities=11% Similarity=0.189 Sum_probs=53.6 Q ss_pred EEEECCCCEEEEEE-CCCCCHHHEEEEEECCEEEEEEEECCCC-CCCCCEEEEEEECCEEEEEEECCC-CCEEEE-EEEE Q ss_conf 89948993999997-0117702149999621799997303323-556621345530230468897386-615810-3852 Q gi|254780621|r 39 IERTGENAYRITIA-VSGFHPSEITIEVDSSILMVRGEKKSEE-KETVEYLHRGIAKRAFERRFQLAD-FVEVMS-ASLE 114 (157) Q Consensus 39 I~~~~e~~y~i~~~-lPG~~kedi~V~v~~~~LtI~g~~~~~~-~~~~~~~~~e~~~~~F~R~f~Lp~-~ve~~~-A~~~ 114 (157) |.+++| ...|.+- -||-+++.| +-++++.|.|+-.-...+ ..+...+ .-+...|.+|. +|.... ++-. T Consensus 4 is~~gd-~~~l~V~V~P~A~r~~i-~g~~~~~Lkv~v~ApPvdGkAN~ali------~~LA~~l~vpks~V~i~~G~tsR 75 (95) T PRK05090 4 VTRDGD-GLVLRLYIQPKASRDQI-VGLHGDELKVAITAPPVDGQANAHLV------KFLAKQFKVAKSQVVIEKGELGR 75 (95) T ss_pred EEEECC-EEEEEEEEEECCCCCEE-CCCCCCEEEEEEECCCCCCHHHHHHH------HHHHHHHCCCHHHEEEEECCCCC T ss_conf 468699-89999999528880458-01039989999965997872899999------99999859975658995258768 Q ss_pred CCEEEEEEECCCCHH Q ss_conf 885999974068601 Q gi|254780621|r 115 NGLLYLELLRSVPER 129 (157) Q Consensus 115 nGiLtI~lpK~~pe~ 129 (157) +=++.|+-|+..|.. T Consensus 76 ~K~i~I~~p~~lp~~ 90 (95) T PRK05090 76 HKQVRIINPQQIPPE 90 (95) T ss_pred CEEEEEECCCCCCHH T ss_conf 439998381309698 No 40 >TIGR02350 prok_dnaK chaperone protein DnaK; InterPro: IPR012725 Molecular chaperones are a diverse family of proteins that function to protect proteins in the intracellular milieu from irreversible aggregation during synthesis and in times of cellular stress. The bacterial molecular chaperone DnaK is an enzyme that couples cycles of ATP binding, hydrolysis, and ADP release by an N-terminal ATP-hydrolysing domain to cycles of sequestration and release of unfolded proteins by a C-terminal substrate binding domain. In prokaryotes, the grpE protein is a co-chaperone for DnaK, and acts as a nucleotide exchange factor, stimulating the rate of ADP release 5000-fold . DnaK is itself a weak ATPase; ATP hydrolysis by DnaK is stimulated by its interaction with another co-chaperone, DnaJ. Thus the co-chaperones DnaJ and GrpE are capable of tightly regulating the nucleotide-bound and substrate-bound state of DnaK in ways that are necessary for the normal housekeeping functions and stress-related functions of the DnaK molecular chaperone cycle. Members of this family are the chaperone DnaK, of the DnaK-DnaJ-GrpE chaperone system. All members of the seed alignment were taken from completely sequenced bacterial or archaeal genomes and (except for the Mycoplasma sequence) found clustered with other genes of this systems. This entry excludes DnaK homologues that are not DnaK itself, such as the heat shock cognate protein HscA (IPR010236 from INTERPRO). However, it is not designed to distinguish among DnaK paralogs in eukaryotes. Note that a number of DnaK genes have shadow ORFs in the same reverse (relative to dnaK) reading frame, a few of which have been assigned glutamate dehydrogenase activity. The significance of this observation is unclear; the lengths of such shadow ORFs are highly variable as if the presumptive protein product is not conserved.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=42.74 E-value=26 Score=15.97 Aligned_cols=109 Identities=20% Similarity=0.265 Sum_probs=58.3 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCC--CEEEEEEC-CCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECC Q ss_conf 1034110235678678897645410667778888--43689948-99399999701177021499996217999973033 Q gi|254780621|r 2 RLDVSRIYNSAVGYDTVLSMLDGLGAPSQTSAYP--PYDIERTG-ENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKS 78 (157) Q Consensus 2 ~~~~~~~~~~~~gfD~~f~~~~~~~~~~~~~~~P--~~dI~~~~-e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~ 78 (157) +||+.|| ++|.+-+-.-+..+- .++++-| .-.|+-|- |+.--|++ -++.|||+- T Consensus 384 LLDVTPL---sLGiET~GGV~T~lI--~RNTTIP~~KsqvFSTA~DnQ~~V~I------------------~VlQGER~~ 440 (598) T TIGR02350 384 LLDVTPL---SLGIETLGGVMTKLI--ERNTTIPTKKSQVFSTAADNQPAVEI------------------HVLQGERPM 440 (598) T ss_pred EEECCCC---CCCHHHHHHHHHHHH--CCCCCCCCCCCCEEECCCCCCCEEEE------------------EEEECCHHH T ss_conf 8613565---110130100131110--26774475345203230568956799------------------986088133 Q ss_pred CCCCCCCEEEEEEECCEEEEEEE--CCCCCEEEEEEE---ECCEEEEEEECCCCHHHCCEEEEECCCCCC Q ss_conf 23556621345530230468897--386615810385---288599997406860118728873378875 Q gi|254780621|r 79 EEKETVEYLHRGIAKRAFERRFQ--LADFVEVMSASL---ENGLLYLELLRSVPERMKPRRIEISQSPQE 143 (157) Q Consensus 79 ~~~~~~~~~~~e~~~~~F~R~f~--Lp~~ve~~~A~~---~nGiLtI~lpK~~pe~~~~r~I~I~~~~~~ 143 (157) ..+.. +=|+|+..=- -|-++=.++.+| .||||+|+..-. ..-|...|.|+.+..- T Consensus 441 A~DNk--------~LG~F~L~GI~paPRG~PQIEVTFDIDANGIl~V~AkD~--~TgK~q~I~I~~~s~L 500 (598) T TIGR02350 441 AADNK--------SLGRFELTGIPPAPRGVPQIEVTFDIDANGILHVSAKDK--ATGKEQSITITASSGL 500 (598) T ss_pred HHHHH--------HHCCEEECCCCCCCCCCCCEEEEEEECCCCEEEEEEEEC--CCCCEEEEEEECCCCC T ss_conf 44024--------436621068888956887557876525884488988654--7785004788127898 No 41 >TIGR02856 spore_yqfC sporulation protein YqfC; InterPro: IPR009877 This family consists of several hypothetical bacterial proteins of around 90 residues in length. The function of this family is unknown.. Probab=41.82 E-value=27 Score=15.88 Aligned_cols=44 Identities=18% Similarity=0.379 Sum_probs=34.3 Q ss_pred CCCCEEEEEECCCCEEEEEE--CCCCCHHHEEEEEECCEEEEEEEE Q ss_conf 88843689948993999997--011770214999962179999730 Q gi|254780621|r 33 AYPPYDIERTGENAYRITIA--VSGFHPSEITIEVDSSILMVRGEK 76 (157) Q Consensus 33 ~~P~~dI~~~~e~~y~i~~~--lPG~~kedi~V~v~~~~LtI~g~~ 76 (157) ..|..|+.=.++.+..||=. +-=|+++.|.+....+.++|+|+. T Consensus 18 d~PrIni~~~G~~~i~iENHrGl~~f~~~~vkl~~~~G~i~i~G~~ 63 (87) T TIGR02856 18 DLPRINITLIGKEHIYIENHRGLVVFSPEEVKLNSTNGKITIEGKN 63 (87) T ss_pred CCCCEEEEEECCEEEEEECCEEEEEECCCEEEEECCCCEEEEEECC T ss_conf 8992659988885899852331687758658983378689997375 No 42 >KOG3413 consensus Probab=38.73 E-value=16 Score=17.31 Aligned_cols=13 Identities=31% Similarity=0.465 Sum_probs=6.2 Q ss_pred EEEECCEEEEEEE Q ss_conf 3852885999974 Q gi|254780621|r 111 ASLENGLLYLELL 123 (157) Q Consensus 111 A~~~nGiLtI~lp 123 (157) +.|.||||||.|+ T Consensus 76 v~y~~GVLTl~lg 88 (156) T KOG3413 76 VDYADGVLTLKLG 88 (156) T ss_pred CCCCCCEEEEEEC T ss_conf 3205653899736 No 43 >TIGR02892 spore_yabP sporulation protein YabP; InterPro: IPR012504 Check - See: . Probab=38.22 E-value=26 Score=16.00 Aligned_cols=40 Identities=23% Similarity=0.382 Sum_probs=26.7 Q ss_pred EECCCCCHHHEEEEEECCEEEEEEEE---CCCCCCCCCEEEEE Q ss_conf 97011770214999962179999730---33235566213455 Q gi|254780621|r 51 IAVSGFHPSEITIEVDSSILMVRGEK---KSEEKETVEYLHRG 90 (157) Q Consensus 51 ~~lPG~~kedi~V~v~~~~LtI~g~~---~~~~~~~~~~~~~e 90 (157) .+|-=|+.++|-|+.+=|.|+|+|+- +.-+-+++.+.-.| T Consensus 17 ~~V~SFd~eei~LeT~~G~L~ikG~~L~~~~LDv~~G~v~i~G 59 (88) T TIGR02892 17 KEVISFDDEEILLETVMGFLTIKGQELKMNKLDVENGQVIIKG 59 (88) T ss_pred EEEECCCCCEEEEEECCCEEEEECCCCEECCCCCCCCEEEEEE T ss_conf 5532148878998631455888227630047305556799877 No 44 >CHL00140 rpl6 ribosomal protein L6; Validated Probab=36.45 E-value=33 Score=15.36 Aligned_cols=67 Identities=19% Similarity=0.297 Sum_probs=32.9 Q ss_pred EEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCC--CCC------EEEEEEECCEEEEEEECC-CCCEEEEEEEECCEE Q ss_conf 99997011770214999962179999730332355--662------134553023046889738-661581038528859 Q gi|254780621|r 48 RITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKE--TVE------YLHRGIAKRAFERRFQLA-DFVEVMSASLENGLL 118 (157) Q Consensus 48 ~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~--~~~------~~~~e~~~~~F~R~f~Lp-~~ve~~~A~~~nGiL 118 (157) .++..+| ..|+|.++++.++|+......... ... -+-.|.+.| |++...|- -+. .|+++.-.| T Consensus 32 ~L~~~~~----~~v~v~~~~~~i~v~~~~~~k~~ka~~GT~RslI~NmI~GVt~G-f~k~LeivGvGy---ra~~~g~~L 103 (178) T CHL00140 32 TLSRTLP----EGIKIEQQNNTLFVKPVEDTKKARKLHGLYRTLINNMVIGVSEG-FEKKLELQGVGY---RAQLQGKDL 103 (178) T ss_pred EEEEECC----CCEEEEEECCEEEEECCCCHHHHHHHHHHHHHHHHHHHEECCCC-EEEEEEEEEEEE---EEEECCCEE T ss_conf 8999868----97399984998986046641789999999999985431222057-289999996128---987069779 Q ss_pred EEEE Q ss_conf 9997 Q gi|254780621|r 119 YLEL 122 (157) Q Consensus 119 tI~l 122 (157) .+.| T Consensus 104 ~l~L 107 (178) T CHL00140 104 ILNV 107 (178) T ss_pred EECC T ss_conf 9935 No 45 >pfam01491 Frataxin_Cyay Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain is currently unknown. It has been suggested that this family is involved in iron transport. Probab=36.33 E-value=33 Score=15.35 Aligned_cols=14 Identities=21% Similarity=0.496 Sum_probs=6.5 Q ss_pred HEEEEEECCEEEEE Q ss_conf 14999962179999 Q gi|254780621|r 60 EITIEVDSSILMVR 73 (157) Q Consensus 60 di~V~v~~~~LtI~ 73 (157) ||+++..+|+|+|+ T Consensus 31 diD~e~~~gvLti~ 44 (108) T pfam01491 31 DIDVERSGGVLTLT 44 (108) T ss_pred CEEEECCCCEEEEE T ss_conf 65775259889999 No 46 >PTZ00179 60S ribosomal protein L9; Provisional Probab=32.35 E-value=33 Score=15.37 Aligned_cols=18 Identities=33% Similarity=0.791 Sum_probs=10.6 Q ss_pred HHHEEEEEECCEEEEEEE Q ss_conf 021499996217999973 Q gi|254780621|r 58 PSEITIEVDSSILMVRGE 75 (157) Q Consensus 58 kedi~V~v~~~~LtI~g~ 75 (157) |++++|++.++.++|+|. T Consensus 12 PegV~vtv~~~~VtVkGp 29 (189) T PTZ00179 12 PEDVTVSVKDRIVTVKGK 29 (189) T ss_pred CCCCEEEEECCEEEEECC T ss_conf 999899998999999989 No 47 >PRK05518 rpl6p 50S ribosomal protein L6P; Reviewed Probab=31.55 E-value=36 Score=15.15 Aligned_cols=16 Identities=19% Similarity=0.389 Sum_probs=11.4 Q ss_pred HHEEEEEECCEEEEEE Q ss_conf 2149999621799997 Q gi|254780621|r 59 SEITIEVDSSILMVRG 74 (157) Q Consensus 59 edi~V~v~~~~LtI~g 74 (157) ..++|+++++.|+|+. T Consensus 41 ~~v~i~~~~~~i~v~~ 56 (181) T PRK05518 41 PGIKISVEDGKVVIET 56 (181) T ss_pred CCEEEEEECCEEEEEE T ss_conf 8589999099799980 No 48 >cd00237 p23 p23 binds heat shock protein (Hsp)90 and participates in the folding of a number of Hsp90 clients, including the progesterone receptor. p23 also has a passive chaperoning activity and in addition may participate in prostaglandin synthesis. Probab=31.25 E-value=40 Score=14.83 Aligned_cols=68 Identities=13% Similarity=0.162 Sum_probs=43.2 Q ss_pred CCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEEEEEC--CEEEEE Q ss_conf 993999997011770214999962179999730332355662134553023046889738661581038528--859999 Q gi|254780621|r 44 ENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMSASLEN--GLLYLE 121 (157) Q Consensus 44 e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~A~~~n--GiLtI~ 121 (157) .+..-|.++|+. ..|++|.++++.|+.+|.-. ++.+ |.-.+.|-..|++....+.. .-+.+. T Consensus 10 ~d~vylTi~l~D--~kd~~i~l~~~~l~F~a~~~----~~~~----------y~~~leff~~I~pe~Sk~~~~~R~i~~~ 73 (106) T cd00237 10 RDYVFIEFCVED--SKDVKVDFEKSKLTFSCLNG----DNVK----------IYNEIELYDRVDPNDSKHKRTDRSILCC 73 (106) T ss_pred CCEEEEEEECCC--CCCCEEEEECCEEEEEEECC----CCCE----------EEEEEECCCCCCHHHCEEEECCCEEEEE T ss_conf 886999998278--88887987549799998879----9956----------7899887146780045587069609999 Q ss_pred EECCCC Q ss_conf 740686 Q gi|254780621|r 122 LLRSVP 127 (157) Q Consensus 122 lpK~~p 127 (157) |.|..+ T Consensus 74 l~K~e~ 79 (106) T cd00237 74 LRKGKE 79 (106) T ss_pred EEECCC T ss_conf 998977 No 49 >TIGR03653 arch_L6P archaeal ribosomal protein L6P. Members of this protein family are the archaeal ribosomal protein L6P. The top-scoring proteins not selected by this model are eukaryotic cytosolic ribosomal protein L9. Bacterial ribosomal protein L6 scores lower and is described by a distinct model. Probab=30.62 E-value=39 Score=14.92 Aligned_cols=17 Identities=18% Similarity=0.302 Sum_probs=11.9 Q ss_pred HHHEEEEEECCEEEEEE Q ss_conf 02149999621799997 Q gi|254780621|r 58 PSEITIEVDSSILMVRG 74 (157) Q Consensus 58 kedi~V~v~~~~LtI~g 74 (157) ..+++|.++++.|+|+- T Consensus 34 ~~~v~v~~~~~~i~v~~ 50 (170) T TIGR03653 34 YPGIEISVEDGKVVIET 50 (170) T ss_pred CCCEEEEECCCEEEEEE T ss_conf 87246897398899972 No 50 >PTZ00027 60S ribosomal protein L6; Provisional Probab=30.21 E-value=40 Score=14.83 Aligned_cols=18 Identities=33% Similarity=0.645 Sum_probs=11.6 Q ss_pred HHHEEEEEECCEEEEEEE Q ss_conf 021499996217999973 Q gi|254780621|r 58 PSEITIEVDSSILMVRGE 75 (157) Q Consensus 58 kedi~V~v~~~~LtI~g~ 75 (157) |++++|++.++.++|+|. T Consensus 13 PegV~vti~~~~VtVkGp 30 (190) T PTZ00027 13 PEGVTVSVKSRKVTVKGK 30 (190) T ss_pred CCCCEEEEECCEEEEECC T ss_conf 999799997889999918 No 51 >TIGR03654 L6_bact ribosomal protein L6, bacterial type. Probab=30.20 E-value=42 Score=14.72 Aligned_cols=25 Identities=16% Similarity=0.335 Sum_probs=17.9 Q ss_pred EEEEECCCCCHHHEEEEEECCEEEEEEEE Q ss_conf 99997011770214999962179999730 Q gi|254780621|r 48 RITIAVSGFHPSEITIEVDSSILMVRGEK 76 (157) Q Consensus 48 ~i~~~lPG~~kedi~V~v~~~~LtI~g~~ 76 (157) .+...+| ..|+++++++.+.|+... T Consensus 31 ~l~~~~~----~~i~i~~~~~~i~v~~~~ 55 (175) T TIGR03654 31 ELSRTLH----PGVTVKVEDGQLTVSRPN 55 (175) T ss_pred EEEEECC----CCEEEEEECCEEEEEECC T ss_conf 7989859----974999939999999889 No 52 >pfam07873 YabP YabP family. This family of proteins is involved in spore coat assembly during the process of sporulation. Probab=29.97 E-value=36 Score=15.17 Aligned_cols=24 Identities=33% Similarity=0.700 Sum_probs=19.6 Q ss_pred CCCCCHHHEEEEEECCEEEEEEEE Q ss_conf 011770214999962179999730 Q gi|254780621|r 53 VSGFHPSEITIEVDSSILMVRGEK 76 (157) Q Consensus 53 lPG~~kedi~V~v~~~~LtI~g~~ 76 (157) +--|+++.|.+....+.|+|+|+. T Consensus 20 I~~y~~~~I~l~t~~G~l~I~G~~ 43 (66) T pfam07873 20 VISFDEEEIRLETKQGVLIIKGEN 43 (66) T ss_pred EEEECCCEEEEEECCEEEEEECCC T ss_conf 899988899999285799999434 No 53 >PRK05498 rplF 50S ribosomal protein L6; Validated Probab=27.23 E-value=46 Score=14.45 Aligned_cols=25 Identities=16% Similarity=0.388 Sum_probs=17.5 Q ss_pred EEEEECCCCCHHHEEEEEECCEEEEEEEE Q ss_conf 99997011770214999962179999730 Q gi|254780621|r 48 RITIAVSGFHPSEITIEVDSSILMVRGEK 76 (157) Q Consensus 48 ~i~~~lPG~~kedi~V~v~~~~LtI~g~~ 76 (157) .++..++ ..|+|.++++.+.|.... T Consensus 32 ~l~~~~~----~~i~i~~~~~~I~v~~~~ 56 (178) T PRK05498 32 ELSRTLN----PDVTVKVEDNELTVTRPD 56 (178) T ss_pred EEEEECC----CCEEEEEECCEEEEEECC T ss_conf 7989869----984999969989997668 No 54 >cd06465 p23_hB-ind1_like p23_like domain found in human (h) butyrate-induced transcript 1 (B-ind1) and similar proteins. hB-ind1 participates in signaling by the small GTPase Rac1. It binds to Rac1 and enhances different Rac1 effects including activation of nuclear factor (NF) kappaB and activation of c-Jun N-terminal kinase (JNK). hB-ind1 also plays a part in the RNA replication and particle production of Hepatitis C virus (HCV) through its interaction with heat shock protein Hsp90, HCV nonstructural protein 5A (NS5A), and the immunophilin FKBP8. hB-ind1 is upregulated in the outer layer of Chinese hamster V79 cells grown as multicell spheroids, versus in the same cells grown as monolayers. This group includes the Saccharomyces cerevisiae Sba1, a co-chaperone of the Hsp90. Sba1 has been shown to be is required for telomere length maintenance, and may modulate telomerase DNA-binding activity. Probab=26.58 E-value=49 Score=14.33 Aligned_cols=72 Identities=8% Similarity=0.153 Sum_probs=47.5 Q ss_pred EEECCCCEEEEEECCCCCHHHEEEEEECCEEEEEEEECCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEEEEE--CCE Q ss_conf 994899399999701177021499996217999973033235566213455302304688973866158103852--885 Q gi|254780621|r 40 ERTGENAYRITIAVSGFHPSEITIEVDSSILMVRGEKKSEEKETVEYLHRGIAKRAFERRFQLADFVEVMSASLE--NGL 117 (157) Q Consensus 40 ~~~~e~~y~i~~~lPG~~kedi~V~v~~~~LtI~g~~~~~~~~~~~~~~~e~~~~~F~R~f~Lp~~ve~~~A~~~--nGi 117 (157) .+.. +..-|.++|++. .|.+|.++++.|..+|.-.. ++. .|.-.+.|-..|.+...+++ .-- T Consensus 6 aQr~-~~v~ltI~l~d~--~d~~v~~~~~~l~F~~~~~~---~~~----------~Y~~~l~l~~~I~pe~Sk~~~~~r~ 69 (108) T cd06465 6 AQRS-DVVYLTIELPDA--KDPKIKLEPTSLSFKAKGGG---GGK----------KYEFDLEFYKEIDPEESKYKVTGRQ 69 (108) T ss_pred EECC-CEEEEEEEECCC--CCCEEEEECCEEEEEEECCC---CCC----------EEEEEEECCCCCCCCCCEEEECCCE T ss_conf 4779-989999995888--87589987288999988089---992----------6999998103438005269966861 Q ss_pred EEEEEECCCC Q ss_conf 9999740686 Q gi|254780621|r 118 LYLELLRSVP 127 (157) Q Consensus 118 LtI~lpK~~p 127 (157) +.++|.|..+ T Consensus 70 i~~~l~Kk~~ 79 (108) T cd06465 70 IEFVLRKKEA 79 (108) T ss_pred EEEEEEECCC T ss_conf 9999996777 No 55 >COG2880 Uncharacterized protein conserved in archaea [Function unknown] Probab=25.69 E-value=41 Score=14.81 Aligned_cols=30 Identities=30% Similarity=0.294 Sum_probs=18.2 Q ss_pred EEEEEEEECCEEEEEEECCCCHHHCCEEEEE Q ss_conf 5810385288599997406860118728873 Q gi|254780621|r 107 EVMSASLENGLLYLELLRSVPERMKPRRIEI 137 (157) Q Consensus 107 e~~~A~~~nGiLtI~lpK~~pe~~~~r~I~I 137 (157) +++.|-|+||||.--=+-..++..+ .+|.| T Consensus 5 kIIEaiYEnGVfKPLqkv~LkEGek-v~i~i 34 (67) T COG2880 5 KIIEAIYENGVLKPLKKVDLKEGEK-VKIVI 34 (67) T ss_pred HHEEEEEECCCCCCCCCCCCCCCCE-EEEEE T ss_conf 3032443067543154366888878-99998 No 56 >PRK00446 cyaY frataxin-like protein; Provisional Probab=24.59 E-value=53 Score=14.10 Aligned_cols=14 Identities=21% Similarity=0.437 Sum_probs=7.3 Q ss_pred HEEEEEECCEEEEE Q ss_conf 14999962179999 Q gi|254780621|r 60 EITIEVDSSILMVR 73 (157) Q Consensus 60 di~V~v~~~~LtI~ 73 (157) ||+++..+|+|+|+ T Consensus 27 DiD~~~~~gvLtl~ 40 (104) T PRK00446 27 DIDCERNGGVLTLT 40 (104) T ss_pred CEEEECCCCEEEEE T ss_conf 85565469989999 No 57 >PRK05183 hscA chaperone protein HscA; Provisional Probab=22.59 E-value=58 Score=13.86 Aligned_cols=38 Identities=16% Similarity=0.176 Sum_probs=23.2 Q ss_pred CCCCEEEEEEE---ECCEEEEEEECCCCHHHCCEEEEECCCCC Q ss_conf 86615810385---28859999740686011872887337887 Q gi|254780621|r 103 ADFVEVMSASL---ENGLLYLELLRSVPERMKPRRIEISQSPQ 142 (157) Q Consensus 103 p~~ve~~~A~~---~nGiLtI~lpK~~pe~~~~r~I~I~~~~~ 142 (157) |.++-.+..+| .||||+|+..-. ..-+...|.|+.+.. T Consensus 468 p~G~~~I~VtF~iD~nGiL~VsA~d~--~tg~~~~i~i~~~~g 508 (621) T PRK05183 468 AAGAARIRVTFQVDADGLLSVTAMEK--STGVEASIQVKPSYG 508 (621) T ss_pred CCCCCCEEEEEEECCCCEEEEEEEEC--CCCCEEEEEECCCCC T ss_conf 89998689999987996799999985--679713599847999 No 58 >pfam08308 PEGA PEGA domain. This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands [Adindla et al. Comparative and Functional Genomics 2004; 5:2-16]. Probab=22.54 E-value=58 Score=13.86 Aligned_cols=40 Identities=13% Similarity=0.223 Sum_probs=26.0 Q ss_pred CEEEEEECCCCEEEEEECCCCCHHHEEEEEEC-CEEEEEEE Q ss_conf 43689948993999997011770214999962-17999973 Q gi|254780621|r 36 PYDIERTGENAYRITIAVSGFHPSEITIEVDS-SILMVRGE 75 (157) Q Consensus 36 ~~dI~~~~e~~y~i~~~lPG~~kedi~V~v~~-~~LtI~g~ 75 (157) |+++.......|.|++..+||..-.-.|.+.. ....|... T Consensus 26 P~~~~~l~~G~h~v~v~~~GY~~~~~~v~V~~g~~~~v~~~ 66 (71) T pfam08308 26 PVTLSDLPAGTHTLRLEKEGYEDYSTTVTVTAGETVSVSLT 66 (71) T ss_pred CCCCCCCCCCCEEEEEEECCCEEEEEEEEECCCCEEEEEEE T ss_conf 87413258966899999599850799999969998999899 No 59 >cd00503 Frataxin Frataxin is a nuclear-encoded mitochondrial protein implicated in Friedreich's ataxia (FRDA), an human autosomal recessive neurodegenerative disease; Frataxin is found in eukaryotes and in purple bacteria; lack of frataxin causes iron to accumulate in the mitochondrial matrix suggesting that frataxin is involved in mitochondrial iron homeostasis and possibly in iron transport; the domain has an alpha-beta fold consisting of two helices flanking an antiparallel beta sheet. Probab=21.54 E-value=61 Score=13.73 Aligned_cols=14 Identities=21% Similarity=0.553 Sum_probs=6.0 Q ss_pred HEEEEEECCEEEEE Q ss_conf 14999962179999 Q gi|254780621|r 60 EITIEVDSSILMVR 73 (157) Q Consensus 60 di~V~v~~~~LtI~ 73 (157) |++++..+++|+|+ T Consensus 29 diD~e~~~gvLti~ 42 (105) T cd00503 29 DIDVETQGGVLTLT 42 (105) T ss_pred CEEEECCCCEEEEE T ss_conf 74445579979999 No 60 >pfam04972 BON Putative phospholipid-binding domain. This domain is found in a family of osmotic shock protection proteins. It is also found in some Secretins and a group of potential haemolysins. Its likely function is attachment to phospholipid membranes. Probab=21.53 E-value=61 Score=13.73 Aligned_cols=26 Identities=27% Similarity=0.572 Sum_probs=21.0 Q ss_pred CCCCHHHEEEEEECCEEEEEEEECCC Q ss_conf 11770214999962179999730332 Q gi|254780621|r 54 SGFHPSEITIEVDSSILMVRGEKKSE 79 (157) Q Consensus 54 PG~~kedi~V~v~~~~LtI~g~~~~~ 79 (157) +.++..+|+|.+.++..++.|.-... T Consensus 12 ~~~~~~~I~V~v~~g~V~L~G~v~s~ 37 (64) T pfam04972 12 PGLPASDIKVTVENGVVTLSGTVDSE 37 (64) T ss_pred CCCCCCCEEEEEECCEEEEEEECCCH T ss_conf 89985716999989699999985999 No 61 >TIGR03421 FeS_CyaY iron donor protein CyaY. Members of this protein family are the iron-sulfur cluster (FeS) metabolism protein CyaY, a homolog of eukaryotic frataxin. ISC is one of several bacterial systems for FeS assembly; we find by Partial Phylogenetic Profiling vs. the ISC system that CyaY most like work with the ISC system for FeS cluster biosynthesis. A study of of cyaY mutants in Salmonella enterica bears this out. Although the trusted cutoff is set low enough to include eukaryotic frataxin sequences, a narrower, exception-type model (TIGR03421) identifies identifies members of that specific set. Probab=20.28 E-value=65 Score=13.57 Aligned_cols=13 Identities=23% Similarity=0.353 Sum_probs=5.5 Q ss_pred EEEEEECCEEEEE Q ss_conf 4999962179999 Q gi|254780621|r 61 ITIEVDSSILMVR 73 (157) Q Consensus 61 i~V~v~~~~LtI~ 73 (157) |+++..+++|+|+ T Consensus 27 iD~~~~~gvLti~ 39 (102) T TIGR03421 27 IDCERAGGVLTLE 39 (102) T ss_pred EEEECCCCEEEEE T ss_conf 5566259989999 No 62 >cd02175 GH16_lichenase Lichenase, also known as 1,3-1,4-beta-glucanase, is a member of glycosyl hydrolase family 16, that specifically cleaves 1,4-beta-D-glucosidic bonds in mixed-linked beta glucans that also contain 1,3-beta-D-glucosidic linkages. Natural substrates of beta-glucanase are beta-glucans from grain endosperm cell walls or lichenan from the Islandic moss, Cetraria islandica. This protein is found not only in bacteria but also in anaerobic fungi. This domain includes two seven-stranded antiparallel beta-sheets that are adjacent to one another forming a compact, jellyroll beta-sandwich structure. Probab=20.08 E-value=65 Score=13.54 Aligned_cols=55 Identities=15% Similarity=0.227 Sum_probs=30.1 Q ss_pred CCHHHEEEEEECCEEEEEEEECCCCCCC---CCEE-EEEEECCEEEEEEECCCCCEEEEEE Q ss_conf 7702149999621799997303323556---6213-4553023046889738661581038 Q gi|254780621|r 56 FHPSEITIEVDSSILMVRGEKKSEEKET---VEYL-HRGIAKRAFERRFQLADFVEVMSAS 112 (157) Q Consensus 56 ~~kedi~V~v~~~~LtI~g~~~~~~~~~---~~~~-~~e~~~~~F~R~f~Lp~~ve~~~A~ 112 (157) .+++++ .++++.|.|+..++...... .++. ++-+.||+|+=+..+|..--+..|- T Consensus 29 ~~~~Nv--~v~dG~L~i~~~~~~~~~~~ytsg~i~T~~~f~YGr~EaR~K~p~g~G~~pAf 87 (209) T cd02175 29 WRANNV--KFSDGILKLSLDSDGSGQNKYACGEYRTTNRYGYGLYEVRMKAAKNSGLVSSF 87 (209) T ss_pred CCHHCE--EEECCEEEEEEECCCCCCCCEEEEEEEEECCCCCEEEEEEEECCCCCCEEEEE T ss_conf 267725--99799899999764579997567999873632231899999858999500132 No 63 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=20.06 E-value=65 Score=13.54 Aligned_cols=11 Identities=36% Similarity=0.323 Sum_probs=5.1 Q ss_pred EEECCEEEEEE Q ss_conf 85288599997 Q gi|254780621|r 112 SLENGLLYLEL 122 (157) Q Consensus 112 ~~~nGiLtI~l 122 (157) -|-||-|.|.= T Consensus 153 VlpNGNL~I~G 163 (230) T PRK12700 153 VLPNGNLQIAG 163 (230) T ss_pred ECCCCCEEEEE T ss_conf 86699589999 Done!