Query gi|254780585|ref|YP_003064998.1| molecular chaperone DnaJ family protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 191 No_of_seqs 180 out of 6459 Neff 6.0 Searched_HMMs 39220 Date Mon May 30 00:16:33 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780585.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10767 chaperone protein Dna 99.7 8.5E-18 2.2E-22 137.5 6.4 57 131-187 3-59 (376) 2 COG0484 DnaJ DnaJ-class molecu 99.7 3E-17 7.6E-22 133.9 6.3 56 132-187 4-59 (371) 3 PRK10266 curved DNA-binding pr 99.7 1.5E-16 3.9E-21 129.3 6.2 56 131-187 3-58 (306) 4 PTZ00037 DnaJ_C chaperone prot 99.6 4.5E-16 1.2E-20 126.2 5.5 59 125-187 19-77 (422) 5 KOG0713 consensus 99.6 2.9E-15 7.3E-20 120.9 6.6 59 130-188 14-72 (336) 6 PRK09430 djlA Dna-J like membr 99.5 1.2E-13 3E-18 110.4 6.3 63 127-189 197-266 (269) 7 KOG0712 consensus 99.4 3.2E-13 8.1E-18 107.6 4.8 53 133-188 5-57 (337) 8 KOG0716 consensus 99.4 5.7E-13 1.5E-17 105.9 6.1 57 131-187 30-86 (279) 9 cd06257 DnaJ DnaJ domain or J- 99.4 9.9E-13 2.5E-17 104.3 6.4 55 133-187 1-55 (55) 10 pfam00226 DnaJ DnaJ domain. Dn 99.4 1E-12 2.6E-17 104.2 6.4 56 133-188 1-56 (63) 11 smart00271 DnaJ DnaJ molecular 99.3 3.1E-12 7.9E-17 101.1 6.3 56 133-188 2-58 (60) 12 KOG0715 consensus 99.3 2.1E-12 5.4E-17 102.1 5.3 55 133-188 44-98 (288) 13 KOG0691 consensus 99.3 7.4E-12 1.9E-16 98.6 6.8 59 131-189 4-62 (296) 14 KOG0717 consensus 99.3 1.2E-11 3E-16 97.3 6.8 60 128-187 4-64 (508) 15 COG2214 CbpA DnaJ-class molecu 99.1 1.1E-10 2.9E-15 90.8 6.8 59 130-188 4-63 (237) 16 KOG0721 consensus 99.1 6.4E-11 1.6E-15 92.5 5.4 61 127-187 94-154 (230) 17 KOG0719 consensus 99.0 9.3E-10 2.4E-14 84.9 5.6 57 130-186 12-70 (264) 18 KOG0718 consensus 98.9 2.8E-09 7.2E-14 81.7 6.8 59 129-187 6-67 (546) 19 KOG0722 consensus 98.8 2E-09 5.1E-14 82.7 3.4 61 127-188 28-88 (329) 20 KOG0720 consensus 98.8 3.6E-09 9.1E-14 81.0 4.1 57 131-188 234-290 (490) 21 KOG0550 consensus 98.8 6.9E-09 1.8E-13 79.2 5.2 56 132-187 373-429 (486) 22 PTZ00100 DnaJ chaperone protei 98.8 1.1E-08 2.8E-13 77.9 6.2 59 125-187 56-114 (114) 23 KOG0714 consensus 98.6 8.6E-08 2.2E-12 72.0 4.9 57 132-188 3-60 (306) 24 COG5407 SEC63 Preprotein trans 98.6 6.5E-08 1.7E-12 72.8 4.3 58 130-187 96-158 (610) 25 KOG1150 consensus 98.5 1.5E-07 3.9E-12 70.4 5.3 57 131-187 52-109 (250) 26 KOG0568 consensus 98.5 5.2E-07 1.3E-11 66.9 6.5 60 126-186 41-101 (342) 27 KOG0624 consensus 98.4 4.6E-07 1.2E-11 67.3 5.1 60 128-187 390-452 (504) 28 PRK01356 hscB co-chaperone Hsc 98.4 1.1E-06 2.9E-11 64.7 6.3 55 133-187 3-62 (166) 29 KOG1789 consensus 98.3 1.4E-06 3.7E-11 64.0 6.6 58 127-187 1276-1337(2235) 30 PRK05014 hscB co-chaperone Hsc 98.3 1.6E-06 4.2E-11 63.6 5.6 56 133-188 2-64 (171) 31 PRK00294 hscB co-chaperone Hsc 98.3 2.5E-06 6.5E-11 62.4 6.1 56 133-188 5-67 (173) 32 PRK01773 hscB co-chaperone Hsc 98.2 3.5E-06 9E-11 61.4 6.2 56 133-188 3-65 (173) 33 PRK03578 hscB co-chaperone Hsc 98.2 3.8E-06 9.7E-11 61.2 6.3 56 132-187 6-68 (176) 34 KOG0723 consensus 98.0 1.8E-05 4.5E-10 56.9 6.5 59 128-190 52-110 (112) 35 COG5269 ZUO1 Ribosome-associat 97.5 0.00024 6E-09 49.5 5.4 58 131-188 42-104 (379) 36 COG1076 DjlA DnaJ-domain-conta 97.3 0.00015 3.8E-09 50.8 3.0 54 132-185 113-173 (174) 37 TIGR02349 DnaJ_bact chaperone 96.5 0.00094 2.4E-08 45.6 1.2 42 129-187 15-56 (386) 38 KOG0431 consensus 96.3 0.0049 1.3E-07 40.9 4.1 51 138-188 394-451 (453) 39 COG1076 DjlA DnaJ-domain-conta 93.6 0.095 2.4E-06 32.5 3.8 51 138-188 7-64 (174) 40 pfam03656 Pam16 Pam16. The Pam 92.6 0.55 1.4E-05 27.5 6.5 41 127-168 53-93 (127) 41 KOG3192 consensus 92.3 0.32 8.2E-06 29.0 5.0 56 132-187 8-70 (168) 42 pfam00790 VHS VHS domain. Doma 77.3 4.4 0.00011 21.6 4.3 49 136-189 88-136 (136) 43 KOG3442 consensus 71.8 6.2 0.00016 20.6 3.9 52 128-180 55-113 (132) 44 cd03569 VHS_Hrs_Vps27p VHS dom 62.4 8.4 0.00021 19.7 3.0 43 143-190 95-137 (142) 45 KOG0100 consensus 60.3 7.5 0.00019 20.0 2.5 128 34-163 489-637 (663) 46 KOG3960 consensus 56.3 8.3 0.00021 19.8 2.1 15 174-188 129-143 (284) 47 smart00288 VHS Domain present 54.8 15 0.00038 18.1 3.2 43 142-189 91-133 (133) 48 cd03565 VHS_Tom1 VHS domain fa 53.0 20 0.0005 17.3 4.7 42 144-190 97-138 (141) 49 cd03568 VHS_STAM VHS domain fa 52.0 15 0.00038 18.1 2.9 41 145-190 93-133 (144) 50 pfam12005 DUF3499 Protein of u 46.5 22 0.00057 16.9 3.1 63 25-87 2-89 (125) 51 TIGR02983 SigE-fam_strep RNA p 46.3 11 0.00029 18.9 1.5 23 136-158 137-159 (165) 52 cd01801 Tsc13_N Tsc13_N N-te 43.1 9.9 0.00025 19.3 0.8 31 135-165 13-44 (77) 53 TIGR00873 gnd 6-phosphoglucona 38.6 22 0.00057 17.0 2.1 17 25-41 167-183 (480) 54 KOG2629 consensus 38.3 33 0.00084 15.8 4.9 37 57-97 22-59 (300) 55 TIGR02999 Sig-70_X6 RNA polyme 37.9 19 0.00048 17.4 1.6 28 137-164 51-79 (194) 56 KOG2217 consensus 36.9 35 0.00089 15.7 4.7 41 148-189 628-668 (705) 57 KOG2653 consensus 36.1 36 0.00091 15.6 3.0 53 24-82 169-223 (487) 58 PRK10203 hypothetical protein; 36.0 27 0.00068 16.5 2.1 20 70-89 4-23 (122) 59 pfam02467 Whib Transcription f 35.1 28 0.00071 16.3 2.1 26 54-79 39-64 (66) 60 COG5062 Uncharacterized membra 34.9 8.5 0.00022 19.7 -0.6 54 130-191 296-349 (429) 61 COG4989 Predicted oxidoreducta 34.2 36 0.00092 15.6 2.6 38 145-189 112-149 (298) 62 TIGR02980 SigBFG RNA polymeras 33.5 22 0.00055 17.0 1.3 16 60-75 20-36 (229) 63 pfam12031 DUF3518 Domain of un 29.2 31 0.00078 16.1 1.5 17 48-64 70-86 (257) 64 PRK13411 molecular chaperone D 28.5 35 0.00089 15.7 1.7 17 71-87 500-516 (655) 65 TIGR01057 topA_arch DNA topois 27.2 50 0.0013 14.6 2.8 33 125-157 282-314 (637) 66 PTZ00043 cytochrome c oxidase 24.4 56 0.0014 14.3 4.3 41 147-190 83-123 (270) 67 cd03567 VHS_GGA VHS domain fam 24.3 57 0.0014 14.3 3.7 40 143-190 97-136 (139) 68 pfam07800 DUF1644 Protein of u 23.0 58 0.0015 14.2 2.0 12 155-166 116-127 (152) 69 KOG3242 consensus 22.0 63 0.0016 14.0 2.9 34 153-186 157-193 (208) 70 pfam03343 SART-1 SART-1 family 21.6 64 0.0016 14.0 3.3 31 149-180 545-575 (577) 71 pfam12132 DUF3587 Protein of u 20.1 69 0.0017 13.8 2.8 24 40-63 153-177 (201) No 1 >PRK10767 chaperone protein DnaJ; Provisional Probab=99.72 E-value=8.5e-18 Score=137.48 Aligned_cols=57 Identities=37% Similarity=0.688 Sum_probs=54.2 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 976998617773288999999999999996578899798799999999999999996 Q gi|254780585|r 131 QFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 131 ~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) +++.|+||||+.+||.+|||++||+|+++||||+|.||++||++|++|++||++|.. T Consensus 3 ~kDyY~iLGV~~~As~~eIKkAYRkLA~k~HPDkn~~d~~Ae~kFkeI~eAYevLsD 59 (376) T PRK10767 3 KQDYYEILGVSKTAEEREIKKAYKRLAMKYHPDRNQGDKEAEAKFKEIKEAYEVLTD 59 (376) T ss_pred CCCCHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 788188719899979999999999999985889899987999999999999998689 No 2 >COG0484 DnaJ DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones] Probab=99.69 E-value=3e-17 Score=133.93 Aligned_cols=56 Identities=45% Similarity=0.789 Sum_probs=53.2 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 76998617773288999999999999996578899798799999999999999996 Q gi|254780585|r 132 FNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 132 ~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) ++-|+||||+.+||.+|||++||+|+++||||+|.|++.|||||++|++||++|.. T Consensus 4 ~dyYeiLGV~k~As~~EIKkAYRKLA~kyHPD~n~g~~~AeeKFKEI~eAYEVLsD 59 (371) T COG0484 4 RDYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSD 59 (371) T ss_pred CCHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 54364209888999999999999999986889898987799999999999987478 No 3 >PRK10266 curved DNA-binding protein CbpA; Provisional Probab=99.66 E-value=1.5e-16 Score=129.29 Aligned_cols=56 Identities=27% Similarity=0.561 Sum_probs=51.9 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 976998617773288999999999999996578899798799999999999999996 Q gi|254780585|r 131 QFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 131 ~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) .++-|+||||+.+||.+|||++||+|+++||||+|. +..||++|++|++||++|.. T Consensus 3 ~kDyY~vLGV~~~As~~EIKkAYRkLA~kyHPD~n~-d~~AeekFkeI~eAYeVLsD 58 (306) T PRK10266 3 LKDYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSK-EPDAEARFKEVAEAWEVLSD 58 (306) T ss_pred CCCHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCC-CCCHHHHHHHHHHHHHHHCC T ss_conf 677088759899989999999999999986879866-92899999999999998689 No 4 >PTZ00037 DnaJ_C chaperone protein; Provisional Probab=99.62 E-value=4.5e-16 Score=126.18 Aligned_cols=59 Identities=34% Similarity=0.643 Sum_probs=51.7 Q ss_pred CCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 158888976998617773288999999999999996578899798799999999999999996 Q gi|254780585|r 125 HRVGSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 125 ~~~~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) +....++.+-|+||||+++||.+|||++||+|+++||||+| +| +|+|++|++||++|.+ T Consensus 19 ~~~~v~~t~yY~iLGV~~~As~~EIKKAYRKLA~KyHPDKn-~d---~EKFKeIseAYEVLSD 77 (422) T PTZ00037 19 RKREVDNEKLYEVLNLSKDCTESEIKKAYRKLAIKHHPDKG-GD---PEKFKEISRAYEVLSD 77 (422) T ss_pred CCCCCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCC-CC---HHHHHHHHHHHHHHCC T ss_conf 88878767656865989998999999999999998499989-88---7999999999987589 No 5 >KOG0713 consensus Probab=99.58 E-value=2.9e-15 Score=120.93 Aligned_cols=59 Identities=32% Similarity=0.603 Sum_probs=55.2 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 89769986177732889999999999999965788997987999999999999999962 Q gi|254780585|r 130 MQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKS 188 (191) Q Consensus 130 ~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~S 188 (191) ..++.|++|||+.+|+..+||++||+|+++||||+|.+|+.|.++|++|+.||++|+.. T Consensus 14 ~~rDfYelLgV~k~Asd~eIKkAYRKLALk~HPDkNpddp~A~e~F~~in~AYEVLsDp 72 (336) T KOG0713 14 AGRDFYELLGVPKNASDQEIKKAYRKLALKYHPDKNPDDPNANEKFKEINAAYEVLSDP 72 (336) T ss_pred CCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCH T ss_conf 37778998498778888999999999999858999989877889999999999885588 No 6 >PRK09430 djlA Dna-J like membrane chaperone protein; Provisional Probab=99.46 E-value=1.2e-13 Score=110.40 Aligned_cols=63 Identities=30% Similarity=0.583 Sum_probs=54.0 Q ss_pred CCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCC--CC-C----HHHHHHHHHHHHHHHHHHHCC Q ss_conf 888897699861777328899999999999999657889--97-9----879999999999999999628 Q gi|254780585|r 127 VGSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDAN--GG-D----RGSEERFQAVIQAYKILKKSG 189 (191) Q Consensus 127 ~~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n--~g-~----~~~ee~~~~I~~Ay~~Lk~Sg 189 (191) ......+||.||||++++|.++||++||+||++||||+- .| . +.+++||++|+.||+.||+++ T Consensus 197 ~~~~l~~~Y~iLgv~~~a~~~eik~ayr~l~~~~HPDkl~~~g~~~~~~~~A~~k~~~i~~Aye~i~~~r 266 (269) T PRK09430 197 SGPTLEDAYKVLGVTESADDQEIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKEKAQEIQAAYELIKKQK 266 (269) T ss_pred CCCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 8877667389969999979999999999999884977232079999999999999999999999999973 No 7 >KOG0712 consensus Probab=99.39 E-value=3.2e-13 Score=107.55 Aligned_cols=53 Identities=40% Similarity=0.721 Sum_probs=49.3 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 69986177732889999999999999965788997987999999999999999962 Q gi|254780585|r 133 NAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKS 188 (191) Q Consensus 133 ~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~S 188 (191) .-|+||||.++||.+|||++||+|+++||||+|.+ +.|||++|.+||++|.+. T Consensus 5 ~~y~il~v~~~As~~eikkayrkla~k~HpDkn~~---~~ekfkei~~AyevLsd~ 57 (337) T KOG0712 5 KLYDILGVSPDASEEEIKKAYRKLALKYHPDKNPD---AGEKFKEISQAYEVLSDP 57 (337) T ss_pred CCCEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCC---HHHHHHHHHHHHHHHCCH T ss_conf 62124145877578999999999988728887865---778888888899886498 No 8 >KOG0716 consensus Probab=99.39 E-value=5.7e-13 Score=105.88 Aligned_cols=57 Identities=32% Similarity=0.688 Sum_probs=53.6 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 976998617773288999999999999996578899798799999999999999996 Q gi|254780585|r 131 QFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 131 ~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) ..+-|+||||+.+++.++|||+||+|+++||||++|.++++.++|++||.||.+|.+ T Consensus 30 ~~~LYdVLgl~k~at~d~IKKaYR~L~~k~HPD~~gd~P~~~dkf~eIN~Ay~ILsD 86 (279) T KOG0716 30 RLDLYDVLGLPKTATKDEIKKAYRKLALKYHPDKNGDNPEATDKFKEINTAYAILSD 86 (279) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 006999868985455689999999999983987688982067899999899998668 No 9 >cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification. Probab=99.37 E-value=9.9e-13 Score=104.33 Aligned_cols=55 Identities=40% Similarity=0.725 Sum_probs=51.7 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 6998617773288999999999999996578899798799999999999999996 Q gi|254780585|r 133 NAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 133 ~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) +.|+||||++++|.++||++|++|+++||||+++++..++++|++|++||++|.+ T Consensus 1 d~Y~vLgl~~~at~~~Ik~ayr~l~~~~HPD~~~~~~~~~~~f~~i~~Ay~iL~d 55 (55) T cd06257 1 DYYDILGVPPDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLSD 55 (55) T ss_pred CHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 9287779999939999999999999881998898817899999979999968438 No 10 >pfam00226 DnaJ DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens, although not in Prosite are confirmed as DnaJ containing domains from literature. Probab=99.37 E-value=1e-12 Score=104.22 Aligned_cols=56 Identities=43% Similarity=0.755 Sum_probs=52.6 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 69986177732889999999999999965788997987999999999999999962 Q gi|254780585|r 133 NAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKS 188 (191) Q Consensus 133 ~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~S 188 (191) +.|+||||+++++.++||++|++|++++|||+++++..++++|+.|++||++|.+. T Consensus 1 d~Y~vLgv~~~a~~~eIk~ay~~l~~~~HPDk~~~~~~~~~~f~~i~~Ay~iL~d~ 56 (63) T pfam00226 1 DYYEILGVPRDASDEEIKKAYRKLALKYHPDKNPGDPEAEEKFKEINEAYEVLSDP 56 (63) T ss_pred CHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCH T ss_conf 91877899999499999999999999869886899699999999999999996999 No 11 >smart00271 DnaJ DnaJ molecular chaperone homology domain. Probab=99.32 E-value=3.1e-12 Score=101.08 Aligned_cols=56 Identities=41% Similarity=0.748 Sum_probs=51.2 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC-HHHHHHHHHHHHHHHHHHHC Q ss_conf 699861777328899999999999999657889979-87999999999999999962 Q gi|254780585|r 133 NAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGD-RGSEERFQAVIQAYKILKKS 188 (191) Q Consensus 133 ~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~-~~~ee~~~~I~~Ay~~Lk~S 188 (191) +.|+||||++++|.++||++|++||++||||+++++ ..++++|++|++||++|.+. T Consensus 2 d~Y~iLgl~~~a~~~~Ik~aYr~l~~~~HPD~~~~~~~~~~~~f~~i~~Ay~~L~d~ 58 (60) T smart00271 2 DYYEILGVPRDASLDEIKKAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDP 58 (60) T ss_pred CCHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 726656989994999999999999998697889996098999999999999987788 No 12 >KOG0715 consensus Probab=99.32 E-value=2.1e-12 Score=102.15 Aligned_cols=55 Identities=29% Similarity=0.647 Sum_probs=51.5 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 69986177732889999999999999965788997987999999999999999962 Q gi|254780585|r 133 NAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKS 188 (191) Q Consensus 133 ~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~S 188 (191) +.|+||||+.+++..|||++|++|+|+||||.|.++ +++++|++|.+||++|... T Consensus 44 d~Y~vLgv~~~At~~EIK~Af~~LaKkyHPD~n~~~-~a~~kF~eI~~AYEiLsd~ 98 (288) T KOG0715 44 DYYKVLGVSRNATLSEIKSAFRKLAKKYHPDVNKDK-EASKKFKEISEAYEILSDE 98 (288) T ss_pred CCHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC-CHHHHHHHHHHHHHHHCCH T ss_conf 603300518888888999999999886399878874-2116678899999985348 No 13 >KOG0691 consensus Probab=99.29 E-value=7.4e-12 Score=98.63 Aligned_cols=59 Identities=31% Similarity=0.616 Sum_probs=55.2 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 97699861777328899999999999999657889979879999999999999999628 Q gi|254780585|r 131 QFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSG 189 (191) Q Consensus 131 ~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg 189 (191) ..+-|++||++++++..+|+++|+.++++||||+|.||+.+.++|+.+.+||++|-... T Consensus 4 ~~dyY~lLgi~~~at~~eIkKaYr~kaL~~HPDKNp~dP~A~ekFq~L~eAy~VL~D~~ 62 (296) T KOG0691 4 DTDYYDLLGISEDATDAEIKKAYRKKALQYHPDKNPGDPQAAEKFQELSEAYEVLSDEE 62 (296) T ss_pred CCHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCHH T ss_conf 20189983888788889999999988885398779998589999999999999864798 No 14 >KOG0717 consensus Probab=99.27 E-value=1.2e-11 Score=97.28 Aligned_cols=60 Identities=28% Similarity=0.493 Sum_probs=52.5 Q ss_pred CHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC-CHHHHHHHHHHHHHHHHHHH Q ss_conf 8889769986177732889999999999999965788997-98799999999999999996 Q gi|254780585|r 128 GSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGG-DRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 128 ~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g-~~~~ee~~~~I~~Ay~~Lk~ 187 (191) +...+-.|+||||..+++..+||+.||+|+++||||+|+. -..+.++|+.|+.||++|-. T Consensus 4 ~~~~~c~YE~L~v~~~a~d~eik~~YRklALq~HPDknpd~ieeat~~F~~i~aAYeVLSd 64 (508) T KOG0717 4 PFKKRCYYEVLGVERDADDDEIKKNYRKLALQYHPDKNPDRIEEATQQFQLIQAAYEVLSD 64 (508) T ss_pred CHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 3023577888602146887899999999888628887986389999999999999998568 No 15 >COG2214 CbpA DnaJ-class molecular chaperone [Posttranslational modification, protein turnover, chaperones] Probab=99.14 E-value=1.1e-10 Score=90.83 Aligned_cols=59 Identities=41% Similarity=0.684 Sum_probs=54.0 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHH-HHHHHHHHHHHHHHHHHC Q ss_conf 89769986177732889999999999999965788997987-999999999999999962 Q gi|254780585|r 130 MQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRG-SEERFQAVIQAYKILKKS 188 (191) Q Consensus 130 ~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~-~ee~~~~I~~Ay~~Lk~S 188 (191) .....|.||||.++++..+|+++|++|+++||||+|.++.. ++++|+.|++||.+|... T Consensus 4 ~~~~~y~iLgv~~~as~~eik~ayrkla~~~HPD~~~~~~~~a~~~f~~i~~Ay~vLsd~ 63 (237) T COG2214 4 DLLDYYEILGVPPNASLEEIKKAYRKLALKYHPDRNPGDPKVAEEKFKEINEAYEILSDP 63 (237) T ss_pred CCCCHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCH T ss_conf 233489981998888899999999999999786878875367999999998899984177 No 16 >KOG0721 consensus Probab=99.14 E-value=6.4e-11 Score=92.48 Aligned_cols=61 Identities=38% Similarity=0.631 Sum_probs=53.5 Q ss_pred CCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 8888976998617773288999999999999996578899798799999999999999996 Q gi|254780585|r 127 VGSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 127 ~~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) ......++|+||||++.++..|||++||.|.++||||++......|+.+..|+.||+.|-. T Consensus 94 ~~~~~fDPyEILGl~pgas~~eIKkaYR~LSik~HPDK~~~~~~~e~~~~~I~KAY~aLTD 154 (230) T KOG0721 94 RERQKFDPYEILGLDPGASEKEIKKAYRRLSIKYHPDKQPPEEGDEEFFEAIAKAYQALTD 154 (230) T ss_pred HHHHCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 8760589077618899988899999999866451877689862219999999999998514 No 17 >KOG0719 consensus Probab=98.97 E-value=9.3e-10 Score=84.89 Aligned_cols=57 Identities=35% Similarity=0.615 Sum_probs=51.3 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCC--CCCHHHHHHHHHHHHHHHHHH Q ss_conf 897699861777328899999999999999657889--979879999999999999999 Q gi|254780585|r 130 MQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDAN--GGDRGSEERFQAVIQAYKILK 186 (191) Q Consensus 130 ~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n--~g~~~~ee~~~~I~~Ay~~Lk 186 (191) ...+.|.||||..+++..+||++|++|+++||||++ -.+..+.++|+.++.||.+|- T Consensus 12 ~~~d~YevLGVer~a~~~eIrkAY~klal~~HPDk~~eed~~ea~~kFq~l~k~y~iLs 70 (264) T KOG0719 12 NKKDLYEVLGVERDATDKEIRKAYHKLALRLHPDKNHEEDKVEATEKFQQLQKAYQILS 70 (264) T ss_pred CCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 44577998621046878999999999988728876606667788999999999999862 No 18 >KOG0718 consensus Probab=98.93 E-value=2.8e-09 Score=81.72 Aligned_cols=59 Identities=31% Similarity=0.608 Sum_probs=51.3 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC---CHHHHHHHHHHHHHHHHHHH Q ss_conf 889769986177732889999999999999965788997---98799999999999999996 Q gi|254780585|r 129 SMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGG---DRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 129 ~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g---~~~~ee~~~~I~~Ay~~Lk~ 187 (191) ..+.+-|.+|||+.+||.++|+++||++.+-||||+... .+.||+.|+.|.+||++|-. T Consensus 6 ~~e~e~Ya~LNlpkdAt~eeI~~AYrr~~~lfHPDkh~dpd~K~~AE~~F~~i~~AyEVLsD 67 (546) T KOG0718 6 LDEIELYALLNLPKDATDEEIKKAYRRLSRLFHPDKHTDPDQKKAAEEKFQRIQRAYEVLSD 67 (546) T ss_pred CCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 41466899858995547799999999998710975457846788999999999999998558 No 19 >KOG0722 consensus Probab=98.84 E-value=2e-09 Score=82.68 Aligned_cols=61 Identities=30% Similarity=0.576 Sum_probs=53.8 Q ss_pred CCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 88889769986177732889999999999999965788997987999999999999999962 Q gi|254780585|r 127 VGSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKS 188 (191) Q Consensus 127 ~~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~S 188 (191) +-...+++|++|||...++..+|.++|+.|+++||||++.+.++ .+.|+.|-.||.+|+.+ T Consensus 28 LYCG~enCYdVLgV~Rea~KseIakAYRqLARrhHPDr~r~~e~-k~~F~~iAtayeilkd~ 88 (329) T KOG0722 28 LYCGAENCYDVLGVAREANKSEIAKAYRQLARRHHPDRNRDPES-KKLFVKIATAYEILKDN 88 (329) T ss_pred HCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCHH-HHHHHHHHCCCCCCCCH T ss_conf 10460648999611132167899999999998739765699224-33564220132326650 No 20 >KOG0720 consensus Probab=98.82 E-value=3.6e-09 Score=81.04 Aligned_cols=57 Identities=30% Similarity=0.491 Sum_probs=53.5 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9769986177732889999999999999965788997987999999999999999962 Q gi|254780585|r 131 QFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKS 188 (191) Q Consensus 131 ~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~S 188 (191) .++||.+|||..+++.++||+.||+|+.-.|||+|. ...|+|-|+.+..||++|-.+ T Consensus 234 ~~daYsvlGl~~d~sd~~lKk~Yrk~A~LVhPDKn~-~~~A~Eafk~Lq~Afevig~~ 290 (490) T KOG0720 234 ILDAYSALGLPSDCSDADLKKNYRKKAMLVHPDKNM-IPRAEEAFKKLQVAFEVIGDS 290 (490) T ss_pred CCCCHHHCCCCCCCCHHHHHHHHHHHCEEECCCCCC-CHHHHHHHHHHHHHHHHHCCH T ss_conf 787243328888878999999877530575787667-764899999999999996354 No 21 >KOG0550 consensus Probab=98.81 E-value=6.9e-09 Score=79.16 Aligned_cols=56 Identities=38% Similarity=0.695 Sum_probs=51.4 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC-HHHHHHHHHHHHHHHHHHH Q ss_conf 7699861777328899999999999999657889979-8799999999999999996 Q gi|254780585|r 132 FNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGD-RGSEERFQAVIQAYKILKK 187 (191) Q Consensus 132 ~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~-~~~ee~~~~I~~Ay~~Lk~ 187 (191) .+-|.|||+..+++.++|+++|++|++.||||+|.|+ .++|-+|++|-+||.+|.+ T Consensus 373 kd~ykilGi~~~as~~eikkayrk~AL~~Hpd~~agsq~eaE~kFkevgeAy~il~d 429 (486) T KOG0550 373 KDWYKILGISRNASDDEIKKAYRKLALVHHPDKNAGSQKEAEAKFKEVGEAYTILSD 429 (486) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 107988614411330125567787788737886742267789889888888887048 No 22 >PTZ00100 DnaJ chaperone protein; Provisional Probab=98.81 E-value=1.1e-08 Score=77.86 Aligned_cols=59 Identities=29% Similarity=0.333 Sum_probs=50.5 Q ss_pred CCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 158888976998617773288999999999999996578899798799999999999999996 Q gi|254780585|r 125 HRVGSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 125 ~~~~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) ...+....+||.||||+++++.++|+++||+|.++.|||+ ||... -..+||+|.++|.+ T Consensus 56 Fe~~Mt~~EA~~ILGl~~~as~~~I~~ahrkLM~knHPD~-GGS~y---lA~KINeAKd~LlK 114 (114) T PTZ00100 56 FEHKMSKSEACKILNIPPTANKERIREVHKQLMLRNHPDN-GGSTY---LASKVNEAKDFLLK 114 (114) T ss_pred CCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCC-CCCHH---HHHHHHHHHHHHHC T ss_conf 7898599999998199999999999999999999709988-98289---99999999999749 No 23 >KOG0714 consensus Probab=98.57 E-value=8.6e-08 Score=72.00 Aligned_cols=57 Identities=32% Similarity=0.608 Sum_probs=49.8 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC-HHHHHHHHHHHHHHHHHHHC Q ss_conf 7699861777328899999999999999657889979-87999999999999999962 Q gi|254780585|r 132 FNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGD-RGSEERFQAVIQAYKILKKS 188 (191) Q Consensus 132 ~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~-~~~ee~~~~I~~Ay~~Lk~S 188 (191) .++|+||+|..+++..+|+++|+.|+++||||+|.+. ..++.+|++|.+||++|... T Consensus 3 ~d~~~~l~i~~~as~~~i~ka~~~~a~~~hpdk~~~~~~~~~~~~~~~~ea~~~ls~~ 60 (306) T KOG0714 3 KDYYKILGIARSASEEDIKKAYRKLALKYHPDKNPSPKEVAEAKFKEIAEAYEVLSDP 60 (306) T ss_pred CCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCH T ss_conf 4178883855111667888887888873389999733677888655555444431364 No 24 >COG5407 SEC63 Preprotein translocase subunit Sec63 [Intracellular trafficking and secretion] Probab=98.57 E-value=6.5e-08 Score=72.80 Aligned_cols=58 Identities=34% Similarity=0.641 Sum_probs=50.1 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC-----CHHHHHHHHHHHHHHHHHHH Q ss_conf 89769986177732889999999999999965788997-----98799999999999999996 Q gi|254780585|r 130 MQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGG-----DRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 130 ~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g-----~~~~ee~~~~I~~Ay~~Lk~ 187 (191) ..-++|+|||++.+++..+||++||.|..++|||+... ....||+.+.|+.||..|-+ T Consensus 96 ~~fDPyEILGI~~~ts~rdik~~yr~Ls~KfhpdK~~~mvn~~rse~Ee~y~~ItkAY~~lTd 158 (610) T COG5407 96 RGFDPYEILGIDQDTSERDIKKRYRMLSMKFHPDKAPPMVNELRSEYEEKYKTITKAYGLLTD 158 (610) T ss_pred CCCCHHHHHCCCCCCCHHHHHHHHHHHEEECCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 088818861246787577899999866240173213777715789999999999999876301 No 25 >KOG1150 consensus Probab=98.52 E-value=1.5e-07 Score=70.37 Aligned_cols=57 Identities=35% Similarity=0.640 Sum_probs=51.9 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC-HHHHHHHHHHHHHHHHHHH Q ss_conf 97699861777328899999999999999657889979-8799999999999999996 Q gi|254780585|r 131 QFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGD-RGSEERFQAVIQAYKILKK 187 (191) Q Consensus 131 ~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~-~~~ee~~~~I~~Ay~~Lk~ 187 (191) -.++|++|.+.|.++.++||++||+|..-.|||+|..| ..|...|-.|..||.+|-+ T Consensus 52 nLNpfeVLqIdpev~~edikkryRklSilVHPDKN~Dd~~rAqkAFdivkKA~k~l~n 109 (250) T KOG1150 52 NLNPFEVLQIDPEVTDEDIKKRYRKLSILVHPDKNPDDAERAQKAFDIVKKAYKLLEN 109 (250) T ss_pred CCCHHHHHHCCCCCCHHHHHHHHHHHHEEECCCCCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 5674787734877878999999875102336888954378899999999999999757 No 26 >KOG0568 consensus Probab=98.46 E-value=5.2e-07 Score=66.91 Aligned_cols=60 Identities=28% Similarity=0.631 Sum_probs=52.2 Q ss_pred CCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH-HHH Q ss_conf 5888897699861777328899999999999999657889979879999999999999-999 Q gi|254780585|r 126 RVGSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYK-ILK 186 (191) Q Consensus 126 ~~~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~-~Lk 186 (191) +......+.|.+|||...++.++||.+|..|||++|||. |......++|+.|-+||. +|. T Consensus 41 kske~~~e~fril~v~e~~~adevr~af~~lakq~hpds-gs~~adaa~f~qideafrkvlq 101 (342) T KOG0568 41 KSKEKIMECFRILGVEEGADADEVREAFHDLAKQVHPDS-GSEEADAARFIQIDEAFRKVLQ 101 (342) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCC-CCCCCCHHHHHHHHHHHHHHHH T ss_conf 629999999998362026764579999999999709998-9753008999999999999999 No 27 >KOG0624 consensus Probab=98.40 E-value=4.6e-07 Score=67.25 Aligned_cols=60 Identities=28% Similarity=0.505 Sum_probs=51.7 Q ss_pred CHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC---HHHHHHHHHHHHHHHHHHH Q ss_conf 88897699861777328899999999999999657889979---8799999999999999996 Q gi|254780585|r 128 GSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGD---RGSEERFQAVIQAYKILKK 187 (191) Q Consensus 128 ~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~---~~~ee~~~~I~~Ay~~Lk~ 187 (191) -+..++-|+||||..+++..+|-|+||+|+.+||||...+. +.+|.+|..|-.|-++|.+ T Consensus 390 qs~kRDYYKILGVkRnAsKqEI~KAYRKlAqkWHPDNFqdEeEKKkAEKKFIDIAAAKEVLsd 452 (504) T KOG0624 390 QSGKRDYYKILGVKRNASKQEITKAYRKLAQKWHPDNFQDEEEKKKAEKKFIDIAAAKEVLSD 452 (504) T ss_pred HHCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 755512788842003542778999999999835986444778887788756239988876269 No 28 >PRK01356 hscB co-chaperone HscB; Provisional Probab=98.36 E-value=1.1e-06 Score=64.68 Aligned_cols=55 Identities=27% Similarity=0.406 Sum_probs=45.4 Q ss_pred HHHHHHCCCCC--CCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH---HHHHHHHHHHHH Q ss_conf 69986177732--88999999999999996578899798799999---999999999996 Q gi|254780585|r 133 NAFEILGLLSD--SSPEEIRGRYKDLVKKHHPDANGGDRGSEERF---QAVIQAYKILKK 187 (191) Q Consensus 133 ~Al~~Lgl~~~--~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~---~~I~~Ay~~Lk~ 187 (191) +-+.+|||++. .+...++++|++|-+++|||+..+....+..+ ..||.||++||+ T Consensus 3 NyFelf~l~~~f~iD~~~L~~~y~~LQ~~~HPDk~~~~~e~~~~~~~Ss~IN~AY~tLk~ 62 (166) T PRK01356 3 NYFQLLGLPQTYNIDLKILEKQYFAMQVKYHPDKAKTLQEKEQNLIIASELNKAYSTLKD 62 (166) T ss_pred CHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 789883999646779999999999999987978678817999999999999999999789 No 29 >KOG1789 consensus Probab=98.34 E-value=1.4e-06 Score=63.99 Aligned_cols=58 Identities=36% Similarity=0.450 Sum_probs=47.4 Q ss_pred CCHHHHHHHHHHCCCCC----CCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 88889769986177732----88999999999999996578899798799999999999999996 Q gi|254780585|r 127 VGSMQFNAFEILGLLSD----SSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 127 ~~~~~~~Al~~Lgl~~~----~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) ....+-+|++||+|+-+ ...+.||++|++|+.+||||+|..-+ |+|-.||.||+.|.. T Consensus 1276 ~~mS~d~A~eiL~i~l~n~~hD~~~KirrqY~kLA~kYHPDKNPEGR---emFe~VnKAYE~L~~ 1337 (2235) T KOG1789 1276 ATMSVDLAREILSVDLTNEEHDKPAKIRRQYYKLAAKYHPDKNPEGR---EMFERVNKAYELLSS 1337 (2235) T ss_pred CCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHH---HHHHHHHHHHHHHHH T ss_conf 75665666777106467888765899999999999972999992478---999999999999888 No 30 >PRK05014 hscB co-chaperone HscB; Provisional Probab=98.28 E-value=1.6e-06 Score=63.62 Aligned_cols=56 Identities=27% Similarity=0.468 Sum_probs=45.7 Q ss_pred HHHHHHCCCCC--CCHHHHHHHHHHHHHHHCCCCCCCCHHHH-----HHHHHHHHHHHHHHHC Q ss_conf 69986177732--88999999999999996578899798799-----9999999999999962 Q gi|254780585|r 133 NAFEILGLLSD--SSPEEIRGRYKDLVKKHHPDANGGDRGSE-----ERFQAVIQAYKILKKS 188 (191) Q Consensus 133 ~Al~~Lgl~~~--~~~~~ik~~yk~lvk~~HPD~n~g~~~~e-----e~~~~I~~Ay~~Lk~S 188 (191) +.+++|||++. .+...+.++|++|.+++|||+..+....| ..-..||.||++|++. T Consensus 2 nyF~lf~l~~~f~iD~~~L~~~y~~Lq~~~HPDkf~~~~~~e~~~a~~~ss~iN~AY~tLk~p 64 (171) T PRK05014 2 NYFTLFGLPARYDIDTQLLSLRYQELQRQVHPDKFANASEREQLAAVQQAAQINDAYQTLKHP 64 (171) T ss_pred CHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 777666999878879999999999999986977405898999999999999999999997488 No 31 >PRK00294 hscB co-chaperone HscB; Provisional Probab=98.25 E-value=2.5e-06 Score=62.40 Aligned_cols=56 Identities=27% Similarity=0.477 Sum_probs=44.8 Q ss_pred HHHHHHCCCC--CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHH-----HHHHHHHHHHHHHC Q ss_conf 6998617773--28899999999999999657889979879999-----99999999999962 Q gi|254780585|r 133 NAFEILGLLS--DSSPEEIRGRYKDLVKKHHPDANGGDRGSEER-----FQAVIQAYKILKKS 188 (191) Q Consensus 133 ~Al~~Lgl~~--~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~-----~~~I~~Ay~~Lk~S 188 (191) +.+.+|||++ +.+...+.++|+.|.+++|||+..+....|.+ -..||.||++|++. T Consensus 5 ~hF~lf~l~~~f~iD~~~L~~~y~~LQ~~~HPDrf~~~s~~e~~~a~~~sa~iN~AY~tLk~p 67 (173) T PRK00294 5 CHFALFDLQPSFRLDLDQLATRYRELAREVHPDRFADAPEREQRLALERSAALNEAYQTLKSA 67 (173) T ss_pred CHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 664644999767679999999999999986957367989999999999999999999997599 No 32 >PRK01773 hscB co-chaperone HscB; Provisional Probab=98.22 E-value=3.5e-06 Score=61.45 Aligned_cols=56 Identities=27% Similarity=0.373 Sum_probs=45.0 Q ss_pred HHHHHHCCCC--CCCHHHHHHHHHHHHHHHCCCCCCCCHHH-----HHHHHHHHHHHHHHHHC Q ss_conf 6998617773--28899999999999999657889979879-----99999999999999962 Q gi|254780585|r 133 NAFEILGLLS--DSSPEEIRGRYKDLVKKHHPDANGGDRGS-----EERFQAVIQAYKILKKS 188 (191) Q Consensus 133 ~Al~~Lgl~~--~~~~~~ik~~yk~lvk~~HPD~n~g~~~~-----ee~~~~I~~Ay~~Lk~S 188 (191) +.+++|||++ +.+...++++|++|-+++|||+..+.... .+.-..||.||.+||+. T Consensus 3 NyFelf~L~~~f~ID~~~L~~~y~~LQ~~~HPDkf~~~s~~Ek~~s~~~Ss~iN~AY~tLkdp 65 (173) T PRK01773 3 NPFALFDLPVDFQLDQALLSERYLALQKSLHPDNFANSSAQEQRLAMQKSAEVNDALQILKDP 65 (173) T ss_pred CHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 837762999777679999999999999986968566899999999999999999999997488 No 33 >PRK03578 hscB co-chaperone HscB; Provisional Probab=98.21 E-value=3.8e-06 Score=61.23 Aligned_cols=56 Identities=18% Similarity=0.343 Sum_probs=45.5 Q ss_pred HHHHHHHCCCC--CCCHHHHHHHHHHHHHHHCCCCCCCCHH-----HHHHHHHHHHHHHHHHH Q ss_conf 76998617773--2889999999999999965788997987-----99999999999999996 Q gi|254780585|r 132 FNAFEILGLLS--DSSPEEIRGRYKDLVKKHHPDANGGDRG-----SEERFQAVIQAYKILKK 187 (191) Q Consensus 132 ~~Al~~Lgl~~--~~~~~~ik~~yk~lvk~~HPD~n~g~~~-----~ee~~~~I~~Ay~~Lk~ 187 (191) .+.+.+|||++ +.+...++++|+.|.+++|||+..+... +...-..||.||++|++ T Consensus 6 ~nyF~lf~lp~~f~iD~~~L~~~y~~LQ~~~HPDkf~~~s~~e~~~a~~~sa~iN~AY~tLk~ 68 (176) T PRK03578 6 DDHFSLFGLPARFALDEAALDAAYRTVQAQVHPDRFAAAGDAEKRVAMQWATRANEAYRTLRD 68 (176) T ss_pred CCHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 567888299976777999999999999998796757788999999999999999999999759 No 34 >KOG0723 consensus Probab=98.02 E-value=1.8e-05 Score=56.87 Aligned_cols=59 Identities=20% Similarity=0.322 Sum_probs=50.9 Q ss_pred CHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 888976998617773288999999999999996578899798799999999999999996289 Q gi|254780585|r 128 GSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSGF 190 (191) Q Consensus 128 ~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg~ 190 (191) ....++|..||||.+.++.+.||.+.|++....|||+ ||.+--.. +||+|+++|-.++. T Consensus 52 kMsr~EA~lIL~v~~s~~k~KikeaHrriM~~NHPD~-GGSPYlAs---KINEAKdlLe~~~~ 110 (112) T KOG0723 52 KMSRREAALILGVTPSLDKDKIKEAHRRIMLANHPDR-GGSPYLAS---KINEAKDLLEGTSL 110 (112) T ss_pred CCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCC-CCCHHHHH---HHHHHHHHHHCCCC T ss_conf 2156889988278754568899999999998079867-99887999---98899999844346 No 35 >COG5269 ZUO1 Ribosome-associated chaperone zuotin [Translation, ribosomal structure and biogenesis / Posttranslational modification, protein turnover, chaperones] Probab=97.48 E-value=0.00024 Score=49.50 Aligned_cols=58 Identities=31% Similarity=0.563 Sum_probs=49.6 Q ss_pred HHHHHHHHCCCC---CCCHHHHHHHHHHHHHHHCCCCC--CCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 976998617773---28899999999999999657889--97987999999999999999962 Q gi|254780585|r 131 QFNAFEILGLLS---DSSPEEIRGRYKDLVKKHHPDAN--GGDRGSEERFQAVIQAYKILKKS 188 (191) Q Consensus 131 ~~~Al~~Lgl~~---~~~~~~ik~~yk~lvk~~HPD~n--~g~~~~ee~~~~I~~Ay~~Lk~S 188 (191) ..+-|.+|||+- -++..+|.++-++.|.+||||+. ||+....+-|+.|..||++|-.+ T Consensus 42 ~~DlYa~lgLskyR~ka~~~qi~kah~kkv~kyHPDk~aa~g~~~~d~fFk~iqkA~evL~D~ 104 (379) T COG5269 42 KVDLYALLGLSKYRTKAIPPQILKAHKKKVYKYHPDKTAAGGNKGCDEFFKLIQKAREVLGDR 104 (379) T ss_pred HHHHHHHHCHHHHHCCCCCHHHHHHHHHHHHHHCCCCHHCCCCCCCHHHHHHHHHHHHHHCCH T ss_conf 566999854276525789589999999999873865021058877289999999999985158 No 36 >COG1076 DjlA DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones] Probab=97.35 E-value=0.00015 Score=50.85 Aligned_cols=54 Identities=26% Similarity=0.538 Sum_probs=47.2 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCC--CC-----HHHHHHHHHHHHHHHHH Q ss_conf 76998617773288999999999999996578899--79-----87999999999999999 Q gi|254780585|r 132 FNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANG--GD-----RGSEERFQAVIQAYKIL 185 (191) Q Consensus 132 ~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~--g~-----~~~ee~~~~I~~Ay~~L 185 (191) .++|.+|++....+..+|+++|+.|+.++|||+-. |. ..+++++++|++||..+ T Consensus 113 ~~~l~~l~~~~~~~~~~i~~~~r~l~~e~~~d~a~~~~~~~e~~~~~~~~~~~~~~a~~~~ 173 (174) T COG1076 113 EDALKVLGVEIKADQDAIKKAYRKLLSEQHPDKAAAKGLKLEFIEKLKEKLQEIQEAYEDI 173 (174) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 5799984684023099999999999874499999863783999999999999999998744 No 37 >TIGR02349 DnaJ_bact chaperone protein DnaJ; InterPro: IPR012724 Molecular chaperones are a diverse family of proteins that function to protect proteins in the intracellular milieu from irreversible aggregation during synthesis and in times of cellular stress. The bacterial molecular chaperone DnaK is an enzyme that couples cycles of ATP binding, hydrolysis, and ADP release by an N-terminal ATP-hydrolizing domain to cycles of sequestration and release of unfolded proteins by a C-terminal substrate binding domain. Dimeric GrpE is the co-chaperone for DnaK, and acts as a nucleotide exchange factor, stimulating the rate of ADP release 5000-fold . DnaK is itself a weak ATPase; ATP hydrolysis by DnaK is stimulated by its interaction with another co-chaperone, DnaJ. Thus the co-chaperones DnaJ and GrpE are capable of tightly regulating the nucleotide-bound and substrate-bound state of DnaK in ways that are necessary for the normal housekeeping functions and stress-related functions of the DnaK molecular chaperone cycle. Besides stimulating the ATPase activity of DnaK through its J-domain, DnaJ also associates with unfolded polypeptide chains and prevents their aggregation . Thus, DnaK and DnaJ may bind to one and the same polypeptide chain to form a ternary complex. The formation of a ternary complex may result in cis-interaction of the J-domain of DnaJ with the ATPase domain of DnaK. An unfolded polypeptide may enter the chaperone cycle by associating first either with ATP-liganded DnaK or with DnaJ. DnaK interacts with both the backbone and side chains of a peptide substrate; it thus shows binding polarity and admits only L-peptide segments. In contrast, DnaJ has been shown to bind both L- and D-peptides and is assumed to interact only with the side chains of the substrate. DnaJ comprises a 70-residue N-terminal domain (the J-domain); a 30-residue glycine-rich region (the G-domain); a central domain containing 4 repeats of a CxxCxGxG motif (the CRR-domain); and a 120-170 residue C-terminal region. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins , either together or separately. This entry mostly consists of bacterial forms of DnaJ. The three components of the DnaK-DnaJ-GrpE system are typically encoded by consecutive genes. DnaJ homologues occur in many genomes, typically not encoded near DnaK and GrpE-like genes. Most such proteins are not included in this family, though some chloroplast homologues are included; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding, 0009408 response to heat. Probab=96.50 E-value=0.00094 Score=45.57 Aligned_cols=42 Identities=21% Similarity=0.372 Sum_probs=29.8 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 88976998617773288999999999999996578899798799999999999999996 Q gi|254780585|r 129 SMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKK 187 (191) Q Consensus 129 ~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~ 187 (191) ...++||+-|-...=-|. ||+ .+++||||||+|++||+||-+ T Consensus 15 ~EIKKAYRkLA~kYHPD~--------------Nkg---n~~~AEeKFKEi~EAYeVLSD 56 (386) T TIGR02349 15 EEIKKAYRKLAKKYHPDR--------------NKG---NDKEAEEKFKEINEAYEVLSD 56 (386) T ss_pred HHHHHHHHHHHHHCCCCC--------------CCC---CCCCHHHHHHHHHHHHHCCCC T ss_conf 999999998888517888--------------888---883756765788877530689 No 38 >KOG0431 consensus Probab=96.35 E-value=0.0049 Score=40.89 Aligned_cols=51 Identities=22% Similarity=0.396 Sum_probs=38.2 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC-HH------HHHHHHHHHHHHHHHHHC Q ss_conf 1777328899999999999999657889979-87------999999999999999962 Q gi|254780585|r 138 LGLLSDSSPEEIRGRYKDLVKKHHPDANGGD-RG------SEERFQAVIQAYKILKKS 188 (191) Q Consensus 138 Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~-~~------~ee~~~~I~~Ay~~Lk~S 188 (191) .+|..-.+...||++|++-|+..|||+..+. .+ +|+-|-...+||+.-... T Consensus 394 VsltDLVtp~~VKKaYrKA~L~VHPDKlqq~gas~~qK~Iaekvfd~l~eawn~f~~~ 451 (453) T KOG0431 394 VSLTDLVTPAQVKKAYRKAVLCVHPDKLQQKGASLEQKYIAEKVFDALSEAWNKFNQQ 451 (453) T ss_pred CCHHHCCCHHHHHHHHHHHHHEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 7566412789999998864411276423688654999999999999999999862214 No 39 >COG1076 DjlA DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones] Probab=93.59 E-value=0.095 Score=32.47 Aligned_cols=51 Identities=25% Similarity=0.421 Sum_probs=37.1 Q ss_pred HCCCCC--CCHHHHHHHHHHHHHHHCCCCCCCCHHH-----HHHHHHHHHHHHHHHHC Q ss_conf 177732--8899999999999999657889979879-----99999999999999962 Q gi|254780585|r 138 LGLLSD--SSPEEIRGRYKDLVKKHHPDANGGDRGS-----EERFQAVIQAYKILKKS 188 (191) Q Consensus 138 Lgl~~~--~~~~~ik~~yk~lvk~~HPD~n~g~~~~-----ee~~~~I~~Ay~~Lk~S 188 (191) .|+... ...+.++..|+.+.+.+|||..+..... -+++..++.||.+|+.. T Consensus 7 ~~~~~~f~~~~~~l~~~~~~~~~~~~~dr~~~~~~~~~~~~l~~~~~~~~a~~tLk~~ 64 (174) T COG1076 7 FGLPRAFQIDLDALKLQYRELQRAYHPDRFGKASEAEQRKALQQSAEVNPAYQTLKDP 64 (174) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 0327889998888677799999985831235744699999999999857757889889 No 40 >pfam03656 Pam16 Pam16. The Pam16 protein is the fifth essential subunit of the pre-sequence translocase-associated protein import motor (PAM). In Saccharomyces cerevisiae, Pam16 is required for preprotein translocation into the matrix, but not for protein insertion into the inner membrane. Pam16 has a degenerate J domain. J-domain proteins play important regulatory roles as co-chaperones, recruiting Hsp70 partners and accelerating the ATP-hydrolysis step of the chaperone cycle. Pam16's J-like domain strongly interacts with Pam18's J domain, leading to a productive interaction of Pam18 with mtHsp70 at the mitochondria import channel. Pam18 stimulates the ATPase activity of mtHsp70. Probab=92.60 E-value=0.55 Score=27.49 Aligned_cols=41 Identities=27% Similarity=0.388 Sum_probs=36.0 Q ss_pred CCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 888897699861777328899999999999999657889979 Q gi|254780585|r 127 VGSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGD 168 (191) Q Consensus 127 ~~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~ 168 (191) .+....+|..||+|+.+.+.++|-++|..|....-|.+ ||. T Consensus 53 ~gmtldEA~qILNV~~~~~~EeI~k~y~~LF~~Ndk~k-GGS 93 (127) T pfam03656 53 GGISLDESCQILNVEEDLNMEEVNKRYEHLFEVNDKSK-GGS 93 (127) T ss_pred CCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCC-CCC T ss_conf 78489999988099999999999999999986379888-986 No 41 >KOG3192 consensus Probab=92.29 E-value=0.32 Score=28.99 Aligned_cols=56 Identities=34% Similarity=0.637 Sum_probs=39.3 Q ss_pred HHHHHHHCCC--CCCCHHHHHHHHHHHHHHHCCCCCC----CCH-HHHHHHHHHHHHHHHHHH Q ss_conf 7699861777--3288999999999999996578899----798-799999999999999996 Q gi|254780585|r 132 FNAFEILGLL--SDSSPEEIRGRYKDLVKKHHPDANG----GDR-GSEERFQAVIQAYKILKK 187 (191) Q Consensus 132 ~~Al~~Lgl~--~~~~~~~ik~~yk~lvk~~HPD~n~----g~~-~~ee~~~~I~~Ay~~Lk~ 187 (191) .+-+.++|.. ...+..-+.-.|..-.+++|||..+ |++ .+.|.-.++|+||++|++ T Consensus 8 ~~ff~~Fg~e~~~~~~p~~l~~~~~~~skkL~~d~~~~~~~~~~d~a~eqSa~lnkAY~TLk~ 70 (168) T KOG3192 8 SRFFDIFGMELSFKIDPDKLKEKYTDISKKLHPDRPGLSFAGDTDQASEQSAELNKAYDTLKD 70 (168) T ss_pred HHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 889999666657789961143788889986271123662224512678878999999999875 No 42 >pfam00790 VHS VHS domain. Domain present in VPS-27, Hrs and STAM. Probab=77.32 E-value=4.4 Score=21.59 Aligned_cols=49 Identities=16% Similarity=0.370 Sum_probs=36.7 Q ss_pred HHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 861777328899999999999999657889979879999999999999999628 Q gi|254780585|r 136 EILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSG 189 (191) Q Consensus 136 ~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg 189 (191) +++.-........||.+-.+|+..|.=+ ..++++ +..|..+|+.||..| T Consensus 88 kli~~~~~~~~~~Vk~kil~li~~Wa~~-Fk~~p~----l~~i~~~y~~Lk~~G 136 (136) T pfam00790 88 KLIPNKPYGKPAKVKKKILELIQEWADA-FKNDPD----LKYIRDMYKLLKRKG 136 (136) T ss_pred HHHCCCCCCCHHHHHHHHHHHHHHHHHH-HCCCCC----CHHHHHHHHHHHHCC T ss_conf 9973788997289999999999999998-667966----569999999998772 No 43 >KOG3442 consensus Probab=71.78 E-value=6.2 Score=20.62 Aligned_cols=52 Identities=25% Similarity=0.391 Sum_probs=40.0 Q ss_pred CHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC-----H--HHHHHHHHHHH Q ss_conf 88897699861777328899999999999999657889979-----8--79999999999 Q gi|254780585|r 128 GSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGD-----R--GSEERFQAVIQ 180 (191) Q Consensus 128 ~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~-----~--~~ee~~~~I~~ 180 (191) .....+|..||+|+++.+.++|-++|-.|....-+-+ ||. + -+.|++..... T Consensus 55 ~iTlqEa~qILnV~~~ln~eei~k~yehLFevNdksk-GGSFYLQSKVfRAkErld~El~ 113 (132) T KOG3442 55 KITLQEAQQILNVKEPLNREEIEKRYEHLFEVNDKSK-GGSFYLQSKVFRAKERLDEELK 113 (132) T ss_pred CCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCC-CCCEEEHHHHHHHHHHHHHHHH T ss_conf 6119998657188998899999999999985367455-8604403788799999999998 No 44 >cd03569 VHS_Hrs_Vps27p VHS domain family, Hrs and Vps27p subfamily; composed of Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and its yeast homolog Vps27p (vacuolar protein sorting). The VHS domain, an essential part of Hrs/Vps27p, has a superhelical structure similar to the structure of ARM (Armadillo) repeats and is present at the N-termini of proteins. Hrs also contains a FYVE (Fab1p, YOTB, Vac1p, and EEA1) zinc finger domain C-terminal to VHS, as well as two coiled-coil domains. Hrs has been proposed to play a role in at least three vesicle trafficking events: exocytosis, endocytosis, and endosome to lysosome trafficking. Hrs is involved in promoting rapid recycling of endocytosed signaling receptors to the plasma membrane. Probab=62.35 E-value=8.4 Score=19.74 Aligned_cols=43 Identities=23% Similarity=0.426 Sum_probs=33.5 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 288999999999999996578899798799999999999999996289 Q gi|254780585|r 143 DSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSGF 190 (191) Q Consensus 143 ~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg~ 190 (191) ..+...||.+-.+|+..|.=. ..++. .+..|..+|+.|+..|+ T Consensus 95 ~~~~~~Vk~kil~li~~Wa~~-Fk~~p----~l~~i~~~y~~Lk~~G~ 137 (142) T cd03569 95 TTKNEEVRQKILELIQAWALA-FRNKP----QLKYVVDTYQILKAEGH 137 (142) T ss_pred CCCCHHHHHHHHHHHHHHHHH-HCCCC----CCHHHHHHHHHHHHCCC T ss_conf 489889999999999999998-28896----65799999999998589 No 45 >KOG0100 consensus Probab=60.32 E-value=7.5 Score=20.04 Aligned_cols=128 Identities=22% Similarity=0.144 Sum_probs=65.8 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHH-----HHHHHHHCCC---------CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 576585217888887443589999-----9998620248---------73015998999999963113776555565421 Q gi|254780585|r 34 CQCIGEYRAPVGSGAEEQFFLFCL-----DHVKKYNKGY---------NYFLGLSDDEVGRYQKEGVTGERFTWTAHLYA 99 (191) Q Consensus 34 C~~~G~yrAPksr~~l~~y~wFCl-----eHvreyNk~w---------nyF~gms~~ei~~~~r~~~tw~RPTw~~~~~~ 99 (191) -+..|.-+||++--+..-. |-+ =||.+-.|.- |=-.-+++++|+..++++--..-...++.... T Consensus 489 FdltGipPAPRGvpqIEVt--FevDangiL~VsAeDKgtg~~~kitItNd~~rLt~EdIerMv~eAekFAeeDk~~Keki 566 (663) T KOG0100 489 FDLTGIPPAPRGVPQIEVT--FEVDANGILQVSAEDKGTGKKEKITITNDKGRLTPEDIERMVNEAEKFAEEDKKLKEKI 566 (663) T ss_pred CCCCCCCCCCCCCCCEEEE--EEECCCCEEEEEEECCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 3236799888898627999--99726745898766067787441788667787798999999998998766668899887 Q ss_pred CCCCHHHC-------CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 01210110-------00013333333321000158888976998617773288999999999999996578 Q gi|254780585|r 100 ERYPSNSS-------FFQDHRSSYGHFADRPDHRVGSMQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPD 163 (191) Q Consensus 100 ~~~~~~~~-------~~~d~~~~~~~~~~~~~~~~~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD 163 (191) ..+....+ -+.|.....+.........+.....++++-|+...+++.++.+.+.++|-...||= T Consensus 567 eaRN~LE~YayslKnqi~dkekLg~Kl~~edKe~~e~av~e~~eWL~~n~~a~~Ee~~ek~kele~vv~Pi 637 (663) T KOG0100 567 EARNELESYAYSLKNQIGDKEKLGGKLSDEDKETIEDAVEEALEWLESNQDASKEEFKEKKKELEAVVQPI 637 (663) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 76778999998765103766675565886678999999999999876166656778999998899998899 No 46 >KOG3960 consensus Probab=56.33 E-value=8.3 Score=19.78 Aligned_cols=15 Identities=33% Similarity=0.620 Sum_probs=10.6 Q ss_pred HHHHHHHHHHHHHHC Q ss_conf 999999999999962 Q gi|254780585|r 174 RFQAVIQAYKILKKS 188 (191) Q Consensus 174 ~~~~I~~Ay~~Lk~S 188 (191) ||++||+|+++||.. T Consensus 129 RLkKVNEAFE~LKRr 143 (284) T KOG3960 129 RLKKVNEAFETLKRR 143 (284) T ss_pred HHHHHHHHHHHHHHH T ss_conf 899999999999862 No 47 >smart00288 VHS Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function. Probab=54.79 E-value=15 Score=18.11 Aligned_cols=43 Identities=19% Similarity=0.375 Sum_probs=32.4 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 328899999999999999657889979879999999999999999628 Q gi|254780585|r 142 SDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSG 189 (191) Q Consensus 142 ~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg 189 (191) +..+...||.+-.+|+..|.=.- .+++ .+..|..+|+.|+..| T Consensus 91 ~~~~~~~Vk~kil~li~~W~~~f-~~~p----~l~~i~~~y~~Lk~~G 133 (133) T smart00288 91 PKYPLPLVKKRILELIQEWADAF-KNDP----DLSQIVDVYDLLKKKG 133 (133) T ss_pred CCCCCHHHHHHHHHHHHHHHHHH-CCCC----CCHHHHHHHHHHHHCC T ss_conf 78997899999999999999984-8896----6659999999998772 No 48 >cd03565 VHS_Tom1 VHS domain family, Tom1 subfamily; The VHS domain is an essential part of Tom1 (Target of myb1 - retroviral oncogene) protein. The VHS domain has a superhelical structure similar to the structure of the ARM repeats and is present at the very N-termini of proteins. It is a right-handed superhelix of eight alpha helices. The VHS domain has been found in a number of proteins, some of which have been implicated in intracellular trafficking and sorting. The VHS domain of the Tom1 protein is essential for the negative regulation of Interleukin-1 and Tumor Necrosis Factor-induced signaling pathways. Probab=53.01 E-value=20 Score=17.30 Aligned_cols=42 Identities=21% Similarity=0.418 Sum_probs=32.8 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 88999999999999996578899798799999999999999996289 Q gi|254780585|r 144 SSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSGF 190 (191) Q Consensus 144 ~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg~ 190 (191) .....||.+-.+|+..|.=.- .++. .|..|..+|+.|+..|+ T Consensus 97 ~~~~~Vk~kil~li~~Wa~~f-~~~p----~l~~i~~~Y~~Lk~~G~ 138 (141) T cd03565 97 NPPTIVQEKVLALIQAWADAF-RGSP----DLTGVVEVYEELKKKGI 138 (141) T ss_pred CCHHHHHHHHHHHHHHHHHHH-CCCC----CCHHHHHHHHHHHHCCC T ss_conf 871999999999999999983-8886----45499999999998779 No 49 >cd03568 VHS_STAM VHS domain family, STAM subfamily; members include STAM (Signal Transducing Adaptor Molecule), EAST (EGFR-associated protein with SH3 and TAM domains) and Hbp (Hrs-binding protein). Collectively, they are referred to as STAM. All STAMs have at their N-termini a VHS domain, which is involved in cytokine-mediated intracellular signal transduction and has a superhelical structure similar to the structure of ARM (Armadillo) repeats, followed by a SH3 (Src homology 3) domain, a well-established protein-protein interaction domain. At the C-termini of most vertebrate STAMS, an ITAM (Immunoreceptor Tyrosine-based Activation) motif is present, which mediates the binding of HRS (hepatocyte growth factor-regulated tyrosine kinase substrate) in endocytic and exocytic machineries. Probab=52.00 E-value=15 Score=18.12 Aligned_cols=41 Identities=20% Similarity=0.345 Sum_probs=33.1 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 8999999999999996578899798799999999999999996289 Q gi|254780585|r 145 SPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSGF 190 (191) Q Consensus 145 ~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg~ 190 (191) +...||.+-.+|+..|.=+- .++++ |..|..+|+.|+..|. T Consensus 93 ~~~~Vk~kil~li~~W~~~F-~~dp~----l~~i~~~Y~~Lk~~G~ 133 (144) T cd03568 93 VHPTVKEKLREVVKQWADEF-KNDPS----LSLMSDLYKKLKNEGP 133 (144) T ss_pred CCHHHHHHHHHHHHHHHHHH-CCCCC----CHHHHHHHHHHHHHCC T ss_conf 98799999999999999984-67977----5699999999998589 No 50 >pfam12005 DUF3499 Protein of unknown function (DUF3499). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 125 to 163 amino acids in length. Probab=46.46 E-value=22 Score=16.94 Aligned_cols=63 Identities=16% Similarity=0.315 Sum_probs=41.6 Q ss_pred CCCCCCCCCCCCCCCC------------CCCCCCCCCCHHHHHHHHHHHHH--HCCCCCC--------CCCCHHHHH--- Q ss_conf 8876786475765852------------17888887443589999999862--0248730--------159989999--- Q gi|254780585|r 25 KSSICQWDNCQCIGEY------------RAPVGSGAEEQFFLFCLDHVKKY--NKGYNYF--------LGLSDDEVG--- 79 (191) Q Consensus 25 ~~r~C~~~gC~~~G~y------------rAPksr~~l~~y~wFCleHvrey--Nk~wnyF--------~gms~~ei~--- 79 (191) ..|.|..+||...+.. -.|.+....-.-|=+|.+|..-. -++|+-- .+.+++++. T Consensus 2 ~~R~Csr~gC~~~AvaTLty~YadstAV~GPLa~~~ePh~yDLC~~Ha~rlTaP~GW~vvr~~~~~~~~~p~~dDL~ALA 81 (125) T pfam12005 2 SPRRCSRPGCGRPAVATLTYVYADSTAVVGPLATYAEPHSYDLCEEHAERLTAPLGWEVVRLAGEFAPAEPDEDDLVALA 81 (125) T ss_pred CCCCCCCCCCCCCCCEEEEEEEECCEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCHHHHHHHH T ss_conf 97653756788761146888740330552367677788404658889863568887577860477777897778899999 Q ss_pred HHHHHHCC Q ss_conf 99963113 Q gi|254780585|r 80 RYQKEGVT 87 (191) Q Consensus 80 ~~~r~~~t 87 (191) +..|++.. T Consensus 82 dAVREaar 89 (125) T pfam12005 82 DAVREAAR 89 (125) T ss_pred HHHHHHCC T ss_conf 99987424 No 51 >TIGR02983 SigE-fam_strep RNA polymerase sigma-70 factor, sigma-E family; InterPro: IPR014325 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This group of similar sigma-70 factors includes the sigE factor from Streptomyces coelicolor . The protein appears to include a paralogous expansion in the Streptomycetes lineage, while related Actinomycetales have at most two representatives.. Probab=46.25 E-value=11 Score=18.89 Aligned_cols=23 Identities=17% Similarity=0.307 Sum_probs=13.2 Q ss_pred HHHCCCCCCCHHHHHHHHHHHHH Q ss_conf 86177732889999999999999 Q gi|254780585|r 136 EILGLLSDSSPEEIRGRYKDLVK 158 (191) Q Consensus 136 ~~Lgl~~~~~~~~ik~~yk~lvk 158 (191) ++||++..+=...+-+.-.+|-. T Consensus 137 ~~LG~SvGTVKS~~sRal~~LR~ 159 (165) T TIGR02983 137 EVLGISVGTVKSRLSRALARLRE 159 (165) T ss_pred HHHCCCCHHHHHHHHHHHHHHHH T ss_conf 88199932289989999999887 No 52 >cd01801 Tsc13_N Tsc13_N N-terminal domain of Tsc13. Tsc13 is an enoyl reductase involved in elongation of long chain fatty acids that localizes to the endoplasmic reticulum and is highly enriched in a novel structure marking nuclear-vacuolar junctions. Probab=43.08 E-value=9.9 Score=19.26 Aligned_cols=31 Identities=6% Similarity=0.069 Sum_probs=25.6 Q ss_pred HHHHCC-CCCCCHHHHHHHHHHHHHHHCCCCC Q ss_conf 986177-7328899999999999999657889 Q gi|254780585|r 135 FEILGL-LSDSSPEEIRGRYKDLVKKHHPDAN 165 (191) Q Consensus 135 l~~Lgl-~~~~~~~~ik~~yk~lvk~~HPD~n 165 (191) +-.+-+ ++++|..+||..+.++..+++|++. T Consensus 13 ~~~~kv~s~~aTi~dlK~~~~k~~p~~yp~RQ 44 (77) T cd01801 13 IGKLKVSSGDATIADLKKLIAKSSPQLTVNRQ 44 (77) T ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCCCHHE T ss_conf 78975069986399999999976889881040 No 53 >TIGR00873 gnd 6-phosphogluconate dehydrogenase, decarboxylating; InterPro: IPR006113 6-Phosphogluconate dehydrogenase (1.1.1.44 from EC) (6PGD) is an oxidative carboxylase that catalyses the decarboxylating reduction of 6-phosphogluconate into ribulose 5-phosphate in the presence of NADP. This reaction is a component of the hexose mono-phosphate shunt and pentose phosphate pathways (PPP) , . Prokaryotic and eukaryotic 6PGD are proteins of about 470 amino acids whose sequence are highly conserved . The protein is a homodimer in which the monomers act independently : each contains a large, mainly alpha-helical domain and a smaller beta-alpha-beta domain, containing a mixed parallel and anti-parallel 6-stranded beta sheet . NADP is bound in a cleft in the small domain, the substrate binding in an adjacent pocket . This model does not specify whether the cofactor is NADP only, NAD only, or both.; GO: 0004616 phosphogluconate dehydrogenase (decarboxylating) activity, 0050661 NADP binding, 0006098 pentose-phosphate shunt. Probab=38.60 E-value=22 Score=16.95 Aligned_cols=17 Identities=24% Similarity=0.560 Sum_probs=9.3 Q ss_pred CCCCCCCCCCCCCCCCC Q ss_conf 88767864757658521 Q gi|254780585|r 25 KSSICQWDNCQCIGEYR 41 (191) Q Consensus 25 ~~r~C~~~gC~~~G~yr 41 (191) .+++|+|-|=+=.|=|. T Consensus 167 g~PCC~wiG~~GAGHyV 183 (480) T TIGR00873 167 GEPCCTWIGPDGAGHYV 183 (480) T ss_pred CCCCEEEECCCCCCCEE T ss_conf 29422437867984305 No 54 >KOG2629 consensus Probab=38.35 E-value=33 Score=15.83 Aligned_cols=37 Identities=16% Similarity=0.432 Sum_probs=26.8 Q ss_pred HHHHHHHHCCCCCCCCCCHHHHH-HHHHHHCCCCCCCCCCCC Q ss_conf 99998620248730159989999-999631137765555654 Q gi|254780585|r 57 LDHVKKYNKGYNYFLGLSDDEVG-RYQKEGVTGERFTWTAHL 97 (191) Q Consensus 57 leHvreyNk~wnyF~gms~~ei~-~~~r~~~tw~RPTw~~~~ 97 (191) |.|=|+|=++ +|++++||. +|.+..++..-++..-++ T Consensus 22 li~kr~FLks----KGLT~eEI~eAfk~~gi~~~d~s~~~p~ 59 (300) T KOG2629 22 LIKKREFLKS----KGLTEEEIQEAFKRDGIPAQDVSKQIPT 59 (300) T ss_pred HHHHHHHHHH----CCCCHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 7999999985----5988999999998548862244556898 No 55 >TIGR02999 Sig-70_X6 RNA polymerase sigma factor, TIGR02999 family; InterPro: IPR011517 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This entry represents a group of sigma factors that are able to regulate extra cellular function (ECF) . Eubacteria display considerable genetic diversity between ECF-sigma factors, but all retain two features: the ability to respond to extra-cytoplasmic functions; and regulation by anti-sigma and anti-anti-sigma factors . This family show sequence similarity to IPR007627 from INTERPRO and IPR007630 from INTERPRO.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=37.95 E-value=19 Score=17.45 Aligned_cols=28 Identities=21% Similarity=0.342 Sum_probs=15.6 Q ss_pred HHCCCCCCCHHHHHHHHHHHHH-HHCCCC Q ss_conf 6177732889999999999999-965788 Q gi|254780585|r 137 ILGLLSDSSPEEIRGRYKDLVK-KHHPDA 164 (191) Q Consensus 137 ~Lgl~~~~~~~~ik~~yk~lvk-~~HPD~ 164 (191) +|...-+++..-|.-+|-+|+. .=|+.- T Consensus 51 ~~~~ci~stT~LV~EAylrL~~t~~~~~~ 79 (194) T TIGR02999 51 VLAVCIDSTTALVHEAYLRLVDTQDEQEW 79 (194) T ss_pred HHHHHHCCHHHHHHHHHHHHHCCCCHHHH T ss_conf 67751046678899876642123202322 No 56 >KOG2217 consensus Probab=36.86 E-value=35 Score=15.68 Aligned_cols=41 Identities=22% Similarity=0.379 Sum_probs=32.1 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999999657889979879999999999999999628 Q gi|254780585|r 148 EIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSG 189 (191) Q Consensus 148 ~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg 189 (191) .-|-+||.|.-++|- ++.|....|.+|++|.+-=...+.+- T Consensus 628 tpKEAyR~LShKFHG-kgsGK~K~EKR~kr~~ee~~~k~m~s 668 (705) T KOG2217 628 TPKEAYRLLSHKFHG-KGSGKKKTEKRLKRIEEEERLKKMSS 668 (705) T ss_pred CHHHHHHHHHHHHCC-CCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 889999997676327-89854679999999888875421467 No 57 >KOG2653 consensus Probab=36.13 E-value=36 Score=15.61 Aligned_cols=53 Identities=26% Similarity=0.383 Sum_probs=28.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCC-CCCCCHHHHHHHH-HHHHHHCCCCCCCCCCHHHHHHHH Q ss_conf 4887678647576585217888-8874435899999-998620248730159989999999 Q gi|254780585|r 24 PKSSICQWDNCQCIGEYRAPVG-SGAEEQFFLFCLD-HVKKYNKGYNYFLGLSDDEVGRYQ 82 (191) Q Consensus 24 ~~~r~C~~~gC~~~G~yrAPks-r~~l~~y~wFCle-HvreyNk~wnyF~gms~~ei~~~~ 82 (191) .+.++|+|-|=.-.|-|.-=.- --.-++-..-|.- ||-.- --|||.+||...- T Consensus 169 ~~epCc~wvG~~GaGhfVKMVHNGIEYGDMqLI~EaY~vlk~------~~gls~~eia~vF 223 (487) T KOG2653 169 DGEPCCDWVGEGGAGHFVKMVHNGIEYGDMQLICEAYDVLKS------VLGLSNDEIAEVF 223 (487) T ss_pred CCCCCEEEECCCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHH------HCCCCHHHHHHHH T ss_conf 999883544688876301453267431138999999999998------4586678899999 No 58 >PRK10203 hypothetical protein; Provisional Probab=36.02 E-value=27 Score=16.46 Aligned_cols=20 Identities=10% Similarity=0.069 Sum_probs=8.9 Q ss_pred CCCCCHHHHHHHHHHHCCCC Q ss_conf 01599899999996311377 Q gi|254780585|r 70 FLGLSDDEVGRYQKEGVTGE 89 (191) Q Consensus 70 F~gms~~ei~~~~r~~~tw~ 89 (191) |+.+.+.-|...+...-.-+ T Consensus 4 ~d~~aE~~I~eA~~~Gefdn 23 (122) T PRK10203 4 LDQWAERHIIEAQAKGEFDN 23 (122) T ss_pred HHHHHHHHHHHHHHCCCCCC T ss_conf 99999999999998389668 No 59 >pfam02467 Whib Transcription factor WhiB. WhiB is a putative transcription factor in Actinobacteria, required for differentiation and sporulation. Probab=35.13 E-value=28 Score=16.31 Aligned_cols=26 Identities=27% Similarity=0.390 Sum_probs=22.5 Q ss_pred HHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 99999998620248730159989999 Q gi|254780585|r 54 LFCLDHVKKYNKGYNYFLGLSDDEVG 79 (191) Q Consensus 54 wFCleHvreyNk~wnyF~gms~~ei~ 79 (191) -=||+|.-+.+-.|--.-||++.|-. T Consensus 39 ~~Cl~~Al~~~e~~GVWGG~t~~ER~ 64 (66) T pfam02467 39 QECAAHALEHREEYGVWGGLSERERR 64 (66) T ss_pred HHHHHHHHHCCCCCEEECCCCHHHHH T ss_conf 99999999769976337898999985 No 60 >COG5062 Uncharacterized membrane protein [Function unknown] Probab=34.87 E-value=8.5 Score=19.71 Aligned_cols=54 Identities=15% Similarity=0.060 Sum_probs=32.7 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 89769986177732889999999999999965788997987999999999999999962898 Q gi|254780585|r 130 MQFNAFEILGLLSDSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSGFC 191 (191) Q Consensus 130 ~~~~Al~~Lgl~~~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg~c 191 (191) ...+|++++++-. +..--| -+|-++||-- ..+-+.--|.-.+.+++.+--.++| T Consensus 296 Tr~~~wk~~~~~~-af~lci-----ylVfnf~s~s--sRRlaNlpfv~wi~~lh~f~lt~y~ 349 (429) T COG5062 296 TRKKAWKIIILYN-AFFLCV-----YLVFNFYSTS--SRRLANLPFVMWIMLLHTFHLTVYE 349 (429) T ss_pred CHHHHHHHHHHHH-HHHHHH-----HHHHHHCCCC--HHHHCCCCHHHHHHHHHHHHHHEEE T ss_conf 5177889999998-999999-----9998644422--0232376179999999998764046 No 61 >COG4989 Predicted oxidoreductase [General function prediction only] Probab=34.17 E-value=36 Score=15.58 Aligned_cols=38 Identities=34% Similarity=0.469 Sum_probs=17.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 899999999999999657889979879999999999999999628 Q gi|254780585|r 145 SPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSG 189 (191) Q Consensus 145 ~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg 189 (191) +...++.-|-.+.+-|.||.-.... +|-+|++.|.+|| T Consensus 112 SL~~L~tDylD~LLiHRPDpLmd~e-------eVAeAf~~L~~sG 149 (298) T COG4989 112 SLINLKTDYLDLLLIHRPDPLMDAE-------EVAEAFTHLHKSG 149 (298) T ss_pred HHHHHCCCHHHHHHCCCCCCCCCHH-------HHHHHHHHHHHCC T ss_conf 9997272155554304886567979-------9999999998638 No 62 >TIGR02980 SigBFG RNA polymerase sigma-70 factor, sigma-B/F/G subfamily; InterPro: IPR014322 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This group of similar sigma-70 factors includes clades found in Bacilli (including the sporulation factors SigF:IPR014236 from INTERPRO and SigG:IPR014212 from INTERPRO as well as SigB:IPR014288 from INTERPRO), and the high GC Gram-positive bacteria (Actinobacteria) where a variable number of them are found depending on the lineage.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=33.48 E-value=22 Score=17.03 Aligned_cols=16 Identities=13% Similarity=0.380 Sum_probs=8.6 Q ss_pred HHHH-HCCCCCCCCCCH Q ss_conf 9862-024873015998 Q gi|254780585|r 60 VKKY-NKGYNYFLGLSD 75 (191) Q Consensus 60 vrey-Nk~wnyF~gms~ 75 (191) |+=| |+++.+++++-+ T Consensus 20 A~RF~~RGe~~~EDL~Q 36 (229) T TIGR02980 20 ARRFRSRGEPHLEDLVQ 36 (229) T ss_pred HHHHHCCCCCCCCHHHH T ss_conf 99851378874311578 No 63 >pfam12031 DUF3518 Domain of unknown function (DUF3518). This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 260 amino acids in length. This domain is found associated with pfam01388. Probab=29.15 E-value=31 Score=16.06 Aligned_cols=17 Identities=12% Similarity=0.583 Sum_probs=13.7 Q ss_pred CCCHHHHHHHHHHHHHH Q ss_conf 74435899999998620 Q gi|254780585|r 48 AEEQFFLFCLDHVKKYN 64 (191) Q Consensus 48 ~l~~y~wFCleHvreyN 64 (191) ...++||-||+++||-- T Consensus 70 s~~ewWwd~L~~lREN~ 86 (257) T pfam12031 70 SKDEWWWDCLEVLRENT 86 (257) T ss_pred HHHHHHHHHHHHHHHCC T ss_conf 07779999999986140 No 64 >PRK13411 molecular chaperone DnaK; Provisional Probab=28.54 E-value=35 Score=15.69 Aligned_cols=17 Identities=35% Similarity=0.493 Sum_probs=12.6 Q ss_pred CCCCHHHHHHHHHHHCC Q ss_conf 15998999999963113 Q gi|254780585|r 71 LGLSDDEVGRYQKEGVT 87 (191) Q Consensus 71 ~gms~~ei~~~~r~~~t 87 (191) .|+|++||+..++++.- T Consensus 500 ~~Ls~~ei~~m~~~a~~ 516 (655) T PRK13411 500 GGLSSSEIERMRQEAEK 516 (655) T ss_pred CCCCHHHHHHHHHHHHH T ss_conf 99799999999999997 No 65 >TIGR01057 topA_arch DNA topoisomerase I; InterPro: IPR005739 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA. This entry describes topoisomerase I from archaea, which is more closely related to bacterial than to eukaryotic topoisomerase I . Topoisomerase I is the major enzyme for relaxing negatively supercoiled DNA, and its presence is balanced by reverse gyrase, which can introduce negative supercoils. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003916 DNA topoisomerase activity, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=27.15 E-value=50 Score=14.65 Aligned_cols=33 Identities=18% Similarity=0.460 Sum_probs=26.9 Q ss_pred CCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 158888976998617773288999999999999 Q gi|254780585|r 125 HRVGSMQFNAFEILGLLSDSSPEEIRGRYKDLV 157 (191) Q Consensus 125 ~~~~~~~~~Al~~Lgl~~~~~~~~ik~~yk~lv 157 (191) ..++..|++||.+||++|--|.+--..=|.+-- T Consensus 282 FdLGtLQ~EAy~~fG~sPk~Tq~IAq~LY~~al 314 (637) T TIGR01057 282 FDLGTLQREAYRVFGFSPKKTQEIAQELYEEAL 314 (637) T ss_pred CCCCHHHHHHHHHCCCCCHHHHHHHHHHHHCCC T ss_conf 875525899998607996446899999864235 No 66 >PTZ00043 cytochrome c oxidase subunit; Provisional Probab=24.41 E-value=56 Score=14.32 Aligned_cols=41 Identities=22% Similarity=0.299 Sum_probs=29.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999999996578899798799999999999999996289 Q gi|254780585|r 147 EEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSGF 190 (191) Q Consensus 147 ~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg~ 190 (191) .-++|.|+.|++..---... .. .|.|--+.+-|++||.|.| T Consensus 83 ~lp~ksykqla~dmgmqiv~--e~-~ehm~gllelyeylk~s~f 123 (270) T PTZ00043 83 KLPKKSYKQLAVDLGMQIVN--EP-SEHMLGLLELYEYLKSSPF 123 (270) T ss_pred HCCHHHHHHHHHHHCCEECC--CC-HHHHHHHHHHHHHHHCCCC T ss_conf 60256799999973865247--95-5989999999999723875 No 67 >cd03567 VHS_GGA VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system. Probab=24.32 E-value=57 Score=14.31 Aligned_cols=40 Identities=20% Similarity=0.407 Sum_probs=26.4 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 288999999999999996578899798799999999999999996289 Q gi|254780585|r 143 DSSPEEIRGRYKDLVKKHHPDANGGDRGSEERFQAVIQAYKILKKSGF 190 (191) Q Consensus 143 ~~~~~~ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~Ay~~Lk~Sg~ 190 (191) +.+...||.+-.+|+..|.=+- ..++ .+.+||+.||..|. T Consensus 97 ~~t~~~Vk~kileli~~Wa~~f-~~~p-------~~~~~y~~Lk~~Gv 136 (139) T cd03567 97 SRTSEKVKTKIIELLYSWTLEL-PHEP-------KIKEAYDMLKKQGI 136 (139) T ss_pred CCCCHHHHHHHHHHHHHHHHHH-CCCC-------HHHHHHHHHHHCCC T ss_conf 8788899999999999999983-4586-------69999999998688 No 68 >pfam07800 DUF1644 Protein of unknown function (DUF1644). This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, and is probably a zinc-binding domain. Probab=23.03 E-value=58 Score=14.24 Aligned_cols=12 Identities=25% Similarity=0.465 Sum_probs=5.9 Q ss_pred HHHHHHCCCCCC Q ss_conf 999996578899 Q gi|254780585|r 155 DLVKKHHPDANG 166 (191) Q Consensus 155 ~lvk~~HPD~n~ 166 (191) +-|+.-||.... T Consensus 116 kHar~~HP~~rP 127 (152) T pfam07800 116 KHARSEHPGARP 127 (152) T ss_pred HHHHHHCCCCCC T ss_conf 988751898886 No 69 >KOG3242 consensus Probab=21.97 E-value=63 Score=14.01 Aligned_cols=34 Identities=18% Similarity=0.373 Sum_probs=19.7 Q ss_pred HHHHHHHHCCCCCCCC--HH-HHHHHHHHHHHHHHHH Q ss_conf 9999999657889979--87-9999999999999999 Q gi|254780585|r 153 YKDLVKKHHPDANGGD--RG-SEERFQAVIQAYKILK 186 (191) Q Consensus 153 yk~lvk~~HPD~n~g~--~~-~ee~~~~I~~Ay~~Lk 186 (191) -++|+++|+||...-+ +. +...|-.|.+.-..|+ T Consensus 157 IkeL~~Rw~P~~~~~aPkK~~~HrAldDI~ESI~ELq 193 (208) T KOG3242 157 IKELARRWYPDIKARAPKKKATHRALDDIRESIKELQ 193 (208) T ss_pred HHHHHHHHCCHHHCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 9999998590322007210014010889999999999 No 70 >pfam03343 SART-1 SART-1 family. SART-1 is a protein involved in cell cycle arrest and pre-mRNA splicing. It has been shown to be a component of U4/U6 x U5 tri-snRNP complex in human, Schizosaccharomyces pombe and Saccharomyces cerevisiae. SART-1 is a known tumour antigen in a range of cancers recognized by T cells. Probab=21.56 E-value=64 Score=13.96 Aligned_cols=31 Identities=23% Similarity=0.390 Sum_probs=23.5 Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 99999999999657889979879999999999 Q gi|254780585|r 149 IRGRYKDLVKKHHPDANGGDRGSEERFQAVIQ 180 (191) Q Consensus 149 ik~~yk~lvk~~HPD~n~g~~~~ee~~~~I~~ 180 (191) -|-+||.|.-+||= +..|....|.+|+.|.+ T Consensus 545 ~KEAFk~LSHkFHG-KgsGK~K~EKrlKK~ee 575 (577) T pfam03343 545 PKEAFKYLSHKFHG-KGSGKMKTEKRLKKIEE 575 (577) T ss_pred HHHHHHHHHHCCCC-CCCCCCHHHHHHHHHHH T ss_conf 89999998741158-68870369999998871 No 71 >pfam12132 DUF3587 Protein of unknown function (DUF3587). This protein is found in viruses. Proteins in this family are typically between 209 and 248 amino acids in length. Probab=20.10 E-value=69 Score=13.76 Aligned_cols=24 Identities=25% Similarity=0.623 Sum_probs=15.4 Q ss_pred CCCC-CCCCCCCHHHHHHHHHHHHH Q ss_conf 2178-88887443589999999862 Q gi|254780585|r 40 YRAP-VGSGAEEQFFLFCLDHVKKY 63 (191) Q Consensus 40 yrAP-ksr~~l~~y~wFCleHvrey 63 (191) +..| ...=.-+.|+.||-.||.-. T Consensus 153 f~~p~~~~C~~gHFHHyCs~HV~~W 177 (201) T pfam12132 153 FVKPSVDECPYGHFHHYCSQHVNSW 177 (201) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 5689878888898566679999999 Done!