Query gi|254780764|ref|YP_003065177.1| hypothetical protein CLIBASIA_03275 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 194 No_of_seqs 163 out of 924 Neff 7.7 Searched_HMMs 39220 Date Sun May 29 18:49:47 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780764.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3807 Uncharacterized protei 100.0 4.1E-41 1.4E-45 270.8 11.6 151 42-194 21-171 (171) 2 COG3103 SH3 domain protein [Si 99.7 1.1E-18 2.8E-23 133.4 3.0 121 56-184 26-149 (205) 3 pfam06347 SH3_4 Bacterial SH3 99.5 8.5E-14 2.2E-18 103.4 7.1 54 133-186 2-55 (55) 4 pfam08239 SH3_3 Bacterial SH3 99.4 1.4E-12 3.6E-17 95.9 7.3 50 135-184 2-51 (52) 5 smart00287 SH3b Bacterial SH3 99.2 7.8E-11 2E-15 85.3 6.8 58 58-116 4-62 (63) 6 PRK10884 putative signal trans 99.2 1.8E-10 4.6E-15 83.1 8.2 63 56-120 26-90 (206) 7 pfam08239 SH3_3 Bacterial SH3 99.1 3.3E-10 8.3E-15 81.5 6.2 52 63-116 1-52 (52) 8 smart00287 SH3b Bacterial SH3 99.0 6.1E-10 1.6E-14 79.8 6.8 57 128-184 3-61 (63) 9 pfam06347 SH3_4 Bacterial SH3 99.0 6.7E-10 1.7E-14 79.5 6.1 55 61-117 1-55 (55) 10 COG3103 SH3 domain protein [Si 98.7 6.7E-08 1.7E-12 67.3 6.5 58 127-184 25-85 (205) 11 PRK10884 putative signal trans 98.6 1.4E-07 3.7E-12 65.2 6.2 57 129-185 27-86 (206) 12 COG4991 Uncharacterized protei 98.6 3.4E-08 8.7E-13 69.1 2.7 54 62-117 39-94 (155) 13 COG4991 Uncharacterized protei 98.5 2E-07 5.2E-12 64.3 4.7 55 130-184 36-92 (155) 14 PRK13545 tagH teichoic acids e 97.4 0.00044 1.1E-08 43.9 6.3 59 59-118 350-414 (549) 15 KOG4226 consensus 97.4 0.0004 1E-08 44.2 5.4 103 80-185 125-247 (379) 16 COG3807 Uncharacterized protei 97.3 0.00012 3E-09 47.4 2.4 55 133-187 41-97 (171) 17 PRK13545 tagH teichoic acids e 97.0 0.0015 3.8E-08 40.6 5.0 65 128-192 348-419 (549) 18 pfam08460 SH3_5 Bacterial SH3 96.4 0.013 3.3E-07 34.9 6.5 48 133-180 12-64 (65) 19 KOG4226 consensus 95.9 0.012 3.2E-07 35.0 4.2 104 83-186 24-161 (379) 20 smart00326 SH3 Src homology 3 95.5 0.028 7.2E-07 32.8 4.9 36 149-184 18-55 (58) 21 pfam07653 SH3_2 Variant SH3 do 95.4 0.023 5.8E-07 33.4 4.2 35 150-184 16-51 (53) 22 cd00174 SH3 Src homology 3 dom 94.9 0.054 1.4E-06 31.1 5.0 35 150-184 16-52 (54) 23 KOG1428 consensus 93.5 0.53 1.3E-05 25.0 8.1 118 56-174 2167-2334(3738) 24 KOG4278 consensus 93.1 0.062 1.6E-06 30.7 2.3 44 83-127 110-155 (1157) 25 PRK13914 invasion associated s 92.0 0.45 1.1E-05 25.5 5.4 57 60-117 82-139 (475) 26 pfam08460 SH3_5 Bacterial SH3 91.6 0.49 1.3E-05 25.2 5.3 47 63-110 13-63 (65) 27 KOG0515 consensus 91.5 0.062 1.6E-06 30.7 0.6 39 152-191 702-744 (752) 28 smart00326 SH3 Src homology 3 91.1 0.26 6.7E-06 26.9 3.5 36 80-116 20-56 (58) 29 KOG4225 consensus 88.9 0.64 1.6E-05 24.5 4.0 102 80-183 248-357 (489) 30 KOG1029 consensus 84.7 1.5 3.9E-05 22.2 4.1 34 152-185 832-865 (1118) 31 KOG2070 consensus 78.8 1.2 3.1E-05 22.8 1.7 32 84-116 38-70 (661) 32 pfam00018 SH3_1 SH3 domain. SH 76.9 3.3 8.3E-05 20.2 3.5 29 152-180 16-46 (47) 33 KOG4792 consensus 71.6 6.9 0.00017 18.2 4.2 119 64-184 124-281 (293) 34 TIGR02162 torC trimethylamine- 70.8 2.4 6.1E-05 21.0 1.6 111 58-186 212-323 (394) 35 KOG2996 consensus 69.8 3.2 8.2E-05 20.2 2.1 35 150-184 822-859 (865) 36 KOG3557 consensus 63.4 2.7 6.8E-05 20.7 0.7 38 151-188 518-556 (721) 37 COG3056 Uncharacterized lipopr 59.9 12 0.0003 16.8 5.0 134 12-159 4-155 (204) 38 KOG2546 consensus 56.7 7.2 0.00018 18.1 1.9 31 154-184 444-475 (483) 39 KOG0162 consensus 55.2 7.2 0.00018 18.1 1.7 47 137-184 1056-1103(1106) 40 pfam11191 DUF2782 Protein of u 51.0 9.4 0.00024 17.3 1.7 50 62-114 38-89 (104) 41 PRK03147 thiol-disulfide oxido 49.1 16 0.00042 15.9 2.7 40 48-99 36-75 (176) 42 PRK10559 p-hydroxybenzoic acid 46.8 19 0.00048 15.5 6.7 31 56-88 39-69 (310) 43 PRK13697 cytochrome c6; Provis 34.6 29 0.00074 14.3 2.1 26 19-45 2-27 (111) 44 COG5368 Uncharacterized protei 33.9 14 0.00036 16.3 0.4 19 172-191 380-398 (451) 45 KOG3523 consensus 31.6 6.5 0.00017 18.3 -1.6 37 149-185 624-663 (695) 46 COG4787 FlgF Flagellar basal b 29.6 36 0.00091 13.8 5.2 48 68-116 59-106 (251) 47 KOG2222 consensus 29.1 29 0.00073 14.4 1.4 33 152-184 567-600 (848) 48 KOG3837 consensus 26.7 13 0.00033 16.5 -0.8 29 65-93 425-453 (523) 49 PRK13861 type IV secretion sys 24.1 45 0.0011 13.2 4.5 52 55-111 32-88 (293) 50 TIGR02924 ICDH_alpha isocitrat 23.6 46 0.0012 13.1 1.7 27 65-94 92-119 (481) 51 KOG3669 consensus 22.8 47 0.0012 13.0 3.7 45 146-192 353-406 (705) 52 KOG3601 consensus 22.5 12 0.0003 16.8 -1.7 34 154-187 184-218 (222) 53 PRK04405 prsA peptidylprolyl i 22.2 49 0.0012 13.0 3.3 24 149-172 209-233 (298) 54 TIGR00316 cdhC CO dehydrogenas 22.1 49 0.0013 13.0 2.9 70 84-182 274-359 (496) No 1 >COG3807 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=4.1e-41 Score=270.82 Aligned_cols=151 Identities=50% Similarity=0.962 Sum_probs=141.3 Q ss_pred CCCCCCCCCCCCCCEEEEEECCEEEEEECCCCCCCCEEEEEECCCEEEEEEECCCEEEEEECCCEEEEEECCCCCCCCCC Q ss_conf 00012333555552167830563688626898863027998369625554201212567532530576402334654433 Q gi|254780764|r 42 LSHEKEIFEKKPLPRFVTIKASRANSRIGPGIMYTVVCTYLTKGLPVEVVKEYENWRQIRDFDGTIGWINKSLLSGKRSA 121 (194) Q Consensus 42 ~s~~~~~~t~~~~pr~~sik~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~~~~W~kV~~~~g~~Gwv~~~~ls~~~~~ 121 (194) .+......+|+|+|||||+|+++||.|.||+++|.+.|.|.++|+|++|+.|+++||+|+|.||..||||++++|++|++ T Consensus 21 ~~g~tig~sgLplPRfVSlKs~~VN~R~GP~~~yav~W~y~k~GlPVEIvqEy~~WRrirDadG~egWv~qsllsG~Rta 100 (171) T COG3807 21 AQGTTIGASGLPLPRFVSLKSAEVNARVGPGTDYAVEWVYLKKGLPVEIVQEYDNWRRIRDADGTEGWVHQSLLSGKRTA 100 (171) T ss_pred HCCCCCCCCCCCCCCEEEECCCCEECCCCCCCCCEEEEEEECCCCCEEHHHHHHHHHHEECCCCCCEEEEEECCCCCCEE T ss_conf 33574278898455157731110431438984401578663069853315566436521078887102360010675217 Q ss_pred CCCCCCCCCCCCCCEEEECCCCCCCCEEEEECCCCEEEEEEEECCEEEEEECCEEEEEECCCEEEECCCCEEC Q ss_conf 3334333434433113331799877537997389889999520998999989989999714122056676039 Q gi|254780764|r 122 IVSPWNRKTNNPIYINLYKKPDIQSIIVAKVEPGVLLTIRECSGEWCFGYNLDTEGWIKKQKIWGIYPGEVFK 194 (194) Q Consensus 122 ~~~~~~~~~~~~~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~~~~~W~kV~~~g~~GWV~k~~iwGv~~~e~~~ 194 (194) +..+..... ...+|+|+.|...+.+++++++|++..+.+|+++||+++.+|.+||+++.-|||+||+|.|| T Consensus 101 i~apw~~~k--g~~i~l~k~~~~~a~V~A~lepgv~~sl~~C~g~wC~~~~~g~~GWi~q~eIWGaypge~~k 171 (171) T COG3807 101 IIAPWMRDK--GVEINLRKSADDGARVVAKLEPGVVGSLKKCKGQWCRLTAKGYSGWISQGEIWGAYPGERFK 171 (171) T ss_pred EECCCCCCC--CEEEEEEECCCCCCEEEEEECCCEEEEECCCCCCEEEEECCCCCCEEECCEEECCCCCCCCC T ss_conf 960011696--40687750788887078996485488700056612787716744403213140357641259 No 2 >COG3103 SH3 domain protein [Signal transduction mechanisms] Probab=99.73 E-value=1.1e-18 Score=133.42 Aligned_cols=121 Identities=24% Similarity=0.352 Sum_probs=106.8 Q ss_pred EEEEEECCEEEEEECCCCCCCCEEEEEECCCEEEEEEECCC--EEEEEECCCEEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 16783056368862689886302799836962555420121--2567532530576402334654433333433343443 Q gi|254780764|r 56 RFVTIKASRANSRIGPGIMYTVVCTYLTKGLPVEVVKEYEN--WRQIRDFDGTIGWINKSLLSGKRSAIVSPWNRKTNNP 133 (194) Q Consensus 56 r~~sik~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~~~~--W~kV~~~~g~~Gwv~~~~ls~~~~~~~~~~~~~~~~~ 133 (194) ||++ ..+..|||+|||.+|.|...+. +|+.++|+++.++ |++|++.+|..|||-.++++...+.. ..+.. T Consensus 26 ~~v~-~~~~~~vRsGpg~~y~I~~~i~-~Ge~vtvl~~~~~~~~~qI~~~~g~t~wi~~~~lt~e~s~~------~~~p~ 97 (205) T COG3103 26 RYVS-DKLNTNVRSGPGDQYRIVGSIK-AGEKVTVLGTDGNTGYYQIRDSSGRTGWILSKNLTSEPSSN------ERVPD 97 (205) T ss_pred HHEE-CCCEEEEECCCCCCEEEEEEEC-CCCEEEEEEECCCCCEEEEEECCCCEEEEECHHHCCCCCCC------EECCC T ss_conf 0001-2004453027875401456651-78589999872763379999668735888452420462010------24475 Q ss_pred CCEEEECCCCCCCCEEEEECCCCEEEEEEEE-CCEEEEEECCEEEEEECCCE Q ss_conf 3113331799877537997389889999520-99899998998999971412 Q gi|254780764|r 134 IYINLYKKPDIQSIIVAKVEPGVLLTIRECS-GEWCFGYNLDTEGWIKKQKI 184 (194) Q Consensus 134 ~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~~~-~~W~kV~~~g~~GWV~k~~i 184 (194) ..+|||+.|.+++.+++..++|+.+.....+ .+|++++++|..|||+++++ T Consensus 98 ~~~nVr~~t~t~~~v~g~~~~~t~~~~~sk~~~~w~~~~~~~k~~~vs~q~~ 149 (205) T COG3103 98 LELNVRELTKTLSNVDGTWKQGTAVMQISKTQNNWYIITLKGKNGYVSGQLV 149 (205) T ss_pred HHHHHHHCCCCHHHHHHHHHCCCEEEEEEECCCCCHHHHHHCCCCHHHHHHE T ss_conf 4442311333034210133168579987412363136666354432333320 No 3 >pfam06347 SH3_4 Bacterial SH3 domain. This family consists of several hypothetical bacterial proteins of unknown function. These are composed of SH3-like domains. Probab=99.48 E-value=8.5e-14 Score=103.42 Aligned_cols=54 Identities=33% Similarity=0.757 Sum_probs=52.3 Q ss_pred CCCEEEECCCCCCCCEEEEECCCCEEEEEEEECCEEEEEECCEEEEEECCCEEE Q ss_conf 331133317998775379973898899995209989999899899997141220 Q gi|254780764|r 133 PIYINLYKKPDIQSIIVAKVEPGVLLTIRECSGEWCFGYNLDTEGWIKKQKIWG 186 (194) Q Consensus 133 ~~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~~~~~W~kV~~~g~~GWV~k~~iwG 186 (194) .+.+|||++|+++|++++.+++|..+++.+|+++||+|+.+|.+|||++++||| T Consensus 2 ~~~vnlr~~P~~~s~i~a~~~~gv~~~v~~c~~~Wc~v~~~g~~GWv~~~~lwG 55 (55) T pfam06347 2 KKRVNLRKGPSPDAKVIAYLEPGVPVRVVKCNGNWCRVRADGATGWIYQSLLWG 55 (55) T ss_pred CCEEEEECCCCCCCCEEEEECCCCEEEEEEECCCEEEEECCCCEEEEECCCCCC T ss_conf 864345059999987428973998899998049789997199278678353409 No 4 >pfam08239 SH3_3 Bacterial SH3 domain. Probab=99.38 E-value=1.4e-12 Score=95.93 Aligned_cols=50 Identities=16% Similarity=0.290 Sum_probs=48.5 Q ss_pred CEEEECCCCCCCCEEEEECCCCEEEEEEEECCEEEEEECCEEEEEECCCE Q ss_conf 11333179987753799738988999952099899998998999971412 Q gi|254780764|r 135 YINLYKKPDIQSIIVAKVEPGVLLTIRECSGEWCFGYNLDTEGWIKKQKI 184 (194) Q Consensus 135 ~lnvR~~Ps~~s~iv~~l~~g~~v~i~~~~~~W~kV~~~g~~GWV~k~~i 184 (194) .||||++|+++++++++|++|+.+++++|+++||+|+++|++|||+++|| T Consensus 2 ~lNvR~~p~t~~~il~~l~~G~~v~v~~~~~~W~~V~~~g~~G~V~~~Yl 51 (52) T pfam08239 2 NLNVRSGPGTSSKIIGQLPKGTKVTVLGETNGWYKIEYNGKTGYVSSDYV 51 (52) T ss_pred CEEEECCCCCCCHHEEEECCCCEEEEEEEECCEEEEEECCEEEEEEEEEC T ss_conf 58883899999841499949999999948485899998999999992637 No 5 >smart00287 SH3b Bacterial SH3 domain homologues. Probab=99.17 E-value=7.8e-11 Score=85.28 Aligned_cols=58 Identities=24% Similarity=0.403 Sum_probs=45.0 Q ss_pred EEEECCEEEEEECCCCCCCCEEEEEECCCEEEEEEECC-CEEEEEECCCEEEEEECCCCC Q ss_conf 78305636886268988630279983696255542012-125675325305764023346 Q gi|254780764|r 58 VTIKASRANSRIGPGIMYTVVCTYLTKGLPVEVVKEYE-NWRQIRDFDGTIGWINKSLLS 116 (194) Q Consensus 58 ~sik~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~~~-~W~kV~~~~g~~Gwv~~~~ls 116 (194) ..++.+.+|||+||+++|+|+.++. +|..++++++.+ +|++|++.+|..|||+.++++ T Consensus 4 ~~Vt~~~vNvR~gP~t~~~ii~~l~-~G~~v~vl~~~~~~W~~i~~~~G~~Gwv~~~~v~ 62 (63) T smart00287 4 AVVTGDGLNVRSGPGTSSPIIGTLK-KGDKVKVLGVDGQDWAKITYGSGQRGYVPGYVVN 62 (63) T ss_pred EEEECCEEEECCCCCCCCCEEEEEC-CCCEEEEEECCCCCEEEEECCCCCEEEEEEEEEC T ss_conf 9996786896169999863479986-9999999532699799999189989999927871 No 6 >PRK10884 putative signal transduction protein; Provisional Probab=99.15 E-value=1.8e-10 Score=83.06 Aligned_cols=63 Identities=33% Similarity=0.477 Sum_probs=53.0 Q ss_pred EEEEEECCEEEEEECCCCCCCCEEEEEECCCEEEEEEECC--CEEEEEECCCEEEEEECCCCCCCCC Q ss_conf 1678305636886268988630279983696255542012--1256753253057640233465443 Q gi|254780764|r 56 RFVTIKASRANSRIGPGIMYTVVCTYLTKGLPVEVVKEYE--NWRQIRDFDGTIGWINKSLLSGKRS 120 (194) Q Consensus 56 r~~sik~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~~~--~W~kV~~~~g~~Gwv~~~~ls~~~~ 120 (194) ||+|=.-. +.||+|||.+|.|+..+. .|++|+++++.+ +|.+|++.+|..|||-..+++...+ T Consensus 26 rYVSD~l~-v~~RsGpg~~yRIv~~l~-sGt~V~lLe~~~~~gys~Vr~~~G~eGWV~s~yLs~~ps 90 (206) T PRK10884 26 RYVSDELN-TWVRSGPGDQYRLVGTVN-AGEEVTLLQTDANTNYAQIKDSSGRTAWIPLKQLSTTPS 90 (206) T ss_pred EEEEEEEE-EEEECCCCCCCEEEEEEC-CCCEEEEEEECCCCCEEEEECCCCCEEEEEHHHCCCCCC T ss_conf 57752598-876428998754898866-998789977548888589983999788766876246886 No 7 >pfam08239 SH3_3 Bacterial SH3 domain. Probab=99.06 E-value=3.3e-10 Score=81.46 Aligned_cols=52 Identities=31% Similarity=0.560 Sum_probs=47.8 Q ss_pred CEEEEEECCCCCCCCEEEEEECCCEEEEEEECCCEEEEEECCCEEEEEECCCCC Q ss_conf 636886268988630279983696255542012125675325305764023346 Q gi|254780764|r 63 SRANSRIGPGIMYTVVCTYLTKGLPVEVVKEYENWRQIRDFDGTIGWINKSLLS 116 (194) Q Consensus 63 ~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~~~~W~kV~~~~g~~Gwv~~~~ls 116 (194) +.+|||+||+++++|+.++. +|+.++|+++.++|++|+. +|..|||++.|++ T Consensus 1 s~lNvR~~p~t~~~il~~l~-~G~~v~v~~~~~~W~~V~~-~g~~G~V~~~Ylt 52 (52) T pfam08239 1 SNLNVRSGPGTSSKIIGQLP-KGTKVTVLGETNGWYKIEY-NGKTGYVSSDYVS 52 (52) T ss_pred CCEEEECCCCCCCHHEEEEC-CCCEEEEEEEECCEEEEEE-CCEEEEEEEEECC T ss_conf 95888389999984149994-9999999948485899998-9999999926371 No 8 >smart00287 SH3b Bacterial SH3 domain homologues. Probab=99.04 E-value=6.1e-10 Score=79.78 Aligned_cols=57 Identities=14% Similarity=0.225 Sum_probs=50.3 Q ss_pred CCCCCCCCEEEECCCCCCCCEEEEECCCCEEEEEEEEC-CEEEEEE-CCEEEEEECCCE Q ss_conf 34344331133317998775379973898899995209-9899998-998999971412 Q gi|254780764|r 128 RKTNNPIYINLYKKPDIQSIIVAKVEPGVLLTIRECSG-EWCFGYN-LDTEGWIKKQKI 184 (194) Q Consensus 128 ~~~~~~~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~~~~-~W~kV~~-~g~~GWV~k~~i 184 (194) ..+++++.+|||++|+++++++++|.+|+.+++++.++ +|++|++ +|.+|||+..++ T Consensus 3 t~~Vt~~~vNvR~gP~t~~~ii~~l~~G~~v~vl~~~~~~W~~i~~~~G~~Gwv~~~~v 61 (63) T smart00287 3 TAVVTGDGLNVRSGPGTSSPIIGTLKKGDKVKVLGVDGQDWAKITYGSGQRGYVPGYVV 61 (63) T ss_pred EEEEECCEEEECCCCCCCCCEEEEECCCCEEEEEECCCCCEEEEECCCCCEEEEEEEEE T ss_conf 79996786896169999863479986999999953269979999918998999992787 No 9 >pfam06347 SH3_4 Bacterial SH3 domain. This family consists of several hypothetical bacterial proteins of unknown function. These are composed of SH3-like domains. Probab=99.01 E-value=6.7e-10 Score=79.54 Aligned_cols=55 Identities=45% Similarity=0.713 Sum_probs=50.6 Q ss_pred ECCEEEEEECCCCCCCCEEEEEECCCEEEEEEECCCEEEEEECCCEEEEEECCCCCC Q ss_conf 056368862689886302799836962555420121256753253057640233465 Q gi|254780764|r 61 KASRANSRIGPGIMYTVVCTYLTKGLPVEVVKEYENWRQIRDFDGTIGWINKSLLSG 117 (194) Q Consensus 61 k~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~~~~W~kV~~~~g~~Gwv~~~~ls~ 117 (194) +++.+|||+||+.+|+|+..+. .|.+++|.+..++|++|+ .+|..|||++++|.+ T Consensus 1 k~~~vnlr~~P~~~s~i~a~~~-~gv~~~v~~c~~~Wc~v~-~~g~~GWv~~~~lwG 55 (55) T pfam06347 1 KKKRVNLRKGPSPDAKVIAYLE-PGVPVRVVKCNGNWCRVR-ADGATGWIYQSLLWG 55 (55) T ss_pred CCCEEEEECCCCCCCCEEEEEC-CCCEEEEEEECCCEEEEE-CCCCEEEEECCCCCC T ss_conf 9864345059999987428973-998899998049789997-199278678353409 No 10 >COG3103 SH3 domain protein [Signal transduction mechanisms] Probab=98.66 E-value=6.7e-08 Score=67.28 Aligned_cols=58 Identities=21% Similarity=0.353 Sum_probs=53.0 Q ss_pred CCCCCCCCCEEEECCCCCCCCEEEEECCCCEEEEEEEECC--EEEEEE-CCEEEEEECCCE Q ss_conf 3343443311333179987753799738988999952099--899998-998999971412 Q gi|254780764|r 127 NRKTNNPIYINLYKKPDIQSIIVAKVEPGVLLTIRECSGE--WCFGYN-LDTEGWIKKQKI 184 (194) Q Consensus 127 ~~~~~~~~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~~~~~--W~kV~~-~g~~GWV~k~~i 184 (194) ...++..+.+|||++|+.+|.|++++..|+.|++++.+++ |++|.+ +|.+|||...++ T Consensus 25 ~~~v~~~~~~~vRsGpg~~y~I~~~i~~Ge~vtvl~~~~~~~~~qI~~~~g~t~wi~~~~l 85 (205) T COG3103 25 TRYVSDKLNTNVRSGPGDQYRIVGSIKAGEKVTVLGTDGNTGYYQIRDSSGRTGWILSKNL 85 (205) T ss_pred HHHEECCCEEEEECCCCCCEEEEEEECCCCEEEEEEECCCCCEEEEEECCCCEEEEECHHH T ss_conf 4000120044530278754014566517858999987276337999966873588845242 No 11 >PRK10884 putative signal transduction protein; Provisional Probab=98.57 E-value=1.4e-07 Score=65.24 Aligned_cols=57 Identities=14% Similarity=0.201 Sum_probs=48.8 Q ss_pred CCCCCCCEEEECCCCCCCCEEEEECCCCEEEEEEEEC--CEEEEEE-CCEEEEEECCCEE Q ss_conf 4344331133317998775379973898899995209--9899998-9989999714122 Q gi|254780764|r 129 KTNNPIYINLYKKPDIQSIIVAKVEPGVLLTIRECSG--EWCFGYN-LDTEGWIKKQKIW 185 (194) Q Consensus 129 ~~~~~~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~~~~--~W~kV~~-~g~~GWV~k~~iw 185 (194) .+...-.+.+|+||+.++.|++.++.|+.|++++.++ +|.+|.. +|.+|||..+||- T Consensus 27 YVSD~l~v~~RsGpg~~yRIv~~l~sGt~V~lLe~~~~~gys~Vr~~~G~eGWV~s~yLs 86 (206) T PRK10884 27 YVSDELNTWVRSGPGDQYRLVGTVNAGEEVTLLQTDANTNYAQIKDSSGRTAWIPLKQLS 86 (206) T ss_pred EEEEEEEEEEECCCCCCCEEEEEECCCCEEEEEEECCCCCEEEEECCCCCEEEEEHHHCC T ss_conf 775259887642899875489886699878997754888858998399978876687624 No 12 >COG4991 Uncharacterized protein with a bacterial SH3 domain homologue [Function unknown] Probab=98.56 E-value=3.4e-08 Score=69.09 Aligned_cols=54 Identities=26% Similarity=0.533 Sum_probs=45.8 Q ss_pred CCEEEEEECCCCCCCCEEEEEECCCEEEEEE--ECCCEEEEEECCCEEEEEECCCCCC Q ss_conf 5636886268988630279983696255542--0121256753253057640233465 Q gi|254780764|r 62 ASRANSRIGPGIMYTVVCTYLTKGLPVEVVK--EYENWRQIRDFDGTIGWINKSLLSG 117 (194) Q Consensus 62 ~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~--~~~~W~kV~~~~g~~Gwv~~~~ls~ 117 (194) ...+|+|+|||++|+++.... .|..+.|-+ +..+|++|. +.|..||++..++.. T Consensus 39 ~a~vN~RsGPgT~Yp~vg~Ip-~G~~~~i~GC~~g~~wc~v~-~~g~~gW~~s~y~~~ 94 (155) T COG4991 39 TANVNVRSGPGTAYPAVGVIP-SGSAATIYGCLEGSTWCTVS-YGGGRGWAYSRYLQA 94 (155) T ss_pred CCCCEEECCCCCCCCEEEEEC-CCCEECCHHHHCCCCCEEEE-ECCCCCHHHHHHHHH T ss_conf 667423148898886145754-78550231120699847887-558430068888875 No 13 >COG4991 Uncharacterized protein with a bacterial SH3 domain homologue [Function unknown] Probab=98.47 E-value=2e-07 Score=64.31 Aligned_cols=55 Identities=20% Similarity=0.369 Sum_probs=49.9 Q ss_pred CCCCCCEEEECCCCCCCCEEEEECCCCEEEEEEEE--CCEEEEEECCEEEEEECCCE Q ss_conf 34433113331799877537997389889999520--99899998998999971412 Q gi|254780764|r 130 TNNPIYINLYKKPDIQSIIVAKVEPGVLLTIRECS--GEWCFGYNLDTEGWIKKQKI 184 (194) Q Consensus 130 ~~~~~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~~~--~~W~kV~~~g~~GWV~k~~i 184 (194) ..+..++|+|++|+|.+++++.++.|+.+.|-+|- .+||+|++.|..||++.+|+ T Consensus 36 a~~~a~vN~RsGPgT~Yp~vg~Ip~G~~~~i~GC~~g~~wc~v~~~g~~gW~~s~y~ 92 (155) T COG4991 36 ASATANVNVRSGPGTAYPAVGVIPSGSAATIYGCLEGSTWCTVSYGGGRGWAYSRYL 92 (155) T ss_pred HHCCCCCEEECCCCCCCCEEEEECCCCEECCHHHHCCCCCEEEEECCCCCHHHHHHH T ss_conf 213667423148898886145754785502311206998478875584300688888 No 14 >PRK13545 tagH teichoic acids export protein ATP-binding subunit; Provisional Probab=97.44 E-value=0.00044 Score=43.87 Aligned_cols=59 Identities=17% Similarity=0.263 Sum_probs=50.8 Q ss_pred EEECCEEEEEECCCCCCCCEEEEEECCCEEEEEEECC------CEEEEEECCCEEEEEECCCCCCC Q ss_conf 8305636886268988630279983696255542012------12567532530576402334654 Q gi|254780764|r 59 TIKASRANSRIGPGIMYTVVCTYLTKGLPVEVVKEYE------NWRQIRDFDGTIGWINKSLLSGK 118 (194) Q Consensus 59 sik~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~~~------~W~kV~~~~g~~Gwv~~~~ls~~ 118 (194) .++++...+|+.|+.+...+.+... |+++.|-++.. +|.++.+.+|..||+...++..- T Consensus 350 ~vn~~~~~ir~~~~~~s~~~~~~~f-g~~f~v~~~~k~~~~~~~w~~~~~~~ge~gwis~~~v~~~ 414 (549) T PRK13545 350 IVNSNGISIREEADASSKRLAIANF-GDIFTISDSNKNEKKDVEWIQITLSNGEIGWISTKFIEPF 414 (549) T ss_pred EEECCCEEEECCCCCCCCEEEEECC-CCEEEEECCCCCCCCCCEEEEEEECCCCCCEEEEEEEEEE T ss_conf 8722761575177867551588627-9779980566431158605999824886330210156641 No 15 >KOG4226 consensus Probab=97.37 E-value=0.0004 Score=44.16 Aligned_cols=103 Identities=17% Similarity=0.309 Sum_probs=63.3 Q ss_pred EEEECCCEEEEEEE-CCCEEEEEECCCEEEEEECCCCCCCCCC---------------CCCCCCCCCCCCCCEEEECCCC Q ss_conf 99836962555420-1212567532530576402334654433---------------3334333434433113331799 Q gi|254780764|r 80 TYLTKGLPVEVVKE-YENWRQIRDFDGTIGWINKSLLSGKRSA---------------IVSPWNRKTNNPIYINLYKKPD 143 (194) Q Consensus 80 ~~~~~g~~v~vi~~-~~~W~kV~~~~g~~Gwv~~~~ls~~~~~---------------~~~~~~~~~~~~~~lnvR~~Ps 143 (194) .+. +|.-+.|.++ .++|++=. ++|..||.-..|....--. .........+-.-.+-|+.-.+ T Consensus 125 sLt-KGtrv~vmEKssDGWWrG~-~ng~VGWFPSNYv~E~~ds~~gd~~s~~~~~~~A~a~n~~~s~vl~vVvaLYsFss 202 (379) T KOG4226 125 SLT-KGTRVTVMEKSSDGWWRGS-YNGQVGWFPSNYVTEEVDSAAGDSPSFLSLRKAASASNGQGSRVLHVVVALYSFSS 202 (379) T ss_pred CCC-CCCEEEEEEECCCCCEECC-CCCEECCCCCCCEEHHCCCCCCCCCCCEECCHHHCCCCCCCCEEEEEEEEEECCCC T ss_conf 332-4757999983468602144-58800101554340102444568865000351120247877458999999831467 Q ss_pred CCCCEEEEECCCCEEEEEEE---ECCEEEEE-ECCEEEEEECCCEE Q ss_conf 87753799738988999952---09989999-89989999714122 Q gi|254780764|r 144 IQSIIVAKVEPGVLLTIREC---SGEWCFGY-NLDTEGWIKKQKIW 185 (194) Q Consensus 144 ~~s~iv~~l~~g~~v~i~~~---~~~W~kV~-~~g~~GWV~k~~iw 185 (194) .+.+- -..++|++++|++. +.+|++-. .+|+.|-|.+.|+- T Consensus 203 sndeE-LsFeKGerleivd~Pe~DPdWwkarn~~G~vGLVPrNYv~ 247 (379) T KOG4226 203 SNDEE-LSFEKGERLEIVDKPENDPDWWKARNARGQVGLVPRNYVV 247 (379) T ss_pred CCHHH-CCCCCCCEEEECCCCCCCCHHHHHCCCCCCCCEEECCEEE T ss_conf 97222-1544585557424887880677640457713201033279 No 16 >COG3807 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.34 E-value=0.00012 Score=47.43 Aligned_cols=55 Identities=18% Similarity=0.274 Sum_probs=47.9 Q ss_pred CCCEEEECCCCCCCCEE-EEECCCCEEEEEEEECCEEEEE-ECCEEEEEECCCEEEE Q ss_conf 33113331799877537-9973898899995209989999-8998999971412205 Q gi|254780764|r 133 PIYINLYKKPDIQSIIV-AKVEPGVLLTIRECSGEWCFGY-NLDTEGWIKKQKIWGI 187 (194) Q Consensus 133 ~~~lnvR~~Ps~~s~iv-~~l~~g~~v~i~~~~~~W~kV~-~~g~~GWV~k~~iwGv 187 (194) ...+|.|.+|+++|.+. -.+.+|.-|+|....++|-+|. .+|..|||+++-+.|- T Consensus 41 s~~VN~R~GP~~~yav~W~y~k~GlPVEIvqEy~~WRrirDadG~egWv~qsllsG~ 97 (171) T COG3807 41 SAEVNARVGPGTDYAVEWVYLKKGLPVEIVQEYDNWRRIRDADGTEGWVHQSLLSGK 97 (171) T ss_pred CCCEECCCCCCCCCEEEEEEECCCCCEEHHHHHHHHHHEECCCCCCEEEEEECCCCC T ss_conf 110431438984401578663069853315566436521078887102360010675 No 17 >PRK13545 tagH teichoic acids export protein ATP-binding subunit; Provisional Probab=96.97 E-value=0.0015 Score=40.64 Aligned_cols=65 Identities=22% Similarity=0.201 Sum_probs=53.6 Q ss_pred CCCCCCCCEEEECCCCCCCCEEEEECCCCEEEEEEEE------CCEEEEEE-CCEEEEEECCCEEEECCCCE Q ss_conf 3434433113331799877537997389889999520------99899998-99899997141220566760 Q gi|254780764|r 128 RKTNNPIYINLYKKPDIQSIIVAKVEPGVLLTIRECS------GEWCFGYN-LDTEGWIKKQKIWGIYPGEV 192 (194) Q Consensus 128 ~~~~~~~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~~~------~~W~kV~~-~g~~GWV~k~~iwGv~~~e~ 192 (194) ..+++.....+|+.|+.+++.++.++=|+.+.+.+.+ -+|.++.+ +|.+|||+..++--+--+-. T Consensus 348 ~~~vn~~~~~ir~~~~~~s~~~~~~~fg~~f~v~~~~k~~~~~~~w~~~~~~~ge~gwis~~~v~~~~~~~~ 419 (549) T PRK13545 348 KYIVNSNGISIREEADASSKRLAIANFGDIFTISDSNKNEKKDVEWIQITLSNGEIGWISTKFIEPFKSNNN 419 (549) T ss_pred EEEEECCCEEEECCCCCCCCEEEEECCCCEEEEECCCCCCCCCCEEEEEEECCCCCCEEEEEEEEEECCCCC T ss_conf 478722761575177867551588627977998056643115860599982488633021015664146861 No 18 >pfam08460 SH3_5 Bacterial SH3 domain. Probab=96.39 E-value=0.013 Score=34.87 Aligned_cols=48 Identities=6% Similarity=0.124 Sum_probs=33.0 Q ss_pred CCCEEEECCCCCCCCEEEEECCCCEEEEEE--EECC--EEEE-EECCEEEEEE Q ss_conf 331133317998775379973898899995--2099--8999-9899899997 Q gi|254780764|r 133 PIYINLYKKPDIQSIIVAKVEPGVLLTIRE--CSGE--WCFG-YNLDTEGWIK 180 (194) Q Consensus 133 ~~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~--~~~~--W~kV-~~~g~~GWV~ 180 (194) ...+|||.+|++++++++.+++|+.|.--+ +.++ |..- .++|.+-||. T Consensus 12 ~~~~~Ir~~p~~ssp~~~~~~~G~~V~YD~v~~~dGy~WisY~s~sG~r~Yv~ 64 (65) T pfam08460 12 TKTTPIKGGPSLSAPVIGTLYKGDTVYYDQVLTADGYVWLSYTSYSGVRRYVP 64 (65) T ss_pred CCCCEEECCCCCCCCEEEEECCCCEEEECEEEECCCCEEEEEECCCCEEEEEE T ss_conf 63006854876789468998999999838899809909999988899899988 No 19 >KOG4226 consensus Probab=95.86 E-value=0.012 Score=34.98 Aligned_cols=104 Identities=15% Similarity=0.270 Sum_probs=67.5 Q ss_pred ECCCEEEEEEECCCEEEEEECCCEEEEEECCCCCCCCCCCCCCC------CCCCCCCCCEEEE---CCCCCCCC------ Q ss_conf 36962555420121256753253057640233465443333343------3343443311333---17998775------ Q gi|254780764|r 83 TKGLPVEVVKEYENWRQIRDFDGTIGWINKSLLSGKRSAIVSPW------NRKTNNPIYINLY---KKPDIQSI------ 147 (194) Q Consensus 83 ~~g~~v~vi~~~~~W~kV~~~~g~~Gwv~~~~ls~~~~~~~~~~------~~~~~~~~~lnvR---~~Ps~~s~------ 147 (194) ++++.+-.++....|++|+..-+.+|+|-..|+..+......+- ..+..+...--.| ..|++++. T Consensus 24 kkner~~llddsk~wwrvrns~n~tgyvpsnyverkn~~~~~si~knlkd~~g~kt~~k~s~~~~~~sp~~d~~~ppd~~ 103 (379) T KOG4226 24 KKNERLWLLDDSKTWWRVRNSANRTGYVPSNYVERKNSLKKGSIVKNLKDTLGLKTRRKTSARDASPSPSTDAEYPPDRI 103 (379) T ss_pred CCCCEEEEECCCCCCEEEECCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHH T ss_conf 23615899717734033303445577666104311463135678876652336311468984446789875455893124 Q ss_pred -------EE-----------EEECCCCEEEEEE-EECCEEEEEECCEEEEEECCCEEE Q ss_conf -------37-----------9973898899995-209989999899899997141220 Q gi|254780764|r 148 -------IV-----------AKVEPGVLLTIRE-CSGEWCFGYNLDTEGWIKKQKIWG 186 (194) Q Consensus 148 -------iv-----------~~l~~g~~v~i~~-~~~~W~kV~~~g~~GWV~k~~iwG 186 (194) ++ -.|-+|+.|.+++ |.++|.+=+++|+.||-.+.|+-- T Consensus 104 ~~~~tpAvVKf~Y~a~~eDELsLtKGtrv~vmEKssDGWWrG~~ng~VGWFPSNYv~E 161 (379) T KOG4226 104 YDLNTPAVVKFNYVAEREDELSLTKGTRVTVMEKSSDGWWRGSYNGQVGWFPSNYVTE 161 (379) T ss_pred HHCCCCEEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCEECCCCCEECCCCCCCEEH T ss_conf 3338842899851124665003324757999983468602144588001015543401 No 20 >smart00326 SH3 Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations. Probab=95.50 E-value=0.028 Score=32.80 Aligned_cols=36 Identities=19% Similarity=0.281 Sum_probs=27.6 Q ss_pred EEEECCCCEEEEEEE-ECCEEEEEEC-CEEEEEECCCE Q ss_conf 799738988999952-0998999989-98999971412 Q gi|254780764|r 149 VAKVEPGVLLTIREC-SGEWCFGYNL-DTEGWIKKQKI 184 (194) Q Consensus 149 v~~l~~g~~v~i~~~-~~~W~kV~~~-g~~GWV~k~~i 184 (194) --.+.+|+.+.|++. +++|++++.. |..||++.+|+ T Consensus 18 eL~~~~Gd~v~v~~~~~~~W~~~~~~~g~~G~vP~~~v 55 (58) T smart00326 18 ELSFKKGDIITVLEKSDDGWWKGRLGRGKEGLFPSNYV 55 (58) T ss_pred EEEECCCCEEEEEEEECCCEEEEEECCCCEEEEEHHHE T ss_conf 87383899999999818997999979999999907789 No 21 >pfam07653 SH3_2 Variant SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel. Probab=95.41 E-value=0.023 Score=33.38 Aligned_cols=35 Identities=14% Similarity=0.334 Sum_probs=29.8 Q ss_pred EEECCCCEEEEEEE-ECCEEEEEECCEEEEEECCCE Q ss_conf 99738988999952-099899998998999971412 Q gi|254780764|r 150 AKVEPGVLLTIREC-SGEWCFGYNLDTEGWIKKQKI 184 (194) Q Consensus 150 ~~l~~g~~v~i~~~-~~~W~kV~~~g~~GWV~k~~i 184 (194) -.+.+|+++.|++. +++|++.+.+|+.||++.+|| T Consensus 16 L~f~~Gdii~v~~~~~~~Ww~g~~~g~~G~fP~~yV 51 (53) T pfam07653 16 LTLKKGDVVKVLDKDDGGWWEGERGGRRGLVPSSYV 51 (53) T ss_pred CCCCCCCEEEEEEECCCCEEEEEECCEEEEEEHHHC T ss_conf 789899999999954999389998997899858988 No 22 >cd00174 SH3 Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies. Probab=94.94 E-value=0.054 Score=31.07 Aligned_cols=35 Identities=17% Similarity=0.326 Sum_probs=27.6 Q ss_pred EEECCCCEEEEEEE-ECCEEEEEECC-EEEEEECCCE Q ss_conf 99738988999952-09989999899-8999971412 Q gi|254780764|r 150 AKVEPGVLLTIREC-SGEWCFGYNLD-TEGWIKKQKI 184 (194) Q Consensus 150 ~~l~~g~~v~i~~~-~~~W~kV~~~g-~~GWV~k~~i 184 (194) -.+.+|+.+.|++. +++|++++..+ .+||++.+|+ T Consensus 16 L~~~~Gd~v~v~~~~~~~W~~~~~~~~~~G~~P~~yv 52 (54) T cd00174 16 LSFKKGDIIEVLEKSDDGWWEGRLLGGKRGLFPSNYV 52 (54) T ss_pred EEECCCCEEEEEEECCCCEEEEEECCCCEEEEEHHHE T ss_conf 5185899999989847997999989998999945979 No 23 >KOG1428 consensus Probab=93.49 E-value=0.53 Score=25.02 Aligned_cols=118 Identities=16% Similarity=0.228 Sum_probs=74.4 Q ss_pred EEEEEECCEEEEEECCCCCCCCEEEEEECCCEEEEEEECC----CEEEEEECCCE-----EEEEECCCCCCC----CCCC Q ss_conf 1678305636886268988630279983696255542012----12567532530-----576402334654----4333 Q gi|254780764|r 56 RFVTIKASRANSRIGPGIMYTVVCTYLTKGLPVEVVKEYE----NWRQIRDFDGT-----IGWINKSLLSGK----RSAI 122 (194) Q Consensus 56 r~~sik~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~~~----~W~kV~~~~g~-----~Gwv~~~~ls~~----~~~~ 122 (194) .|++--...+.+|.+|+++..-++.+.. |..++-|+|-+ -|.+..|..-. -|+-..+-|+-. +..+ T Consensus 2167 ~~~akD~aGlRiR~~~sLqSeqiGiv~~-~~~itFIdEi~NDDG~WlRLnDET~~~Y~~N~G~~eaWCLsf~q~L~ksLl 2245 (3738) T KOG1428 2167 KFVAKDSAGLRIRSHPSLQSEQIGIVKV-NGTITFIDEIHNDDGVWLRLNDETIKKYVPNMGYTEAWCLSFNQHLGKSLL 2245 (3738) T ss_pred EEECCCCCCCEEECCCCCCCCEEEEEEC-CCEEEEHHHHCCCCCEEEEECHHHHHHHCCCCCCCHHHHHHCCHHCCCCCC T ss_conf 0012466550653377432224537970-877986456327886399844688987367877212133010101063435 Q ss_pred C---------------------------------CCCCCCCCCCCCEEEECCCCCCCCEEEEECCCCEEEEEE----EEC Q ss_conf 3---------------------------------343334344331133317998775379973898899995----209 Q gi|254780764|r 123 V---------------------------------SPWNRKTNNPIYINLYKKPDIQSIIVAKVEPGVLLTIRE----CSG 165 (194) Q Consensus 123 ~---------------------------------~~~~~~~~~~~~lnvR~~Ps~~s~iv~~l~~g~~v~i~~----~~~ 165 (194) + ..........+..|+|+-|+..+-++|.+--|..+...+ ..| T Consensus 2246 ~~~~esD~~K~in~~~~q~at~qe~d~~~~~~~~~~~k~v~t~~s~~ni~S~Pn~~~i~~G~~vlG~kv~a~G~vtN~~G 2325 (3738) T KOG1428 2246 VPVDESDFFKDINSCCPQEATMQEQDMPFLRGGPGMYKVVKTGPSGHNIRSCPNLRGIPIGMLVLGNKVKAVGEVTNSEG 2325 (3738) T ss_pred CCCCHHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCEECCCCCCCCCCCCEEECCEEEEEEEEECCCC T ss_conf 65550577777765082666642268662137887047984499856200389735674420343560123145623773 Q ss_pred CEEEEEECC Q ss_conf 989999899 Q gi|254780764|r 166 EWCFGYNLD 174 (194) Q Consensus 166 ~W~kV~~~g 174 (194) -|.+..-+. T Consensus 2326 tWvqL~~~s 2334 (3738) T KOG1428 2326 TWVQLDQNS 2334 (3738) T ss_pred EEEEECCCE T ss_conf 599953735 No 24 >KOG4278 consensus Probab=93.14 E-value=0.062 Score=30.71 Aligned_cols=44 Identities=25% Similarity=0.525 Sum_probs=28.7 Q ss_pred ECCCEEEEEEE--CCCEEEEEECCCEEEEEECCCCCCCCCCCCCCCC Q ss_conf 36962555420--1212567532530576402334654433333433 Q gi|254780764|r 83 TKGLPVEVVKE--YENWRQIRDFDGTIGWINKSLLSGKRSAIVSPWN 127 (194) Q Consensus 83 ~~g~~v~vi~~--~~~W~kV~~~~g~~Gwv~~~~ls~~~~~~~~~~~ 127 (194) .+|+.+.|++- .+.|...+..+|. |||-..|+....+.....+. T Consensus 110 tKGeklRvLGYN~NgEWcEartKNGq-GWVPSNyItPvNSLeKhsWY 155 (1157) T KOG4278 110 TKGEKLRVLGYNKNGEWCEARTKNGQ-GWVPSNYITPVNSLEKHSWY 155 (1157) T ss_pred ECCCEEEEEEECCCCCEEEEECCCCC-CCCCCCCCCCCCCHHHCCCC T ss_conf 03765788641478860000025887-55664444655213220300 No 25 >PRK13914 invasion associated secreted endopeptidase; Provisional Probab=91.95 E-value=0.45 Score=25.47 Aligned_cols=57 Identities=28% Similarity=0.506 Sum_probs=45.4 Q ss_pred EECCEEEEEECCCCCCCCEEEEEECCCEEEEE-EECCCEEEEEECCCEEEEEECCCCCC Q ss_conf 30563688626898863027998369625554-20121256753253057640233465 Q gi|254780764|r 60 IKASRANSRIGPGIMYTVVCTYLTKGLPVEVV-KEYENWRQIRDFDGTIGWINKSLLSG 117 (194) Q Consensus 60 ik~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi-~~~~~W~kV~~~~g~~Gwv~~~~ls~ 117 (194) ...+.+|+|.|++++...+..+. .|..+.|. .+...|.++...+|..+|+...++.. T Consensus 82 ~s~t~~~vksg~~~~~~~~ssi~-~g~~~tv~~~~~~~w~~is~~~g~t~~v~~~~~~~ 139 (475) T PRK13914 82 VSATWLNVRSGAGVDNSIITSIK-GGTKVTVETTESNGWHKITYNDGKTGFVNGKYLTD 139 (475) T ss_pred CCCCCCEECCCCCCCCCCEEEEE-CCCEEEEEECCCCCCCCEECCCCCCEEECCCEEEC T ss_conf 66754033046653455237873-58379997401344200100565301422420202 No 26 >pfam08460 SH3_5 Bacterial SH3 domain. Probab=91.59 E-value=0.49 Score=25.19 Aligned_cols=47 Identities=19% Similarity=0.271 Sum_probs=29.5 Q ss_pred CEEEEEECCCCCCCCEEEEEECCCEEEEEEE----CCCEEEEEECCCEEEEE Q ss_conf 6368862689886302799836962555420----12125675325305764 Q gi|254780764|r 63 SRANSRIGPGIMYTVVCTYLTKGLPVEVVKE----YENWRQIRDFDGTIGWI 110 (194) Q Consensus 63 ~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~----~~~W~kV~~~~g~~Gwv 110 (194) ..+|+|.+|+++.|++..+. +|+.+.-.+. -.-|..=...+|..-|+ T Consensus 13 ~~~~Ir~~p~~ssp~~~~~~-~G~~V~YD~v~~~dGy~WisY~s~sG~r~Yv 63 (65) T pfam08460 13 KTTPIKGGPSLSAPVIGTLY-KGDTVYYDQVLTADGYVWLSYTSYSGVRRYV 63 (65) T ss_pred CCCEEECCCCCCCCEEEEEC-CCCEEEECEEEECCCCEEEEEECCCCEEEEE T ss_conf 30068548767894689989-9999983889980990999998889989998 No 27 >KOG0515 consensus Probab=91.46 E-value=0.062 Score=30.73 Aligned_cols=39 Identities=26% Similarity=0.550 Sum_probs=30.7 Q ss_pred ECCCCEEEEEEEEC----CEEEEEECCEEEEEECCCEEEECCCC Q ss_conf 73898899995209----98999989989999714122056676 Q gi|254780764|r 152 VEPGVLLTIRECSG----EWCFGYNLDTEGWIKKQKIWGIYPGE 191 (194) Q Consensus 152 l~~g~~v~i~~~~~----~W~kV~~~g~~GWV~k~~iwGv~~~e 191 (194) ...|+.++|+..++ +|.--..+|.+|||.+.|| |+||.= T Consensus 702 f~eGd~lTvirr~d~~eteWWwa~lng~eGyVPRnyl-gLyPri 744 (752) T KOG0515 702 FDEGDELTVIRRDDEVETEWWWARLNGEEGYVPRNYL-GLYPRI 744 (752) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHCCCCCCCCHHHH-HCCCCC T ss_conf 4678636887427840334466764685665531232-027545 No 28 >smart00326 SH3 Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations. Probab=91.09 E-value=0.26 Score=26.89 Aligned_cols=36 Identities=19% Similarity=0.339 Sum_probs=30.9 Q ss_pred EEEECCCEEEEEEE-CCCEEEEEECCCEEEEEECCCCC Q ss_conf 99836962555420-12125675325305764023346 Q gi|254780764|r 80 TYLTKGLPVEVVKE-YENWRQIRDFDGTIGWINKSLLS 116 (194) Q Consensus 80 ~~~~~g~~v~vi~~-~~~W~kV~~~~g~~Gwv~~~~ls 116 (194) .+. +|..+.|+++ .++|++++..+|..||+..+++. T Consensus 20 ~~~-~Gd~v~v~~~~~~~W~~~~~~~g~~G~vP~~~v~ 56 (58) T smart00326 20 SFK-KGDIITVLEKSDDGWWKGRLGRGKEGLFPSNYVE 56 (58) T ss_pred EEC-CCCEEEEEEEECCCEEEEEECCCCEEEEEHHHEE T ss_conf 383-8999999998189979999799999999077899 No 29 >KOG4225 consensus Probab=88.88 E-value=0.64 Score=24.51 Aligned_cols=102 Identities=16% Similarity=0.172 Sum_probs=49.8 Q ss_pred EEEECCCEEEEEEE-CCCEEEEEECCCEEEEEECCCCCCCCCCCCCCCCC--CCCCCCCEEEECCCCC--CCCEEEEECC Q ss_conf 99836962555420-12125675325305764023346544333334333--4344331133317998--7753799738 Q gi|254780764|r 80 TYLTKGLPVEVVKE-YENWRQIRDFDGTIGWINKSLLSGKRSAIVSPWNR--KTNNPIYINLYKKPDI--QSIIVAKVEP 154 (194) Q Consensus 80 ~~~~~g~~v~vi~~-~~~W~kV~~~~g~~Gwv~~~~ls~~~~~~~~~~~~--~~~~~~~lnvR~~Ps~--~s~iv~~l~~ 154 (194) .+. +|..+-|+.+ -++|+... ++|..|.+-.+|+-............ -......---++.++. ++.+--.+.+ T Consensus 248 ~~~-kGDIVyI~rkvD~nWyeGE-hhGr~GifP~sYvE~~~~~e~~qP~~~~P~q~~~~g~a~A~y~F~~~s~~Els~~k 325 (489) T KOG4225 248 PFN-KGDIVYILRKVDQNWYEGE-HHGRVGIFPASYVEILTPREKAQPARPPPQQVLEQGEAIAKYNFNADSPVELSLRK 325 (489) T ss_pred CCC-CCCEEEEEEECCCCEEEEE-ECCEECCEECHHEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCC T ss_conf 647-8888999760247521011-15621110010135347300137677996555555530014787777750001036 Q ss_pred CCEEEEEE-EECCEEEEEEC--CEEEEEECCC Q ss_conf 98899995-20998999989--9899997141 Q gi|254780764|r 155 GVLLTIRE-CSGEWCFGYNL--DTEGWIKKQK 183 (194) Q Consensus 155 g~~v~i~~-~~~~W~kV~~~--g~~GWV~k~~ 183 (194) |+.+.+.. .+++|+.=... |..|--.-+| T Consensus 326 ge~v~L~r~vd~nw~eG~i~g~~rqgifP~Sy 357 (489) T KOG4225 326 GERVTLTRQVDENWYEGKIPGTNRQGIFPASY 357 (489) T ss_pred CCEEEEEEECCCCEEECCCCCCCCCCCCCHHH T ss_conf 85678877415754530023546456565467 No 30 >KOG1029 consensus Probab=84.73 E-value=1.5 Score=22.21 Aligned_cols=34 Identities=26% Similarity=0.458 Sum_probs=26.2 Q ss_pred ECCCCEEEEEEEECCEEEEEECCEEEEEECCCEE Q ss_conf 7389889999520998999989989999714122 Q gi|254780764|r 152 VEPGVLLTIRECSGEWCFGYNLDTEGWIKKQKIW 185 (194) Q Consensus 152 l~~g~~v~i~~~~~~W~kV~~~g~~GWV~k~~iw 185 (194) ..+|+.++|++.++.|..=+..|.+||-.++|+= T Consensus 832 Fskgd~I~VlekqemwW~G~v~g~~GwFPksYVk 865 (1118) T KOG1029 832 FSKGDTITVLEKQEMWWFGEVAGEIGWFPKSYVK 865 (1118) T ss_pred CCCCCEEEEEHHCCCEECCCCCCCCCCCCHHHHH T ss_conf 4678846532000563004536766757377652 No 31 >KOG2070 consensus Probab=78.78 E-value=1.2 Score=22.84 Aligned_cols=32 Identities=22% Similarity=0.522 Sum_probs=17.0 Q ss_pred CCCEEEEEEE-CCCEEEEEECCCEEEEEECCCCC Q ss_conf 6962555420-12125675325305764023346 Q gi|254780764|r 84 KGLPVEVVKE-YENWRQIRDFDGTIGWINKSLLS 116 (194) Q Consensus 84 ~g~~v~vi~~-~~~W~kV~~~~g~~Gwv~~~~ls 116 (194) +|..+.|... .|+|+.=. .+|.+||.-..|+. T Consensus 38 KgDvItVTq~eeGGWWEGT-lng~TGWFPsnYV~ 70 (661) T KOG2070 38 KGDVITVTQVEEGGWWEGT-LNGRTGWFPSNYVR 70 (661) T ss_pred CCCEEEEEEECCCCCEECC-CCCCCCCCCHHHHH T ss_conf 6887899875037611122-06846746258888 No 32 >pfam00018 SH3_1 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel. Probab=76.92 E-value=3.3 Score=20.16 Aligned_cols=29 Identities=24% Similarity=0.347 Sum_probs=16.5 Q ss_pred ECCCCEEEEEEE-ECCEEEEEE-CCEEEEEE Q ss_conf 738988999952-099899998-99899997 Q gi|254780764|r 152 VEPGVLLTIREC-SGEWCFGYN-LDTEGWIK 180 (194) Q Consensus 152 l~~g~~v~i~~~-~~~W~kV~~-~g~~GWV~ 180 (194) +.+|+.+.|++. +.+|++++. +|.+||++ T Consensus 16 ~~~Gd~i~v~~~~~~~Ww~g~~~~~~~G~~P 46 (47) T pfam00018 16 FKKGDIIIVLEKSDDGWWKGRLKGGGEGLIP 46 (47) T ss_pred CCCCCEEEEEEECCCCEEEEEECCCCEEEEC T ss_conf 9899999994862999189997999999972 No 33 >KOG4792 consensus Probab=71.56 E-value=6.9 Score=18.19 Aligned_cols=119 Identities=14% Similarity=0.130 Sum_probs=66.6 Q ss_pred EEEEE---ECCCCCCCCEEEEEECCCEEEEEEE-CCCEEEEEECCCEEEEEECCCCCCCCC------CCCCC-------- Q ss_conf 36886---2689886302799836962555420-121256753253057640233465443------33334-------- Q gi|254780764|r 64 RANSR---IGPGIMYTVVCTYLTKGLPVEVVKE-YENWRQIRDFDGTIGWINKSLLSGKRS------AIVSP-------- 125 (194) Q Consensus 64 ~vNvR---~GP~~~y~i~~~~~~~g~~v~vi~~-~~~W~kV~~~~g~~Gwv~~~~ls~~~~------~~~~~-------- 125 (194) .+-+| .-+|.+-. ..-+ ++|+.++|+++ .+.|+.-+..+|..|.|-..|+..-+. ++... T Consensus 124 ~~~vr~~fdF~G~dee-DLPF-kkGeiL~I~~K~eeqWW~Arns~Gk~GmIPvpYVe~~~~~~~~~~~~~~ga~e~si~q 201 (293) T KOG4792 124 AEYVRALFDFNGNDEE-DLPF-KKGEILRIRDKPEEQWWNARNSEGKRGMIPVPYVEKYRPASASVSALIGGASESSIPQ 201 (293) T ss_pred HEEEEEEECCCCCCCC-CCCC-CCCCEEEEECCCHHHHHHHHCCCCCCCCEECHHHHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 1001234203798634-4885-5675889845967877554156885352304388875142112013567753133788 Q ss_pred CC------CC-CCCCCC-E-EEEC-----------CCCCCCCEEEEECCCCEEEEEEEE-CCEEEEEECCEEEEEECCCE Q ss_conf 33------34-344331-1-3331-----------799877537997389889999520-99899998998999971412 Q gi|254780764|r 126 WN------RK-TNNPIY-I-NLYK-----------KPDIQSIIVAKVEPGVLLTIRECS-GEWCFGYNLDTEGWIKKQKI 184 (194) Q Consensus 126 ~~------~~-~~~~~~-l-nvR~-----------~Ps~~s~iv~~l~~g~~v~i~~~~-~~W~kV~~~g~~GWV~k~~i 184 (194) .. .. -...+. + +.+. -|+.--+..-.|+.|+.|++.+.. .+-++-+.||+.|-+.-.++ T Consensus 202 ~~g~~e~~s~a~~s~~~~l~l~~~lPa~Arv~q~RVPnAYDkTaL~levGdiVkVTk~ninGqwegelnGk~G~fPfThv 281 (293) T KOG4792 202 SGGGAERFSSASTSSDTPLPLQQNLPAYARVIQKRVPNAYDKTALALEVGDIVKVTKKNINGQWEGELNGKIGHFPFTHV 281 (293) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCHHEEEEHHCCCCCCCHHHHHHHCCCEEEEEEECCCCEEEEEECCCCCCCCCEEE T ss_conf 88988877887744577675555787010110110887557145311028689997602474365444674366552358 No 34 >TIGR02162 torC trimethylamine-N-oxide reductase c-type cytochrome TorC; InterPro: IPR009154 This family includes TorC, a pentahemic c-type cytochrome subunit of periplasmic reductases for trimethylamine-N-oxide (TMAO). The N-terminal half is closely related to tetrahemic NapC (or NirT) subunits of periplasmic nitrate (or nitrite) reductases; some species have both TMAO and nitrate reductase complexes .; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding, 0006118 electron transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=70.80 E-value=2.4 Score=21.01 Aligned_cols=111 Identities=18% Similarity=0.248 Sum_probs=58.2 Q ss_pred EEEECCEEEEEECCCCCCC-CEEEEEECCCEEEEEEECCCEEEEEECCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 7830563688626898863-027998369625554201212567532530576402334654433333433343443311 Q gi|254780764|r 58 VTIKASRANSRIGPGIMYT-VVCTYLTKGLPVEVVKEYENWRQIRDFDGTIGWINKSLLSGKRSAIVSPWNRKTNNPIYI 136 (194) Q Consensus 58 ~sik~~~vNvR~GP~~~y~-i~~~~~~~g~~v~vi~~~~~W~kV~~~~g~~Gwv~~~~ls~~~~~~~~~~~~~~~~~~~l 136 (194) +|+..-.+.. ..|+-+-. -.+.|. .=.||+|+.+.+||.+|.- .||-...-.. | ++....+.. -.+. T Consensus 212 yS~~~~~~Y~-~~PsG~~~~~~G~L~-~A~evkvl~~~gD~~qvei----~GW~~s~G~~--R-v~~~~~Gk~---I~~A 279 (394) T TIGR02162 212 YSVRIIPVYK-DKPSGEKKKAAGRLL-PASEVKVLKEKGDWLQVEI----TGWRKSKGFG--R-VIYEKFGKR---IFVA 279 (394) T ss_pred CHHHHHHHHC-CCCCCHHHHHCCCCC-CCCCEEEEEECCCEEEEEE----ECCCCCCCCC--C-CEECCCCCH---HHHH T ss_conf 5055567530-388606898558887-8874378841688789984----0530013662--0-010278860---5686 Q ss_pred EEECCCCCCCCEEEEECCCCEEEEEEEECCEEEEEECCEEEEEECCCEEE Q ss_conf 33317998775379973898899995209989999899899997141220 Q gi|254780764|r 137 NLYKKPDIQSIIVAKVEPGVLLTIRECSGEWCFGYNLDTEGWIKKQKIWG 186 (194) Q Consensus 137 nvR~~Ps~~s~iv~~l~~g~~v~i~~~~~~W~kV~~~g~~GWV~k~~iwG 186 (194) -|+.....+.+.|..++ .-..-.+...|-+|+.. -||.+..+-+ T Consensus 280 ~L~~~~~~~~~~v~~l~---~~~d~~T~~~W~~vs~~---~W~~k~~l~~ 323 (394) T TIGR02162 280 SLAKEAAQSDASVTTLE---KKVDPDTGLTWEQVSAT---VWMKKEDLVN 323 (394) T ss_pred HHHHHHHCCCCEEECCC---CEECCCCCCCHHHHHHH---HHHHHHHHHH T ss_conf 63177733787152176---20115667846666777---6643553230 No 35 >KOG2996 consensus Probab=69.81 E-value=3.2 Score=20.20 Aligned_cols=35 Identities=17% Similarity=0.368 Sum_probs=30.1 Q ss_pred EEECCCCEEEEEEE---ECCEEEEEECCEEEEEECCCE Q ss_conf 99738988999952---099899998998999971412 Q gi|254780764|r 150 AKVEPGVLLTIREC---SGEWCFGYNLDTEGWIKKQKI 184 (194) Q Consensus 150 ~~l~~g~~v~i~~~---~~~W~kV~~~g~~GWV~k~~i 184 (194) -.+..|++|.|+.. +.+|.+=+.+|..||-...|+ T Consensus 822 LSlk~GDvV~i~~k~g~d~GWWkGevngrvGwFPstYV 859 (865) T KOG2996 822 LSLKEGDVVKIYDKVGEDQGWWKGEVNGRVGWFPSTYV 859 (865) T ss_pred CCCCCCCEEEEEHHCCCCCCEECCEECCCCCCCCCCCC T ss_conf 56545878997021366576110213571155310026 No 36 >KOG3557 consensus Probab=63.41 E-value=2.7 Score=20.70 Aligned_cols=38 Identities=16% Similarity=0.286 Sum_probs=29.0 Q ss_pred EECCCCEEEEEEEECCEEEEEE-CCEEEEEECCCEEEEC Q ss_conf 9738988999952099899998-9989999714122056 Q gi|254780764|r 151 KVEPGVLLTIRECSGEWCFGYN-LDTEGWIKKQKIWGIY 188 (194) Q Consensus 151 ~l~~g~~v~i~~~~~~W~kV~~-~g~~GWV~k~~iwGv~ 188 (194) .+.+|+.++++.-..+|.+++. .|+.|||...-|-++- T Consensus 518 sV~k~E~LEvl~d~R~WW~~kn~~G~~GyvP~nIL~~~~ 556 (721) T KOG3557 518 SVKKGEVLEVLDDGRKWWKVKNGHGRAGYVPSNILAPLQ 556 (721) T ss_pred HHHCCCHHHEECCCCCCEECCCCCCCCCCCCHHHHCCCC T ss_conf 331120110001354502214776776776255416688 No 37 >COG3056 Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=59.92 E-value=12 Score=16.80 Aligned_cols=134 Identities=16% Similarity=0.066 Sum_probs=58.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEE-EEECCEEEEEECCCCCCCCEEEEEECCCEEEE Q ss_conf 44678999999999999999999975331000012333555552167-83056368862689886302799836962555 Q gi|254780764|r 12 LDLRKYMPKILQNSLIFTLAIYFYLAPILALSHEKEIFEKKPLPRFV-TIKASRANSRIGPGIMYTVVCTYLTKGLPVEV 90 (194) Q Consensus 12 ~~~~~~~~k~~~~~ll~~l~~~~~~~~~~~~s~~~~~~t~~~~pr~~-sik~~~vNvR~GP~~~y~i~~~~~~~g~~v~v 90 (194) ..+..++.+....++++.++.++++..|.+.+..-.-.-..|+|-.- ++....+-+-..-..+-.-+..+.+.|..+++ T Consensus 4 ~~~~~i~~k~~t~k~L~~laa~~lLagC~a~~~tl~~tpp~~~~sqn~v~~~~~vsit~kD~R~~q~vA~v~~~~~lv~L 83 (204) T COG3056 4 AALCVIESKNMTKKILFPLAAIFLLAGCAAPPTTLIVTPPAPLPSQNPVLMGVTVSITGKDQRSDQAVAKVTRDNQLVKL 83 (204) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCCHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEE T ss_conf 58988761788999999999999997537997503136997664437233141899970550166788762047978983 Q ss_pred EEECC---------------CEEEEEECCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCC--CCEEEEEC Q ss_conf 42012---------------12567532530576402334654433333433343443311333179987--75379973 Q gi|254780764|r 91 VKEYE---------------NWRQIRDFDGTIGWINKSLLSGKRSAIVSPWNRKTNNPIYINLYKKPDIQ--SIIVAKVE 153 (194) Q Consensus 91 i~~~~---------------~W~kV~~~~g~~Gwv~~~~ls~~~~~~~~~~~~~~~~~~~lnvR~~Ps~~--s~iv~~l~ 153 (194) ....+ .-++|-..++..-|+.-..+- .....-|+|..=++. -.|.++.. T Consensus 84 ~as~~vr~llq~vl~~q~~srGfrig~n~~~n~~i~V~qly--------------a~V~qgnlryni~tkv~i~I~A~~~ 149 (204) T COG3056 84 TASRDVRFLLQEVLEKQMTSRGFRVGPNGEVNLQITVDQLY--------------ADVEQGNLRYNIDTKVDITITATAA 149 (204) T ss_pred CCCCCHHHHHHHHHHHHHHHCCEEECCCCCCEEEEEHHHHH--------------HHHHHCCEEECCCCEEEEEEEEECC T ss_conf 47813999999999998653436753689831899849966--------------3556331233025458899999737 Q ss_pred CCCEEE Q ss_conf 898899 Q gi|254780764|r 154 PGVLLT 159 (194) Q Consensus 154 ~g~~v~ 159 (194) +|..++ T Consensus 150 ~Gk~~K 155 (204) T COG3056 150 NGKMVK 155 (204) T ss_pred CCCEEE T ss_conf 871553 No 38 >KOG2546 consensus Probab=56.71 E-value=7.2 Score=18.06 Aligned_cols=31 Identities=16% Similarity=0.314 Sum_probs=24.7 Q ss_pred CCCEEEEEEE-ECCEEEEEECCEEEEEECCCE Q ss_conf 8988999952-099899998998999971412 Q gi|254780764|r 154 PGVLLTIREC-SGEWCFGYNLDTEGWIKKQKI 184 (194) Q Consensus 154 ~g~~v~i~~~-~~~W~kV~~~g~~GWV~k~~i 184 (194) .|.++.+++. +++||..-.+|.+|.-.++|+ T Consensus 444 E~ailyv~kknddgw~EgV~~~VTglFpgnyv 475 (483) T KOG2546 444 EGAILYVLKKNDDGWYEGVQDGVTGLFPGNYV 475 (483) T ss_pred CCCEEEEEEECCCCCHHHEECCCCEECCCCCC T ss_conf 36379999725885021000574131457665 No 39 >KOG0162 consensus Probab=55.16 E-value=7.2 Score=18.06 Aligned_cols=47 Identities=19% Similarity=0.306 Sum_probs=33.6 Q ss_pred EEECCCCCCCCEEEEECCCCEEEEEEEE-CCEEEEEECCEEEEEECCCE Q ss_conf 3331799877537997389889999520-99899998998999971412 Q gi|254780764|r 137 NLYKKPDIQSIIVAKVEPGVLLTIRECS-GEWCFGYNLDTEGWIKKQKI 184 (194) Q Consensus 137 nvR~~Ps~~s~iv~~l~~g~~v~i~~~~-~~W~kV~~~g~~GWV~k~~i 184 (194) -++.-|+.+. ---.+.+|++++|...+ .+|..-..+|++||+...|+ T Consensus 1056 A~Y~y~gq~~-dEls~~~~diIei~~edpSGWw~gk~~~keG~~P~~Yv 1103 (1106) T KOG0162 1056 ALYDYPGQDV-DELSFKKGDIIEIMREDPSGWWLGKLNGKEGLFPGNYV 1103 (1106) T ss_pred EECCCCCCCC-CCCCCCCCCEEEEECCCCCCCHHHCCCCCCCCCCCCCC T ss_conf 2235789980-30462578778872268874201024886665554342 No 40 >pfam11191 DUF2782 Protein of unknown function (DUF2782). This is a bacterial family of proteins whose function is unknown. Probab=51.05 E-value=9.4 Score=17.35 Aligned_cols=50 Identities=14% Similarity=0.114 Sum_probs=34.8 Q ss_pred CCEEEEEECCCCCCCCEEEEEECC--CEEEEEEECCCEEEEEECCCEEEEEECCC Q ss_conf 563688626898863027998369--62555420121256753253057640233 Q gi|254780764|r 62 ASRANSRIGPGIMYTVVCTYLTKG--LPVEVVKEYENWRQIRDFDGTIGWINKSL 114 (194) Q Consensus 62 ~~~vNvR~GP~~~y~i~~~~~~~g--~~v~vi~~~~~W~kV~~~~g~~Gwv~~~~ 114 (194) .-++-++.+-+. .+..|.-+| +.++|+-+.+-||...+.+|...|..... T Consensus 38 epevtI~~~~~~---~ieEYRvnG~ly~ikV~P~~g~pYyl~d~dg~g~~~~~~~ 89 (104) T pfam11191 38 EPEVTIIRDGGA---TIEEYRVNGQLYMIKVQPKVGPPYYLVDADGDGQYSESDA 89 (104) T ss_pred CCEEEEEECCCC---EEEEEEECCEEEEEEEECCCCCCEEEEECCCCCCEEECCC T ss_conf 960799958982---8999977899999998679999869881589876375687 No 41 >PRK03147 thiol-disulfide oxidoreductase; Provisional Probab=49.10 E-value=16 Score=15.89 Aligned_cols=40 Identities=23% Similarity=0.343 Sum_probs=19.6 Q ss_pred CCCCCCCCEEEEEECCEEEEEECCCCCCCCEEEEEECCCEEEEEEECCCEEE Q ss_conf 3355555216783056368862689886302799836962555420121256 Q gi|254780764|r 48 IFEKKPLPRFVTIKASRANSRIGPGIMYTVVCTYLTKGLPVEVVKEYENWRQ 99 (194) Q Consensus 48 ~~t~~~~pr~~sik~~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~~~~W~k 99 (194) ...|.+.|.|.-...+ |..+..- . .+|.++ ++.-+..|+- T Consensus 36 ~~iG~~aP~f~l~~l~--------G~~v~ls-d--~kGK~v-ll~FWAtWC~ 75 (176) T PRK03147 36 MQVGKEAPNFVLTDLE--------GKKIELK-D--LKGKGV-FLNFWGTWCK 75 (176) T ss_pred CCCCCCCCCCEEECCC--------CCEEEHH-H--HCCCEE-EEEEECCCCC T ss_conf 3379908971627899--------8987189-9--699979-9999789792 No 42 >PRK10559 p-hydroxybenzoic acid efflux subunit AaeA; Provisional Probab=46.79 E-value=19 Score=15.49 Aligned_cols=31 Identities=13% Similarity=0.066 Sum_probs=17.3 Q ss_pred EEEEEECCEEEEEECCCCCCCCEEEEEECCCEE Q ss_conf 167830563688626898863027998369625 Q gi|254780764|r 56 RFVTIKASRANSRIGPGIMYTVVCTYLTKGLPV 88 (194) Q Consensus 56 r~~sik~~~vNvR~GP~~~y~i~~~~~~~g~~v 88 (194) +-+.+.++.+++ .|-.+..|...+.+.|..| T Consensus 39 ddAyV~a~iv~V--sp~VsG~V~eV~V~dnq~V 69 (310) T PRK10559 39 RDARFSADVVAI--APDVSGLITQVNVHDNQLV 69 (310) T ss_pred CCEEEEEEEEEE--ECCCCEEEEEEECCCCCEE T ss_conf 773898579999--3667469999991794987 No 43 >PRK13697 cytochrome c6; Provisional Probab=34.61 E-value=29 Score=14.34 Aligned_cols=26 Identities=23% Similarity=0.259 Sum_probs=8.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999999999997533100001 Q gi|254780764|r 19 PKILQNSLIFTLAIYFYLAPILALSHE 45 (194) Q Consensus 19 ~k~~~~~ll~~l~~~~~~~~~~~~s~~ 45 (194) .+++...++.+++++ +..++++++++ T Consensus 2 k~l~~~~l~~~~~~~-~~~~~pA~Aad 27 (111) T PRK13697 2 KKILKLVLLTLLLLT-FAFTSPAFAAD 27 (111) T ss_pred HHHHHHHHHHHHHHH-HHCCCHHHHHH T ss_conf 789999999999999-98035544416 No 44 >COG5368 Uncharacterized protein conserved in bacteria [Function unknown] Probab=33.92 E-value=14 Score=16.30 Aligned_cols=19 Identities=37% Similarity=0.706 Sum_probs=15.0 Q ss_pred ECCEEEEEECCCEEEECCCC Q ss_conf 89989999714122056676 Q gi|254780764|r 172 NLDTEGWIKKQKIWGIYPGE 191 (194) Q Consensus 172 ~~g~~GWV~k~~iwGv~~~e 191 (194) ++|.+|||++.++ |+..+- T Consensus 380 ~~g~~GWVs~~~~-gidQGi 398 (451) T COG5368 380 YNGTTGWVSKTHL-GIDQGI 398 (451) T ss_pred CCCCCCCEECCEE-ECCCCC T ss_conf 8898664224402-014422 No 45 >KOG3523 consensus Probab=31.62 E-value=6.5 Score=18.34 Aligned_cols=37 Identities=14% Similarity=0.272 Sum_probs=29.5 Q ss_pred EEEECCCCEEEEEEEE-CCEEEEE--ECCEEEEEECCCEE Q ss_conf 7997389889999520-9989999--89989999714122 Q gi|254780764|r 149 VAKVEPGVLLTIRECS-GEWCFGY--NLDTEGWIKKQKIW 185 (194) Q Consensus 149 v~~l~~g~~v~i~~~~-~~W~kV~--~~g~~GWV~k~~iw 185 (194) --+|+.++++.|+... ++|++=. .+|-.||...+|+- T Consensus 624 el~Le~~Dvv~v~~k~~DGWl~GeRl~Dge~GWFP~~~ve 663 (695) T KOG3523 624 ELTLELADVVNVLQKTPDGWLEGERLRDGERGWFPSSYVE 663 (695) T ss_pred EEEEEHHHHHHHHHCCCCCCEECCCCCCCCCCCCHHHHHH T ss_conf 0010130254554237884320403466766756199999 No 46 >COG4787 FlgF Flagellar basal body rod protein [Cell motility and secretion] Probab=29.61 E-value=36 Score=13.81 Aligned_cols=48 Identities=19% Similarity=0.372 Sum_probs=34.6 Q ss_pred EECCCCCCCCEEEEEECCCEEEEEEECCCEEEEEECCCEEEEEECCCCC Q ss_conf 6268988630279983696255542012125675325305764023346 Q gi|254780764|r 68 RIGPGIMYTVVCTYLTKGLPVEVVKEYENWRQIRDFDGTIGWINKSLLS 116 (194) Q Consensus 68 R~GP~~~y~i~~~~~~~g~~v~vi~~~~~W~kV~~~~g~~Gwv~~~~ls 116 (194) -+-||++.. -+.+..-|-|+.|--+.++|.-|++.+|.++|...--+. T Consensus 59 ~s~pg~d~s-pG~l~~TgR~LDvaiq~DGwlaVq~~dG~EaYTRnG~~q 106 (251) T COG4787 59 ASTPGTDMS-PGSLDYTGRPLDVAIQGDGWLAVQDADGSEAYTRNGNIQ 106 (251) T ss_pred CCCCCCCCC-CCCCCCCCCCCEEEECCCCEEEEECCCCCCHHEECCCEE T ss_conf 146765678-861013688530797368569998688750111157467 No 47 >KOG2222 consensus Probab=29.15 E-value=29 Score=14.38 Aligned_cols=33 Identities=18% Similarity=0.309 Sum_probs=26.4 Q ss_pred ECCCCEEEEEEEECCEEEEE-ECCEEEEEECCCE Q ss_conf 73898899995209989999-8998999971412 Q gi|254780764|r 152 VEPGVLLTIRECSGEWCFGY-NLDTEGWIKKQKI 184 (194) Q Consensus 152 l~~g~~v~i~~~~~~W~kV~-~~g~~GWV~k~~i 184 (194) ..+++.++++...++-|-|. .||.+||-....+ T Consensus 567 frkndiitiisekdehcwvgelnglrgwfpakfv 600 (848) T KOG2222 567 FRKNDIITIISEKDEHCWVGELNGLRGWFPAKFV 600 (848) T ss_pred CCCCCEEEEEECCCCCEEEECCCCCCCCCHHHHH T ss_conf 5436678985237762156302663455619999 No 48 >KOG3837 consensus Probab=26.74 E-value=13 Score=16.50 Aligned_cols=29 Identities=24% Similarity=0.370 Sum_probs=18.1 Q ss_pred EEEEECCCCCCCCEEEEEECCCEEEEEEE Q ss_conf 68862689886302799836962555420 Q gi|254780764|r 65 ANSRIGPGIMYTVVCTYLTKGLPVEVVKE 93 (194) Q Consensus 65 vNvR~GP~~~y~i~~~~~~~g~~v~vi~~ 93 (194) +|+|.|||++..-...+.+.|...+|..+ T Consensus 425 lni~rg~~~nr~fqR~fkr~g~kfeifhk 453 (523) T KOG3837 425 LNIRRGPGLNREFQRRFKRLGKKFEIFHK 453 (523) T ss_pred EECCCCCCCCHHHHHHHHHCCEEEEEEEC T ss_conf 65057876348999999855736888521 No 49 >PRK13861 type IV secretion system protein VirB9; Provisional Probab=24.14 E-value=45 Score=13.20 Aligned_cols=52 Identities=15% Similarity=0.144 Sum_probs=25.6 Q ss_pred CEEEEEEC---CEEEEEECCCCCCCCEEEEEECCCEEEEEEE--CCCEEEEEECCCEEEEEE Q ss_conf 21678305---6368862689886302799836962555420--121256753253057640 Q gi|254780764|r 55 PRFVTIKA---SRANSRIGPGIMYTVVCTYLTKGLPVEVVKE--YENWRQIRDFDGTIGWIN 111 (194) Q Consensus 55 pr~~sik~---~~vNvR~GP~~~y~i~~~~~~~g~~v~vi~~--~~~W~kV~~~~g~~Gwv~ 111 (194) ||..++.. +.+.++..+|....| ++. .|+.++-++- ..+| +|.. .|..-++. T Consensus 32 ~Riq~v~Ynp~~V~~V~~~~G~~T~I--~F~-~dE~I~~v~~Gd~~~W-~v~~-~gN~lfiK 88 (293) T PRK13861 32 PRMRYLAYNPDQVVRLSTAVGATLVV--TFG-ANETVTAVAVSNSKDL-AALP-RGNYLFFK 88 (293) T ss_pred CCCEEEEECCCCEEEEEEECCEEEEE--EEC-CCCEEEEECCCCCCCE-EEEC-CCCEEEEE T ss_conf 77179880888889999987469999--988-9987998615882014-8962-78779996 No 50 >TIGR02924 ICDH_alpha isocitrate dehydrogenase; InterPro: IPR014273 This entry represents a group of isocitrate dehydrogenases found mainly in the alphaproteobacteria. Many of the species containing these proteins appear to have a TCA cycle lacking only a determined isocitrate dehydrogenase. The precise identity of the cofactor (NADH -- 1.1.1.41 from EC vs. NADPH -- 1.1.1.42 from EC) is unclear.. Probab=23.57 E-value=46 Score=13.13 Aligned_cols=27 Identities=37% Similarity=0.543 Sum_probs=13.0 Q ss_pred EEEEECCCCCC-CCEEEEEECCCEEEEEEEC Q ss_conf 68862689886-3027998369625554201 Q gi|254780764|r 65 ANSRIGPGIMY-TVVCTYLTKGLPVEVVKEY 94 (194) Q Consensus 65 vNvR~GP~~~y-~i~~~~~~~g~~v~vi~~~ 94 (194) +|+| |..+| |++.+.. .++-+.|+-|. T Consensus 92 ANiR--PCvsyhP~i~t~~-p~ldiVivREN 119 (481) T TIGR02924 92 ANIR--PCVSYHPVIETKY-PNLDIVIVREN 119 (481) T ss_pred EECC--CCCCCCCEECCCC-CCCCEEEEEEC T ss_conf 4126--6530035241677-98346999835 No 51 >KOG3669 consensus Probab=22.79 E-value=47 Score=13.04 Aligned_cols=45 Identities=22% Similarity=0.377 Sum_probs=29.7 Q ss_pred CCEEEEECCCCEEEEEEEECCEEEEEECCEEEEEE---------CCCEEEECCCCE Q ss_conf 75379973898899995209989999899899997---------141220566760 Q gi|254780764|r 146 SIIVAKVEPGVLLTIRECSGEWCFGYNLDTEGWIK---------KQKIWGIYPGEV 192 (194) Q Consensus 146 s~iv~~l~~g~~v~i~~~~~~W~kV~~~g~~GWV~---------k~~iwGv~~~e~ 192 (194) ..|-.+.+.|..-+.+.|.+.|.-|...|. |+. +-.+-|.|.||. T Consensus 353 ~GVt~~~P~Gk~w~liqc~~~wisv~~sgv--~i~s~k~~~~~~kgS~~G~l~n~~ 406 (705) T KOG3669 353 EGVTDKLPMGKWWQLIQCQPSWISVNNSGV--WISSGKNEFHVAKGSLIGTLWNHV 406 (705) T ss_pred CCCCCCCCCCCCEEEEECCCEEEEEECCEE--EEEECCCCEEECCCCCCEEEECCC T ss_conf 066656765510221104742676504407--998057712311043100551576 No 52 >KOG3601 consensus Probab=22.46 E-value=12 Score=16.78 Aligned_cols=34 Identities=21% Similarity=0.327 Sum_probs=15.7 Q ss_pred CCCEEEEEEEE-CCEEEEEECCEEEEEECCCEEEE Q ss_conf 89889999520-99899998998999971412205 Q gi|254780764|r 154 PGVLLTIRECS-GEWCFGYNLDTEGWIKKQKIWGI 187 (194) Q Consensus 154 ~g~~v~i~~~~-~~W~kV~~~g~~GWV~k~~iwGv 187 (194) .|+..+++... ..|-.=+..|..|--...|+-|. T Consensus 184 ~gq~~~v~~~ss~~ww~Gs~lg~agiFpagyv~p~ 218 (222) T KOG3601 184 RGQQIQVLDSSSPFWWFGSKLGRAGIFPAGYVAPS 218 (222) T ss_pred CCCCCEEECCCCCCHHHCCCCCCEEEECCCCCCCC T ss_conf 58843660578831221233674034057654554 No 53 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=22.25 E-value=49 Score=12.97 Aligned_cols=24 Identities=13% Similarity=-0.078 Sum_probs=12.9 Q ss_pred EEEECCCCEEE-EEEEECCEEEEEE Q ss_conf 79973898899-9952099899998 Q gi|254780764|r 149 VAKVEPGVLLT-IRECSGEWCFGYN 172 (194) Q Consensus 149 v~~l~~g~~v~-i~~~~~~W~kV~~ 172 (194) .-.|..|.... ....+-+|+=|+. T Consensus 209 af~Lk~geis~~PvkT~~GYhIIK~ 233 (298) T PRK04405 209 AFKLKNGEYTTTPVKTTYGYEVIKM 233 (298) T ss_pred HHCCCCCCCCCCCEECCCCEEEEEE T ss_conf 9748899816884521673399998 No 54 >TIGR00316 cdhC CO dehydrogenase/acetyl-CoA synthase complex, beta subunit; InterPro: IPR004461 The carbon monoxide dehydrogenase alpha subunit (1.2.99.2 from EC) catalyses the interconversion of CO and CO2 and the synthesis of acteyl-coA from the methylated corrinoid/iron sulphur protein, CO and CoA. Nomenclature follows the description for Methanosarcina thermophila. The complex is also found in Archaeoglobus fulgidus, not considered a methanogen, but is otherwise generally associated with methanogenesis.; GO: 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0006084 acetyl-CoA metabolic process. Probab=22.08 E-value=49 Score=12.95 Aligned_cols=70 Identities=16% Similarity=0.297 Sum_probs=39.5 Q ss_pred CCCEEEEEEECC--CEEEEEECCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEEECCCCEE--- Q ss_conf 696255542012--125675325305764023346544333334333434433113331799877537997389889--- Q gi|254780764|r 84 KGLPVEVVKEYE--NWRQIRDFDGTIGWINKSLLSGKRSAIVSPWNRKTNNPIYINLYKKPDIQSIIVAKVEPGVLL--- 158 (194) Q Consensus 84 ~g~~v~vi~~~~--~W~kV~~~~g~~Gwv~~~~ls~~~~~~~~~~~~~~~~~~~lnvR~~Ps~~s~iv~~l~~g~~v--- 158 (194) -++|.+-.+=|+ ..|--. -|| .||+|+.|--.....++-+ ..-|+-.=|.+| T Consensus 274 f~~PhTSCGCFEai~FYiPE-vDG-iG~vhR~y~getP~GlpFS---------------------tlAGQcsGGKQV~GF 330 (496) T TIGR00316 274 FEYPHTSCGCFEAIVFYIPE-VDG-IGIVHRGYRGETPLGLPFS---------------------TLAGQCSGGKQVPGF 330 (496) T ss_pred CCCCCCCCCCEEEEEEECCC-CCC-CEEEECCCCCCCCCCCCHH---------------------HHHCCCCCCEEECCE T ss_conf 37788745642687884355-054-0037638778888887135---------------------440127887250242 Q ss_pred -----------EEEEEECCEEEEEECCEEEEEECC Q ss_conf -----------999520998999989989999714 Q gi|254780764|r 159 -----------TIRECSGEWCFGYNLDTEGWIKKQ 182 (194) Q Consensus 159 -----------~i~~~~~~W~kV~~~g~~GWV~k~ 182 (194) +.+..+|+|+||- |+.|+ T Consensus 331 ~Gisi~YmrSpKFlQaDGGw~Rvv------WlPkE 359 (496) T TIGR00316 331 VGISISYMRSPKFLQADGGWERVV------WLPKE 359 (496) T ss_pred ECCCEEECCCCCCEECCCCEEEEE------ECCCH T ss_conf 313211203775033588616878------56712 Done!