Query gi|254780443|ref|YP_003064856.1| hypothetical protein CLIBASIA_01640 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 141 No_of_seqs 100 out of 102 Neff 5.4 Searched_HMMs 39220 Date Sun May 29 19:07:00 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780443.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam06282 DUF1036 Protein of u 100.0 0 0 311.6 13.2 115 20-135 1-115 (115) 2 COG5480 Predicted integral mem 100.0 4.3E-43 0 272.9 9.5 119 18-137 28-146 (147) 3 PRK01742 tolB translocation pr 78.7 2.9 7.3E-05 21.8 3.7 66 1-75 3-70 (430) 4 PRK02889 tolB translocation pr 73.4 6.3 0.00016 19.8 4.3 117 1-127 4-126 (430) 5 PRK11528 hypothetical protein; 72.4 8 0.0002 19.2 6.1 81 1-100 1-81 (254) 6 pfam10697 DUF2502 Protein of u 71.7 2.1 5.2E-05 22.7 1.4 47 9-59 3-56 (96) 7 PRK04792 tolB translocation pr 68.3 7.6 0.00019 19.3 3.7 67 1-75 1-69 (450) 8 PRK11408 hypothetical protein; 51.8 20 0.0005 16.9 4.4 40 26-67 45-84 (219) 9 TIGR03431 PhnD phosphonate ABC 49.5 18 0.00045 17.2 2.9 23 1-24 1-23 (288) 10 KOG1113 consensus 48.0 20 0.00051 16.9 3.0 70 49-126 162-231 (368) 11 CHL00132 psaF photosystem I su 45.8 17 0.00044 17.2 2.4 33 1-33 1-37 (185) 12 pfam06650 DUF1162 Protein of u 45.3 25 0.00063 16.3 10.8 99 24-138 119-219 (268) 13 pfam10946 DUF2625 Protein of u 45.3 25 0.00063 16.3 4.3 40 26-67 42-81 (208) 14 PRK09926 putative chaperone pr 41.4 18 0.00047 17.1 2.0 79 4-83 5-120 (245) 15 PRK05137 tolB translocation pr 41.2 27 0.0007 16.1 2.9 64 7-75 8-73 (437) 16 PRK01029 tolB translocation pr 40.7 29 0.00075 15.9 3.8 40 1-40 1-44 (425) 17 smart00205 THN Thaumatin famil 38.7 26 0.00065 16.2 2.4 38 25-64 1-38 (218) 18 PRK03757 hypothetical protein; 38.5 32 0.00081 15.7 3.0 59 1-60 1-65 (191) 19 pfam11730 DUF3297 Protein of u 37.1 15 0.00037 17.7 0.9 16 47-62 37-52 (71) 20 TIGR02538 type_IV_pilB type IV 36.3 15 0.00038 17.6 0.9 38 23-64 462-507 (577) 21 PRK13874 conjugal transfer pro 34.8 29 0.00075 15.9 2.2 24 10-33 14-37 (233) 22 PRK12789 flgI flagellar basal 31.1 41 0.001 15.1 2.4 40 1-41 1-42 (367) 23 COG5664 Predicted secreted Zn- 29.3 43 0.0011 14.9 2.3 29 1-29 4-32 (201) 24 PHA02099 hypothetical protein 27.2 41 0.0011 15.0 1.9 31 55-85 27-57 (84) 25 cd01776 Rin1_RA Rin1_RA RIN1 26.6 28 0.00072 16.0 1.0 31 48-78 50-85 (87) 26 PRK13697 cytochrome c6; Provis 25.3 44 0.0011 14.9 1.7 19 5-24 8-26 (111) 27 TIGR01855 IMP_synth_hisH imida 24.1 43 0.0011 14.9 1.5 49 50-101 126-179 (211) 28 PRK04922 tolB translocation pr 22.5 61 0.0015 14.0 4.1 66 4-75 6-73 (439) 29 PRK10894 hypothetical protein; 21.9 54 0.0014 14.4 1.7 50 8-60 12-71 (184) 30 pfam07172 GRP Glycine rich pro 21.8 28 0.00073 16.0 0.2 25 12-36 15-41 (91) 31 COG4991 Uncharacterized protei 20.9 49 0.0013 14.6 1.3 29 1-29 10-38 (155) 32 COG0118 HisH Glutamine amidotr 20.1 65 0.0017 13.8 1.8 34 47-80 116-150 (204) No 1 >pfam06282 DUF1036 Protein of unknown function (DUF1036). This family consists of several hypothetical bacterial proteins of unknown function. Probab=100.00 E-value=0 Score=311.65 Aligned_cols=115 Identities=54% Similarity=1.131 Sum_probs=112.6 Q ss_pred CCCCCEEEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCCCCCCCEEEEEEEECCCCCCCCCCEEEEECCCC Q ss_conf 35472298318998489999997488857875468857887037405676674799999953887120575013650753 Q gi|254780443|r 20 VSFAGFRVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKGALHSRYYYLYAEGVSHSEHWAGNVQMCVGQDE 99 (141) Q Consensus 20 ~A~A~~~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g~l~~r~yY~yAe~~~~~~~w~G~~~~Cv~~~~ 99 (141) ||+|||+|||+|+++|+|||||+++ ++|+|||||+|+||+|++||+++|+++|||+|||+.+++.+|+|+++|||++++ T Consensus 1 pA~A~l~~CN~T~~~v~vAig~~~~-~~w~s~GWw~i~pg~C~~~l~g~l~~r~yY~yAe~~~g~~~w~G~~~fCv~~~~ 79 (115) T pfam06282 1 PAHAGLRVCNDTQNVVGVAIGYKDG-EGWTSEGWWTIDPGKCETLIEGPLKSRYYYLYAEDADGGGRWGGDIQFCVREDE 79 (115) T ss_pred CCCCCCEEECCCCCEEEEEEEEECC-CCEEEEEEEEECCCCCEEECCCCCCCCEEEEEEEECCCCCEECCCEEEEECCCC T ss_conf 9867868730899837999999659-961870338808985406456876641799999956998510797757565873 Q ss_pred EEEECCCCHHHCCCCCCCCEEEEECCCCEEEEEEEC Q ss_conf 255253101227840028288760797517899707 Q gi|254780443|r 100 FNIVDIKNCYTRGYLRVGFTEYDTGQHENWTVQLTE 135 (141) Q Consensus 100 F~i~g~~dC~~RG~~~~~F~evDtg~~~~wTv~Lt~ 135 (141) |+|.|++||.+|||++++|+||||+++++||+|||| T Consensus 80 F~i~g~~dC~~RG~~~~~F~~vDt~~~~~wT~~Lt~ 115 (115) T pfam06282 80 FTIVGVEDCLARGYDRAGFFEVDTGEQKSWTVQLTE 115 (115) T ss_pred EEECCCCCHHHCCCCCCCCEEEECCCCCCCEEECCC T ss_conf 766284152245956138799853898604598359 No 2 >COG5480 Predicted integral membrane protein [Function unknown] Probab=100.00 E-value=4.3e-43 Score=272.89 Aligned_cols=119 Identities=48% Similarity=1.090 Sum_probs=114.3 Q ss_pred CCCCCCCEEEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCCCCCCCEEEEEEEECCCCCCCCCCEEEEECC Q ss_conf 68354722983189984899999974888578754688578870374056766747999999538871205750136507 Q gi|254780443|r 18 SNVSFAGFRVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKGALHSRYYYLYAEGVSHSEHWAGNVQMCVGQ 97 (141) Q Consensus 18 ~~~A~A~~~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g~l~~r~yY~yAe~~~~~~~w~G~~~~Cv~~ 97 (141) ..+|||+||+||.|...|++||||+.. .+|+|||||.|+|++|.++++++|.+||||+|||+..++++|.|++.|||.+ T Consensus 28 a~sa~adfrvCN~t~~~v~vAiGyr~~-ngwvtEGWW~v~~~~Cv~~ie~~lt~ry~ylyA~d~~~~a~~~Gsi~mCv~~ 106 (147) T COG5480 28 APSAHADFRVCNYTQTLVGVAIGYRAK-NGWVTEGWWVVEASECVTLIEGELTSRYYYLYAEDAKGGARWAGSINMCVAE 106 (147) T ss_pred CCCCCCCEEECHHHHHHHHEEEEEECC-CCCEECCEEEECCCEEEEEECCCCEEEEEEEECCCHHCCCEECCCEEEEEEC T ss_conf 578210026500032122114423448-8864454499559615887401000147767715511075762746478860 Q ss_pred CCEEEECCCCHHHCCCCCCCCEEEEECCCCEEEEEEECCC Q ss_conf 5325525310122784002828876079751789970765 Q gi|254780443|r 98 DEFNIVDIKNCYTRGYLRVGFTEYDTGQHENWTVQLTEPA 137 (141) Q Consensus 98 ~~F~i~g~~dC~~RG~~~~~F~evDtg~~~~wTv~Lt~p~ 137 (141) ++|+|.|.+||.+||+++.||.|+||++|.+|+++||+.. T Consensus 107 ~rF~I~gv~Dc~arg~~a~gf~e~dt~~q~sw~v~ls~~p 146 (147) T COG5480 107 DRFKIFGVQDCFARGFQAAGFNEYDTGEQASWMVQLSDSP 146 (147) T ss_pred CEEEEECCHHHHHHHHHHCCCEEECCCCCCEEEEEECCCC T ss_conf 4568963278888887734761320477824999955889 No 3 >PRK01742 tolB translocation protein TolB; Provisional Probab=78.73 E-value=2.9 Score=21.80 Aligned_cols=66 Identities=12% Similarity=0.177 Sum_probs=44.2 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCEEEE--CCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCCCCCCCEEE Q ss_conf 9578999999999974368354722983--18998489999997488857875468857887037405676674799 Q gi|254780443|r 1 MLRSFLICFCFGAMIFCSNVSFAGFRVC--NGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKGALHSRYYY 75 (141) Q Consensus 1 ~l~~~l~~~~~~~~~~~~~~A~A~~~vC--N~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g~l~~r~yY 75 (141) |++.+++++++.+.++.+..|+|.|+|= -....++-+||.=-..++. .|.+...||..+|...=.| T Consensus 3 m~~~~~~~~~~l~~~~~~~~a~a~L~I~It~G~~~~~pIAi~pF~~~~~---------~~~~i~~vI~~DL~~SG~F 70 (430) T PRK01742 3 LLKRLVSVFAVLFAVISNAFAGDEVRIVIDEGVDGARPIAVVPFKWNGS---------APEDIAKIIAADLRNSGKF 70 (430) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCEEEECCCCCCC---------CHHHHHHHHHHHHHCCCCC T ss_conf 6999999999999998778505755999945767740459925367897---------6678999998777608472 No 4 >PRK02889 tolB translocation protein TolB; Provisional Probab=73.37 E-value=6.3 Score=19.80 Aligned_cols=117 Identities=15% Similarity=0.131 Sum_probs=58.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCE--EEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCCCCCCCEEEEEE Q ss_conf 9578999999999974368354722--98318998489999997488857875468857887037405676674799999 Q gi|254780443|r 1 MLRSFLICFCFGAMIFCSNVSFAGF--RVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKGALHSRYYYLYA 78 (141) Q Consensus 1 ~l~~~l~~~~~~~~~~~~~~A~A~~--~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g~l~~r~yY~yA 78 (141) |++..+.+++..++++++.+|||.| +|--....++-+||.=-.++++ .|.+-..+|..+|...-.|- . T Consensus 4 ~~~l~~~~~~~~~l~~~~~~A~A~L~IeIt~g~~~~~pIAV~pF~~~~~---------~~~~i~~II~~DL~~SG~F~-~ 73 (430) T PRK02889 4 MTKLGLRALVASCLIAAGGAAHAQLNVEITGVGATQFPIATANFANEAG---------LPQQVTSIVRADLARSGKFS-N 73 (430) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCEEEEEEECCCCCCC---------CHHHHHHHHHHHHHHCCCCE-E T ss_conf 9989999999999986210202789999935788707279906578887---------50679999998887288840-0 Q ss_pred EECCCCC-CCCCCEEE---EECCCCEEEECCCCHHHCCCCCCCCEEEEECCCC Q ss_conf 9538871-20575013---6507532552531012278400282887607975 Q gi|254780443|r 79 EGVSHSE-HWAGNVQM---CVGQDEFNIVDIKNCYTRGYLRVGFTEYDTGQHE 127 (141) Q Consensus 79 e~~~~~~-~w~G~~~~---Cv~~~~F~i~g~~dC~~RG~~~~~F~evDtg~~~ 127 (141) .+..... .-..+..| =.-..++-+.|.=.=..-|--.+.|+-+|+-.++ T Consensus 74 l~~~~~~~~~~~~i~f~~W~~lgad~lv~G~v~~~~~g~~~v~~~L~Dv~~~~ 126 (430) T PRK02889 74 VDAGGAPVPETASVDLGAWKAKGADAFVAGSVNREPNGQYEVRFRLYDTVKQQ 126 (430) T ss_pred CCCCCCCCCCCCCCCHHHHHHCCCCEEEEEEEEECCCCCEEEEEEEEECCCCC T ss_conf 58434789987886868988608989999999977898489999999746661 No 5 >PRK11528 hypothetical protein; Provisional Probab=72.36 E-value=8 Score=19.19 Aligned_cols=81 Identities=17% Similarity=0.304 Sum_probs=43.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCCCCCCCEEEEEEEE Q ss_conf 95789999999999743683547229831899848999999748885787546885788703740567667479999995 Q gi|254780443|r 1 MLRSFLICFCFGAMIFCSNVSFAGFRVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKGALHSRYYYLYAEG 80 (141) Q Consensus 1 ~l~~~l~~~~~~~~~~~~~~A~A~~~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g~l~~r~yY~yAe~ 80 (141) |.+.+| ++.+.+.+..+.||||+++- .--.|.+.|.| |.++ .-..--...-++++-|+-.|. T Consensus 1 m~K~ll-~ls~~~~~~~~~~a~ae~~~-----gfa~is~nylD----Ws~~--------t~~~~g~~~~k~DF~YleleG 62 (254) T PRK11528 1 MRKSLL-ALSLLATLALAAPAQAEYKG-----GFANISLNYLD----WTSR--------TTEKSGDKSHKDDFGYLELEG 62 (254) T ss_pred CCHHHH-HHHHHHHHCCCCCCCHHHHC-----CEEEEECCEEE----CCCC--------CCCCCCCCCCCCCCEEEEEEC T ss_conf 941245-68999873156642045525-----53100113232----2688--------610367554346623588723 Q ss_pred CCCCCCCCCCEEEEECCCCE Q ss_conf 38871205750136507532 Q gi|254780443|r 81 VSHSEHWAGNVQMCVGQDEF 100 (141) Q Consensus 81 ~~~~~~w~G~~~~Cv~~~~F 100 (141) +.+--|+.=+.|---++.+ T Consensus 63 -Gagf~wGElYgF~dlEn~~ 81 (254) T PRK11528 63 -GAGFSWGELYGFFDLENPY 81 (254) T ss_pred -CCCCCHHHHEEEEEEECCC T ss_conf -5687611305786643567 No 6 >pfam10697 DUF2502 Protein of unknown function (DUF2502). Members of this family are all Gammaproteobacteria. The function is not known. Probab=71.71 E-value=2.1 Score=22.66 Aligned_cols=47 Identities=15% Similarity=0.336 Sum_probs=30.1 Q ss_pred HHHHHHHHCCCCCCCC-EEEECCCCCEEEEEEEEECC------CCCEEEEEEEEECCC Q ss_conf 9999997436835472-29831899848999999748------885787546885788 Q gi|254780443|r 9 FCFGAMIFCSNVSFAG-FRVCNGTKNLIGVAVGYPAV------KGGWMTEGWWQIPGN 59 (141) Q Consensus 9 ~~~~~~~~~~~~A~A~-~~vCN~T~~~v~vAigy~~~------~~~W~seGWw~i~pg 59 (141) +++++++++|.+|+|+ ++| ---|++-||-+|. ...|....||+-.-| T Consensus 3 L~~sl~~~~Pl~a~a~~I~l----~Pgv~l~iGdrD~rG~YWDG~~WRd~~wW~~h~g 56 (96) T pfam10697 3 LAASLLAVAPLAAQAGSITL----LPGVSLQIGDRDNRGNYWDGYDWRDRDWWQRHNR 56 (96) T ss_pred HHHHHHHHHHHHCCCCCEEE----CCCEEEEECCCCCCCCCCCCCCCCCCHHHHHCCC T ss_conf 78999866254416551686----4743787235567765566875468177885778 No 7 >PRK04792 tolB translocation protein TolB; Provisional Probab=68.33 E-value=7.6 Score=19.35 Aligned_cols=67 Identities=19% Similarity=0.289 Sum_probs=38.1 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCE--EEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCCCCCCCEEE Q ss_conf 9578999999999974368354722--98318998489999997488857875468857887037405676674799 Q gi|254780443|r 1 MLRSFLICFCFGAMIFCSNVSFAGF--RVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKGALHSRYYY 75 (141) Q Consensus 1 ~l~~~l~~~~~~~~~~~~~~A~A~~--~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g~l~~r~yY 75 (141) ||+.+++.+++.+++| +..|+|.| +|-.....++-+||.=-. |..++ -.|.+...||..+|...=.| T Consensus 1 M~k~~~~~l~lll~~~-s~~A~A~L~IeIt~G~~~~~PIAV~pF~----~~g~~---~~~~~i~~II~~DL~rSG~F 69 (450) T PRK04792 1 MLKRLILGLFVLLLSF-SQVANAALELVITDGIDSARPIAIVPFK----WEGEG---KLPTDISAVIASDLQRSGKF 69 (450) T ss_pred CCHHHHHHHHHHHHHH-HHHHEEEEEEEEECCCCCCCCEEEECCC----CCCCC---CHHHHHHHHHHHHHHHCCCC T ss_conf 9389999999999986-5752024999995477774254993754----56874---32579999999887608785 No 8 >PRK11408 hypothetical protein; Provisional Probab=51.77 E-value=20 Score=16.92 Aligned_cols=40 Identities=28% Similarity=0.488 Sum_probs=31.3 Q ss_pred EEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCC Q ss_conf 983189984899999974888578754688578870374056 Q gi|254780443|r 26 RVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKG 67 (141) Q Consensus 26 ~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g 67 (141) ++==.|.++++ ||.|-.+ +..+-.||-||--..|..+.++ T Consensus 45 ~lQVtTrS~lG-Aivy~tG-GiLID~GWlRilGSG~~~LpRg 84 (219) T PRK11408 45 GLQVSTRSPMG-AIVYETG-GILIDYGWLRILGSGNEKLPRG 84 (219) T ss_pred HHCCCCCCCCE-EEEECCC-CEEEECCEEEEECCCCCCCCCC T ss_conf 73444457540-3565158-5898788899965898567786 No 9 >TIGR03431 PhnD phosphonate ABC transporter, periplasmic phosphonate binding protein. Note that this model does not identify all phnD-subfamily genes with evident phosphonate context, but all sequences above the trusted context may be inferred to bind phosphonate compounds even in the absence of such context. Furthermore, there is ample evidence to suggest that many other members of the TIGR01098 subfamily have a different primary function. Probab=49.53 E-value=18 Score=17.18 Aligned_cols=23 Identities=26% Similarity=0.400 Sum_probs=14.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 957899999999997436835472 Q gi|254780443|r 1 MLRSFLICFCFGAMIFCSNVSFAG 24 (141) Q Consensus 1 ~l~~~l~~~~~~~~~~~~~~A~A~ 24 (141) |++++|+++++.+++. +.+|+|. T Consensus 1 m~~r~l~~~~~~~~~~-~~~a~a~ 23 (288) T TIGR03431 1 MLRRLILSLVAAFMLI-SSNAQAE 23 (288) T ss_pred CCHHHHHHHHHHHHHH-CCHHHHC T ss_conf 9088999999999997-4322430 No 10 >KOG1113 consensus Probab=47.96 E-value=20 Score=16.88 Aligned_cols=70 Identities=16% Similarity=0.131 Sum_probs=54.3 Q ss_pred EEEEEEEECCCCEEEECCCCCCCCEEEEEEEECCCCCCCCCCEEEEECCCCEEEECCCCHHHCCCCCCCCEEEEECCC Q ss_conf 875468857887037405676674799999953887120575013650753255253101227840028288760797 Q gi|254780443|r 49 MTEGWWQIPGNTCETVVKGALHSRYYYLYAEGVSHSEHWAGNVQMCVGQDEFNIVDIKNCYTRGYLRVGFTEYDTGQH 126 (141) Q Consensus 49 ~seGWw~i~pg~C~~vl~g~l~~r~yY~yAe~~~~~~~w~G~~~~Cv~~~~F~i~g~~dC~~RG~~~~~F~evDtg~~ 126 (141) ..---+-|+.|+|.....+. |...++.|+..+.=.-|.-.+..+|+....+|..=|+++..|++|=+++. T Consensus 162 eGd~fYvI~kGt~dVyv~~~--------~v~~~~~g~sFGElALmyn~PRaATv~a~t~~klWgldr~SFrrIi~~s~ 231 (368) T KOG1113 162 EGDNFYVIDKGTFDVYVNGT--------YVTTYSPGGSFGELALMYNPPRAATVVAKSLKKLWGLDRTSFRRIIMKSC 231 (368) T ss_pred CCCCEEEEECCEEEEEECCE--------EEEECCCCCCHHHHHHHHCCCCCCCEEECCCCCEEEEEECEEEEEEECCC T ss_conf 67717999535489997772--------76521798635564765189766522203666168985030678862010 No 11 >CHL00132 psaF photosystem I subunit III; Validated Probab=45.76 E-value=17 Score=17.24 Aligned_cols=33 Identities=21% Similarity=0.354 Sum_probs=18.3 Q ss_pred CHHHHHHH-HHHHHHHHCCCCCCC---CEEEECCCCC Q ss_conf 95789999-999999743683547---2298318998 Q gi|254780443|r 1 MLRSFLIC-FCFGAMIFCSNVSFA---GFRVCNGTKN 33 (141) Q Consensus 1 ~l~~~l~~-~~~~~~~~~~~~A~A---~~~vCN~T~~ 33 (141) |.|.+.+. +++..+.|.|.+|.| +|+=|.+... T Consensus 1 Mrr~~a~~l~~~l~~~~ap~~a~AdvagLtPCseS~a 37 (185) T CHL00132 1 MKKFNLIFLLAALLLTLNPNIALADVAGLTPCKESAA 37 (185) T ss_pred CHHHHHHHHHHHHHHHCCCCHHHCCCCCCCCCCCCHH T ss_conf 9069999999999993686221101138860756999 No 12 >pfam06650 DUF1162 Protein of unknown function (DUF1162). This family represents a conserved region within several hypothetical eukaryotic proteins. Family members might be vacuolar protein sorting related-proteins. Probab=45.35 E-value=25 Score=16.32 Aligned_cols=99 Identities=5% Similarity=0.048 Sum_probs=54.6 Q ss_pred CEEEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCCC-CCCCEEEEEEEECCCCCCCCCCEEEEECCCCEEE Q ss_conf 229831899848999999748885787546885788703740567-6674799999953887120575013650753255 Q gi|254780443|r 24 GFRVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKGA-LHSRYYYLYAEGVSHSEHWAGNVQMCVGQDEFNI 102 (141) Q Consensus 24 ~~~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g~-l~~r~yY~yAe~~~~~~~w~G~~~~Cv~~~~F~i 102 (141) -|-|.|+++..+.+.=- + +.+|..|+||+|...-... ...+...+.- .+....|+|...+..- ..+.+ T Consensus 119 ryil~N~~~~~l~v~q~---~-----~~~~~~l~~~~~~pf~~~~~~~~k~l~vr~--~~~~~~WS~~F~i~~~-g~~~~ 187 (268) T pfam06650 119 RYILINKLSEDLEIREP---G-----SSKWISLKPGELIPLHWLRNTKEKQLCVRF--PGPNCKWSSPFNISDV-GTTYV 187 (268) T ss_pred EEEEEECCCCCEEEEEC---C-----CCCEEEECCCCEEECCCCCCCCCEEEEEEE--CCCCCCCCCCEEEEEC-CEEEE T ss_conf 18999579987899976---8-----888499789980530577888776999994--8988767599836364-44999 Q ss_pred ECCCCHHHCCCCCCCCEEEEEC-CCCEEEEEEECCCC Q ss_conf 2531012278400282887607-97517899707653 Q gi|254780443|r 103 VDIKNCYTRGYLRVGFTEYDTG-QHENWTVQLTEPAQ 138 (141) Q Consensus 103 ~g~~dC~~RG~~~~~F~evDtg-~~~~wTv~Lt~p~~ 138 (141) .=.. .-...+|.+||+- +..+..+.+++..+ T Consensus 188 ~l~~-----~~~~~~~i~vdI~~~~~t~~I~fs~~~~ 219 (268) T pfam06650 188 KLLD-----ENSRQGLLRVEIILEGATIFITFSDATD 219 (268) T ss_pred EECC-----CCCCEEEEEEEEEECCCEEEEEEECCCC T ss_conf 9835-----8983579999999558759999986888 No 13 >pfam10946 DUF2625 Protein of unknown function DUF2625. Some members in this family of proteins are annotated as ybfG however currently no function is known. Probab=45.29 E-value=25 Score=16.32 Aligned_cols=40 Identities=30% Similarity=0.501 Sum_probs=31.1 Q ss_pred EEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCC Q ss_conf 983189984899999974888578754688578870374056 Q gi|254780443|r 26 RVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKG 67 (141) Q Consensus 26 ~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g 67 (141) ++==.|.++++ ||.|..+ +..+-.||-||--..|..+.++ T Consensus 42 ~lQVtTrS~lG-Avvy~tG-GiLID~GWlRilGSG~~~l~r~ 81 (208) T pfam10946 42 ALQVTTRSPLG-AVVYETG-GILIDGGWLRILGSGHERLPRG 81 (208) T ss_pred HHCCCCCCCCE-EEEECCC-CEEEECCEEEEECCCCCCCCCC T ss_conf 73544457540-3665158-5898788899966898667887 No 14 >PRK09926 putative chaperone protein EcpD; Provisional Probab=41.41 E-value=18 Score=17.07 Aligned_cols=79 Identities=16% Similarity=0.330 Sum_probs=41.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCCEEE------ECCCCCEEEEEEEEECCCCCEEEEEE-----------------------E Q ss_conf 899999999997436835472298------31899848999999748885787546-----------------------8 Q gi|254780443|r 4 SFLICFCFGAMIFCSNVSFAGFRV------CNGTKNLIGVAVGYPAVKGGWMTEGW-----------------------W 54 (141) Q Consensus 4 ~~l~~~~~~~~~~~~~~A~A~~~v------CN~T~~~v~vAigy~~~~~~W~seGW-----------------------w 54 (141) .++.++++.++++.++.|+|++.+ =|.-...+.+.|.=.+ +..+..+-| . T Consensus 5 ~~~~~l~~~~~~~~s~~a~Agi~l~~TRvIy~~~~~~~si~l~N~~-~~pyLvQsWvd~~~~~~~p~~~~~PFivtPPlf 83 (245) T PRK09926 5 KHTTALCFATCMAFSSSSIADIVISGTRIIYKSDQKDVNVRLENKG-NNPLLVQSWLDTGDDNAEPGSIKVPFTATPPVS 83 (245) T ss_pred HHHHHHHHHHHHHHHHHCEEEEEECCCEEEECCCCCEEEEEEEECC-CCCEEEEEEEECCCCCCCCCCCCCCEEECCCEE T ss_conf 7799999999999876403778978778998599947899999598-994899999977988888431357668638337 Q ss_pred EECCCCEEEE--C-CC--CCC-CC--EEEEEEEECCC Q ss_conf 8578870374--0-56--766-74--79999995388 Q gi|254780443|r 55 QIPGNTCETV--V-KG--ALH-SR--YYYLYAEGVSH 83 (141) Q Consensus 55 ~i~pg~C~~v--l-~g--~l~-~r--~yY~yAe~~~~ 83 (141) +++|++-.++ + .+ .|. .| .||+...+... T Consensus 84 Rlep~~~~~lRI~~~~~~~LP~DRESlf~lnv~~IPp 120 (245) T PRK09926 84 RIDPKRGQTIKLMYTASTALPKDRESVFWFNVLEVPP 120 (245) T ss_pred EECCCCCCEEEEEECCCCCCCCCEEEEEEEEEEECCC T ss_conf 8889986379999768988985504799999676578 No 15 >PRK05137 tolB translocation protein TolB; Provisional Probab=41.25 E-value=27 Score=16.06 Aligned_cols=64 Identities=9% Similarity=0.114 Sum_probs=33.3 Q ss_pred HHHHHHHHHHCCCCCCCCE--EEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCCCCCCCEEE Q ss_conf 9999999974368354722--98318998489999997488857875468857887037405676674799 Q gi|254780443|r 7 ICFCFGAMIFCSNVSFAGF--RVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKGALHSRYYY 75 (141) Q Consensus 7 ~~~~~~~~~~~~~~A~A~~--~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g~l~~r~yY 75 (141) +++.+.++++++.+|+|.+ +|=--...++-+||.--..+ +..++ .-+.+...+|..+|...=+| T Consensus 8 ~~l~l~l~l~~~~~A~A~L~IeIt~g~~~~ipIAV~pF~~~--~~~~~---~l~~~i~~II~~DL~~SG~F 73 (437) T PRK05137 8 LALALALALLAAQPARALLRIDITKGNVEPLPIAIPDFLAD--TGGAD---ELGADIAQVIAADLKRSGLF 73 (437) T ss_pred HHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEEECCCCC--CCCCC---HHHHHHHHHHHHHHHHCCCC T ss_conf 99999999873121431399999467677335799366578--87641---35688999999778638895 No 16 >PRK01029 tolB translocation protein TolB; Provisional Probab=40.71 E-value=29 Score=15.90 Aligned_cols=40 Identities=20% Similarity=0.208 Sum_probs=18.8 Q ss_pred CHHHHH-HHHHHHHHHHCCCCCCCCEEEE---CCCCCEEEEEEE Q ss_conf 957899-9999999974368354722983---189984899999 Q gi|254780443|r 1 MLRSFL-ICFCFGAMIFCSNVSFAGFRVC---NGTKNLIGVAVG 40 (141) Q Consensus 1 ~l~~~l-~~~~~~~~~~~~~~A~A~~~vC---N~T~~~v~vAig 40 (141) ||++.. +.+++.++++.+...+++++|= +....+|.||.= T Consensus 1 ~~~~~~~~~~l~~~~~~~~~~~~~~l~v~v~~~~~~~Pi~va~~ 44 (425) T PRK01029 1 MLRSLCFLRFLICFLCLLPSLYAEDLEVHVRSESSLLPIEVSLL 44 (425) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCCCCCCEEEEEC T ss_conf 90166899999999998557636218999737873616799813 No 17 >smart00205 THN Thaumatin family. The thaumatin family gathers proteins related to plant pathogenesis. The thaumatin family includes very basic members with extracellular and vacuolar localization. Thaumatin itsel is a potent sweet-tasting protein. Several members of this family display significant in vitro activity of inhibiting hyphal growth or spore germination of various fungi probably by a membrane permeabilizing mechanism. Probab=38.67 E-value=26 Score=16.23 Aligned_cols=38 Identities=13% Similarity=0.108 Sum_probs=27.3 Q ss_pred EEEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEE Q ss_conf 2983189984899999974888578754688578870374 Q gi|254780443|r 25 FRVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETV 64 (141) Q Consensus 25 ~~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~v 64 (141) |+|-|+-++.||.+|.-. +... ...|=|+++||+-..+ T Consensus 1 Fti~N~C~~tVWP~~~~~-g~~~-~~~gGf~L~~g~s~~~ 38 (218) T smart00205 1 FEFVNNCPYTVWAAALPS-GKPQ-LSGGGFELNSGASWQL 38 (218) T ss_pred CEEEECCCCCCCEEECCC-CCCC-CCCCCEECCCCCCEEE T ss_conf 976848999855023688-9865-5899877189984588 No 18 >PRK03757 hypothetical protein; Provisional Probab=38.46 E-value=32 Score=15.69 Aligned_cols=59 Identities=12% Similarity=0.201 Sum_probs=27.3 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCC-CEEEECC-CCCEEEEEEEEEC---CCCC-EEEEEEEEECCCC Q ss_conf 95789999999999743683547-2298318-9984899999974---8885-7875468857887 Q gi|254780443|r 1 MLRSFLICFCFGAMIFCSNVSFA-GFRVCNG-TKNLIGVAVGYPA---VKGG-WMTEGWWQIPGNT 60 (141) Q Consensus 1 ~l~~~l~~~~~~~~~~~~~~A~A-~~~vCN~-T~~~v~vAigy~~---~~~~-W~seGWw~i~pg~ 60 (141) |.+.+|.++++. +++..++|+| .+++=-+ +-+.|.-.|.+-. ..|. =.-+|=..++|.. T Consensus 1 mkk~~l~~~l~a-~~~~~~~A~a~~y~iDp~~~Hs~v~F~v~H~G~S~~~G~F~~~~g~i~~D~~~ 65 (191) T PRK03757 1 MKKTLLGLTLGS-LLFSAGSAVAADYKIDKEGQHAFINFKISHLGYSWLYGRFNDFDGTFSFDEKN 65 (191) T ss_pred CCHHHHHHHHHH-HHHHCCCCCCCCEEECCCCCCEEEEEEEEEEEEEEEEEEEEEEEEEEEECCCC T ss_conf 904399999999-98630404575278889876148999998438888999998579999988888 No 19 >pfam11730 DUF3297 Protein of unknown function (DUF3297). This family is expressed in Proteobacteria and Actinobacteria. The function is not known. Probab=37.15 E-value=15 Score=17.68 Aligned_cols=16 Identities=19% Similarity=0.509 Sum_probs=12.2 Q ss_pred CEEEEEEEEECCCCEE Q ss_conf 5787546885788703 Q gi|254780443|r 47 GWMTEGWWQIPGNTCE 62 (141) Q Consensus 47 ~W~seGWw~i~pg~C~ 62 (141) ==+||||-+|+.|+-. T Consensus 37 YciSEGWvrv~agka~ 52 (71) T pfam11730 37 YCISEGWVRVAAGKAL 52 (71) T ss_pred EECCCCEEEEECCCCC T ss_conf 6424776994168653 No 20 >TIGR02538 type_IV_pilB type IV-A pilus assembly ATPase PilB; InterPro: IPR013374 This model describes a protein involved in type IV pilus biogenesis designated PilB in Pseudomonas aeruginosa but PilF in Neisseria gonorrhoeae; the more common usage, reflected here, is PilB. This protein is an ATPase involved in protein export for pilin assembly, and is closely related to GspE (IPR013369 from INTERPRO) of type II secretion systems (also referred to as the main terminal branch of the general secretion pathway). Note that type IV pilus systems are often divided into type IV-A and IV-B, with the latter group including bundle-forming pilus, mannose-sensitive hemagglutinin, etc. Members of this family are found in type IV-A systems.; GO: 0005524 ATP binding, 0008565 protein transporter activity, 0009297 pilus biogenesis. Probab=36.32 E-value=15 Score=17.63 Aligned_cols=38 Identities=24% Similarity=0.429 Sum_probs=16.2 Q ss_pred CCEEEECCCCCE-EEEE------EEEECCCCCEEEE-EEEEECCCCEEEE Q ss_conf 722983189984-8999------9997488857875-4688578870374 Q gi|254780443|r 23 AGFRVCNGTKNL-IGVA------VGYPAVKGGWMTE-GWWQIPGNTCETV 64 (141) Q Consensus 23 A~~~vCN~T~~~-v~vA------igy~~~~~~W~se-GWw~i~pg~C~~v 64 (141) |. ||||+...+ ..+. .|++. + . --. ||---.|-+|..= T Consensus 462 AR-RLCs~CK~~~~~~p~~~Ll~lGF~~-e-d-L~~Pg~~ly~pvGC~~C 507 (577) T TIGR02538 462 AR-RLCSHCKAPVEEVPEEALLELGFTE-E-D-LADPGLKLYGPVGCEEC 507 (577) T ss_pred HH-HCCCCCCCCCCCCCHHHHHHCCCCH-H-H-HCCCCCEECCCCCCCCC T ss_conf 52-4031368776646859998568886-8-9-26999386176261003 No 21 >PRK13874 conjugal transfer protein TrbJ; Provisional Probab=34.78 E-value=29 Score=15.88 Aligned_cols=24 Identities=13% Similarity=0.041 Sum_probs=14.7 Q ss_pred HHHHHHHCCCCCCCCEEEECCCCC Q ss_conf 999997436835472298318998 Q gi|254780443|r 10 CFGAMIFCSNVSFAGFRVCNGTKN 33 (141) Q Consensus 10 ~~~~~~~~~~~A~A~~~vCN~T~~ 33 (141) ++++....+.||+|.+.|---|++ T Consensus 14 ~~~~~~~~~~pa~A~~~V~DptN~ 37 (233) T PRK13874 14 ALALSPALATPAHAQWIVYDPTNY 37 (233) T ss_pred HHHHCCCCCCCCCCCCEEECCHHH T ss_conf 987423588873145024670189 No 22 >PRK12789 flgI flagellar basal body P-ring protein; Reviewed Probab=31.06 E-value=41 Score=15.05 Aligned_cols=40 Identities=18% Similarity=0.207 Sum_probs=22.3 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCC-CE-EEECCCCCEEEEEEEE Q ss_conf 95789999999999743683547-22-9831899848999999 Q gi|254780443|r 1 MLRSFLICFCFGAMIFCSNVSFA-GF-RVCNGTKNLIGVAVGY 41 (141) Q Consensus 1 ~l~~~l~~~~~~~~~~~~~~A~A-~~-~vCN~T~~~v~vAigy 41 (141) |++.+++.++++++++ +.+|+| .+ .+|+=-..+-.--||| T Consensus 1 m~~~~~~~~~~~l~~~-~~~~~a~RIkDi~~v~GvR~NqLiGY 42 (367) T PRK12789 1 MIRRLLLAVLLALLAG-PAAAAATRIKDIATLQGVRDNQLVGY 42 (367) T ss_pred CHHHHHHHHHHHHHHC-CCHHHHHHHHHHEEEEEEEECCEEEE T ss_conf 9089999999999855-30666665640200400244355677 No 23 >COG5664 Predicted secreted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones] Probab=29.27 E-value=43 Score=14.91 Aligned_cols=29 Identities=14% Similarity=0.374 Sum_probs=20.0 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCEEEEC Q ss_conf 95789999999999743683547229831 Q gi|254780443|r 1 MLRSFLICFCFGAMIFCSNVSFAGFRVCN 29 (141) Q Consensus 1 ~l~~~l~~~~~~~~~~~~~~A~A~~~vCN 29 (141) |++..+.++++.+.++.|..|+|+++-.- T Consensus 4 ~~~~~~~c~lla~~v~~p~~A~agwkave 32 (201) T COG5664 4 MRSLVVTCLLLASAVMLPAQASAGWKAVE 32 (201) T ss_pred HHHHHHHHHHHHHHHCCCCHHHCCCEEEE T ss_conf 79999999999987528400201663544 No 24 >PHA02099 hypothetical protein Probab=27.22 E-value=41 Score=15.01 Aligned_cols=31 Identities=13% Similarity=0.083 Sum_probs=23.6 Q ss_pred EECCCCEEEECCCCCCCCEEEEEEEECCCCC Q ss_conf 8578870374056766747999999538871 Q gi|254780443|r 55 QIPGNTCETVVKGALHSRYYYLYAEGVSHSE 85 (141) Q Consensus 55 ~i~pg~C~~vl~g~l~~r~yY~yAe~~~~~~ 85 (141) ++.||+|+.-+-.....--+-++|+.++... T Consensus 27 rv~pg~~emhilr~~~g~diifha~gy~p~l 57 (84) T PHA02099 27 RVGPGESEMHILRNFEGVDIVFHAEGYNPDL 57 (84) T ss_pred CCCCCCCEEEEEEECCCCCEEEECCCCCHHH T ss_conf 5589971688886057742899857799789 No 25 >cd01776 Rin1_RA Rin1_RA RIN1 is a RAS effector that binds with specificity and high affinity to activated RAS via its carboxy-terminal RA (RAS-associated) domain. RIN1 competes directly with RAF1 for RAS binding and is thought to divert signaling away from RAF and the MAPK pathway while also shunting RAS signals through alternate pathways. In addition, Rin1 and Rin2 are Rab5-binding proteins, binding preferentially to the GTP-bound form, that enhance the GDP-GTP exchange reaction on Rab5 that regulate the docking and fusion processes of endocytic vesicles. In addition to the RA domain, RIN1 and RIN2 have an SH2 (Src homology 2) domain, a proline-rich SH3 domain, and a Vps9 domain. Probab=26.56 E-value=28 Score=15.98 Aligned_cols=31 Identities=26% Similarity=0.640 Sum_probs=22.3 Q ss_pred EEEEEEEEECCCCEEEECCCCCC-----CCEEEEEE Q ss_conf 78754688578870374056766-----74799999 Q gi|254780443|r 48 WMTEGWWQIPGNTCETVVKGALH-----SRYYYLYA 78 (141) Q Consensus 48 W~seGWw~i~pg~C~~vl~g~l~-----~r~yY~yA 78 (141) .+-.+|..++|++|-.-|+..|- ..|||+|- T Consensus 50 ~vdg~~~qL~~d~~Pq~IKa~L~~~~~~~~f~FvYr 85 (87) T cd01776 50 FVEETWQQLAPDTYPQRIKAELHSRPQPNTFHFVYR 85 (87) T ss_pred EECCCEEECCCCCCHHHHHHHHHCCCCCCCEEEEEE T ss_conf 977807767951134565798843888871479875 No 26 >PRK13697 cytochrome c6; Provisional Probab=25.31 E-value=44 Score=14.88 Aligned_cols=19 Identities=21% Similarity=0.431 Sum_probs=8.8 Q ss_pred HHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999997436835472 Q gi|254780443|r 5 FLICFCFGAMIFCSNVSFAG 24 (141) Q Consensus 5 ~l~~~~~~~~~~~~~~A~A~ 24 (141) +|+++++..+.| ++||.|. T Consensus 8 ~l~~~~~~~~~~-~~pA~Aa 26 (111) T PRK13697 8 VLLTLLLLTFAF-TSPAFAA 26 (111) T ss_pred HHHHHHHHHHHC-CCHHHHH T ss_conf 999999999980-3554441 No 27 >TIGR01855 IMP_synth_hisH imidazole glycerol phosphate synthase, glutamine amidotransferase subunit; InterPro: IPR010139 Imidazole glycerol phosphate synthetase (IGPS) is a key metabolic enzyme, which links amino acid and nucleotide biosynthesis: it catalyses the closure of the imidazole ring within histidine biosynthesis (fifth step), and provides the substrate for de novo purine biosynthesis. IGPS consists of two different subunits: HisH, a glutamine amidotransferase (glutaminase), and HisF, a synthase (cyclase). HisH functions to provide a source of nitrogen, which is required for the synthesis of histidine and purines. In the HisH glutaminase reaction, the hydrolysis of glutamine yields ammonia, which is then used by HisF in the subsequent synthase reaction. The X-ray structure of the HisH/HisF heterodimer of IGPS reveals a putative tunnel for the transfer of ammonia from HisH to HisF via a (beta-alpha)8 barrel fold within HisF that abuts HisH . Ammonia tunnels connect the glutaminase and synthase active sites. HisH belongs to a large group of enzymes found in diverse and fundamental anabolic pathways. These enzymes share a common domain, referred to as the type-I glutamine amidotransferase (GATase) domain, which can occur either as single polypeptides (as with HisH) or as domains of large multifunctional proteins.; GO: 0016763 transferase activity transferring pentosyl groups, 0000105 histidine biosynthetic process, 0005737 cytoplasm. Probab=24.14 E-value=43 Score=14.92 Aligned_cols=49 Identities=14% Similarity=0.233 Sum_probs=27.4 Q ss_pred EEEEEEECCCCEE-EECCC-CCCCCEEEEEEEECCCCCCCCC--CE-EEEECCCCEE Q ss_conf 7546885788703-74056-7667479999995388712057--50-1365075325 Q gi|254780443|r 50 TEGWWQIPGNTCE-TVVKG-ALHSRYYYLYAEGVSHSEHWAG--NV-QMCVGQDEFN 101 (141) Q Consensus 50 seGWw~i~pg~C~-~vl~g-~l~~r~yY~yAe~~~~~~~w~G--~~-~~Cv~~~~F~ 101 (141) --||=++.+-.-. .++.| +-.++|||+| ++... .+++ .. ..|--...|+ T Consensus 126 HmGWN~v~~~~~~C~ll~g~~~~~~~YFVH--SY~~~-~~~~S~~v~a~~~YG~~F~ 179 (211) T TIGR01855 126 HMGWNEVEPVKESCPLLKGIDKGAYFYFVH--SYYAV-CEEESEAVLAKADYGEKFT 179 (211) T ss_pred CCCCCEEEEHHHCCCHHCCCCCCCCEEEEE--EEEEE-CCCCHHHEEEEHHCCCCEE T ss_conf 565642312311551130776678155688--66776-1688115011200186304 No 28 >PRK04922 tolB translocation protein TolB; Provisional Probab=22.49 E-value=61 Score=14.04 Aligned_cols=66 Identities=15% Similarity=0.205 Sum_probs=36.2 Q ss_pred HHHHHHHHHHHHHCCCCCCCCE--EEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEECCCCCCCCEEE Q ss_conf 8999999999974368354722--98318998489999997488857875468857887037405676674799 Q gi|254780443|r 4 SFLICFCFGAMIFCSNVSFAGF--RVCNGTKNLIGVAVGYPAVKGGWMTEGWWQIPGNTCETVVKGALHSRYYY 75 (141) Q Consensus 4 ~~l~~~~~~~~~~~~~~A~A~~--~vCN~T~~~v~vAigy~~~~~~W~seGWw~i~pg~C~~vl~g~l~~r~yY 75 (141) +.|.++++.++++.+.+|+|.| +|--....++-+||.=-..++. . ...+.+...||..+|...=+| T Consensus 6 ~~~~~~~~~~~~~~a~~a~a~L~IeIt~G~~~~~pIAV~pF~~~~~---~---~~~~~~i~~II~~DL~~SG~F 73 (439) T PRK04922 6 RWLAALTALLLPLSALAQQQGLTIDIVGGSASATPIAVVPMPYQGS---G---TAPQTDVSAVVSADLDRSGQF 73 (439) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCEEEEEECCCCCCC---C---CCHHHHHHHHHHHHHHHCCCC T ss_conf 9999999999987788616836999935777705069937565787---6---550364999998657727274 No 29 >PRK10894 hypothetical protein; Provisional Probab=21.93 E-value=54 Score=14.35 Aligned_cols=50 Identities=16% Similarity=0.129 Sum_probs=26.2 Q ss_pred HHHHHHHHHCCCCCCCCEEEECCCCCEEEEEE---EEECCC-------CCEEEEEEEEECCCC Q ss_conf 99999997436835472298318998489999---997488-------857875468857887 Q gi|254780443|r 8 CFCFGAMIFCSNVSFAGFRVCNGTKNLIGVAV---GYPAVK-------GGWMTEGWWQIPGNT 60 (141) Q Consensus 8 ~~~~~~~~~~~~~A~A~~~vCN~T~~~v~vAi---gy~~~~-------~~W~seGWw~i~pg~ 60 (141) ++++.++++++.||+|- . .+..-||.+-- -+.+.+ .--.++|=..|.... T Consensus 12 l~l~~~ll~~S~~a~Al-~--sD~~qPI~I~AD~~~~d~~~~~avytGNVvitQGsl~I~ADk 71 (184) T PRK10894 12 LLLASSLLAASIPAFAV-T--GDTEQPIHIDSDQQSLDMQGNVVTFTGNVVVTQGTIKINADK 71 (184) T ss_pred HHHHHHHHHHCCHHHHC-C--CCCCCCEEEEECEEEEECCCCEEEEEEEEEEEEEEEEEEEEE T ss_conf 99999998716124315-5--344699899945289431599899992599996259999358 No 30 >pfam07172 GRP Glycine rich protein family. This family of proteins includes several glycine rich proteins as well as two nodulins 16 and 24. The family also contains proteins that are induced in response to various stresses. Probab=21.81 E-value=28 Score=15.97 Aligned_cols=25 Identities=16% Similarity=0.243 Sum_probs=12.0 Q ss_pred HHHHHCCCCCCCCE--EEECCCCCEEE Q ss_conf 99974368354722--98318998489 Q gi|254780443|r 12 GAMIFCSNVSFAGF--RVCNGTKNLIG 36 (141) Q Consensus 12 ~~~~~~~~~A~A~~--~vCN~T~~~v~ 36 (141) .+++.+++.+++++ +.-++..+.|. T Consensus 15 ~~lLIsSevaa~~ls~~~K~e~e~~v~ 41 (91) T pfam07172 15 AVLLISSEVAAADLSNTEKSESENEVQ 41 (91) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEE T ss_conf 999999999999971284555424021 No 31 >COG4991 Uncharacterized protein with a bacterial SH3 domain homologue [Function unknown] Probab=20.90 E-value=49 Score=14.56 Aligned_cols=29 Identities=10% Similarity=0.356 Sum_probs=18.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCEEEEC Q ss_conf 95789999999999743683547229831 Q gi|254780443|r 1 MLRSFLICFCFGAMIFCSNVSFAGFRVCN 29 (141) Q Consensus 1 ~l~~~l~~~~~~~~~~~~~~A~A~~~vCN 29 (141) |+..++.+.++.+.++.++.|.|..+--| T Consensus 10 ~~~~~~~A~a~~~~l~~~~~a~aa~~a~~ 38 (155) T COG4991 10 SMKTLMRASAFGLALLMPAAAYAAVSASA 38 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 67789999999999874788888775213 No 32 >COG0118 HisH Glutamine amidotransferase [Amino acid transport and metabolism] Probab=20.09 E-value=65 Score=13.85 Aligned_cols=34 Identities=18% Similarity=0.250 Sum_probs=22.4 Q ss_pred CEEEEEEEEECCCCEEEECCCCCC-CCEEEEEEEE Q ss_conf 578754688578870374056766-7479999995 Q gi|254780443|r 47 GWMTEGWWQIPGNTCETVVKGALH-SRYYYLYAEG 80 (141) Q Consensus 47 ~W~seGWw~i~pg~C~~vl~g~l~-~r~yY~yAe~ 80 (141) .---.||=++.+.+...+..|-.. .++||+|.=. T Consensus 116 kvPHMGWN~l~~~~~~~l~~gi~~~~~~YFVHSY~ 150 (204) T COG0118 116 KVPHMGWNQVEFVRGHPLFKGIPDGAYFYFVHSYY 150 (204) T ss_pred CCCCCCCCEEECCCCCHHHCCCCCCCEEEEEEEEE T ss_conf 78866652231267874432899887799997876 Done!