Query gi|254780357|ref|YP_003064770.1| hypothetical protein CLIBASIA_01210 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 394 No_of_seqs 195 out of 286 Neff 6.2 Searched_HMMs 39220 Date Sun May 29 16:27:15 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780357.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK11539 hypothetical protein; 99.9 5.8E-20 1.5E-24 133.7 24.3 236 4-261 297-548 (754) 2 pfam03772 Competence Competenc 99.9 2.3E-20 5.9E-25 136.0 19.3 173 4-195 94-268 (270) 3 TIGR00361 ComEC_Rec2 DNA inter 98.9 8.7E-08 2.2E-12 63.1 13.2 121 4-138 243-365 (731) 4 COG0658 ComEC Predicted membra 98.2 2.2E-05 5.6E-10 49.2 9.7 107 4-137 308-414 (453) 5 PRK11539 hypothetical protein; 97.8 0.00013 3.4E-09 44.7 7.5 52 3-54 115-166 (754) 6 TIGR00360 ComEC_N-term ComEC/R 97.5 0.00075 1.9E-08 40.3 8.7 110 4-128 77-186 (187) 7 PRK09424 pntA NAD(P) transhydr 81.3 2.8 7.1E-05 19.6 4.3 46 318-366 239-286 (510) 8 PRK04949 putative sulfate tran 63.1 9.1 0.00023 16.6 8.0 17 110-126 90-106 (251) 9 PRK00029 hypothetical protein; 52.4 7.5 0.00019 17.1 1.4 55 215-269 103-157 (487) 10 pfam02696 UPF0061 Uncharacteri 52.1 8 0.00021 16.9 1.5 54 216-269 104-157 (487) 11 TIGR02882 QoxB cytochrome aa3 44.1 14 0.00035 15.6 1.7 25 137-161 376-400 (648) 12 COG0397 Uncharacterized conser 39.3 22 0.00057 14.3 3.0 56 214-269 102-157 (488) 13 PRK04654 sec-independent trans 37.3 24 0.00061 14.2 2.4 27 35-62 2-28 (214) 14 pfam01262 AlaDh_PNT_C Alanine 35.4 26 0.00066 14.0 5.1 58 315-375 72-132 (150) 15 cd05212 NAD_bind_m-THF_DH_Cycl 31.2 30 0.00078 13.6 3.8 44 320-372 64-107 (140) 16 COG3224 Uncharacterized protei 30.0 32 0.00082 13.5 3.2 30 103-132 146-175 (195) 17 pfam03213 Pox_P35 Poxvirus P35 29.5 22 0.00055 14.4 0.8 116 23-148 170-315 (325) 18 PRK10494 hypothetical protein; 29.2 33 0.00084 13.4 4.6 13 251-263 141-153 (259) 19 pfam02882 THF_DHG_CYH_C Tetrah 27.3 36 0.00091 13.2 3.5 38 320-366 72-109 (159) 20 COG2981 CysZ Uncharacterized p 27.0 36 0.00092 13.1 4.3 11 113-123 91-101 (250) 21 COG0686 Ald Alanine dehydrogen 26.2 37 0.00095 13.1 6.1 65 308-375 212-279 (371) 22 pfam09163 Form-deh_trans Forma 23.9 41 0.0011 12.8 2.9 33 189-221 4-36 (44) 23 TIGR00915 2A0602 RND transport 23.3 43 0.0011 12.7 2.7 15 191-205 535-549 (1058) 24 pfam02639 DUF188 Uncharacteriz 23.2 28 0.00071 13.8 0.4 32 320-351 44-76 (130) 25 KOG2923 consensus 21.5 47 0.0012 12.5 2.3 25 338-362 22-46 (67) 26 pfam02515 CoA_transf_3 CoA-tra 21.0 48 0.0012 12.5 2.9 10 226-235 122-131 (179) 27 pfam07420 DUF1509 Protein of u 20.5 49 0.0012 12.4 1.9 23 136-158 59-81 (377) 28 pfam03092 BT1 BT1 family. Memb 20.4 49 0.0013 12.4 8.6 35 115-149 318-352 (425) No 1 >PRK11539 hypothetical protein; Provisional Probab=99.89 E-value=5.8e-20 Score=133.68 Aligned_cols=236 Identities=12% Similarity=0.128 Sum_probs=141.6 Q ss_pred ECCCCCCHHHHHHHHHHHHCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 02668855589998888820485567152135879999999999999861264113577764211456788888889999 Q gi|254780357|r 4 ITKKYDPFPAETTIEGIARLSPPSGPAFPGLFDFNFSSYYKGISAIGYFYSIPKMIYSSESHNTWTWKKISSIKQSFLNE 83 (394) Q Consensus 4 ~~~~~~p~~~g~~i~~~v~L~~P~~p~~PGlFD~SF~asF~aVaaigff~~~p~~~~~~~~~~~~~~~~l~~Lr~~i~~~ 83 (394) ..|+..|+.........+++.+|.....+| ||+||.+|+++.+++..-..... .. .++...+.. . T Consensus 297 ~~r~~~~~~~l~~a~~~iLl~~P~~~~~~g-----F~LSf~Av~~li~~~~~~p~~~~--~~----~~~~~~~~~----~ 361 (754) T PRK11539 297 SGRQWSGWQVWLCCLAAILLSDPLAVLSQS-----LWLSALAVAALIFWYQWFPLPER--FL----PGWLRAVLR----L 361 (754) T ss_pred HCCCCCHHHHHHHHHHHHHHHCHHHHHCHH-----HHHHHHHHHHHHHHHHHCCCCHH--HH----HHHHHHHHH----H T ss_conf 386477899999999999995879996625-----79999999999999740433111--10----236899999----9 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC---CHHHHHHHHHHHHHHHHHHH Q ss_conf 999965555888999999860102327883888999999999999999999997516---41689999999999999999 Q gi|254780357|r 84 IRANIGTYGSASASIFLIKHFHCIPVYGLIANILAIPILSFVVIPAGLIAILLMILS---LDNIPFGVMAWGLDIIIHIA 160 (394) Q Consensus 84 I~a~ls~~~~~a~~p~~~~~F~~~~~~~~~aNl~avP~~~~~v~p~~~~~~~l~p~g---l~~~~l~~mg~gi~~il~vA 160 (394) + .++...++.++|+.+||||++|++|+++|+++||+++++++|+.++++++..++ +....|++....+.++.+.- T Consensus 362 l--~~q~~~~l~llPl~l~~F~~iSl~s~laNL~aVPlvs~vvvPl~ll~l~~~~~~~~~l~~~~~~l~~~~l~~~~~~l 439 (754) T PRK11539 362 L--HLQLGITLLLMPLQIVLFHGISLTSIPANLWAVPLVTFITVPLILAAMVLHLSGPFILEEGLWFLADRSLALLFWPL 439 (754) T ss_pred H--HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9--99999999999999999387527899999999999999999999999999987499999999999999999999998 Q ss_pred HHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCH-HHHHHHHHHHHHHHHHH-HCCCCCCCEE----EECCCCCEEEE Q ss_conf 997137876223557722799999999999971463-78987999999999985-2346777348----98699968799 Q gi|254780357|r 161 HRISVAHNEFCIGRIPETSFVTIVVEFLLMVFCKTN-IRHIGSIMIGITIIILF-TFPYSFSPDL----LISEKGNLVAL 234 (394) Q Consensus 161 ~~va~~~ga~~~~~~p~~~l~l~~~g~l~l~l~~~~-~R~~g~~~~~~~~~~l~-~~~~~~~Pdi----lIs~dG~lVav 234 (394) + .++..... .+.....+...+.+.+.+|+.. +|......+.+..+..+ .+....+++. +==+.|..+-+ T Consensus 440 ~---~l~~~~~~--~~~~~~~~~~~~~~~l~~~~~~~~r~~~~~~~~~~lll~~~~~~~~~~~~~~v~~LDVGQG~avlI 514 (754) T PRK11539 440 N---SLPEGWIN--IGERWQWLTFSPWLALIIWRFNWWRTYPAMCVALGLLMCLPLWQRPDPYEWRVDMLDVGQGLAMVI 514 (754) T ss_pred H---HHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEE T ss_conf 7---60133422--215889999999999999999999999999999999999997166788760899997278518999 Q ss_pred ECCCEEEEE-CC------CCCCHHHHHHHHHCCC Q ss_conf 748804663-47------7441379889875176 Q gi|254780357|r 235 VDKNTLISN-YS------NPPSFIFSQWKSALIT 261 (394) Q Consensus 235 ~~~~g~~~~-~~------r~~~F~~~~W~~~~g~ 261 (394) ++++...+- .+ ...+.+..-+++..|. T Consensus 515 ~~~~~~~L~DtG~~~~~~~~g~~vi~P~L~~~Gi 548 (754) T PRK11539 515 ERNGKAILYDTGNAWPEGDSAQQVIIPWLRWHNL 548 (754) T ss_pred EECCEEEEEECCCCCCCCCCHHHHHHHHHHHCCC T ss_conf 9899789995799888887237788999996799 No 2 >pfam03772 Competence Competence protein. Members of this family are integral membrane proteins with 6 predicted transmembrane helices. Some members of this family have been shown to be essential for bacterial competence in uptake of extracellular DNA. These proteins may transport DNA across the cell membrane. These proteins contain a highly conserved motif in the amino terminal transmembrane region that has two histidines that may form a metal binding site. Probab=99.88 E-value=2.3e-20 Score=135.98 Aligned_cols=173 Identities=18% Similarity=0.288 Sum_probs=135.7 Q ss_pred ECCCCCCHHHHHHHHHHHHCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 02668855589998888820485567152135879999999999999861264113577764211456788888889999 Q gi|254780357|r 4 ITKKYDPFPAETTIEGIARLSPPSGPAFPGLFDFNFSSYYKGISAIGYFYSIPKMIYSSESHNTWTWKKISSIKQSFLNE 83 (394) Q Consensus 4 ~~~~~~p~~~g~~i~~~v~L~~P~~p~~PGlFD~SF~asF~aVaaigff~~~p~~~~~~~~~~~~~~~~l~~Lr~~i~~~ 83 (394) ..||.++.........+.++..|.....+| ||+||.+|.++.++++..+..... ++.....+ T Consensus 94 ~~r~~~~~~~l~~a~~~~Ll~~P~~~~~~g-----FqLSf~av~~i~~~~~~~~~~~~~--------~~~~~~~~----- 155 (270) T pfam03772 94 LGRRISPLDSLALAALLLLLIDPLALLSAG-----FQLSFLAVAGLLLLAPRLQKRLSR--------LSPRLLAL----- 155 (270) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHCHH-----HHHHHHHHHHHHHHHHHHHHHHHH--------CCHHHHHH----- T ss_conf 689998799999999999997477994879-----999999999999998999888764--------00579999----- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99996555588899999986010232788388899999999999999999999751-64168999999999999999999 Q gi|254780357|r 84 IRANIGTYGSASASIFLIKHFHCIPVYGLIANILAIPILSFVVIPAGLIAILLMIL-SLDNIPFGVMAWGLDIIIHIAHR 162 (394) Q Consensus 84 I~a~ls~~~~~a~~p~~~~~F~~~~~~~~~aNl~avP~~~~~v~p~~~~~~~l~p~-gl~~~~l~~mg~gi~~il~vA~~ 162 (394) ...++..++++|+|+.+||||++|++|+++|++++|+++++++|+.+++++..++ ++..+.++..++.++++.+++++ T Consensus 156 -~~~~s~~a~l~~~Pl~~~~F~~~s~~s~l~Nll~vPl~~~~~~p~~l~~~~~~~~~~l~~~~~~~~~~~l~~~~~~~~~ 234 (270) T pfam03772 156 -LFLVSLAAQLATLPLLLYHFGQFSLVSLLANLLAVPLVSFVVLPLALLALLLSLFPPLAALLLWLADWLLELLLALLEW 234 (270) T ss_pred -HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf -9999999999999999999387568999999999999999999999999999888889999999999999999999999 Q ss_pred HHCCCCCC-CCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 71378762-2355772279999999999997146 Q gi|254780357|r 163 ISVAHNEF-CIGRIPETSFVTIVVEFLLMVFCKT 195 (394) Q Consensus 163 va~~~ga~-~~~~~p~~~l~l~~~g~l~l~l~~~ 195 (394) ++++|++. ..+..+.+...++.+..+.+++.+. T Consensus 235 ~~~lp~~~i~~~~~~~~~~~~~~~~l~~~~~~~~ 268 (270) T pfam03772 235 LASLPGASLWVGRPSLWLLLLYYLLLLLLLLLLL 268 (270) T ss_pred HHCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 9729983764789879999999999999999995 No 3 >TIGR00361 ComEC_Rec2 DNA internalization-related competence protein ComEC/Rec2; InterPro: IPR004797 This is the DNA internalisation-related competence protein ComEC/Rec2 family. Apparent orthologs are found in 5 species so far (Haemophilus influenzae, Escherichia coli, Bacillus subtilis, Neisseria gonorrhoeae, Streptococcus pneumoniae), of which all but E. coli are model systems for the study of competence for natural transformation. This protein is a predicted multiple membrane-spanning protein likely to be involved in DNA internalisation.; GO: 0030420 establishment of competence for transformation, 0016021 integral to membrane. Probab=98.89 E-value=8.7e-08 Score=63.08 Aligned_cols=121 Identities=19% Similarity=0.198 Sum_probs=90.1 Q ss_pred ECCCCCCHHHHHHHHHHHHCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC--CCCCHHHHHHHHHHHHHH Q ss_conf 026688555899988888204855671521358799999999999998612641135777--642114567888888899 Q gi|254780357|r 4 ITKKYDPFPAETTIEGIARLSPPSGPAFPGLFDFNFSSYYKGISAIGYFYSIPKMIYSSE--SHNTWTWKKISSIKQSFL 81 (394) Q Consensus 4 ~~~~~~p~~~g~~i~~~v~L~~P~~p~~PGlFD~SF~asF~aVaaigff~~~p~~~~~~~--~~~~~~~~~l~~Lr~~i~ 81 (394) ..+++.|+..-....+..++.+|-.+..-+ |+++++.++...++|..-......- .+..-..+++..+ T Consensus 243 ~~~~~~~~~~~~~~~~~~l~~~p~~~~~~~-----~~l~~~~~~~l~~~~~~~~~~~~~~~~~p~~p~~~~~~~l----- 312 (731) T TIGR00361 243 VGRRWSGWDAWCLCLGALLLFDPLHLLSAG-----FWLSCAAVGALLLWYSWFPCPLWQLGNRPFSPKLRWVFSL----- 312 (731) T ss_pred HHCCCCCHHHHHHHHHHHHHHCCHHHHHHH-----HHHHHHHHHHHHHHHHHHCCHHHHCCCCCCCHHHHHHHHH----- T ss_conf 420034315788999999861313565444-----5667788888888765410112104787752478999999----- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999655558889999998601023278838889999999999999999999975 Q gi|254780357|r 82 NEIRANIGTYGSASASIFLIKHFHCIPVYGLIANILAIPILSFVVIPAGLIAILLMI 138 (394) Q Consensus 82 ~~I~a~ls~~~~~a~~p~~~~~F~~~~~~~~~aNl~avP~~~~~v~p~~~~~~~l~p 138 (394) ..++....+...|+.+||||.++..+.++|++++|+.+|+++|+.+.++++.- T Consensus 313 ----~~~~~~~~l~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~p~~~~~~~~~~ 365 (731) T TIGR00361 313 ----FHLQLGLLLGLLPLQLFLFHGLSLTSFLANLLAVPLYTFLLVPLILAAVLLLG 365 (731) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ----99999999876566666651002567888888878899999889999998752 No 4 >COG0658 ComEC Predicted membrane metal-binding protein [General function prediction only] Probab=98.18 E-value=2.2e-05 Score=49.19 Aligned_cols=107 Identities=18% Similarity=0.258 Sum_probs=73.2 Q ss_pred ECCCCCCHHHHHHHHHHHHCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 02668855589998888820485567152135879999999999999861264113577764211456788888889999 Q gi|254780357|r 4 ITKKYDPFPAETTIEGIARLSPPSGPAFPGLFDFNFSSYYKGISAIGYFYSIPKMIYSSESHNTWTWKKISSIKQSFLNE 83 (394) Q Consensus 4 ~~~~~~p~~~g~~i~~~v~L~~P~~p~~PGlFD~SF~asF~aVaaigff~~~p~~~~~~~~~~~~~~~~l~~Lr~~i~~~ 83 (394) .++++.++.........+.+.+|.....+| |++||-+++..+. ++...+.... .+. .+ T Consensus 308 ~~~~~~~~~~l~~~~~~~l~~~p~~i~~~~-f~lSf~~~~~li~----~~~~~~~~~~-----~~~------~~------ 365 (453) T COG0658 308 SLRNLSSIFILLLSAIAILLLAPLAILSLG-FWLSFLAVAALIL----WSQLFRLLGL-----SWP------LP------ 365 (453) T ss_pred HCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH----HHHHHHHCCC-----CCC------HH------ T ss_conf 700245799999999999995699999999-9999999999999----9998730124-----520------34------ Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999965555888999999860102327883888999999999999999999997 Q gi|254780357|r 84 IRANIGTYGSASASIFLIKHFHCIPVYGLIANILAIPILSFVVIPAGLIAILLM 137 (394) Q Consensus 84 I~a~ls~~~~~a~~p~~~~~F~~~~~~~~~aNl~avP~~~~~v~p~~~~~~~l~ 137 (394) ..+. ...++|+.++|||++++++++.|++..|+.+++.+|+.+...+.. T Consensus 366 --~~~~---~~~~~~~~~~~f~~~~~~~l~~~l~~~~~~~~i~~~~~~~~~~~~ 414 (453) T COG0658 366 --ITLA---LLATLPLLILQFGRLGLLGLLLNLIVSPLISLISLPGLLANFLLS 414 (453) T ss_pred --HHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf --6899---889889999999999999999999999999999999999999998 No 5 >PRK11539 hypothetical protein; Provisional Probab=97.77 E-value=0.00013 Score=44.66 Aligned_cols=52 Identities=13% Similarity=-0.029 Sum_probs=44.0 Q ss_pred EECCCCCCHHHHHHHHHHHHCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 3026688555899988888204855671521358799999999999998612 Q gi|254780357|r 3 SITKKYDPFPAETTIEGIARLSPPSGPAFPGLFDFNFSSYYKGISAIGYFYS 54 (394) Q Consensus 3 ~~~~~~~p~~~g~~i~~~v~L~~P~~p~~PGlFD~SF~asF~aVaaigff~~ 54 (394) ..+++.+|+..|+.....++|.+|.+..+||+||+..++....+.+.|+... T Consensus 115 ~~~~~~~~~~~Gq~w~~~~rlk~p~g~~NpggFD~~~~l~~~~i~~tg~v~~ 166 (754) T PRK11539 115 YGEYLPQAVCAGQRWAMKLKVRAVHGQLNEGGFDSQRYAIAQHQPLTGRFLQ 166 (754) T ss_pred EECCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHCCCCEEEEEEE T ss_conf 8427888799888899999976887877998617899999778976999964 No 6 >TIGR00360 ComEC_N-term ComEC/Rec2-related protein; InterPro: IPR004477 This family is defined to identify a pair of paralogous 3' exoribonucleases in Escherichia coli, plus the set of proteins apparently orthologous to one or the other in other eubacteria. VacB was characterised originally as required for the expression of virulence genes, but is now recognized as the exoribonuclease RNase R (Rnr). Its paralog in Escherichia coli and Haemophilus influenzae is designated exoribonuclease II (Rnb). Both are involved in the degradation of mRNA, and consequently have strong pleiotropic effects that may be difficult to disentangle. Both these proteins share domain-level similarity (RNB, S1) with a considerable number of other proteins, and full-length similarity scoring below the trusted cut off to proteins associated with various phenotypes but uncertain biochemistry; it may be that these latter proteins are also 3' exoribonucleases.. Probab=97.55 E-value=0.00075 Score=40.27 Aligned_cols=110 Identities=18% Similarity=0.257 Sum_probs=70.5 Q ss_pred ECCCCCCHHHHHHHHHHHHCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 02668855589998888820485567152135879999999999999861264113577764211456788888889999 Q gi|254780357|r 4 ITKKYDPFPAETTIEGIARLSPPSGPAFPGLFDFNFSSYYKGISAIGYFYSIPKMIYSSESHNTWTWKKISSIKQSFLNE 83 (394) Q Consensus 4 ~~~~~~p~~~g~~i~~~v~L~~P~~p~~PGlFD~SF~asF~aVaaigff~~~p~~~~~~~~~~~~~~~~l~~Lr~~i~~~ 83 (394) ..++.++...-........+..|..+..-| |+.+|.+..++.+||..-... ...+. ...+...+... T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~g-----~~~~~~~~~~~~~w~~~~~~~-----~~~~~---~~~~~~~~~~~ 143 (187) T TIGR00360 77 SLPKLNLIGALLLSALLLLLVDPVALLSLG-----FWLSFLATFGLLLWYRVLLFP-----NFQWL---LRPLSGWIASL 143 (187) T ss_pred HHHCCCHHHHHHHHHHHHHHHCCHHHHHHH-----HHHHHHHHHHHHHHHHHHHHH-----HHHHH---HHHHHHHHHHH T ss_conf 620122047899999999984202466654-----578999999999999999743-----25553---12210378999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 999965555888999999860102327883888999999999999 Q gi|254780357|r 84 IRANIGTYGSASASIFLIKHFHCIPVYGLIANILAIPILSFVVIP 128 (394) Q Consensus 84 I~a~ls~~~~~a~~p~~~~~F~~~~~~~~~aNl~avP~~~~~v~p 128 (394) + .+.........|...++||+++..+.+.|++.+|+.++...| T Consensus 144 ~--~~~~~~~~~~~p~~~~~~~~~~~~~~~~n~~~~p~~~~~~~~ 186 (187) T TIGR00360 144 L--TLSLGLLLLSTPLLLYLFHGLSPLSVLANLLAVPLLSFLLLP 186 (187) T ss_pred H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9--999999999877888874002588999999999999997606 No 7 >PRK09424 pntA NAD(P) transhydrogenase subunit alpha; Provisional Probab=81.32 E-value=2.8 Score=19.59 Aligned_cols=46 Identities=26% Similarity=0.279 Sum_probs=36.5 Q ss_pred HHHHHHCCCCCEEEEECCCCCCCCCCCCCEEECHHHHH--HHCCEEEEECC Q ss_conf 67866176535099962676677578870886688998--50948999747 Q gi|254780357|r 318 KDMIRLSCQLSDILITTIRDINTKLCNVSLLITPEILQ--KQGSLEITIIP 366 (394) Q Consensus 318 ~~~~~~~C~~adivI~~~~~~~~~~~~~~lvid~~~L~--~~Ga~ai~l~~ 366 (394) .+.+.+.|..+||||++...|. .+++.+++++.++ +.||+-+.+.- T Consensus 239 ~~~~~e~~~~aDiVItTalIPG---~~AP~LIt~emV~~MKpGSVIVDlAa 286 (510) T PRK09424 239 MALFAEQAKEVDIIITTALIPG---KPAPKLITKEMVDSMKPGSVIVDLAA 286 (510) T ss_pred HHHHHHHHHHCCEEEEEEECCC---CCCCEECCHHHHHHCCCCCEEEEECC T ss_conf 9999999742477998654189---98997037999952699978999525 No 8 >PRK04949 putative sulfate transport protein CysZ; Validated Probab=63.06 E-value=9.1 Score=16.62 Aligned_cols=17 Identities=24% Similarity=0.505 Sum_probs=10.8 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 78838889999999999 Q gi|254780357|r 110 YGLIANILAIPILSFVV 126 (394) Q Consensus 110 ~~~~aNl~avP~~~~~v 126 (394) ++.++|+++-|..+.+. T Consensus 90 ft~i~~li~aPF~g~La 106 (251) T PRK04949 90 FSTLANWIAAPFNGLLA 106 (251) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999727769999 No 9 >PRK00029 hypothetical protein; Validated Probab=52.37 E-value=7.5 Score=17.10 Aligned_cols=55 Identities=16% Similarity=-0.045 Sum_probs=44.0 Q ss_pred CCCCCCCEEEECCCCCEEEEECCCEEEEECCCCCCHHHHHHHHHCCCCCCCCCCC Q ss_conf 3467773489869996879974880466347744137988987517653333101 Q gi|254780357|r 215 FPYSFSPDLLISEKGNLVALVDKNTLISNYSNPPSFIFSQWKSALITPFHEAPHV 269 (394) Q Consensus 215 ~~~~~~PdilIs~dG~lVav~~~~g~~~~~~r~~~F~~~~W~~~~g~~~~~~~~~ 269 (394) .+...+=||-.-+.|+.---|.+|||+++++.-..|..+.-+.++|.|++.+-+. T Consensus 103 ~~~g~~~dlqlKGsG~TpySR~gDGra~Lrs~iREyl~sEam~~LgipTtRaLa~ 157 (487) T PRK00029 103 DADGRLYDLQLKGAGRTPYSRMGDGRAVLRSSIREYLASEAMHALGIPTTRALSL 157 (487) T ss_pred CCCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHEEE T ss_conf 6888678875136788986758886433547899999999998649974011024 No 10 >pfam02696 UPF0061 Uncharacterized ACR, YdiU/UPF0061 family. Probab=52.10 E-value=8 Score=16.92 Aligned_cols=54 Identities=13% Similarity=-0.121 Sum_probs=34.9 Q ss_pred CCCCCCEEEECCCCCEEEEECCCEEEEECCCCCCHHHHHHHHHCCCCCCCCCCC Q ss_conf 467773489869996879974880466347744137988987517653333101 Q gi|254780357|r 216 PYSFSPDLLISEKGNLVALVDKNTLISNYSNPPSFIFSQWKSALITPFHEAPHV 269 (394) Q Consensus 216 ~~~~~PdilIs~dG~lVav~~~~g~~~~~~r~~~F~~~~W~~~~g~~~~~~~~~ 269 (394) +..++=|+-.-+.|+.---|.+|||+++|+.-..|..+.-+.++|.|++.+-+. T Consensus 104 ~~g~~~dlqlKGsG~TPySR~gDGravLrs~iREyL~SEam~~LGIpTTRaLsl 157 (487) T pfam02696 104 VDGELFDWGLKGAGQTPYSRGGDGRAVLRSGIREVLASEALHRLGIPTTRALSL 157 (487) T ss_pred CCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 888678887346888996878886334446889999999998749986322589 No 11 >TIGR02882 QoxB cytochrome aa3 quinol oxidase, subunit I; InterPro: IPR014233 This entry represents subunit I of the aa3-type quinone oxidase, one of several terminal oxidases found in bacteria . This complex couples the oxidation of reduced quinones to the reduction of molecular oxygen to water, and the pumping of protons to form a proton gradient utilised for ATP production. aa3-type oxidases contain two heme a cofactors as well as copper atoms at the active site.. Probab=44.08 E-value=14 Score=15.58 Aligned_cols=25 Identities=28% Similarity=0.313 Sum_probs=10.6 Q ss_pred HHHCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 7516416899999999999999999 Q gi|254780357|r 137 MILSLDNIPFGVMAWGLDIIIHIAH 161 (394) Q Consensus 137 ~p~gl~~~~l~~mg~gi~~il~vA~ 161 (394) |.+.+.-+|..+.|==-..||++|. T Consensus 376 ML~SlAFIP~F~iGGVTGVMLa~As 400 (648) T TIGR02882 376 MLYSLAFIPNFVIGGVTGVMLAMAS 400 (648) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHH T ss_conf 8999988534452206899999975 No 12 >COG0397 Uncharacterized conserved protein [Function unknown] Probab=39.26 E-value=22 Score=14.34 Aligned_cols=56 Identities=14% Similarity=-0.051 Sum_probs=37.8 Q ss_pred HCCCCCCCEEEECCCCCEEEEECCCEEEEECCCCCCHHHHHHHHHCCCCCCCCCCC Q ss_conf 23467773489869996879974880466347744137988987517653333101 Q gi|254780357|r 214 TFPYSFSPDLLISEKGNLVALVDKNTLISNYSNPPSFIFSQWKSALITPFHEAPHV 269 (394) Q Consensus 214 ~~~~~~~PdilIs~dG~lVav~~~~g~~~~~~r~~~F~~~~W~~~~g~~~~~~~~~ 269 (394) ..+..++-|+-.-+.|+.=--|.+|||+++|+.-..|+.+.=+-+.|.+++.+-+. T Consensus 102 ~~~~G~~~diqlKGaG~TPySR~gDGRAvLrssiRE~l~SEAmh~LGIpTTRaL~l 157 (488) T COG0397 102 RGADGELFDIQLKGAGRTPYSRGGDGRAVLRSSIREYLASEALHALGIPTTRALSL 157 (488) T ss_pred ECCCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 25897078887136889984646784044415689999999999748997210356 No 13 >PRK04654 sec-independent translocase; Provisional Probab=37.34 E-value=24 Score=14.16 Aligned_cols=27 Identities=15% Similarity=0.208 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 5879999999999999861264113577 Q gi|254780357|r 35 FDFNFSSYYKGISAIGYFYSIPKMIYSS 62 (394) Q Consensus 35 FD~SF~asF~aVaaigff~~~p~~~~~~ 62 (394) ||++|+=.+ .|+.|+...-.|+++... T Consensus 2 FDIGF~ELl-LI~VIaLvVlGPERLP~a 28 (214) T PRK04654 2 FDIGVGELT-LIAVVALVVLGPERLPKA 28 (214) T ss_pred CCCCHHHHH-HHHHHHHHHCCHHHHHHH T ss_conf 651199999-999999972382463899 No 14 >pfam01262 AlaDh_PNT_C Alanine dehydrogenase/PNT, C-terminal domain. This family now also contains the lysine 2-oxoglutarate reductases. Probab=35.40 E-value=26 Score=13.98 Aligned_cols=58 Identities=16% Similarity=0.245 Sum_probs=41.1 Q ss_pred CCCHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEECHHHHH--HHCCEEEEEC-CCCCCCCCCC Q ss_conf 17967866176535099962676677578870886688998--5094899974-7765445552 Q gi|254780357|r 315 LKRKDMIRLSCQLSDILITTIRDINTKLCNVSLLITPEILQ--KQGSLEITII-PPSNDNDKVK 375 (394) Q Consensus 315 ~~~~~~~~~~C~~adivI~~~~~~~~~~~~~~lvid~~~L~--~~Ga~ai~l~-~~~~~~~~~r 375 (394) ..+.+.+.+....+|+||.+...+.. ..+.+++++.++ +.||+-+... ++|.-.|+.| T Consensus 72 ~~~~~~l~~~i~~aDvvIgavl~pg~---~aP~lIt~~mv~~MkpGsVIVDvaiDqGG~~Ets~ 132 (150) T pfam01262 72 FSNCEYLAEAIAEADLVIGTVLIPGA---RAPKLVTREMVKTMKPGSVIVDVAIDQGGCIETTR 132 (150) T ss_pred HCCHHHHHHHHHHCCEEEEEEECCCC---CCCCEECHHHHHHHCCCCEEEEEECCCCCCCCCCC T ss_conf 53799999997438799972031788---69922079999844799399996204899662577 No 15 >cd05212 NAD_bind_m-THF_DH_Cyclohyd_like NAD(P) binding domain of methylene-tetrahydrofolate dehydrogenase and methylene-tetrahydrofolate dehydrogenase/cyclohydrolase. NAD(P) binding domains of methylene-tetrahydrofolate dehydrogenase (m-THF DH) and m-THF DH/cyclohydrolase bifunctional enzymes (m-THF DH/cyclohydrolase). M-THF is a versatile carrier of activated one-carbon units. The major one-carbon folate donors are N-5 methyltetrahydrofolate, N5,N10-m-THF, and N10-formayltetrahydrofolate. The oxidation of metabolic intermediate m-THF to m-THF requires the enzyme m-THF DH. In addition, most DHs also have an associated cyclohydrolase activity which catalyzes its hydrolysis to N10-formyltetrahydrofolate. m-THF DH is typically found as part of a multifunctional protein in eukaryotes. NADP-dependent m-THF DH in mammals, birds and yeast are components of a trifunctional enzyme with DH, cyclohydrolase, and synthetase activities. Certain eukaryotic cells also contain homodimeric bifunctional Probab=31.18 E-value=30 Score=13.57 Aligned_cols=44 Identities=14% Similarity=0.211 Sum_probs=29.1 Q ss_pred HHHHCCCCCEEEEECCCCCCCCCCCCCEEECHHHHHHHCCEEEEECCCCCCCC Q ss_conf 86617653509996267667757887088668899850948999747765445 Q gi|254780357|r 320 MIRLSCQLSDILITTIRDINTKLCNVSLLITPEILQKQGSLEITIIPPSNDND 372 (394) Q Consensus 320 ~~~~~C~~adivI~~~~~~~~~~~~~~lvid~~~L~~~Ga~ai~l~~~~~~~~ 372 (394) .+.+.+.+|||+|+.. +..-.++.+-+++ |++-|..+-+....+ T Consensus 64 ~l~~~~~~ADIvI~a~--------G~p~~i~~~~vk~-GavvIDvGi~~~~~~ 107 (140) T cd05212 64 QLQSKVHDADVVVVGS--------PKPEKVPTEWIKP-GATVINCSPTKLSGD 107 (140) T ss_pred CHHHHHHHCCEEEECC--------CCCCCCCHHHCCC-CCEEEEECCCCCCCH T ss_conf 9899852349999816--------8767278977679-988998037756772 No 16 >COG3224 Uncharacterized protein conserved in bacteria [Function unknown] Probab=30.00 E-value=32 Score=13.45 Aligned_cols=30 Identities=27% Similarity=0.449 Sum_probs=23.9 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 601023278838889999999999999999 Q gi|254780357|r 103 HFHCIPVYGLIANILAIPILSFVVIPAGLI 132 (394) Q Consensus 103 ~F~~~~~~~~~aNl~avP~~~~~v~p~~~~ 132 (394) .|--+....++.|++.+++++.++||..-. T Consensus 146 ~~~~l~~~~Li~t~~~v~LltYf~iP~vs~ 175 (195) T COG3224 146 GFLPLPTRVLIGTLCSVSLLTYFVIPLVSR 175 (195) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 877800999999999999887888888888 No 17 >pfam03213 Pox_P35 Poxvirus P35 protein. Probab=29.53 E-value=22 Score=14.43 Aligned_cols=116 Identities=16% Similarity=0.258 Sum_probs=58.8 Q ss_pred CCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH------------------------------HHHH Q ss_conf 0485567152135879999999999999861264113577764211------------------------------4567 Q gi|254780357|r 23 LSPPSGPAFPGLFDFNFSSYYKGISAIGYFYSIPKMIYSSESHNTW------------------------------TWKK 72 (394) Q Consensus 23 L~~P~~p~~PGlFD~SF~asF~aVaaigff~~~p~~~~~~~~~~~~------------------------------~~~~ 72 (394) --.|+--+.||+||+|-.||.--+.+..-.|.--..-....+.... ...| T Consensus 170 ~~~p~l~~YtGgyDvSLsAYIIr~~ta~kl~~~i~~~~Gvs~~l~~Ei~rlE~~l~inR~Vl~~a~~YV~hdy~l~~~~R 249 (325) T pfam03213 170 SGNPSLYSYTGGYDVSLSAYIIRVSTAKKLYAEIIKNGGVSSSLSFEIYRLENELGINRQVLNDASKYVTHDYRLVSEKR 249 (325) T ss_pred CCCCCEEEECCCCCEEEEEEEEEHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHCCHHHHHCCCCCEEECCHHHHHHHH T ss_conf 68964266337532267899997278899999998658966661211887776627302101587626743143368887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 8888888999999996555588899999986010232788388899999999999999999999751641689999 Q gi|254780357|r 73 ISSIKQSFLNEIRANIGTYGSASASIFLIKHFHCIPVYGLIANILAIPILSFVVIPAGLIAILLMILSLDNIPFGV 148 (394) Q Consensus 73 l~~Lr~~i~~~I~a~ls~~~~~a~~p~~~~~F~~~~~~~~~aNl~avP~~~~~v~p~~~~~~~l~p~gl~~~~l~~ 148 (394) +...+..+..++-.- ++.-=-.+|||-.-++++++ -+.-|-+++++++-..+ +|..|..-.=.+|. T Consensus 250 ~~e~r~~lw~rl~~W------ia~~~P~~~y~lt~PLfSfF-GlfDI~vigviiil~Ii---~L~IF~vnSKLLWf 315 (325) T pfam03213 250 VNEMRNSLWNRLGNW------IARRFPGAYYFLTTPLFSFF-GLFDINVIGVIIILFII---VLLIFDVNSKLLWF 315 (325) T ss_pred HHHHHHHHHHHHHHH------HHHHCCCHHHEECCHHHHHH-CHHHHHHHHHHHHHHHH---HHHHHCCCCHHHHH T ss_conf 642206699999999------98648753210004077762-31124788999999999---99986556148999 No 18 >PRK10494 hypothetical protein; Provisional Probab=29.24 E-value=33 Score=13.37 Aligned_cols=13 Identities=15% Similarity=0.071 Sum_probs=6.1 Q ss_pred HHHHHHHHCCCCC Q ss_conf 7988987517653 Q gi|254780357|r 251 IFSQWKSALITPF 263 (394) Q Consensus 251 ~~~~W~~~~g~~~ 263 (394) +..+.++..|.+. T Consensus 141 ~~~~~~~~lGv~~ 153 (259) T PRK10494 141 VGARVAQSLGVPR 153 (259) T ss_pred HHHHHHHHCCCCH T ss_conf 9999999839998 No 19 >pfam02882 THF_DHG_CYH_C Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain. Probab=27.33 E-value=36 Score=13.17 Aligned_cols=38 Identities=16% Similarity=0.229 Sum_probs=25.2 Q ss_pred HHHHCCCCCEEEEECCCCCCCCCCCCCEEECHHHHHHHCCEEEEECC Q ss_conf 86617653509996267667757887088668899850948999747 Q gi|254780357|r 320 MIRLSCQLSDILITTIRDINTKLCNVSLLITPEILQKQGSLEITIIP 366 (394) Q Consensus 320 ~~~~~C~~adivI~~~~~~~~~~~~~~lvid~~~L~~~Ga~ai~l~~ 366 (394) .+.+.|.+|||||+.. +.+-.++.+-++ .|++-|..+= T Consensus 72 nl~~~~~~ADIvI~A~--------G~p~~i~~~~ik-~gavvIDvGi 109 (159) T pfam02882 72 DLAEITREADIVVVAV--------GKPGLIKADWVK-PGAVVIDVGI 109 (159) T ss_pred CHHHHHHCCCEEEEEC--------CCCCCCCHHHCC-CCCEEEEECC T ss_conf 9789630034442315--------885505698858-9988998022 No 20 >COG2981 CysZ Uncharacterized protein involved in cysteine biosynthesis [Amino acid transport and metabolism] Probab=27.03 E-value=36 Score=13.14 Aligned_cols=11 Identities=36% Similarity=0.682 Sum_probs=4.1 Q ss_pred HHHHHHHHHHH Q ss_conf 38889999999 Q gi|254780357|r 113 IANILAIPILS 123 (394) Q Consensus 113 ~aNl~avP~~~ 123 (394) ++|+++-|.-+ T Consensus 91 v~~~IAapFng 101 (250) T COG2981 91 VANLIAAPFNG 101 (250) T ss_pred HHHHHHHHHHH T ss_conf 99999714526 No 21 >COG0686 Ald Alanine dehydrogenase [Amino acid transport and metabolism] Probab=26.22 E-value=37 Score=13.05 Aligned_cols=65 Identities=14% Similarity=0.202 Sum_probs=46.7 Q ss_pred CCEEEEECCCHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEECHHHHH--HHCCEEEEEC-CCCCCCCCCC Q ss_conf 865999717967866176535099962676677578870886688998--5094899974-7765445552 Q gi|254780357|r 308 SDVIVGVLKRKDMIRLSCQLSDILITTIRDINTKLCNVSLLITPEILQ--KQGSLEITII-PPSNDNDKVK 375 (394) Q Consensus 308 ~g~~va~~~~~~~~~~~C~~adivI~~~~~~~~~~~~~~lvid~~~L~--~~Ga~ai~l~-~~~~~~~~~r 375 (394) ++++..+..+++.+.+.+..+|++|-..-.+.+ .++.+++++-++ +.||+-+... ++|...|+.| T Consensus 212 ~~rv~~~~st~~~iee~v~~aDlvIgaVLIpga---kaPkLvt~e~vk~MkpGsVivDVAiDqGGc~Et~~ 279 (371) T COG0686 212 GGRVHTLYSTPSNIEEAVKKADLVIGAVLIPGA---KAPKLVTREMVKQMKPGSVIVDVAIDQGGCFETSH 279 (371) T ss_pred CCEEEEEECCHHHHHHHHHHCCEEEEEEEECCC---CCCEEHHHHHHHHCCCCCEEEEEEECCCCCEECCC T ss_conf 766699975899999874312679888884588---78601069999744798589999874888153045 No 22 >pfam09163 Form-deh_trans Formate dehydrogenase N, transmembrane. Members of this family are predominantly found in the beta subunit of formate dehydrogenase, and consist of a single transmembrane helix. They act as a transmembrane anchor, and allow for conduction of electrons within the protein. Probab=23.90 E-value=41 Score=12.80 Aligned_cols=33 Identities=18% Similarity=0.249 Sum_probs=22.4 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 999714637898799999999998523467773 Q gi|254780357|r 189 LMVFCKTNIRHIGSIMIGITIIILFTFPYSFSP 221 (394) Q Consensus 189 ~l~l~~~~~R~~g~~~~~~~~~~l~~~~~~~~P 221 (394) ..-+|++.++.+|.+.++..+++.+..-....| T Consensus 4 ~V~lWKg~~Kplg~~~~~~t~~~~~~Hyi~vGP 36 (44) T pfam09163 4 SVELWKGVLKPLGAAGMGATAAAGFLHYITVGP 36 (44) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 788772016899999999999999998973187 No 23 >TIGR00915 2A0602 RND transporter, hydrophobe/amphiphile efflux-1 (HAE1) family; InterPro: IPR004764 Hydrophobe/amphiphile efflux-1 HAE1 is involved in toxin production and resistance processes.; GO: 0005215 transporter activity, 0006810 transport, 0016021 integral to membrane. Probab=23.27 E-value=43 Score=12.72 Aligned_cols=15 Identities=13% Similarity=0.168 Sum_probs=7.6 Q ss_pred HHCCCHHHHHHHHHH Q ss_conf 971463789879999 Q gi|254780357|r 191 VFCKTNIRHIGSIMI 205 (394) Q Consensus 191 ~l~~~~~R~~g~~~~ 205 (394) .+.+.+.|++.+-.+ T Consensus 535 ~~l~~~~r~l~~Y~l 549 (1058) T TIGR00915 535 KILKRRLRYLLVYVL 549 (1058) T ss_pred HHHHCCCCHHHHHHH T ss_conf 986057504356777 No 24 >pfam02639 DUF188 Uncharacterized BCR, YaiI/YqxD family COG1671. Probab=23.24 E-value=28 Score=13.81 Aligned_cols=32 Identities=19% Similarity=0.182 Sum_probs=19.4 Q ss_pred HHHHCCCCCEEEEECCCC-CCCCCCCCCEEECH Q ss_conf 866176535099962676-67757887088668 Q gi|254780357|r 320 MIRLSCQLSDILITTIRD-INTKLCNVSLLITP 351 (394) Q Consensus 320 ~~~~~C~~adivI~~~~~-~~~~~~~~~lvid~ 351 (394) .+.+.|...|||||..-. -..|-..++.++++ T Consensus 44 ~I~~~~~~gDlVIT~Di~LAa~~l~Kga~vl~p 76 (130) T pfam02639 44 YIVERAEPGDLVITQDIPLAARVLEKGAYVLNP 76 (130) T ss_pred HHHHHCCCCCEEEECCHHHHHHHHHCCCEEECC T ss_conf 999837899999858889899999889889899 No 25 >KOG2923 consensus Probab=21.45 E-value=47 Score=12.51 Aligned_cols=25 Identities=20% Similarity=0.145 Sum_probs=20.4 Q ss_pred CCCCCCCCCEEECHHHHHHHCCEEE Q ss_conf 6775788708866889985094899 Q gi|254780357|r 338 INTKLCNVSLLITPEILQKQGSLEI 362 (394) Q Consensus 338 ~~~~~~~~~lvid~~~L~~~Ga~ai 362 (394) .-||+|++.-.|+.++|+..-.+|- T Consensus 22 ~yPCpCGDrf~It~edL~~ge~Va~ 46 (67) T KOG2923 22 YYPCPCGDRFQITLEDLENGEDVAR 46 (67) T ss_pred ECCCCCCCEEEECHHHHHCCCEEEC T ss_conf 8477788733653999867880631 No 26 >pfam02515 CoA_transf_3 CoA-transferase family III. CoA-transferases are found in organisms from all lines of descent. Most of these enzymes belong to two well-known enzyme families, but recent work on unusual biochemical pathways of anaerobic bacteria has revealed the existence of a third family of CoA-transferases. The members of this enzyme family differ in sequence and reaction mechanism from CoA-transferases of the other families. Currently known enzymes of the new family are a formyl-CoA: oxalate CoA-transferase, a succinyl-CoA: (R)-benzylsuccinate CoA-transferase, an (E)-cinnamoyl-CoA: (R)-phenyllactate CoA-transferase, and a butyrobetainyl-CoA: (R)-carnitine CoA-transferase. In addition, a large number of proteins of unknown or differently annotated function from Bacteria, Archaea and Eukarya apparently belong to this enzyme family. Properties and reaction mechanisms of the CoA-transferases of family III are described and compared to those of the previously known CoA-transferase Probab=21.01 E-value=48 Score=12.45 Aligned_cols=10 Identities=10% Similarity=0.405 Sum_probs=4.3 Q ss_pred CCCCCEEEEE Q ss_conf 6999687997 Q gi|254780357|r 226 SEKGNLVALV 235 (394) Q Consensus 226 s~dG~lVav~ 235 (394) ++.|..|-+- T Consensus 122 tG~G~~vdvS 131 (179) T pfam02515 122 TGRGQRIDVS 131 (179) T ss_pred CCCCEEEEEC T ss_conf 5997499974 No 27 >pfam07420 DUF1509 Protein of unknown function (DUF1509). This family consists of several uncharacterized viral proteins from the Marek's disease-like viruses. Members of this family are typically around 400 residues in length. The function of this family is unknown. Probab=20.49 E-value=49 Score=12.39 Aligned_cols=23 Identities=13% Similarity=-0.033 Sum_probs=18.1 Q ss_pred HHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 97516416899999999999999 Q gi|254780357|r 136 LMILSLDNIPFGVMAWGLDIIIH 158 (394) Q Consensus 136 l~p~gl~~~~l~~mg~gi~~il~ 158 (394) .++-...|++|++|+..|||--. T Consensus 59 i~~~~atWpFWq~M~kCLDWCCs 81 (377) T pfam07420 59 INVNHATWPFWQLMGKCLDWCCS 81 (377) T ss_pred CCCCCCCCHHHHHHHHHHHHHHC T ss_conf 02475423799999999998850 No 28 >pfam03092 BT1 BT1 family. Members of this family are transmembrane proteins. Several are Leishmania putative proteins that are thought to be pteridine transporters. One such protein, previously termed (and is still annotated as) ORFG, was shown to encode a biopterin transport protein using null mutants, thus being subsequently renamed BT1. The significant similarity of ORFG/BT1 to Trypanosoma brucei ESAG10 (a putative transmembrane protein and another member of this family) was previously noted. This family also contains five putative Arabidopsis thaliana proteins of unknown function. In addition, it also contains two predicted prokaryotic proteins (from the cyanobacteria Synechocystis and Synechococcus). Probab=20.36 E-value=49 Score=12.37 Aligned_cols=35 Identities=11% Similarity=0.168 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 88999999999999999999997516416899999 Q gi|254780357|r 115 NILAIPILSFVVIPAGLIAILLMILSLDNIPFGVM 149 (394) Q Consensus 115 Nl~avP~~~~~v~p~~~~~~~l~p~gl~~~~l~~m 149 (394) ..+.-.+-.+-.||..++..=+.|-|.|+-...+| T Consensus 318 ~il~~~i~~l~~mP~lvL~arLCP~G~E~T~yAlL 352 (425) T pfam03092 318 SIILEVLYMLKFMPFLVLLARLCPRGSESTVFALL 352 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 79999999999999999999878997089999999 Done!