Query gi|254780529|ref|YP_003064942.1| flagellar motor protein MotB [Candidatus Liberibacter asiaticus str. psy62] Match_columns 343 No_of_seqs 156 out of 4336 Neff 9.8 Searched_HMMs 39220 Date Sun May 29 22:58:34 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780529.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK05996 motB flagellar motor 100.0 0 0 488.4 28.6 336 5-341 2-430 (431) 2 PRK09041 motB flagellar motor 100.0 0 0 486.0 28.9 272 9-342 3-277 (317) 3 PRK06778 hypothetical protein; 100.0 0 0 461.1 27.3 259 13-341 2-260 (289) 4 PRK12799 motB flagellar motor 100.0 0 0 436.5 25.9 261 9-342 3-267 (424) 5 PRK08944 motB flagellar motor 100.0 0 0 429.9 26.4 287 20-342 3-294 (305) 6 PRK09038 flagellar motor prote 100.0 0 0 428.6 26.4 243 18-342 3-248 (285) 7 PRK07734 motB flagellar motor 100.0 0 0 412.4 25.6 246 18-341 4-254 (259) 8 PRK06925 flagellar motor prote 100.0 0 0 401.7 24.1 223 17-341 3-229 (230) 9 PRK08457 motB flagellar motor 100.0 0 0 398.8 21.9 223 19-341 3-229 (251) 10 PRK06667 motB flagellar motor 100.0 0 0 387.2 23.4 232 18-341 3-243 (254) 11 PRK06742 flagellar motor prote 100.0 0 0 378.4 22.6 218 25-340 4-225 (225) 12 COG1360 MotB Flagellar motor p 100.0 0 0 352.7 20.4 238 17-342 2-244 (244) 13 PRK09040 hypothetical protein; 100.0 4E-42 0 300.2 20.0 114 225-342 84-206 (215) 14 PRK07033 hypothetical protein; 100.0 9E-35 2.3E-39 251.2 20.9 129 211-343 297-428 (429) 15 PRK09039 hypothetical protein; 100.0 4.4E-35 1.1E-39 253.3 16.0 126 210-341 206-343 (343) 16 PRK08126 hypothetical protein; 100.0 8.3E-34 2.1E-38 244.7 19.7 126 211-340 300-428 (428) 17 TIGR03350 type_VI_ompA type VI 100.0 4.8E-31 1.2E-35 226.3 13.3 125 210-338 10-137 (137) 18 PRK10510 putative outer membra 100.0 8.2E-29 2.1E-33 211.4 14.3 120 213-339 95-217 (219) 19 PRK09967 putative outer membra 99.9 3.2E-27 8.3E-32 200.8 9.4 105 232-340 51-159 (160) 20 COG2885 OmpA Outer membrane pr 99.9 5.5E-26 1.4E-30 192.5 10.2 105 232-340 82-189 (190) 21 PRK10802 peptidoglycan-associa 99.9 1.1E-24 2.9E-29 183.8 9.8 99 235-337 71-172 (173) 22 TIGR02802 Pal_lipo peptidoglyc 99.9 7.9E-25 2E-29 184.8 8.0 100 235-338 2-104 (104) 23 PRK10808 outer membrane protei 99.9 1E-23 2.6E-28 177.4 10.9 105 233-341 224-342 (347) 24 cd07185 OmpA_C-like Peptidogly 99.9 1.6E-23 4E-28 176.2 9.7 100 235-338 4-106 (106) 25 PRK07034 hypothetical protein; 99.8 1.9E-20 4.8E-25 155.7 12.3 122 213-338 368-502 (565) 26 pfam00691 OmpA OmpA family. Th 99.8 6.9E-20 1.8E-24 151.9 9.6 93 236-332 1-97 (97) 27 pfam02472 ExbD Biopolymer tran 76.5 1.7 4.2E-05 22.6 1.9 20 32-51 8-27 (128) 28 COG3965 Predicted Co/Zn/Cd cat 69.5 1.8 4.7E-05 22.3 0.8 50 217-273 255-309 (314) 29 TIGR02814 pfaD_fam PfaD family 61.2 4.1 0.0001 20.0 1.3 41 237-277 145-201 (449) 30 PRK11409 antitoxin YefM; Provi 59.6 9.5 0.00024 17.6 2.9 50 8-70 20-69 (83) 31 PRK11024 colicin uptake protei 59.5 9 0.00023 17.7 2.8 43 250-305 90-133 (142) 32 COG5432 RAD18 RING-finger-cont 58.5 6.8 0.00017 18.5 2.0 23 236-258 280-303 (391) 33 pfam12273 RCR Chitin synthesis 57.5 9.7 0.00025 17.5 2.7 27 32-62 1-27 (124) 34 TIGR01055 parE_Gneg DNA topois 57.0 6.9 0.00018 18.5 1.8 73 265-341 526-609 (647) 35 pfam09476 Pilus_CpaD Pilus bio 56.5 13 0.00034 16.5 9.8 77 237-322 44-121 (200) 36 pfam07019 Rab5ip Rab5-interact 56.4 9.3 0.00024 17.6 2.4 29 26-55 52-80 (81) 37 pfam06898 YqfD Putative stage 55.6 9.7 0.00025 17.5 2.4 30 28-57 75-107 (383) 38 TIGR02471 sucr_syn_bact_C sucr 54.9 14 0.00036 16.4 4.2 66 237-314 117-186 (241) 39 cd01324 cbb3_Oxidase_CcoQ Cyto 52.9 15 0.00039 16.2 3.6 33 34-66 7-43 (48) 40 pfam03356 Pox_LP_H2 Viral late 50.9 10 0.00026 17.4 1.9 22 34-57 23-44 (189) 41 TIGR01059 gyrB DNA gyrase, B s 49.6 7.8 0.0002 18.1 1.1 25 240-264 355-380 (818) 42 PRK00159 putative septation in 47.0 19 0.00048 15.5 2.8 17 28-50 27-43 (87) 43 pfam00737 PsbH Photosystem II 45.7 18 0.00046 15.7 2.5 30 24-54 13-42 (52) 44 PRK11267 biopolymer transport 45.3 20 0.00051 15.4 2.9 43 251-306 86-129 (141) 45 TIGR03656 IsdC heme uptake pro 45.0 12 0.00031 16.8 1.6 15 43-57 5-19 (217) 46 COG0848 ExbD Biopolymer transp 43.9 20 0.00052 15.3 2.6 26 31-56 15-40 (137) 47 pfam11548 Receptor_IA-2 Protei 43.5 21 0.00054 15.2 4.1 69 245-318 16-90 (91) 48 pfam10624 TraS Plasmid conjuga 42.9 13 0.00032 16.7 1.4 28 41-70 27-60 (164) 49 pfam10959 DUF2761 Protein of u 40.3 5.3 0.00013 19.2 -0.9 10 266-275 16-25 (96) 50 pfam00543 P-II Nitrogen regula 40.2 24 0.00061 14.9 4.1 52 290-341 8-65 (102) 51 TIGR00946 2a69 Auxin Efflux Ca 39.0 25 0.00064 14.7 2.6 26 33-58 161-186 (608) 52 pfam03032 Brevenin Brevenin/es 35.6 24 0.00061 14.9 1.8 21 42-62 6-26 (46) 53 PRK10665 nitrogen regulatory p 35.5 28 0.00072 14.4 3.8 52 290-341 11-68 (112) 54 pfam00101 RuBisCO_small Ribulo 34.7 29 0.00074 14.3 4.7 51 248-300 13-66 (98) 55 pfam12000 DUF3495 Domain of un 34.7 29 0.00074 14.3 3.3 39 281-319 39-80 (172) 56 pfam11587 Prion_bPrPp Major pr 34.3 23 0.00059 15.0 1.6 10 44-53 10-19 (29) 57 cd06869 PX_UP2_fungi The phosp 30.4 28 0.00071 14.4 1.5 32 8-44 30-63 (119) 58 COG1735 Php Predicted metal-de 29.7 35 0.00089 13.8 7.8 64 236-317 138-201 (316) 59 PRK10369 heme lyase subunit Nr 29.6 35 0.00089 13.8 3.6 27 32-58 22-48 (552) 60 COG4646 DNA methylase [Transcr 29.6 25 0.00065 14.7 1.2 10 15-24 104-113 (637) 61 pfam01707 Peptidase_C9 Peptida 29.5 35 0.0009 13.7 2.3 94 237-336 78-182 (202) 62 KOG3385 consensus 29.2 34 0.00086 13.9 1.7 10 237-246 73-82 (118) 63 PRK13455 F0F1 ATP synthase sub 28.9 32 0.00082 14.0 1.6 26 26-52 16-41 (184) 64 TIGR00835 agcS amino acid carr 28.0 37 0.00095 13.6 5.3 55 19-73 112-174 (461) 65 KOG0605 consensus 28.0 22 0.00057 15.1 0.6 27 38-70 226-252 (550) 66 TIGR00944 2a6301s04 Multicompo 27.9 38 0.00096 13.6 2.2 29 31-59 66-106 (504) 67 PRK10858 nitrogen regulatory p 27.8 38 0.00096 13.6 4.0 52 290-341 11-68 (112) 68 cd03527 RuBisCO_small Ribulose 27.7 38 0.00096 13.5 4.8 72 257-340 22-98 (99) 69 PTZ00264 circumsporozoite-rela 27.7 35 0.00088 13.8 1.6 14 41-54 1-14 (162) 70 PRK02251 putative septation in 27.6 38 0.00097 13.5 2.5 20 25-50 22-41 (84) 71 PRK10053 hypothetical protein; 27.3 38 0.00098 13.5 1.8 16 41-56 5-20 (130) 72 pfam06151 Trehalose_recp Treha 26.9 39 0.00099 13.4 4.9 37 32-70 204-247 (414) 73 cd00649 catalase_peroxidase Ca 26.7 28 0.00071 14.4 1.0 12 32-44 128-139 (405) 74 TIGR02544 III_secr_YscJ type I 26.4 31 0.00079 14.1 1.2 11 236-246 95-105 (203) 75 PRK13165 cytochrome c-type bio 25.1 42 0.0011 13.2 2.2 25 32-57 9-33 (161) 76 PRK13289 nitric oxide dioxygen 24.9 42 0.0011 13.2 3.1 29 292-321 370-398 (399) 77 cd04742 NPD_FabD 2-Nitropropan 24.7 43 0.0011 13.2 1.8 36 241-276 144-195 (418) 78 COG1315 Uncharacterized conser 24.5 43 0.0011 13.2 1.7 19 35-54 101-119 (543) 79 TIGR02416 CO_dehy_Mo_lg carbon 24.3 29 0.00075 14.3 0.7 48 232-279 473-537 (775) 80 PRK05986 cob(I)yrinic acid a,c 24.3 33 0.00085 13.9 1.0 23 252-274 134-158 (190) 81 pfam07509 DUF1523 Protein of u 24.1 44 0.0011 13.1 1.7 37 235-271 91-140 (174) 82 cd05536 POLBc_B3 DNA polymeras 23.7 45 0.0011 13.1 1.8 32 243-275 133-164 (371) 83 TIGR02855 spore_yabG sporulati 23.4 45 0.0012 13.0 1.9 22 253-275 152-174 (292) 84 pfam09961 DUF2195 Uncharacteri 23.4 45 0.0012 13.0 3.4 53 281-339 64-116 (121) 85 TIGR02054 MerD mercuric resist 23.3 24 0.00062 14.8 0.1 22 295-317 20-41 (120) 86 TIGR02707 butyr_kinase butyrat 23.0 46 0.0012 13.0 3.2 25 291-315 48-72 (353) 87 TIGR00768 rimK_fam alpha-L-glu 22.7 47 0.0012 12.9 1.6 18 287-304 82-99 (321) 88 cd00530 PTE Phosphotriesterase 21.9 48 0.0012 12.8 5.0 65 239-320 125-189 (293) 89 pfam05545 FixQ Cbb3-type cytoc 21.4 49 0.0013 12.8 4.0 34 33-66 5-42 (49) 90 COG0711 AtpF F0F1-type ATP syn 21.4 49 0.0013 12.8 3.9 21 37-57 6-26 (161) 91 PRK12891 allantoate amidohydro 20.6 51 0.0013 12.7 4.5 12 300-311 344-355 (414) 92 COG1115 AlsT Na+/alanine sympo 20.6 51 0.0013 12.7 5.5 53 17-70 120-176 (452) 93 pfam03402 V1R Vomeronasal orga 20.4 52 0.0013 12.6 1.7 11 289-299 196-206 (265) 94 pfam03571 Peptidase_M49 Peptid 20.3 15 0.00039 16.1 -1.4 40 255-295 363-402 (550) 95 TIGR02644 Y_phosphoryl pyrimid 20.1 28 0.00073 14.4 -0.1 52 16-72 4-64 (425) 96 pfam02572 CobA_CobO_BtuR ATP:c 20.1 53 0.0013 12.6 1.6 23 252-274 116-140 (172) No 1 >PRK05996 motB flagellar motor protein MotB; Validated Probab=100.00 E-value=0 Score=488.42 Aligned_cols=336 Identities=30% Similarity=0.521 Sum_probs=220.5 Q ss_pred CCCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHC-CCCCCCC Q ss_conf 111337688689998216888878874318879999999999999999975089999999999853554320-3665555 Q gi|254780529|r 5 GDKNQENDQFIIIKKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNL-MTASKGI 83 (343) Q Consensus 5 ~~~~~~~~~~~iikr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~-~~~~~~~ 83 (343) ...||+|+|+|||||+++.++++||+|+|||+||||||+|||||+|||+++.+++++++.|++||||++... .+..+++ T Consensus 2 ~~~~~~~~eiIIIKR~~~~~~~gHHGGAWKVAYADFmTAMMAFFLvMWLlnaTteeqr~gIA~YFnP~~l~d~~~~~kG~ 81 (431) T PRK05996 2 EIEHEGKNEIIIVRRGGHDDHEGHHGGVWKIAYADFMTAMMAFFLVMWLINAANEETKAAVASYFNPVKLTDRKPSPKGL 81 (431) T ss_pred CCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 87768987428999614889999977323889988999999999999980599999999999873843212245643344 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCC--------C--------------CCCC-----------------------CCCCC-- Q ss_conf 55567677666665544455566--------7--------------6665-----------------------65667-- Q gi|254780529|r 84 FDEQNPPERSSQNKIDLSIDEHI--------T--------------KNSV-----------------------DLKIG-- 116 (343) Q Consensus 84 ~~~~~~~~~~~~~~~d~~~~~~~--------~--------------~~~~-----------------------~~~~~-- 116 (343) .+..................... . .++. ....+ T Consensus 82 ~dp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~lF~~Py~vl~~ia~~~~~~~~~s~~g~g~~~~~~~~ 161 (431) T PRK05996 82 KDPGEQKPGKSKFESDQRVEGSPAVVGDTTTRPAGDQTNYSEADLFRNPYAVLAEIAHEVGVLANVSAKGDGGAAQSGPA 161 (431) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 56433455544334444566764434444456544333310333204613555443012221112234566654445665 Q ss_pred -CCCCCCCCCCCCCCCCCCCCCCCCCCC---------CCCCCCCCCC-------------CCCCCCHHH----------- Q ss_conf -766554432112221112332111112---------2222334322-------------222220012----------- Q gi|254780529|r 117 -STNQKKQYQRLENNIFYLSNSQQSEHC---------KNSLTRDSEK-------------GKDLCKSTD----------- 162 (343) Q Consensus 117 -~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~-------------~~~~~~~~~----------- 162 (343) +......+...+++..+.....-.... .......... ......... T Consensus 162 ~G~~gG~~~rDPFdP~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (431) T PRK05996 162 TGADGGEAYRDPFDPDFWSKQVEVPLAGKALPPKEAGEQAFDEAATELAKGAGRKPSQLTTDAKAAPAAASSASLPPAPK 241 (431) T ss_pred CCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 45456633357656201110123333444456544445543211111122334565544443334443322334555555 Q ss_pred HHHCCC-----------CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCC Q ss_conf 321001-----------110012465213578878889999999999999975553321013114511453499997034 Q gi|254780529|r 163 LEKSIN-----------KEENYFLPPLSKEKILAMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQ 231 (343) Q Consensus 163 ~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~ 231 (343) ...... .......................++.+...+.+.+... ...+..++.|+.+++||+|+|+|+ T Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~l~~~-~~~l~~~v~v~~t~eGLrIqI~D~ 320 (431) T PRK05996 242 TKPQTAPLPQAKPAKAETQAELIADLKKAAAGEPAPQQGKAEKLQAAIAAALGGI-AGKLMPGVTVTPVEGGLLISLTDQ 320 (431) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHCCCEEEEECCCEEEEEEEEC T ss_conf 6554444532110244443310011444322223455789999999999987765-676336528986599279999858 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEE Q ss_conf 56745556875468356565443678986127833898731467667874106999999999999999986999657122 Q gi|254780529|r 232 RNTPMFDKSSSIPLPETIVVLQKIGEVLAHSTEVISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISK 311 (343) Q Consensus 232 ~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~ 311 (343) .+..||..|||.++|.++.+|.+|+.+|...|++|.|+||||++||.+..|+||+||++||.+.++.|+..|+++.||.+ T Consensus 321 ~~~~MF~~GSA~p~p~~~~lL~~Ia~vL~~~pn~I~I~GHTDa~Pf~~~~y~NWeLSa~RA~aARr~L~~gGl~~~RI~r 400 (431) T PRK05996 321 VDYGMFAIGSAVPRAELVLAMEKIGQLLAEQPGAVIIRGHTDARPFKSATYDNWRLSTARAQMAYYMLVRGGLDEKRILR 400 (431) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHEEE T ss_conf 99988477977568899999999999996389817998756887777889984211189999999999985998546667 Q ss_pred EEECCCCCCCCCCCCCCCCCCEEEEEEECC Q ss_conf 563033488889885213375599999569 Q gi|254780529|r 312 ISGFAHHRLKIASDPMNSANRRIDILVEDR 341 (343) Q Consensus 312 v~G~g~~~P~~~n~t~~~~NRRVEi~i~~~ 341 (343) |+||+|++|+.++||..++||||||+|+.. T Consensus 401 V~G~ADr~p~~~~dP~a~~NRRIeILlr~~ 430 (431) T PRK05996 401 VEGYADRRPKLPNDPNAAANRRIEILIQAK 430 (431) T ss_pred EEEECCCCCCCCCCCCCCCCCEEEEEEECC T ss_conf 887135676888999886457489997058 No 2 >PRK09041 motB flagellar motor protein MotB; Validated Probab=100.00 E-value=0 Score=485.98 Aligned_cols=272 Identities=30% Similarity=0.510 Sum_probs=195.4 Q ss_pred CCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 37688689998216888878874318879999999999999999975089999999999853554320366555555567 Q gi|254780529|r 9 QENDQFIIIKKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTASKGIFDEQN 88 (343) Q Consensus 9 ~~~~~~~iikr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~~~~~~~~~ 88 (343) ++|++||||||||+ ++++||+|+||||||||||||||||||||+||.+|++||++|++||+........+......... T Consensus 3 ~~~~~~IIiKr~k~-~~~~hHgGaWkVaYADFmTamMAFFvvmw~is~~d~~k~k~iA~yF~~p~~~~~~~g~~~~~~~~ 81 (317) T PRK09041 3 KENNRPIVVKRRKK-QGGGHHGGSWKIAYADFMTAMMAFFLVMWLLSSSTPEQLIGIAEYFRTPLKVALTGGDRSSNSPS 81 (317) T ss_pred CCCCCCEEEEECCC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 88999779996268-88899897048789999999999999999962799999999999874115665037766678887 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCC Q ss_conf 67766666554445556676665656677665544321122211123321111122222334322222220012321001 Q gi|254780529|r 89 PPERSSQNKIDLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSIN 168 (343) Q Consensus 89 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (343) +.+ .+..+...... ... ..... T Consensus 82 ~~~-----------------------~~g~~~~~~~~--------------------~~~-~~~~~-------------- 103 (317) T PRK09041 82 PIP-----------------------GGGDDYTQKDG--------------------EVN-KQTKP-------------- 103 (317) T ss_pred CCC-----------------------CCCCCCCCCCC--------------------CCC-CCCCC-------------- T ss_conf 667-----------------------99987655556--------------------657-66677-------------- Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHH Q ss_conf 11001246521357887888999999999999997555-33210131145114534999970345674555687546835 Q gi|254780529|r 169 KEENYFLPPLSKEKILAMKRKKRLQDLAKKISSTLSGL-VADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPE 247 (343) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~ 247 (343) ....................++.+...+...+... ....+..++.++.+++||+|+|.|+.+..||++|||+|+|. T Consensus 104 ---~~~~~~~~~~~~~~~~e~~~l~~l~~~l~~~i~~~~~l~~l~~~v~v~~t~~GL~I~I~D~~~~~mF~sGSA~l~p~ 180 (317) T PRK09041 104 ---TKSLNAPQAEELKERMEQERLEKLKQELDQAIESNPKLRQFKPQLLIDLTQEGLRIQIVDSQNRPMFATGSAEVEPY 180 (317) T ss_pred ---CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEECCCEEEEEEECCCCCCCCCCCCCCCHH T ss_conf ---66554201345788899999999999999998744667764114479980694599998667786767886645888 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCC--CCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCC Q ss_conf 656544367898612783389873146766787--410699999999999999998699965712256303348888988 Q gi|254780529|r 248 TIVVLQKIGEVLAHSTEVISIRGHTDASPFRNI--ARDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIASD 325 (343) Q Consensus 248 ~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g~--~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~ 325 (343) ++.+|.+|+.+|+..++.|.|+||||++||.+. +++||+||++||.+|+++|+..|++++||.+|+||||++|+.+++ T Consensus 181 ~~~lL~~Ia~vL~~~~n~I~I~GHTD~~P~~~~~~~~SNWeLSaaRA~aarr~L~~~Gl~~~Ri~~V~G~AD~~Pl~~~d 260 (317) T PRK09041 181 MRDILREIAPVLNDVPNRISLSGHTDATPYAGGEKGYSNWELSADRANASRRELVAGGMDEGKVLRVVGLASTMLLDKKD 260 (317) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHEEEEEEECCCCCCCCCC T ss_conf 99999999999972578569987558777567888899515589999999999998499976878999835677589999 Q ss_pred CCCCCCCEEEEEEECCC Q ss_conf 52133755999995699 Q gi|254780529|r 326 PMNSANRRIDILVEDRQ 342 (343) Q Consensus 326 t~~~~NRRVEi~i~~~~ 342 (343) |..++||||||+|+.++ T Consensus 261 P~~a~NRRI~I~vLt~~ 277 (317) T PRK09041 261 PLNPVNRRISIVVLNKK 277 (317) T ss_pred CCCCCCCCEEEEEECHH T ss_conf 87654685799997859 No 3 >PRK06778 hypothetical protein; Validated Probab=100.00 E-value=0 Score=461.07 Aligned_cols=259 Identities=28% Similarity=0.436 Sum_probs=192.0 Q ss_pred CCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 86899982168888788743188799999999999999999750899999999998535543203665555555676776 Q gi|254780529|r 13 QFIIIKKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTASKGIFDEQNPPER 92 (343) Q Consensus 13 ~~~iikr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~~~~~~~~~~~~~ 92 (343) +.+|||||+++.+++||+|+||||||||||||||||||||+||.+|++||++|+.||+.... +...+.... T Consensus 2 ~~~Iikr~~k~~~~~hHgGaWkVaYADfmTaLMAFFvvmw~ls~~~~~k~~~ia~~~~~~~~---------~~~g~~~~~ 72 (289) T PRK06778 2 ATTIIKRANRKKHHVRHSGAWKVAFADFTLAMMALFMVLWIVNSVSKSERESIIAALHGQSI---------FNGGGLSPL 72 (289) T ss_pred CEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC---------CCCCCCCCC T ss_conf 61899985337888997840598999999999999999999757999999999998455644---------348987865 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCC Q ss_conf 66665544455566766656566776655443211222111233211111222223343222222200123210011100 Q gi|254780529|r 93 SSQNKIDLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSINKEEN 172 (343) Q Consensus 93 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (343) .... + ..+. ........ .+... . ..+. T Consensus 73 ~~~~-~-------~~~~--~~~~~~~~-------------------------------~~~~~-~--~~~~--------- 99 (289) T PRK06778 73 NKIS-P-------SHPP--KPATVAVP-------------------------------EETEK-K--ARDV--------- 99 (289) T ss_pred CCCC-C-------CCCC--CCCCCCCC-------------------------------CCCCC-C--HHHH--------- T ss_conf 6678-8-------8888--87656677-------------------------------66764-0--0105--------- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 12465213578878889999999999999975553321013114511453499997034567455568754683565654 Q gi|254780529|r 173 YFLPPLSKEKILAMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVL 252 (343) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l 252 (343) .............++.++...+...... ..+..++.++.+++||+|+|.|..+..||++|||+|+|.++.+| T Consensus 100 -----~~~~~~~~~~~~~~l~~l~~~i~~~~~~---~~l~~~v~~~it~~GL~I~i~D~~~~~mF~~GSa~l~p~~~~lL 171 (289) T PRK06778 100 -----NEKTALLKKKSATELGELATSINTIARD---AHMEANLEMEIVPQGLRVLIKDDQNRNMFERGSAQIMPFFKRLL 171 (289) T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHCEEEEECCCCEEEEEEECCCCCCCCCCCCCCCHHHHHHH T ss_conf -----6777888888899999999999999988---76642704898489408999837888782889875687899999 Q ss_pred HHHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCCCCCCCC Q ss_conf 43678986127833898731467667874106999999999999999986999657122563033488889885213375 Q gi|254780529|r 253 QKIGEVLAHSTEVISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIASDPMNSANR 332 (343) Q Consensus 253 ~~ia~~l~~~~~~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~~~NR 332 (343) ..|+.+|+..|+.|.|+||||++||++..|+|||||++||.+|+++|+..|++++||.+|+||||++|+.+++|..++|| T Consensus 172 ~~ia~vL~~~~n~I~I~GHTD~~Pi~~~~ySNWELSa~RA~avrr~L~~~Gl~~~ri~~V~G~Ad~~p~~~~dP~~~~NR 251 (289) T PRK06778 172 VELAPVFDSLDNKIIITGHTDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPEDKVMQVSAMADQMLLDAKNPQSAGNR 251 (289) T ss_pred HHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHEEEEEEECCCCCCCCCCCCCCCCC T ss_conf 99999986289538997405777777888996066899999999999985999768889998457777899998765468 Q ss_pred EEEEEEECC Q ss_conf 599999569 Q gi|254780529|r 333 RIDILVEDR 341 (343) Q Consensus 333 RVEi~i~~~ 341 (343) ||||+|+++ T Consensus 252 RI~I~vL~~ 260 (289) T PRK06778 252 RIEIMVLTK 260 (289) T ss_pred CEEEEEECH T ss_conf 579999786 No 4 >PRK12799 motB flagellar motor protein MotB; Reviewed Probab=100.00 E-value=0 Score=436.52 Aligned_cols=261 Identities=31% Similarity=0.515 Sum_probs=198.1 Q ss_pred CCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH-HHHHHCCCCCCCCCCCC Q ss_conf 37688689998216888878874318879999999999999999975089999999999853-55432036655555556 Q gi|254780529|r 9 QENDQFIIIKKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFN-PFGKNLMTASKGIFDEQ 87 (343) Q Consensus 9 ~~~~~~~iikr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~-p~~~~~~~~~~~~~~~~ 87 (343) |.|+.+||||||++.++++||||+|||+||||||+|||||+|||+|+.+++++++.|++||+ |.......+... .... T Consensus 3 ~~n~PiIIiKKrkkk~g~GHHGGAWKVAYADFvTAMMAFFLVLWLL~stt~eqr~gIAdYFrtPl~va~~~G~~s-~~s~ 81 (424) T PRK12799 3 HQNHPVILVKKRKTKHGAAHYGGSWKIAYADFMTAMMAFFLVMWLLAVSSPQELTQIAEYFRTPLKVALTSGDKS-SSST 81 (424) T ss_pred CCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCHHHHHCCCCCCC-CCCC T ss_conf 888886999887557898987604799999999999999999998548999999999998508035432578777-8877 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCC Q ss_conf 76776666655444555667666565667766554432112221112332111112222233432222222001232100 Q gi|254780529|r 88 NPPERSSQNKIDLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSI 167 (343) Q Consensus 88 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (343) ++ ...++.+.. .. ..+... .. T Consensus 82 sp-----------------------IpgGG~dpt------~~--------------------~gev~~----------~~ 102 (424) T PRK12799 82 SP-----------------------IPGGGDDPT------QQ--------------------VGEVRK----------QI 102 (424) T ss_pred CC-----------------------CCCCCCCCC------CC--------------------CCCHHC----------CC T ss_conf 86-----------------------789998733------23--------------------321110----------23 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCH Q ss_conf 11100124652135788788899999999999999755-53321013114511453499997034567455568754683 Q gi|254780529|r 168 NKEENYFLPPLSKEKILAMKRKKRLQDLAKKISSTLSG-LVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLP 246 (343) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~ 246 (343) ...........++.+...+.+.+.. .....+..++.++.+++||+|.|+|+.++.||+.||++|+| T Consensus 103 -------------~~~~s~~e~~rl~~L~e~Leq~Ie~~P~L~~lk~qLlIdiTpEGLRIQIvD~qnRPMF~~GSA~l~P 169 (424) T PRK12799 103 -------------NSEESRQEIHRLNKLREKLDQLIESDPRLKALRPHLLINMMDEGLRIQIIDSQNRPMFKMGSAQVEP 169 (424) T ss_pred -------------CHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCEEEEEECCCEEEEEEECCCCCCCCCCCCCCCH T ss_conf -------------3566788999999999999999972987998741746997089358999857999760127775788 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCC-C-CCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCC Q ss_conf 565654436789861278338987314676678-7-41069999999999999999869996571225630334888898 Q gi|254780529|r 247 ETIVVLQKIGEVLAHSTEVISIRGHTDASPFRN-I-ARDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIAS 324 (343) Q Consensus 247 ~~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g-~-~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n 324 (343) .++.+|..|+.+|.+.|++|.|+||||+.||.+ . .|+|||||+.||++.++.|+..|++.+||++|.|++++.|+..+ T Consensus 170 y~rdIL~aIApvLn~vPNkIsISGHTDA~PYa~G~~gYSNWELSADRANAARReLvaGGm~e~KvlRVvGmAst~~ld~~ 249 (424) T PRK12799 170 YMRDILRAIAPILNDIPNKLSLSGHTDDLPYARGERGYSNWELSADRANASRRELLAGGLDEGKILRVVGMASTMRLKEQ 249 (424) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCC T ss_conf 99999999999986589834886316885556899887656223666879999999768981136789851213414667 Q ss_pred CCCCCCCCEEEEEEECCC Q ss_conf 852133755999995699 Q gi|254780529|r 325 DPMNSANRRIDILVEDRQ 342 (343) Q Consensus 325 ~t~~~~NRRVEi~i~~~~ 342 (343) ++....||||.|++++++ T Consensus 250 ~p~~PvNRRISIiVLnK~ 267 (424) T PRK12799 250 ASDDPVNRRISILVLNKQ 267 (424) T ss_pred CCCCCCCCEEEEEEECHH T ss_conf 999854365789996668 No 5 >PRK08944 motB flagellar motor protein MotB; Reviewed Probab=100.00 E-value=0 Score=429.90 Aligned_cols=287 Identities=20% Similarity=0.295 Sum_probs=183.0 Q ss_pred ECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCC-CCCCCCCCCCCCCCCCC Q ss_conf 21688887887431887999999999999999997508999999999985355432036655-55555676776666655 Q gi|254780529|r 20 KVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTASK-GIFDEQNPPERSSQNKI 98 (343) Q Consensus 20 ~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 98 (343) ||+|+++|+|+|+||||||||||||||||||||+||.+|++||++|++||+..+........ ......+... T Consensus 3 ~~~~~c~p~g~paW~vaYADfmTlLmAFFvlL~~~S~~d~~k~~~ia~s~~~afg~~~~~~~~~~~~G~s~i~------- 75 (305) T PRK08944 3 EKKCKCPPPGLPAWLATFADLMSLLMCFFVLLLSFSEMDVLKFKQIAGSMKFAFGVQNKVEVKDIPKGTSVIA------- 75 (305) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC------- T ss_conf 5568997799950898999999999999999998623899999999999998635556566566788865334------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCC Q ss_conf 44455566766656566776655443211222111233211111222223343222222200123210011100124652 Q gi|254780529|r 99 DLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSINKEENYFLPPL 178 (343) Q Consensus 99 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (343) ....+....+.......+.................. ....... ............... T Consensus 76 -----~~~~p~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~--~~~~~~~~~~~~~~~ 133 (305) T PRK08944 76 -----QEFRPGRPEPTPIEVIMQQTNEMTQPTLEFQAGESD---------------SAGGTQQ--QRGKQRGGEASATAQ 133 (305) T ss_pred -----CCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCC---------------CCCCCCC--CCCCCCCCCCCHHHH T ss_conf -----456788988873112104565555553112355544---------------5442101--110001332100345 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 13578878889999999999999975553321013114511453499997034567455568754683565654436789 Q gi|254780529|r 179 SKEKILAMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGEV 258 (343) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~~ 258 (343) .............++.+...+...+...+ ....+.++..+.+|+|+|.| ..||++|||.|+|.++.+|++|+.+ T Consensus 134 ~~~~~~~~~~~~~~~~~~~~l~~~l~~~i---~~~~v~v~~~~~~i~I~l~d---~~lF~~GSa~l~p~~~~lL~~ia~~ 207 (305) T PRK08944 134 ETAQAVSAAAQEQINELVKKIAQQLEKEI---EDGAIEIESLGQQIIIRIRE---KGSFPSGSAFLQPKFKPLVRKIGEL 207 (305) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HCCCEEEEECCCEEEEEECC---CCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 55566788899999999999999999753---13763799738879999568---7675788642487689999999999 Q ss_pred HHHCCCCEEEEEEECCCCCCCCC-CCHHHHHHHHHHHHHHHHHH-CCCCHHHEEEEEECCCCCCCCCCCCCC--CCCCEE Q ss_conf 86127833898731467667874-10699999999999999998-699965712256303348888988521--337559 Q gi|254780529|r 259 LAHSTEVISIRGHTDASPFRNIA-RDNWRLSLDRAYSAYQVLMK-SGVSEDRISKISGFAHHRLKIASDPMN--SANRRI 334 (343) Q Consensus 259 l~~~~~~i~IeGHTD~~p~~g~~-~~Nw~LS~~RA~aV~~~L~~-~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NRRV 334 (343) |+..++.|.|+||||+.|+.+.. ++||+||++||.+|++||+. .|+++.|+ +|+||||++|+++|+|.+ ++|||| T Consensus 208 l~~~~n~I~I~GHTD~~p~~~~~~~sNWeLSaaRA~~v~r~L~~~~gi~~~rl-~v~G~ad~~Pl~~n~t~e~ra~NRRV 286 (305) T PRK08944 208 LKDVPGIITVSGHTDNMPISSELYRSNWDLSSARAVAVAHELLKVKGFDPSRL-KVVGMADTQPLVPNDSPENRARNRRV 286 (305) T ss_pred HHCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEE-EEEEECCCCCCCCCCCHHHHHCCCCE T ss_conf 97389627997415777665556787489999999999999998069894348-99986578748999780134118877 Q ss_pred EEEEECCC Q ss_conf 99995699 Q gi|254780529|r 335 DILVEDRQ 342 (343) Q Consensus 335 Ei~i~~~~ 342 (343) ||+|..++ T Consensus 287 eI~il~~~ 294 (305) T PRK08944 287 EISIEQGK 294 (305) T ss_pred EEEEECCC T ss_conf 99993798 No 6 >PRK09038 flagellar motor protein MotD; Reviewed Probab=100.00 E-value=0 Score=428.62 Aligned_cols=243 Identities=22% Similarity=0.367 Sum_probs=175.6 Q ss_pred EEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98216888878874318879999999999999999975089999999999853554320366555555567677666665 Q gi|254780529|r 18 KKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTASKGIFDEQNPPERSSQNK 97 (343) Q Consensus 18 kr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 97 (343) ||||+++ +++|+++||+||||||||||||||||||||.+|+.||+++++||.-.+.......... ...... T Consensus 3 ~Rrk~~~-~~~~~~aW~vtyADlmTLLlaFFVlL~smS~~d~~K~~~~~~sl~~af~~~~~~~~~~-------~~~~~~- 73 (285) T PRK09038 3 RRRRHHE-EHENHERWLVSYADFITLLFAFFVVMYAISSVNEGKYKVLSEALTGAFNGPDRSLKPI-------PIGEEQ- 73 (285) T ss_pred CCCCCCC-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCCC-------CCCCCC- T ss_conf 7767887-8999861888999999999999999999725899999999999998735877666766-------677777- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCC Q ss_conf 54445556676665656677665544321122211123321111122222334322222220012321001110012465 Q gi|254780529|r 98 IDLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSINKEENYFLPP 177 (343) Q Consensus 98 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (343) ........ ...... T Consensus 74 ---------~~~~~~~~---------------------------------------------------------~~~~~~ 87 (285) T PRK09038 74 ---------PLSPEPDR---------------------------------------------------------PSVIKA 87 (285) T ss_pred ---------CCCCCCCC---------------------------------------------------------CCCCCC T ss_conf ---------77877776---------------------------------------------------------554555 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 21357887888999999999999997555332101311451145349999703456745556875468356565443678 Q gi|254780529|r 178 LSKEKILAMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGE 257 (343) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~ 257 (343) ..............+..+...+.+.+..... ...+.++..+.||+|+|.+ .+||++|||+|++.++.+|++|+. T Consensus 88 ~~~~~~~~~~~~~~l~~~~~~l~~~l~~~~~---~~~v~v~~~~~gl~I~l~~---~~lF~~GSA~L~~~~~~lL~~ia~ 161 (285) T PRK09038 88 GQPEQAQEARAQDQLKTIADDLRKALGPLID---SGQVTVRRNELWIEIEINS---SLLFGSGDALLSDEAFAILEKVAE 161 (285) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH---CCCEEEEECCCEEEEEECC---CCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 5431266788999999999999999986411---5835999739979999568---868688843249879999999999 Q ss_pred HHHHCCCCEEEEEEECCCCCCCCC-CCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCCCC--CCCCEE Q ss_conf 986127833898731467667874-10699999999999999998699965712256303348888988521--337559 Q gi|254780529|r 258 VLAHSTEVISIRGHTDASPFRNIA-RDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIASDPMN--SANRRI 334 (343) Q Consensus 258 ~l~~~~~~i~IeGHTD~~p~~g~~-~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NRRV 334 (343) +|+..++.|.|+||||++||++.. ++||+||++||.+|++||+..|++++||. +.||||++|+++|+|.+ ++|||| T Consensus 162 ~L~~~~n~I~I~GHTD~~Pi~~~~~~sNWeLSaaRA~aV~r~L~~~Gi~p~Rl~-~~Gygd~~P~~~N~t~egRa~NRRV 240 (285) T PRK09038 162 VLKPFPNPIHVEGFTDNVPIATAQFPSNWELSAARAASVVRLLADDGVAPSRLA-AVGYGEFQPVADNDTAEGRARNRRV 240 (285) T ss_pred HHHHCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEE-EEEECCCCCCCCCCCCHHHHHCCCE T ss_conf 998438728999857887655566786489999999999999997599967879-9876578768999480025307876 Q ss_pred EEEEECCC Q ss_conf 99995699 Q gi|254780529|r 335 DILVEDRQ 342 (343) Q Consensus 335 Ei~i~~~~ 342 (343) ||+|.++. T Consensus 241 eIvILr~~ 248 (285) T PRK09038 241 VLVILRNL 248 (285) T ss_pred EEEEECCH T ss_conf 99996786 No 7 >PRK07734 motB flagellar motor protein MotB; Reviewed Probab=100.00 E-value=0 Score=412.43 Aligned_cols=246 Identities=23% Similarity=0.378 Sum_probs=178.4 Q ss_pred EEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98216888878874318879999999999999999975089999999999853554320366555555567677666665 Q gi|254780529|r 18 KKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTASKGIFDEQNPPERSSQNK 97 (343) Q Consensus 18 kr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 97 (343) ||||+++|+.||+++||+|||||||||||||||||+||.+|.+||+++++||...+. +..+......+..+..... T Consensus 4 rr~kk~~~~~~~~~~WlvtyADlmTLLlaFFVlL~s~S~~d~~k~~~~a~s~~~~f~----~~~g~~~~~~~~~~~~~~~ 79 (259) T PRK07734 4 RKKKKKKHEEHVDESWLIPYADLLTLLLALFIVLFAMSSIDAAKFKQMAVAFNSVFT----GGTGNKEFLSDQKPNDEKE 79 (259) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHC----CCCCCCCCCCCCCCCCCCC T ss_conf 556778777899971487999999999999999999764899999999999998742----7877555677778776655 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCC Q ss_conf 54445556676665656677665544321122211123321111122222334322222220012321001110012465 Q gi|254780529|r 98 IDLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSINKEENYFLPP 177 (343) Q Consensus 98 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (343) .... ... ...... T Consensus 80 ------------------~~~~----------------------------~~~---------------------~~~~~~ 92 (259) T PRK07734 80 ------------------LSAS----------------------------SLE---------------------QEEAKK 92 (259) T ss_pred ------------------CCCC----------------------------CCC---------------------CCCCHH T ss_conf ------------------5667----------------------------743---------------------221000 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 21357887888999999999999997555332101311451145349999703456745556875468356565443678 Q gi|254780529|r 178 LSKEKILAMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGE 257 (343) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~ 257 (343) ............+.+..+...+.+.+... .+.+.+.++.++.||+|+|.+. +||++|||+|+|+++.+|.+++. T Consensus 93 ~~~~~~~~~~~~~~l~~~~~~l~~~l~~~---~l~~~v~v~~~~~Gl~I~l~~~---~lF~~GsA~l~~~~~~lL~~ia~ 166 (259) T PRK07734 93 QAEAREKKKKEMEELKAVQKKIDQYIKEK---QLSSSLQTKLTEEGLLVTILDN---ILFDSGKADVRLEDLPLAKEMSN 166 (259) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HCCCCEEEEECCCEEEEEECCC---CCCCCCCCCCCHHHHHHHHHHHH T ss_conf 11100345788999999999999999863---2036537997488379995377---66688987569779999999999 Q ss_pred HHHHCCC-CEEEEEEECCCCCCCCC-CCHHHHHHHHHHHHHHHHHH-CCCCHHHEEEEEECCCCCCCCCCCCCC--CCCC Q ss_conf 9861278-33898731467667874-10699999999999999998-699965712256303348888988521--3375 Q gi|254780529|r 258 VLAHSTE-VISIRGHTDASPFRNIA-RDNWRLSLDRAYSAYQVLMK-SGVSEDRISKISGFAHHRLKIASDPMN--SANR 332 (343) Q Consensus 258 ~l~~~~~-~i~IeGHTD~~p~~g~~-~~Nw~LS~~RA~aV~~~L~~-~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NR 332 (343) +|...+. .|.|+||||+.|+.+.. .+||+||++||.+|++||+. .|+++.|+. |+||||++|+++|+|++ ++|| T Consensus 167 ~l~~~~~~~I~V~GHTD~~Pi~~~~~~sNWeLSaaRA~sV~~~L~~~~gi~~~rl~-v~G~gd~~Pi~~n~t~egra~NR 245 (259) T PRK07734 167 LLVSNPPRNITISGHTDNVPIANAQFASNWELSVMRAVNFMQVLLENKELDPEKFS-AKGYGEYKPIASNDTAEGRAKNR 245 (259) T ss_pred HHHHCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEE-EEEECCCCCCCCCCCCCHHHCCC T ss_conf 99857996389998557766667767644899999999999999981699956799-99855787489996920142178 Q ss_pred EEEEEEECC Q ss_conf 599999569 Q gi|254780529|r 333 RIDILVEDR 341 (343) Q Consensus 333 RVEi~i~~~ 341 (343) ||||+|+.. T Consensus 246 RVeI~il~~ 254 (259) T PRK07734 246 RVEVLILPL 254 (259) T ss_pred CEEEEEECC T ss_conf 679999576 No 8 >PRK06925 flagellar motor protein MotS; Reviewed Probab=100.00 E-value=0 Score=401.70 Aligned_cols=223 Identities=20% Similarity=0.394 Sum_probs=175.0 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 99821688887887431887999999999999999997508999999999985355432036655555556767766666 Q gi|254780529|r 17 IKKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTASKGIFDEQNPPERSSQN 96 (343) Q Consensus 17 ikr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (343) .||||+++++++|+|+||+||||||||||||||||||||.+|+.||+++++||+..+.. . T Consensus 3 ~~~kk~~~e~~~g~p~WlvtyaD~mTLLlafFvlL~s~S~~d~~k~~~~~~s~~~~~~g---------------~----- 62 (230) T PRK06925 3 LRKERFEREQSKGSPRWMVTFSDLITLILVFFILLFSMSQIDLQKFKAAVESFQDRGDG---------------E----- 62 (230) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC---------------C----- T ss_conf 65567888788999735879999999999999999998707999999999998515688---------------8----- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCC Q ss_conf 55444555667666565667766554432112221112332111112222233432222222001232100111001246 Q gi|254780529|r 97 KIDLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSINKEENYFLP 176 (343) Q Consensus 97 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (343) .+....... . T Consensus 63 ----------~~~~~~~~~-------------------------------------------~----------------- 72 (230) T PRK06925 63 ----------EPAELKIDA-------------------------------------------G----------------- 72 (230) T ss_pred ----------CCCCCCCCC-------------------------------------------C----------------- T ss_conf ----------987665566-------------------------------------------6----------------- Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 52135788788899999999999999755533210131145114534999970345674555687546835656544367 Q gi|254780529|r 177 PLSKEKILAMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIG 256 (343) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia 256 (343) ..........++.+...+...+.. ..+...+.+...+.|++|+|. +.+||++|||+|+++++.+|.++| T Consensus 73 -----~~~~~~~~~~~~~~~~~~~~~l~~---~~l~~~v~v~~~~~gv~i~l~---~~~lF~~Gsa~l~~~~~~~L~~ia 141 (230) T PRK06925 73 -----DSDEEKTEKEQDELLKKVNTYIEE---NELSSQITAKRDERGVVLVLQ---EAVLFDTGEADVLKNAEPLLHKIA 141 (230) T ss_pred -----CCHHHHHHHHHHHHHHHHHHHHHH---HHCCCCEEEEECCCEEEEEEC---CCCCCCCCCCCCCHHHHHHHHHHH T ss_conf -----411355689999999999999986---302466378876986899968---987878896534877999999999 Q ss_pred HHHHHCCCCEEEEEEECCCCCCCC-CCCHHHHHHHHHHHHHHHHHH-CCCCHHHEEEEEECCCCCCCCCCCCCC--CCCC Q ss_conf 898612783389873146766787-410699999999999999998-699965712256303348888988521--3375 Q gi|254780529|r 257 EVLAHSTEVISIRGHTDASPFRNI-ARDNWRLSLDRAYSAYQVLMK-SGVSEDRISKISGFAHHRLKIASDPMN--SANR 332 (343) Q Consensus 257 ~~l~~~~~~i~IeGHTD~~p~~g~-~~~Nw~LS~~RA~aV~~~L~~-~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NR 332 (343) .+|+..++.|.|+||||++|+++. |.+||+||++||.+|++||+. .|++++|+. +.|||+++|+++|+|++ ++|| T Consensus 142 ~~l~~~~~~I~V~GHTD~~p~~~~~~~sNWeLS~aRA~~V~~~li~~~gi~~~rl~-~~Gygd~~P~~~n~t~e~ra~NR 220 (230) T PRK06925 142 VLLKTIPNDIQVEGHTDSRPISTYRYPSNWELSAARASSVIRYFTSKEKLDSSRFI-AVGYADTKPVKDNKTEEHMKENR 220 (230) T ss_pred HHHHCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHEE-EEEECCCCCCCCCCCCCHHHCCC T ss_conf 99973996389987448765555678853899999999999999983599846789-98765787489995832032188 Q ss_pred EEEEEEECC Q ss_conf 599999569 Q gi|254780529|r 333 RIDILVEDR 341 (343) Q Consensus 333 RVEi~i~~~ 341 (343) ||||+|.+. T Consensus 221 RVeI~i~~e 229 (230) T PRK06925 221 RVEIVIKKQ 229 (230) T ss_pred CEEEEEECC T ss_conf 679998458 No 9 >PRK08457 motB flagellar motor protein MotB; Reviewed Probab=100.00 E-value=0 Score=398.83 Aligned_cols=223 Identities=21% Similarity=0.429 Sum_probs=157.1 Q ss_pred EECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 82168888788743188799999999999999999750899999999998535543203665555555676776666655 Q gi|254780529|r 19 KKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTASKGIFDEQNPPERSSQNKI 98 (343) Q Consensus 19 r~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (343) |||++++|++| ++||+||||||||||||||||||||.+|.+||+++++||...+.... . .....+. T Consensus 3 kkkk~~e~~~g-~~WlvTYADlmTLLlaFFVlL~smS~~d~~k~~~~~~s~~~~f~~~~--~-----------~~~~~~~ 68 (251) T PRK08457 3 KKKKCPECPAG-EKWAVPYADFLSLLLALFIALYAISAVNKAKVEALKTEFIKIFDYAA--S-----------PTVKQPV 68 (251) T ss_pred CCCCCCCCCCC-CHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCC--C-----------CCCCCCC T ss_conf 87778779999-61787899999999999999999863899999999999998605788--8-----------7667765 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCC Q ss_conf 44455566766656566776655443211222111233211111222223343222222200123210011100124652 Q gi|254780529|r 99 DLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSINKEENYFLPPL 178 (343) Q Consensus 99 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (343) ... ... .... .... T Consensus 69 ~~~-----~~~-------------------------------------------~~~~------------------~~~~ 82 (251) T PRK08457 69 KTI-----KKD-------------------------------------------SGAE------------------EEEM 82 (251) T ss_pred CCC-----CCC-------------------------------------------CCCC------------------CCCH T ss_conf 557-----766-------------------------------------------6766------------------4202 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCC-HHHHHHHHHHHH Q ss_conf 1357887888999999999999997555332101311451145349999703456745556875468-356565443678 Q gi|254780529|r 179 SKEKILAMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPL-PETIVVLQKIGE 257 (343) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~-~~~~~~l~~ia~ 257 (343) ..... .. ...... ... .....+.++..+.+++|+|. +.+||++|||+|+ +.+..+|..++. T Consensus 83 ~~~~~-----~~-~~~~~~--~~~-------~~~~~~~i~~~~~gi~i~l~---~~~lF~~GsA~l~~~~~~~~L~~ia~ 144 (251) T PRK08457 83 ASESS-----LK-ASQNTE--QKT-------IKREGAALDQSDNGVALKLP---SNLLFENGSAEIINQDMMLYLKRIAK 144 (251) T ss_pred HHHHH-----HH-HHHHHH--HHH-------HHHHHHHHHCCCCCEEEEEC---CCCCCCCCCCEECCCCHHHHHHHHHH T ss_conf 35554-----67-889999--999-------87530245403672699806---87545798801168039999999999 Q ss_pred HHHHCCC--CEEEEEEECCCCCCCC-CCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCCCCCCCCEE Q ss_conf 9861278--3389873146766787-410699999999999999998699965712256303348888988521337559 Q gi|254780529|r 258 VLAHSTE--VISIRGHTDASPFRNI-ARDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIASDPMNSANRRI 334 (343) Q Consensus 258 ~l~~~~~--~i~IeGHTD~~p~~g~-~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~~~NRRV 334 (343) +|...|+ .|.|+||||++|+.+. |.+||+||++||.+|++||+..|+++.|+. +.||||++|+++| ..|++|||| T Consensus 145 il~~lp~~~~I~V~GHTD~~Pi~~~~f~SNWELSaaRA~~V~r~Li~~Gi~p~Rl~-a~Gygd~~Pi~~N-e~ra~NRRV 222 (251) T PRK08457 145 IIQKLPKRVKIEVRGYTDNSPLNKTRYKSSYELAANRAYNVMKVLIQYGINPNRLS-FSSYGSNNPIAPN-ENRLKNNRV 222 (251) T ss_pred HHHHCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEE-EEEECCCCCCCCC-CCHHHCCCE T ss_conf 99848997459987158999777788887399999999999999998599878978-9853278879899-765563957 Q ss_pred EEEEECC Q ss_conf 9999569 Q gi|254780529|r 335 DILVEDR 341 (343) Q Consensus 335 Ei~i~~~ 341 (343) ||+|..+ T Consensus 223 eI~i~~~ 229 (251) T PRK08457 223 EIFFKVD 229 (251) T ss_pred EEEEECC T ss_conf 9999428 No 10 >PRK06667 motB flagellar motor protein MotB; Validated Probab=100.00 E-value=0 Score=387.23 Aligned_cols=232 Identities=23% Similarity=0.340 Sum_probs=162.9 Q ss_pred EEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98216888878874318879999999999999999975089999999999853554320366555555567677666665 Q gi|254780529|r 18 KKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTASKGIFDEQNPPERSSQNK 97 (343) Q Consensus 18 kr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 97 (343) ++||+|++|++|+|+||+|||||||||||||||||+||.+|..|++.++.+.... ..+......... T Consensus 3 m~kkK~~e~~~g~~~WlvtyADlmTLLlaFFVlL~smS~~d~~k~~~~~~s~~~~-------~~g~~~~~~~~~------ 69 (254) T PRK06667 3 MAKKKCPKCIQGAPEYMLTYGDMVTLLLCFFVMLFSLNDVDAKENQIKIMSASFT-------GTGFFKGGKTLS------ 69 (254) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCC-------CCCCCCCCCCCC------ T ss_conf 7777899999998618889999999999999999998448899999999976437-------766557886677------ Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCC Q ss_conf 54445556676665656677665544321122211123321111122222334322222220012321001110012465 Q gi|254780529|r 98 IDLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSINKEENYFLPP 177 (343) Q Consensus 98 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (343) .... ...... . . ..+. T Consensus 70 ----------~~~~-------------~~~~~~----------------~--------------------~-----~~~~ 85 (254) T PRK06667 70 ----------KGRL-------------SYLGNS----------------I--------------------M-----SLPS 85 (254) T ss_pred ----------CCCC-------------CCCCCC----------------C--------------------C-----CCCH T ss_conf ----------7653-------------334454----------------3--------------------3-----5731 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCC-HHHHHHHHHHH Q ss_conf 21357887888999999999999997555332101311451145349999703456745556875468-35656544367 Q gi|254780529|r 178 LSKEKILAMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPL-PETIVVLQKIG 256 (343) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~-~~~~~~l~~ia 256 (343) ......+..........+.. +.....+.+...++||+|+|.+ .+||++|||+|+ +.++.+|..++ T Consensus 86 --------~~~~~~~~~~~~~~~~~~~~---~~~~~~v~i~~~~~gl~i~l~~---~~lF~~gsa~l~~~~~~~~l~~l~ 151 (254) T PRK06667 86 --------TEKGKQLSQAKKNASELFKP---EIQSNKVAVTEDERGLVISLVS---DAFFYPGSADVKLEENRDLIQKLA 151 (254) T ss_pred --------HHHHHHHHHHHHHHHHHHHH---HHHCCCEEEEECCCEEEEEEEC---CCCCCCCCCCCCCHHHHHHHHHHH T ss_conf --------45678999999999999887---7643874899749979999606---856179973126474899999999 Q ss_pred HHHHHCCC---CEEEEEEECCCCCC--CCCCCHHHHHHHHHHHHHHHHHHCC-CCHHHEEEEEECCCCCCCCCCCCCC-- Q ss_conf 89861278---33898731467667--8741069999999999999999869-9965712256303348888988521-- Q gi|254780529|r 257 EVLAHSTE---VISIRGHTDASPFR--NIARDNWRLSLDRAYSAYQVLMKSG-VSEDRISKISGFAHHRLKIASDPMN-- 328 (343) Q Consensus 257 ~~l~~~~~---~i~IeGHTD~~p~~--g~~~~Nw~LS~~RA~aV~~~L~~~G-v~~~ri~~v~G~g~~~P~~~n~t~~-- 328 (343) .++...+. .|.|+||||+.|++ +.|.+||+||++||.+|++||+..| +++.|+ +|+||||++|+++|+|.+ T Consensus 152 ~il~~~~~~~~~I~VeGHTD~~pi~~~~~~~sNWeLSsaRA~~V~~~l~~~g~i~~~rl-~v~Gygd~~P~~~n~t~e~r 230 (254) T PRK06667 152 SFIGFLDLAGYNFRIEGHTDNVDVNPEGAWKSNWELSGARAVNMLEYILNYEDQSESWF-EVSGFGGSRPLATDDTPEGR 230 (254) T ss_pred HHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEE-EEEEECCCCCCCCCCCHHHH T ss_conf 99984567574289994378776687888886288999999999999998179995459-99854588768999596688 Q ss_pred CCCCEEEEEEECC Q ss_conf 3375599999569 Q gi|254780529|r 329 SANRRIDILVEDR 341 (343) Q Consensus 329 ~~NRRVEi~i~~~ 341 (343) ++||||||+|... T Consensus 231 a~NRRVeI~Il~~ 243 (254) T PRK06667 231 AYNRRIDIVILTE 243 (254) T ss_pred HHCCCEEEEEECH T ss_conf 4089769999574 No 11 >PRK06742 flagellar motor protein MotS; Reviewed Probab=100.00 E-value=0 Score=378.37 Aligned_cols=218 Identities=22% Similarity=0.399 Sum_probs=164.8 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 88788743188799999999999999999750899999999998535543203665555555676776666655444555 Q gi|254780529|r 25 DTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTASKGIFDEQNPPERSSQNKIDLSIDE 104 (343) Q Consensus 25 ~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 104 (343) .+++|+|+||+|||||||||||||||||+||..|..|+++++++|...... .. . T Consensus 4 ~p~~g~p~WlvtyaDlmTLLl~fFVlL~a~S~~d~~k~~~~~~~~~~~~~~-----------~~------~--------- 57 (225) T PRK06742 4 RPQKGSPRWMTTFTDLTMLLLTFFVLLVATSKQDAVKLSKMLEKFSDTEQV-----------DA------K--------- 57 (225) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC-----------CC------C--------- T ss_conf 999998537889999999999999999998551599999999985200035-----------76------4--------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCHHHHH Q ss_conf 66766656566776655443211222111233211111222223343222222200123210011100124652135788 Q gi|254780529|r 105 HITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSINKEENYFLPPLSKEKIL 184 (343) Q Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (343) ......+. .... .... T Consensus 58 -~~~~~~~~-------------------------------------------~~~~--------------------~~~~ 73 (225) T PRK06742 58 -VMENTIPD-------------------------------------------ISHE--------------------KNDE 73 (225) T ss_pred -CCCCCCCC-------------------------------------------CCCC--------------------CCHH T ss_conf -45678886-------------------------------------------4444--------------------4214 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 78889999999999999975553321013114511453499997034567455568754683565654436789861278 Q gi|254780529|r 185 AMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGEVLAHSTE 264 (343) Q Consensus 185 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~ 264 (343) .......++.+...+...+.. .....+.+...+.|++|+|.+. ++|++|||+|+|+++.+|++++.+|+..++ T Consensus 74 ~~~~~~~~~~l~~~l~~~~~~----~~~~~v~~~~~~~gl~I~l~~~---~lF~~Gsa~l~p~~~~~L~~la~~l~~~~~ 146 (225) T PRK06742 74 KMISKKRMDELYKKLKAYVDN----NGISQVNVYREDTGVSVVIVDN---LIFDTGDANVKPEAKEIISQLVGFFQSVPN 146 (225) T ss_pred HHHHHHHHHHHHHHHHHHHHH----CCCCCEEEEECCCEEEEEECCC---CCCCCCCCCCCHHHHHHHHHHHHHHHCCCC T ss_conf 455589999999999999973----7876438997287389996787---776898432396699999999999965997 Q ss_pred CEEEEEEECCCCCCCCC-CCHHHHHHHHHHHHHHHHHH-CCCCHHHEEEEEECCCCCCCCCCCCCC--CCCCEEEEEEEC Q ss_conf 33898731467667874-10699999999999999998-699965712256303348888988521--337559999956 Q gi|254780529|r 265 VISIRGHTDASPFRNIA-RDNWRLSLDRAYSAYQVLMK-SGVSEDRISKISGFAHHRLKIASDPMN--SANRRIDILVED 340 (343) Q Consensus 265 ~i~IeGHTD~~p~~g~~-~~Nw~LS~~RA~aV~~~L~~-~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NRRVEi~i~~ 340 (343) .|.|+||||+.|+.+.. .+||+||++||.+|++||+. .|++++|+. +.|||+++|+++|+|++ ++||||||+|.. T Consensus 147 ~I~I~GHTD~~pi~~~~~~sNWeLSsaRA~~V~r~l~~~~gi~~~rl~-~~Gygd~~P~~~n~t~e~ra~NRRVeI~i~~ 225 (225) T PRK06742 147 PIVVEGHTDSRPIHNDKFPSNWELSSARAANMIHHLIEVYNVDDKRLA-AVGYADTKPVVPNDSPQNWEKNRRVVIYIKE 225 (225) T ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEE-EEEECCCCCCCCCCCHHHHHHCCCEEEEEEC T ss_conf 089999678876667788864999999999999999983699931489-9865577617999494468537966999839 No 12 >COG1360 MotB Flagellar motor protein [Cell motility and secretion] Probab=100.00 E-value=0 Score=352.68 Aligned_cols=238 Identities=32% Similarity=0.517 Sum_probs=171.8 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCC Q ss_conf 9982168888788743188799999999999999999750899999999998535543203665-555555676776666 Q gi|254780529|r 17 IKKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFNPFGKNLMTAS-KGIFDEQNPPERSSQ 95 (343) Q Consensus 17 ikr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~~~~~~~~~-~~~~~~~~~~~~~~~ 95 (343) +||+++.++ ||+++||++|||||||||||||+||++|.++..++++|+.||+..+....... .+... T Consensus 2 ~~r~k~~~~--~~~~~w~~~yad~~tl~~~ffi~~~~~s~~~~~~~~~ia~~~~~~~~~~~~~~~~~~~~---------- 69 (244) T COG1360 2 AKKKKEEEQ--HGGGAWKIAYADFMTLLLAFFIVLWAMSSISPEKFQQIAAYFRIAFSGAPGSASEGGKS---------- 69 (244) T ss_pred CCCCCCCCC--CCCCCCCEECCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCC---------- T ss_conf 722026678--88852012547578899999999986301578889999997303014565554445432---------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCC Q ss_conf 65544455566766656566776655443211222111233211111222223343222222200123210011100124 Q gi|254780529|r 96 NKIDLSIDEHITKNSVDLKIGSTNQKKQYQRLENNIFYLSNSQQSEHCKNSLTRDSEKGKDLCKSTDLEKSINKEENYFL 175 (343) Q Consensus 96 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 175 (343) ...... .......... T Consensus 70 -----------~~~~~~------------------------------------------~~~~~~~~~~----------- 85 (244) T COG1360 70 -----------MQQVKE------------------------------------------KELAEELNVG----------- 85 (244) T ss_pred -----------HHHCCC------------------------------------------CCCCCCCCHH----------- T ss_conf -----------001010------------------------------------------0122453467----------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 65213578878889999999999999975553321013114511453499997034567455568754683565654436 Q gi|254780529|r 176 PPLSKEKILAMKRKKRLQDLAKKISSTLSGLVADNIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKI 255 (343) Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~i 255 (343) ...........+.+....+...+.. ......+.++..+.+++|+|.++ .||++|||++.+.++.+|.++ T Consensus 86 -----~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~~~gl~i~i~~~---~~F~~gsa~~~~~~~~~l~~i 154 (244) T COG1360 86 -----EAIEKKELSEKLGDLAKELESKPKD---IELEHQLGVDDVEEGLVISISDS---LMFASGSAVVQPEFRDLLLKI 154 (244) T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHHHH---HHHHCCCEEEECCCCEEEEEECC---CCCCCCCCCCCHHHHHHHHHH T ss_conf -----7876522366679999999989888---77534655887488628996244---464677521276679999999 Q ss_pred HHHHHHCCC-CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC-CCHHHEEEEEECCCCCCCCCCCCC--CCCC Q ss_conf 789861278-338987314676678741069999999999999999869-996571225630334888898852--1337 Q gi|254780529|r 256 GEVLAHSTE-VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSG-VSEDRISKISGFAHHRLKIASDPM--NSAN 331 (343) Q Consensus 256 a~~l~~~~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~G-v~~~ri~~v~G~g~~~P~~~n~t~--~~~N 331 (343) |..|...++ .|.|+|||||+|+.+.+++||+||++||.+|+++|+..| +++.|+ .|+||||++|+++|+|+ +++| T Consensus 155 a~~l~~~~~~~I~I~GHTDn~p~~~~~~sNWeLS~aRA~~v~~~L~~~g~~~~~~~-~~~G~gd~~Pva~n~t~~~ra~N 233 (244) T COG1360 155 AKLLADIPNGNIRIEGHTDNVPIKGSFYSNWELSAARAQSVVRVLINGGLVEAKRL-SVVGYADTRPLADNDTAEGRAKN 233 (244) T ss_pred HHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEE-EEEECCCCCCCCCCCCHHHHHHC T ss_conf 99987267982899865899766778885088899999999999997489985668-99712547767889881347657 Q ss_pred CEEEEEEECCC Q ss_conf 55999995699 Q gi|254780529|r 332 RRIDILVEDRQ 342 (343) Q Consensus 332 RRVEi~i~~~~ 342 (343) |||||+|..+. T Consensus 234 RRVeI~i~~~~ 244 (244) T COG1360 234 RRVEILILTKK 244 (244) T ss_pred CEEEEEEECCC T ss_conf 83799983589 No 13 >PRK09040 hypothetical protein; Provisional Probab=100.00 E-value=4e-42 Score=300.18 Aligned_cols=114 Identities=27% Similarity=0.417 Sum_probs=99.9 Q ss_pred EEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC----CCEEEEEEECCCCCCC---CCCCHHHHHHHHHHHHHH Q ss_conf 999703456745556875468356565443678986127----8338987314676678---741069999999999999 Q gi|254780529|r 225 LISIIDQRNTPMFDKSSSIPLPETIVVLQKIGEVLAHST----EVISIRGHTDASPFRN---IARDNWRLSLDRAYSAYQ 297 (343) Q Consensus 225 ~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~----~~i~IeGHTD~~p~~g---~~~~Nw~LS~~RA~aV~~ 297 (343) +|.| .+.+||++|||+|+|+++.+|++++..|+... +.|.|+|||||+|+.. .|.+||+||.+||.+|++ T Consensus 84 ~i~i---~~~vlF~~gsa~L~p~~~~~L~~ia~~L~~~~~~~~~~i~V~GhTD~~pi~~~~~~f~sNw~LS~~RA~~V~~ 160 (215) T PRK09040 84 RIGI---SGSVLFALNSDQLQPEGRELLKSLAGPLAAYLRARDEILMVSGFTDDQPVRAGNRRFADNWELSAQRALTVTR 160 (215) T ss_pred EEEC---CCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 7960---5643676896400832699999999999987447986599873489976767788889789999999999999 Q ss_pred HHHHCCCCHHHEEEEEECCCCCCCCCCCCCC--CCCCEEEEEEECCC Q ss_conf 9998699965712256303348888988521--33755999995699 Q gi|254780529|r 298 VLMKSGVSEDRISKISGFAHHRLKIASDPMN--SANRRIDILVEDRQ 342 (343) Q Consensus 298 ~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NRRVEi~i~~~~ 342 (343) ||+..|++++||. +.|||+++|+++|+|+. ++||||||.+.-+. T Consensus 161 ~L~~~gv~~~rl~-~~g~G~~~P~~~n~t~~~ra~NRRVeI~~~p~~ 206 (215) T PRK09040 161 ALIDAGVPASSVF-AAAFGSEQPVSSNADEEGRAKNRRVEIAPVPRP 206 (215) T ss_pred HHHHCCCCHHHEE-EEEECCCCCCCCCCCHHHHHHCCCEEEEEECCC T ss_conf 9998699978958-988657687899949889987698789984389 No 14 >PRK07033 hypothetical protein; Provisional Probab=100.00 E-value=9e-35 Score=251.15 Aligned_cols=129 Identities=20% Similarity=0.298 Sum_probs=117.4 Q ss_pred HCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCC-CCHHHHHH Q ss_conf 01311451145349999703456745556875468356565443678986127833898731467667874-10699999 Q gi|254780529|r 211 IVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGEVLAHSTEVISIRGHTDASPFRNIA-RDNWRLSL 289 (343) Q Consensus 211 ~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g~~-~~Nw~LS~ 289 (343) ..+.+.|+...+.++|.| .+..||.+|||+|+++++.+|++||..|+..++.|.|+|||||+||++.. .|||+||+ T Consensus 297 ~~G~v~V~~~~~~~~v~l---~~~~lF~sGsa~l~~~~~~ll~~ia~~l~~~~g~i~V~GHTDn~Pi~s~~f~SNw~LS~ 373 (429) T PRK07033 297 AAGLVAVRDEADRSVVTL---RGDGLFASASTSVRDRYQPVLLRVADALNQVKGNVLVTGYSDNRPIRTARFPSNWHLSQ 373 (429) T ss_pred HCCCEEEEECCCEEEEEE---CCCCCCCCCCHHHCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCHHHHHH T ss_conf 748758997598269997---88888788661008668999999999984799808998838998887788897599999 Q ss_pred HHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCCCC--CCCCEEEEEEECCCC Q ss_conf 999999999998699965712256303348888988521--337559999956999 Q gi|254780529|r 290 DRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIASDPMN--SANRRIDILVEDRQG 343 (343) Q Consensus 290 ~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NRRVEi~i~~~~~ 343 (343) +||.+|+++|...|++++|+. ++||||++|+++|+|.+ ++||||||+|.-..| T Consensus 374 aRA~~V~~~l~~~~~~~~R~~-a~G~gd~~Pia~N~t~~gRa~NRRVEI~ll~~p~ 428 (429) T PRK07033 374 ARAQAVRALLAARLGDPERVT-AEGRGDSDPVAPNDSPEGRARNRRVEITLMVAPG 428 (429) T ss_pred HHHHHHHHHHHHCCCCCCEEE-EEEECCCCCCCCCCCHHHHHHCCCEEEEEECCCC T ss_conf 999999999986499977379-9763788888989882347546817999963899 No 15 >PRK09039 hypothetical protein; Validated Probab=100.00 E-value=4.4e-35 Score=253.25 Aligned_cols=126 Identities=26% Similarity=0.390 Sum_probs=106.5 Q ss_pred HHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH----CC----CCEEEEEEECCCCCCC-- Q ss_conf 1013114511453499997034567455568754683565654436789861----27----8338987314676678-- Q gi|254780529|r 210 NIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGEVLAH----ST----EVISIRGHTDASPFRN-- 279 (343) Q Consensus 210 ~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~----~~----~~i~IeGHTD~~p~~g-- 279 (343) .+.....|++.++..+. .+.+||++|||+|+|+++..|++++.+|.+ .| +.|+|+||||++|+++ T Consensus 206 ~L~~~~~I~i~gdrfvf-----~SevLF~~gsa~l~~~gk~~l~~ia~~l~~i~~~Ip~~i~wilrVeGHTD~~pi~~~~ 280 (343) T PRK09039 206 ILGDREGIRIVGDRFVF-----QSEVLFPTGSAELNPEGQAEIAKLAAALIQLAKEIPPEINWVLRVDGHTDNVPLSGTG 280 (343) T ss_pred HHCCCCCCCCCCCEEEE-----CCCCCCCCCCCHHCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCC T ss_conf 96688774102554774-----0332316884010665899999999999998763750146137750457787888899 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCCCC--CCCCEEEEEEECC Q ss_conf 7410699999999999999998699965712256303348888988521--3375599999569 Q gi|254780529|r 280 IARDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIASDPMN--SANRRIDILVEDR 341 (343) Q Consensus 280 ~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NRRVEi~i~~~ 341 (343) .|.|||+||++||.+|++||+..|+++.|+. +.|||+++|+++|+|.. ++||||||.+..| T Consensus 281 ~f~sNweLS~~RA~sV~~~L~~~gv~~~rl~-a~G~g~~~Pl~~~~t~e~ra~NRRIEiklt~r 343 (343) T PRK09039 281 RFRDNWELSSARAISVVKYLIALGVPANRLV-AAGFGEYQPLDPGDTPEARARNRRIELKLTER 343 (343) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCHHHEE-EECCCCCCCCCCCCCHHHHHHCCCEEEEECCC T ss_conf 9997599999999999999997699888936-52046678899995965797379557982789 No 16 >PRK08126 hypothetical protein; Provisional Probab=100.00 E-value=8.3e-34 Score=244.74 Aligned_cols=126 Identities=23% Similarity=0.324 Sum_probs=115.6 Q ss_pred HCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCC-CCCHHHHHH Q ss_conf 0131145114534999970345674555687546835656544367898612783389873146766787-410699999 Q gi|254780529|r 211 IVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGEVLAHSTEVISIRGHTDASPFRNI-ARDNWRLSL 289 (343) Q Consensus 211 ~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g~-~~~Nw~LS~ 289 (343) ..+.+.++...+.++|.| .+..||.+|||+|+|.+..+|++||..|...++.|.|+|||||+||.+. |.|||+||. T Consensus 300 ~~glv~V~~~~~~~~V~l---~~~~lF~sGsA~l~p~~~pll~rIa~~l~~~~G~I~V~GHTDn~Pi~s~~f~SNw~LS~ 376 (428) T PRK08126 300 AAGTLSVDEDARHSSVTF---RGDAMFAPGGATVNPAMGPLINKIAREIARVPGKVTVTGHTDNQPIRSAQFASNLVLSE 376 (428) T ss_pred HCCCEEEEECCCEEEEEE---CCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCHHHHHH T ss_conf 668648896398479997---68888787761106447999999999983799818998858998887888797499999 Q ss_pred HHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCCCC--CCCCEEEEEEEC Q ss_conf 999999999998699965712256303348888988521--337559999956 Q gi|254780529|r 290 DRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIASDPMN--SANRRIDILVED 340 (343) Q Consensus 290 ~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NRRVEi~i~~ 340 (343) +||.+|+++|...|++++||. +.||||++|+++|+|.+ ++||||||+|.+ T Consensus 377 aRA~~V~~~L~~~~~~~~R~~-a~G~gd~~Pia~N~T~~gRa~NRRVEI~l~q 428 (428) T PRK08126 377 ERATQVAQMLQSAGVPASRLE-AVGKGDADPVGDNRTPQGRAQNRRVEITVAE 428 (428) T ss_pred HHHHHHHHHHHHCCCCCCEEE-EEEECCCCCCCCCCCHHHHHHCCCEEEEEEC T ss_conf 999999999985599976368-8721788999999895567535824789509 No 17 >TIGR03350 type_VI_ompA type VI secretion system OmpA/MotB family protein. The flagellar motor protein MotB and the Gram-negative bacterial outer membrane protein OmpA share a region of sequence homology. This model describes a domain found fused to type VI secretion system homologs of the type IV system protein DotU (see model TIGR03349), with OmpA/MotB homology. Probab=99.97 E-value=4.8e-31 Score=226.29 Aligned_cols=125 Identities=25% Similarity=0.354 Sum_probs=113.8 Q ss_pred HHCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCC-CCCCHHHHH Q ss_conf 1013114511453499997034567455568754683565654436789861278338987314676678-741069999 Q gi|254780529|r 210 NIVKGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGEVLAHSTEVISIRGHTDASPFRN-IARDNWRLS 288 (343) Q Consensus 210 ~~~~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g-~~~~Nw~LS 288 (343) ...+.+.|+..+++++|+| .+.+||.+|||.|+|+++.+|.+++.+|+..+..|.|+||||+.|+.. .|.+||+|| T Consensus 10 i~~g~v~V~~~~~~v~l~l---~~~~lF~~gsa~L~~~~~~~L~~va~~L~~~~~~I~I~GHTD~~~~~~~~~~~N~~LS 86 (137) T TIGR03350 10 IAAGLLTVDEDGDRSVVRL---RGDELFASGSAEVRADFEPLLDRIAKALAAVPGRITVVGHTDNVPIRTSRFPSNWHLS 86 (137) T ss_pred HHCCCEEEEEECCEEEEEE---CCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCHHHHHH T ss_conf 9769649999799999995---7987188996320987999999999999709985999998389877787751046799 Q ss_pred HHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCCCC--CCCCEEEEEE Q ss_conf 9999999999998699965712256303348888988521--3375599999 Q gi|254780529|r 289 LDRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIASDPMN--SANRRIDILV 338 (343) Q Consensus 289 ~~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NRRVEi~i 338 (343) ++||.+|++||+..|+++.||. +.|||+++|+++|+++. ++||||||+| T Consensus 87 ~~RA~aV~~~L~~~Gi~~~rl~-~~g~G~~~P~~~n~~~~~r~~NRRVEI~i 137 (137) T TIGR03350 87 EARAKAVADVLAQGGVPAGRVR-AEGRGDSEPIASNATAAGRAQNRRVEIDV 137 (137) T ss_pred HHHHHHHHHHHHHCCCCHHEEE-EEECCCCCCCCCCCCHHHHHHCCEEEEEC T ss_conf 9999999999997299724089-97012778679993977997549089979 No 18 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=99.96 E-value=8.2e-29 Score=211.40 Aligned_cols=120 Identities=27% Similarity=0.379 Sum_probs=111.7 Q ss_pred CCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCHHHHHHHH Q ss_conf 3114511453499997034567455568754683565654436789861278-338987314676678741069999999 Q gi|254780529|r 213 KGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGEVLAHSTE-VISIRGHTDASPFRNIARDNWRLSLDR 291 (343) Q Consensus 213 ~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~R 291 (343) ..+.+....+.+++.+ ++.++|+++|++|+|+++..|+++|.+|+++|. .|.|+||||++ |+..||+.||.+| T Consensus 95 ~gv~v~r~gd~l~l~m---p~~I~F~~dsa~l~p~~~~~L~~vA~~L~~yp~~~V~I~GHTD~~---G~~~yN~~LS~~R 168 (219) T PRK10510 95 TGVSVTRSGDNIILNM---PNNVTFDSSSATLKPAGANTLTGVAMVLKEYPKTAVNVVGYTDST---GSHDLNMRLSQQR 168 (219) T ss_pred CCCEEEECCCEEEEEC---CCCCEEECCCHHCCHHHHHHHHHHHHHHHHCCCCEEEEEEEECCC---CCHHHHHHHHHHH T ss_conf 8928998687789967---888566188521296789999999999998899279999876896---7199999999999 Q ss_pred HHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCCCC--CCCCEEEEEEE Q ss_conf 9999999998699965712256303348888988521--33755999995 Q gi|254780529|r 292 AYSAYQVLMKSGVSEDRISKISGFAHHRLKIASDPMN--SANRRIDILVE 339 (343) Q Consensus 292 A~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~--~~NRRVEi~i~ 339 (343) |.+|+.||+..||+++||. +.|||+++|+++|+|+. ++||||||.|. T Consensus 169 A~aV~~yL~~~GV~~~Ri~-~~G~Ge~~Pia~n~t~~gra~NRRVEI~i~ 217 (219) T PRK10510 169 ADSVASALITQGVDASRIR-TQGLGPANPIASNSTAEGKAQNRRVEITLS 217 (219) T ss_pred HHHHHHHHHHCCCCHHHEE-EEEECCCCCCCCCCCHHHHHHCCCEEEEEE T ss_conf 9999999998599956489-998766577899939778985398899995 No 19 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=99.94 E-value=3.2e-27 Score=200.76 Aligned_cols=105 Identities=21% Similarity=0.289 Sum_probs=96.5 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHC-CCCHHHE Q ss_conf 567455568754683565654436789861278-33898731467667874106999999999999999986-9996571 Q gi|254780529|r 232 RNTPMFDKSSSIPLPETIVVLQKIGEVLAHSTE-VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKS-GVSEDRI 309 (343) Q Consensus 232 ~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~-Gv~~~ri 309 (343) .+.+||+++|+.|+|+++..|+++|.+|+.+|. .|.|+||||++ |...|||.||.+||.+|++||+.. |++..|| T Consensus 51 ~d~V~Fd~d~~~L~~~s~~~L~~~A~~L~~~p~~~v~I~GHTD~~---G~~~yN~~LS~rRA~aV~~~l~~~g~i~~~ri 127 (160) T PRK09967 51 SDAILFAKNDYKLLPESQQQIQTMAAKLASTGLTHARMDGHTDNY---GEDSYNEGLSLKRANVVADAWAMGGQIPRSNL 127 (160) T ss_pred CCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCC---CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 983792699501199899999999999997899629998747998---99899999999999999999985378544358 Q ss_pred EEEEECCCCCCCCCCCCCC--CCCCEEEEEEEC Q ss_conf 2256303348888988521--337559999956 Q gi|254780529|r 310 SKISGFAHHRLKIASDPMN--SANRRIDILVED 340 (343) Q Consensus 310 ~~v~G~g~~~P~~~n~t~~--~~NRRVEi~i~~ 340 (343) . +.|||+.+|+++|+|++ ++||||||+|.- T Consensus 128 ~-~~g~Ge~~Pv~~n~t~~gra~NRRVeivit~ 159 (160) T PRK09967 128 T-TQGLGKKYPIASNKTAQGRAENRRVAVVITT 159 (160) T ss_pred E-EEECCCCCCCCCCCCHHHHHHCCCEEEEEEC T ss_conf 9-9964656738899698999876998999839 No 20 >COG2885 OmpA Outer membrane protein and related peptidoglycan-associated (lipo)proteins [Cell envelope biogenesis, outer membrane] Probab=99.93 E-value=5.5e-26 Score=192.53 Aligned_cols=105 Identities=29% Similarity=0.408 Sum_probs=97.7 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEE Q ss_conf 567455568754683565654436789861278-3389873146766787410699999999999999998699965712 Q gi|254780529|r 232 RNTPMFDKSSSIPLPETIVVLQKIGEVLAHSTE-VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRIS 310 (343) Q Consensus 232 ~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~ 310 (343) ...++|+++|+.|+|.++.+|+.++..|+++|. .|.|+||||++ |+..||..||.+||.+|+.||+..|+.++||. T Consensus 82 ~~~v~F~~ds~~l~p~~~~~L~~~a~~L~~~p~~~i~V~GHTD~~---Gs~~yN~~LS~rRA~aV~~~L~~~Gv~~~~i~ 158 (190) T COG2885 82 PNDVLFDFDSSVLKPKAQATLDELAKYLKKNPITRILVEGHTDST---GSDEYNQALSERRAEAVADYLVSQGVVADRIS 158 (190) T ss_pred CCEEEECCCCHHHCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCC---CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 540683588344398999999999999987899779997557887---98678768889999999999998499701178 Q ss_pred EEEECCCCCCCCCCCCCC--CCCCEEEEEEEC Q ss_conf 256303348888988521--337559999956 Q gi|254780529|r 311 KISGFAHHRLKIASDPMN--SANRRIDILVED 340 (343) Q Consensus 311 ~v~G~g~~~P~~~n~t~~--~~NRRVEi~i~~ 340 (343) +.|||+.+|+++|+|+. ++||||||.|.. T Consensus 159 -~~G~G~~~Pia~n~t~~gra~NRRVei~i~~ 189 (190) T COG2885 159 -TVGYGEEKPIASNATEEGRAKNRRVEIKISP 189 (190) T ss_pred -EEECCCCCCCCCCCCCHHCCCCCCEEEEEEC T ss_conf -9861645777889780010346747898627 No 21 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=99.92 E-value=1.1e-24 Score=183.75 Aligned_cols=99 Identities=21% Similarity=0.267 Sum_probs=93.8 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEE Q ss_conf 455568754683565654436789861278-3389873146766787410699999999999999998699965712256 Q gi|254780529|r 235 PMFDKSSSIPLPETIVVLQKIGEVLAHSTE-VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISKIS 313 (343) Q Consensus 235 ~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~ 313 (343) ++|++++++|+++++.+|+.++.+|+.+|. .|.|+||||++ |+..||..||.+||.+|+.||+..||+++||.. . T Consensus 71 vyFdfd~~~l~~~~~~~L~~~a~~L~~~p~~~v~i~GhtD~~---Gt~eYN~~Ls~rRA~aV~~yL~~~GV~~~ri~~-v 146 (173) T PRK10802 71 VYFDLDKYDIRSDFAQMLDAHANFLRSNPSYKVTVEGHADER---GTPEYNISLGERRANAVKMYLQGKGVSADQISI-V 146 (173) T ss_pred EEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCC---CCHHHHHHHHHHHHHHHHHHHHHCCCCHHHEEE-E T ss_conf 991588521699899999999999998899179998746998---849999999999999999999984999899689-8 Q ss_pred ECCCCCCCCCCCCCC--CCCCEEEEE Q ss_conf 303348888988521--337559999 Q gi|254780529|r 314 GFAHHRLKIASDPMN--SANRRIDIL 337 (343) Q Consensus 314 G~g~~~P~~~n~t~~--~~NRRVEi~ 337 (343) |||+.+|+..|.++. ++||||||+ T Consensus 147 syGee~P~~~~~~e~~~~~NRR~~~v 172 (173) T PRK10802 147 SYGKEKPAVLGHDEAAYAKNRRAVLV 172 (173) T ss_pred EECCCCCCCCCCCHHHHHHCCCEEEE T ss_conf 76665768989398899876977886 No 22 >TIGR02802 Pal_lipo peptidoglycan-associated lipoprotein; InterPro: IPR014169 Members in this entry represent the peptidoglycan-associated lipoprotein (Pal or OprL) protein of the Tol-Pal system. The system appears to be involved both in the maintenance of outer membrane integrity and in the import of certain organic molecules as nutrients. Members of this family contain a hydrodrophobic lipoprotein signal sequence, a conserved N-terminal cleavage and modification site, a poorly conserved low-complexity region, together comprising about 65 amino acids, and a well-conserved C-terminal domain. The entry represents only the conserved C-terminal domain.. Probab=99.91 E-value=7.9e-25 Score=184.83 Aligned_cols=100 Identities=25% Similarity=0.299 Sum_probs=94.3 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEE Q ss_conf 455568754683565654436789861278-3389873146766787410699999999999999998699965712256 Q gi|254780529|r 235 PMFDKSSSIPLPETIVVLQKIGEVLAHSTE-VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISKIS 313 (343) Q Consensus 235 ~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~ 313 (343) ++|+++|++|+++++.+|+.=|.+|+.+|+ +|.||||||-+ |+..||+-|-++||.||+.||+..||.++||..| T Consensus 2 vyF~~D~~~l~~e~~~~l~~hA~~L~~~p~~~V~iEGH~DER---GTrEYNlALGeRRA~AV~~yL~~~GV~~~q~~~v- 77 (104) T TIGR02802 2 VYFDFDSSDLKSEAQAILDAHAQYLKKNPSVRVTIEGHTDER---GTREYNLALGERRANAVKDYLQAKGVSASQIETV- 77 (104) T ss_pred EECCCCHHHCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCC---CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEE- T ss_conf 432358143177999999998999986799378886307888---8877777788999999999999738961100463- Q ss_pred ECCCCCCCCCCCCC--CCCCCEEEEEE Q ss_conf 30334888898852--13375599999 Q gi|254780529|r 314 GFAHHRLKIASDPM--NSANRRIDILV 338 (343) Q Consensus 314 G~g~~~P~~~n~t~--~~~NRRVEi~i 338 (343) .||+.+|+++..++ +++|||++|++ T Consensus 78 SYGkErP~~~G~~E~~wakNRR~~~vy 104 (104) T TIGR02802 78 SYGKERPAALGHDEAAWAKNRRAVLVY 104 (104) T ss_pred ECCCCCCCCCCCCHHHHHHCCCCCCCC T ss_conf 026656877898857775405310049 No 23 >PRK10808 outer membrane protein A; Reviewed Probab=99.91 E-value=1e-23 Score=177.37 Aligned_cols=105 Identities=24% Similarity=0.291 Sum_probs=94.4 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHC--CC-CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHE Q ss_conf 674555687546835656544367898612--78-338987314676678741069999999999999999869996571 Q gi|254780529|r 233 NTPMFDKSSSIPLPETIVVLQKIGEVLAHS--TE-VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRI 309 (343) Q Consensus 233 ~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~--~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri 309 (343) ..++|+++|++|+|++...|+.++..|+.. +. .|.|.||||++ |+..||..||.+||.+|++||+..||+++|| T Consensus 224 s~v~F~fdss~l~p~~~~~L~~~~~~l~~~~~~~~~i~v~GhtD~~---G~~~yN~~LS~~RA~~V~~~l~~~Gi~~~~i 300 (347) T PRK10808 224 SDVLFNFNKATLKPEGQQALDQLYSQLSNLDPKDGSVVVLGYTDRI---GSDAYNQALSERRAQSVVDYLISKGIPADKI 300 (347) T ss_pred CCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCC---CCHHHHHHHHHHHHHHHHHHHHHCCCCHHHE T ss_conf 4225515702058668999999999997458776518999836998---8878879999999999999999869984518 Q ss_pred EEEEECCCCCCCCCCCCCCCC-----------CCEEEEEEECC Q ss_conf 225630334888898852133-----------75599999569 Q gi|254780529|r 310 SKISGFAHHRLKIASDPMNSA-----------NRRIDILVEDR 341 (343) Q Consensus 310 ~~v~G~g~~~P~~~n~t~~~~-----------NRRVEi~i~~~ 341 (343) . +.|||+++||++|.+..+| ||||||.|..- T Consensus 301 ~-~~g~Ge~~pva~n~~~~~~~~~~li~c~a~nRRVei~v~g~ 342 (347) T PRK10808 301 S-ARGMGESNPVTGNTCDNVKPRAALIDCLAPDRRVEIEVKGI 342 (347) T ss_pred E-EEEECCCCCCCCCCCCCCCCCCCHHHCCCCCCCEEEEEEEE T ss_conf 9-99864668888897702244300010247887389999879 No 24 >cd07185 OmpA_C-like Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA. OmpA-like domains (named after the C-terminal domain of Escherichia coli OmpA protein) have been shown to non-covalently associate with peptidoglycan, a network of glycan chains composed of disaccharides, which are crosslinked via short peptide bridges. Well-studied members of this family include the Escherichia coli outer membrane protein OmpA, the Escherichia coli lipoprotein PAL, Neisseria meningitdis RmpM, which interact with the outer membrane, as well as the Escherichia coli motor protein MotB, and the Vibrio flagellar motor proteins PomB and MotY, which interact with the inner membrane. Probab=99.90 E-value=1.6e-23 Score=176.20 Aligned_cols=100 Identities=33% Similarity=0.467 Sum_probs=93.9 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEE Q ss_conf 455568754683565654436789861278-3389873146766787410699999999999999998699965712256 Q gi|254780529|r 235 PMFDKSSSIPLPETIVVLQKIGEVLAHSTE-VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISKIS 313 (343) Q Consensus 235 ~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~ 313 (343) ++|++||+.|++.+...|+.++..|+..|. .|.|.||||+. |...+||.||.+||.+|++||+..|++++||. +. T Consensus 4 v~F~~~s~~l~~~~~~~L~~ia~~l~~~~~~~v~v~GhtD~~---G~~~~N~~LS~~RA~~V~~~L~~~Gi~~~ri~-~~ 79 (106) T cd07185 4 IYFDFGSAELTPEAKPLLDKLAEVLKKNPDAKIRIEGHTDSR---GSDAYNQELSERRAEAVADYLVSKGVDASRIT-AV 79 (106) T ss_pred EEECCCCCEECHHHHHHHHHHHHHHHHCCCCEEEEEEEECCC---CCHHHHHHHHHHHHHHHHHHHHHCCCCHHHEE-EE T ss_conf 996899776198899999999999985899379999984899---99899899999999999999998499989989-99 Q ss_pred ECCCCCCCCCCCCC--CCCCCEEEEEE Q ss_conf 30334888898852--13375599999 Q gi|254780529|r 314 GFAHHRLKIASDPM--NSANRRIDILV 338 (343) Q Consensus 314 G~g~~~P~~~n~t~--~~~NRRVEi~i 338 (343) |||+++|+.+|++. +++||||||+| T Consensus 80 g~G~~~p~~~~~~~~~~~~NRRVei~~ 106 (106) T cd07185 80 GYGESRPIASNDTEEGRAKNRRVEIVI 106 (106) T ss_pred EECCCCCCCCCCCHHHHHHCCCEEEEC T ss_conf 975557689990988998559889969 No 25 >PRK07034 hypothetical protein; Provisional Probab=99.85 E-value=1.9e-20 Score=155.66 Aligned_cols=122 Identities=24% Similarity=0.289 Sum_probs=103.8 Q ss_pred CCEEEECCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCC-CHHHHHHHH Q ss_conf 3114511453499997034567455568754683565654436789861278338987314676678741-069999999 Q gi|254780529|r 213 KGVLFETTRTGILISIIDQRNTPMFDKSSSIPLPETIVVLQKIGEVLAHSTEVISIRGHTDASPFRNIAR-DNWRLSLDR 291 (343) Q Consensus 213 ~~~~v~~~~~~i~I~i~d~~~~~lF~~gsa~l~~~~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g~~~-~Nw~LS~~R 291 (343) +.+++...+++-++.+ .+.+.|++|+|.|.+++...+.+++..+...|+.|.|.||||++||+.+.+ +||.||++| T Consensus 368 Gwltvrk~P~gwllif---~sDGaF~~g~A~l~~~f~~nieRLG~A~ApwpGdleViGHTD~rPiRss~fp~N~~LSeAR 444 (565) T PRK07034 368 GWLTAYKHPQGWLLVF---KSDGAFDVGKANVRADFMHNIERLGLAFAPWPGDLEVIGHTDSRPIRTSEFPDNQALSEAR 444 (565) T ss_pred HHEEEEECCCCCEEEE---ECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCHHHHHHHH T ss_conf 0054020797418998---2487644341203698898899986763688886577514788764378898407678888 Q ss_pred HHHHHHHHHHCCCCH---------HH-EEEEEECCCCCCCCCCCCCCC--CCCEEEEEE Q ss_conf 999999999869996---------57-122563033488889885213--375599999 Q gi|254780529|r 292 AYSAYQVLMKSGVSE---------DR-ISKISGFAHHRLKIASDPMNS--ANRRIDILV 338 (343) Q Consensus 292 A~aV~~~L~~~Gv~~---------~r-i~~v~G~g~~~P~~~n~t~~~--~NRRVEi~i 338 (343) |..|++.|.+.-++. .| |. .+|.||++|+++|+|+++ |||||+|+- T Consensus 445 A~~VAD~lR~s~~~~~~~~~~~~~QR~i~-~~GrGd~~Pldt~~T~~~~~RNRRVDiLW 502 (565) T PRK07034 445 ARNVADELRKTALPGGARAPENAVQRNIE-YSGRGDAQPIDTAKTAAAYERNRRVDVLW 502 (565) T ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHEE-EECCCCCCCCCCCCHHHHHHHCCCCEEEE T ss_conf 87899999985343544571266652000-00478888665400577775265301566 No 26 >pfam00691 OmpA OmpA family. The Pfam entry also includes MotB and related proteins which are not included in the Prosite family. Probab=99.82 E-value=6.9e-20 Score=151.86 Aligned_cols=93 Identities=32% Similarity=0.504 Sum_probs=85.0 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCC--CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEE Q ss_conf 55568754683565654436789861278--3389873146766787410699999999999999998699965712256 Q gi|254780529|r 236 MFDKSSSIPLPETIVVLQKIGEVLAHSTE--VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISKIS 313 (343) Q Consensus 236 lF~~gsa~l~~~~~~~l~~ia~~l~~~~~--~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~ 313 (343) +|++||+.|++.++..|++++.+|+..+. .|.|+||||+. |...+||.||.+||.+|++||+..||+++||. +. T Consensus 1 ~F~~~s~~L~~~~~~~L~~~a~~l~~~~~~~~v~I~GhtD~~---g~~~~N~~LS~~RA~~V~~~L~~~Gv~~~ri~-~~ 76 (97) T pfam00691 1 LFDPGSAELTAEARETLDRLAEVLKAPELKIAIKIEGHTDSR---GSAKYNWELSARRAQAVANYLVNHGIPPSRIS-VE 76 (97) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCC---CCHHHHHHHHHHHHHHHHHHHHHCCCCHHHEE-EE T ss_conf 999981213989999999999999718997289999998999---98889999999999999999998599978989-98 Q ss_pred ECCCCCCCCCCCCCC--CCCC Q ss_conf 303348888988521--3375 Q gi|254780529|r 314 GFAHHRLKIASDPMN--SANR 332 (343) Q Consensus 314 G~g~~~P~~~n~t~~--~~NR 332 (343) |||+++|+.+|++.. ++|| T Consensus 77 g~G~~~Pi~~~~~~~~r~~NR 97 (97) T pfam00691 77 GYGESQPLASNDSDEGRAKNR 97 (97) T ss_pred EECCCCCCCCCCCHHHHHHCC T ss_conf 875768889992999997475 No 27 >pfam02472 ExbD Biopolymer transport protein ExbD/TolR. This group of proteins are membrane bound transport proteins essential for ferric ion uptake in bacteria. The Pfam family consists of ExbD, and TolR which are involved in TonB-dependent transport of various receptor bound substrates including colicins. Probab=76.51 E-value=1.7 Score=22.60 Aligned_cols=20 Identities=25% Similarity=0.350 Sum_probs=16.8 Q ss_pred CHHHHHHHHHHHHHHHHHHH Q ss_conf 31887999999999999999 Q gi|254780529|r 32 SWKIVYADFMTVLMAFFLVM 51 (343) Q Consensus 32 ~W~vtyAD~~TlLmaFFvll 51 (343) .=+.++-|+|.+|+.||++- T Consensus 8 initPliDv~flLLiFFmvt 27 (128) T pfam02472 8 INLTPLIDVVFLLLIFFMVT 27 (128) T ss_pred CCCCCHHHHHHHHHHHHHHC T ss_conf 46532899999999999860 No 28 >COG3965 Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism] Probab=69.49 E-value=1.8 Score=22.31 Aligned_cols=50 Identities=18% Similarity=0.185 Sum_probs=22.4 Q ss_pred EECCCCEEEEEECCCCCCCCCCC--CC-CCCCHHHHHHHHHHHHHHHHCCC--CEEEEEEEC Q ss_conf 51145349999703456745556--87-54683565654436789861278--338987314 Q gi|254780529|r 217 FETTRTGILISIIDQRNTPMFDK--SS-SIPLPETIVVLQKIGEVLAHSTE--VISIRGHTD 273 (343) Q Consensus 217 v~~~~~~i~I~i~d~~~~~lF~~--gs-a~l~~~~~~~l~~ia~~l~~~~~--~i~IeGHTD 273 (343) +-..++++-|+|. |-. .+ +.--.+...+=+++...|...+. .+.|.=-+| T Consensus 255 vArVGr~l~IEi~-------fiip~~~~ar~Ved~d~Irdei~~slg~~g~~rwltvsfT~D 309 (314) T COG3965 255 VARVGRGLFIEIH-------FIIPRESDARNVEDWDDIRDEIGQSLGSLGYERWLTVSFTRD 309 (314) T ss_pred HHHHCCCEEEEEE-------EEECCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEEECC T ss_conf 9984662289999-------982776687463368999999988863478674389998533 No 29 >TIGR02814 pfaD_fam PfaD family protein; InterPro: IPR014179 The protein PfaD is part of a four-gene locus, similar to polyketide biosynthesis systems, which is responsible for omega-3 polyunsaturated fatty acid biosynthesis in several high pressure and/or cold-adapted bacteria. Several other members of the entry are found in loci presumed to act in polyketide biosyntheses per se.. Probab=61.25 E-value=4.1 Score=19.98 Aligned_cols=41 Identities=29% Similarity=0.503 Sum_probs=29.1 Q ss_pred CCCCCCCCCHHHHHHHHHH----------HHHHHHCC--CCEEEE----EEECCCCC Q ss_conf 5568754683565654436----------78986127--833898----73146766 Q gi|254780529|r 237 FDKSSSIPLPETIVVLQKI----------GEVLAHST--EVISIR----GHTDASPF 277 (343) Q Consensus 237 F~~gsa~l~~~~~~~l~~i----------a~~l~~~~--~~i~Ie----GHTD~~p~ 277 (343) =+.-+.-|.|.-..+|+++ |++...+| +-|.|| ||||+.|. T Consensus 145 pEVA~~Fm~PAP~~~L~~L~a~G~it~eeAeLA~~vPvAdDi~vEADSGGHTD~r~~ 201 (449) T TIGR02814 145 PEVAEAFMSPAPAAILQKLLAEGRITEEEAELAAEVPVADDICVEADSGGHTDNRPL 201 (449) T ss_pred HHHHHHHCCCCHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCEEEECCCCCCCCCHHH T ss_conf 799986215234899999853387687899998417730360375268339762288 No 30 >PRK11409 antitoxin YefM; Provisional Probab=59.57 E-value=9.5 Score=17.55 Aligned_cols=50 Identities=8% Similarity=0.180 Sum_probs=33.0 Q ss_pred CCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 337688689998216888878874318879999999999999999975089999999999853 Q gi|254780529|r 8 NQENDQFIIIKKKVAIDDTPNNLGSWKIVYADFMTVLMAFFLVMWIINATDDDTKKAIEQYFN 70 (343) Q Consensus 8 ~~~~~~~~iikr~~~~~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~ 70 (343) --.+++||||.|++. .++=|++..|+-.++=+..+| + ++.-.+.+..|+. T Consensus 20 V~~d~e~viItR~~~-------~~vVl~S~~ey~sl~ET~yLl----~--sp~Na~rL~~si~ 69 (83) T PRK11409 20 AVEDHAPILITRQNG-------EACVLMSLEEYNSLEETAYLL----R--SPANARRLMDSID 69 (83) T ss_pred HHCCCCEEEEEECCC-------CCEEEEEHHHHHHHHHHHHHH----C--CHHHHHHHHHHHH T ss_conf 965687289994699-------977997799998899999997----2--9899999999999 No 31 >PRK11024 colicin uptake protein TolR; Provisional Probab=59.54 E-value=9 Score=17.71 Aligned_cols=43 Identities=23% Similarity=0.335 Sum_probs=27.6 Q ss_pred HHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 654436789861278-33898731467667874106999999999999999986999 Q gi|254780529|r 250 VVLQKIGEVLAHSTE-VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVS 305 (343) Q Consensus 250 ~~l~~ia~~l~~~~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~ 305 (343) .++..+...+...+. .|.|.|+.+- +...-..|.+.+.+.|+. T Consensus 90 ~l~~~~~~~~~~~~~~~v~i~aDk~v-------------~y~~Vv~vmd~~k~aGi~ 133 (142) T PRK11024 90 QVVAEAKSRFKANPKTVFLIGGAKDV-------------PYDEIIKALNLLHSAGVK 133 (142) T ss_pred HHHHHHHHHHHCCCCCEEEEECCCCC-------------CHHHHHHHHHHHHHCCCC T ss_conf 99999999974199835999688888-------------888999999999985998 No 32 >COG5432 RAD18 RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms] Probab=58.53 E-value=6.8 Score=18.49 Aligned_cols=23 Identities=9% Similarity=0.171 Sum_probs=13.1 Q ss_pred CCCCCCCCCCHHHHHHH-HHHHHH Q ss_conf 55568754683565654-436789 Q gi|254780529|r 236 MFDKSSSIPLPETIVVL-QKIGEV 258 (343) Q Consensus 236 lF~~gsa~l~~~~~~~l-~~ia~~ 258 (343) |+.++=..+.|..+.-| +++..+ T Consensus 280 L~NSN~D~~qPV~~~~L~RQLi~W 303 (391) T COG5432 280 LYNSNLDQKQPVSKRNLIRQLIDW 303 (391) T ss_pred HHCCCCCCCCCCCHHHHHHHHHHH T ss_conf 440676534776689999998667 No 33 >pfam12273 RCR Chitin synthesis regulation, resistance to Congo red. RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5. Probab=57.48 E-value=9.7 Score=17.48 Aligned_cols=27 Identities=15% Similarity=0.645 Sum_probs=13.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 3188799999999999999999750899999 Q gi|254780529|r 32 SWKIVYADFMTVLMAFFLVMWIINATDDDTK 62 (343) Q Consensus 32 ~W~vtyAD~~TlLmaFFvll~s~s~~d~~k~ 62 (343) +| |.||=++-.+ |+++++|+-++...+ T Consensus 1 RW-v~fav~i~~~---~i~~f~~~~~n~RRr 27 (124) T pfam12273 1 RW-VLFAIFIIAL---LILFFLTARINRRRR 27 (124) T ss_pred CE-EEHHHHHHHH---HHHHHHHHHHHHHHH T ss_conf 91-1089999999---999999998739998 No 34 >TIGR01055 parE_Gneg DNA topoisomerase IV, B subunit; InterPro: IPR005737 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . This entry represents subunit B (parE) of topoisomerase IV from Gram negative bacteria. Topoisomerase IV primarily decatenates DNA and relaxes positive supercoils, which is important in bacteria, where the circular chromosome becomes catenated, or linked, during replication . Topoisomerase IV consists of two polypeptide subunits, parC (subunit A), which is homologous to gyrA of topoisomerase II, and parE (subunit B), which is homologous to gyrB of topoisomerase II. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0005694 chromosome. Probab=56.95 E-value=6.9 Score=18.45 Aligned_cols=73 Identities=19% Similarity=0.343 Sum_probs=40.1 Q ss_pred CEEEEEEEC-CCC--CC----CCCCCHHHHHHHHHHHHHHHHHHC--CCCHHHEEEEEECCCCCCCCCCC-CCCCCCCE- Q ss_conf 338987314-676--67----874106999999999999999986--99965712256303348888988-52133755- Q gi|254780529|r 265 VISIRGHTD-ASP--FR----NIARDNWRLSLDRAYSAYQVLMKS--GVSEDRISKISGFAHHRLKIASD-PMNSANRR- 333 (343) Q Consensus 265 ~i~IeGHTD-~~p--~~----g~~~~Nw~LS~~RA~aV~~~L~~~--Gv~~~ri~~v~G~g~~~P~~~n~-t~~~~NRR- 333 (343) .+.=+||-= ..| |+ ..-|| -|...-=..|.+-|.+. |-+. +-+..|.|+-.|.-=-+ |.....|| T Consensus 526 ~LV~~GH~Y~A~PPLYR~~~~~~vyY--A~dEeEK~~~~~~L~~~sK~K~~--V~RFKGLGEM~P~QL~ETT~~P~~R~L 601 (647) T TIGR01055 526 KLVEEGHVYVAKPPLYRIDLSKEVYY--ALDEEEKEKLLEKLKKKSKGKPN--VQRFKGLGEMNPAQLRETTMDPNTRRL 601 (647) T ss_pred HHHHCCCEEEECCCCEEEECCCEEEE--ECCHHHHHHHHHHHHHHCCCCCC--CCCCCCCCCCCCCHHHCCCCCCCCCEE T ss_conf 88753907871787202106870688--52846689999999763068854--411025456782012103588876137 Q ss_pred EEEEEECC Q ss_conf 99999569 Q gi|254780529|r 334 IDILVEDR 341 (343) Q Consensus 334 VEi~i~~~ 341 (343) |.++++.. T Consensus 602 ~~~T~~~~ 609 (647) T TIGR01055 602 VQLTLDDV 609 (647) T ss_pred EEEEEECC T ss_conf 88763125 No 35 >pfam09476 Pilus_CpaD Pilus biogenesis CpaD protein (pilus_cpaD). Proteins in this entry consist of a pilus biogenesis protein, CpaD, from Caulobacter, and homologues in other bacteria, including three in the root nodule bacterium Bradyrhizobium japonicum. The molecular function of the homologues is not known. Probab=56.52 E-value=13 Score=16.53 Aligned_cols=77 Identities=12% Similarity=0.089 Sum_probs=54.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHC-CCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEEC Q ss_conf 55687546835656544367898612-78338987314676678741069999999999999999869996571225630 Q gi|254780529|r 237 FDKSSSIPLPETIVVLQKIGEVLAHS-TEVISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGF 315 (343) Q Consensus 237 F~~gsa~l~~~~~~~l~~ia~~l~~~-~~~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~ 315 (343) -..|...|++.-+..|..++.-.... -+.|.|+=...+- |=.-+.+=+..|+..|...||+..+|. +.+| T Consensus 44 v~~~~~~Lt~~qr~~l~~f~~~~~~~g~g~i~I~~Psgs~--------n~~aa~~~~~~ir~~l~~~Gv~~~~I~-~~~y 114 (200) T pfam09476 44 VGPGRGGLTASQRARLAGFAARYGRRGAGPLVIDVPSGSP--------NARAASALSAEVRALLAAAGVPPSNIS-VRAY 114 (200) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCC--------CHHHHHHHHHHHHHHHHHCCCCHHHEE-ECCC T ss_conf 2688898899999999999999845688618997589986--------357899999999999998599843155-4225 Q ss_pred CCCCCCC Q ss_conf 3348888 Q gi|254780529|r 316 AHHRLKI 322 (343) Q Consensus 316 g~~~P~~ 322 (343) ...-|.. T Consensus 115 ~~~~~~~ 121 (200) T pfam09476 115 DAAGARA 121 (200) T ss_pred CCCCCCC T ss_conf 7778788 No 36 >pfam07019 Rab5ip Rab5-interacting protein (Rab5ip). This family consists of several Rab5-interacting protein (RIP5 or Rab5ip ) sequences. The ras-related GTPase rab5 is rate-limiting for homotypic early endosome fusion. Rab5ip represents a novel rab5 interacting protein that may function on endocytic vesicles as a receptor for rab5-GDP and participate in the activation of rab5. Probab=56.43 E-value=9.3 Score=17.59 Aligned_cols=29 Identities=21% Similarity=0.513 Sum_probs=21.6 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 878874318879999999999999999975 Q gi|254780529|r 26 TPNNLGSWKIVYADFMTVLMAFFLVMWIIN 55 (343) Q Consensus 26 ~~~~~~~W~vtyAD~~TlLmaFFvll~s~s 55 (343) +..-.|.|-+....+++.++. |||.|.+. T Consensus 52 ~~~f~~~~~l~~eg~~~~l~~-Fvl~Wtl~ 80 (81) T pfam07019 52 EEYFGGRWELFTEGLLTGLAT-FVLTWTLF 80 (81) T ss_pred HHHCCCHHHHHHHHHHHHHHH-HHHHHHHC T ss_conf 887298989989338768999-99999830 No 37 >pfam06898 YqfD Putative stage IV sporulation protein YqfD. This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis is known to be essential for efficient sporulation although its exact function is unknown. Probab=55.64 E-value=9.7 Score=17.48 Aligned_cols=30 Identities=7% Similarity=0.207 Sum_probs=22.6 Q ss_pred CCCCCHHHHHH---HHHHHHHHHHHHHHHHHCC Q ss_conf 88743188799---9999999999999997508 Q gi|254780529|r 28 NNLGSWKIVYA---DFMTVLMAFFLVMWIINAT 57 (343) Q Consensus 28 ~~~~~W~vtyA---D~~TlLmaFFvll~s~s~~ 57 (343) +|.|=|+--|= =|+..+++||++||.+|.+ T Consensus 75 ~GlpF~~~r~~kR~~~~~G~~~f~~~l~~lS~f 107 (383) T pfam06898 75 KGLPFLFFRLRKRRGIVLGILLFVILLYFLSNF 107 (383) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHE T ss_conf 471789999888799999999999999978406 No 38 >TIGR02471 sucr_syn_bact_C sucrose phosphate synthase, sucrose phosphatase-like domain; InterPro: IPR012821 Sucrose phosphate synthase (SPS) and sucrose phosphate phosphatase (SPP) are the last two enzymes of sucrose biosynthesis. In cyanobacteria and plants, the C-terminal region of most or all versions of SPS has a domain homologous to the known SPP. This domain may serve a binding or regulatory rather than catalytic function. Sequences in this family are bacterial C-terminal regions found in all but two of the putative bacterial sucrose phosphate synthases described by IPR012822 from INTERPRO.. Probab=54.91 E-value=14 Score=16.36 Aligned_cols=66 Identities=20% Similarity=0.295 Sum_probs=48.7 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCE----EEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEE Q ss_conf 556875468356565443678986127833----8987314676678741069999999999999999869996571225 Q gi|254780529|r 237 FDKSSSIPLPETIVVLQKIGEVLAHSTEVI----SIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISKI 312 (343) Q Consensus 237 F~~gsa~l~~~~~~~l~~ia~~l~~~~~~i----~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v 312 (343) |. =|+.++|+..+.|..|-..|+...-.+ ..+-+-|=+|.+++. ..|||..=.+.|+|.++++ | T Consensus 117 FK-~SY~~~p~~~p~l~~~~~~Lr~~~~~~~~~ls~~~FLDv~P~RAS~----------~~A~Ryls~~wGlpLe~~L-v 184 (241) T TIGR02471 117 FK-ISYLLDPEDEPSLKQIRQRLRQQELAVKVILSLGAFLDVLPLRASK----------GLALRYLSYRWGLPLEQLL-V 184 (241) T ss_pred EE-EEEEECCCCCCCHHHHHHHHHHCCCEEEEEEECCCCCCEEECCCCC----------HHHHHHHHHHHCCCHHHEE-E T ss_conf 13-5665376778787899999861587477986046745434203770----------4899999998279745814-5 Q ss_pred EE Q ss_conf 63 Q gi|254780529|r 313 SG 314 (343) Q Consensus 313 ~G 314 (343) .| T Consensus 185 ag 186 (241) T TIGR02471 185 AG 186 (241) T ss_pred EE T ss_conf 40 No 39 >cd01324 cbb3_Oxidase_CcoQ Cytochrome cbb oxidase CcoQ. Cytochrome cbb3 oxidase, the terminal oxidase in the respiratory chains of proteobacteria, is a multi-chain transmembrane protein located in the cell membrane. Like other cytochrome oxidases, it catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. Found exclusively in proteobacteria, cbb3 is believed to be a modern enzyme that has evolved independently to perform a specialized function in microaerobic energy metabolism. The cbb3 operon contains four genes (ccoNOQP or fixNOQP), with ccoN coding for subunit I. Instead of a CuA-containing subunit II analogous to other cytochrome oxidases, cbb3 utilizes subunits ccoO and ccoP, which contain one and two hemes, respectively, to transfer electrons to the binuclear center. ccoQ, the fourth subunit, is a single transmembrane helix protein. It has been shown to protect the core complex from proteolytic degradation by serine proteases. See cd00919, cd01322 Probab=52.94 E-value=15 Score=16.15 Aligned_cols=33 Identities=24% Similarity=0.317 Sum_probs=18.3 Q ss_pred HHHHHH--HHHHHHHHHH--HHHHHHCCCHHHHHHHH Q ss_conf 887999--9999999999--99997508999999999 Q gi|254780529|r 34 KIVYAD--FMTVLMAFFL--VMWIINATDDDTKKAIE 66 (343) Q Consensus 34 ~vtyAD--~~TlLmaFFv--ll~s~s~~d~~k~~~i~ 66 (343) |..||| .+.+|++||+ ++|++.--...++..-+ T Consensus 7 lr~~AdsWgLl~~~~~Figvv~wa~~Pg~k~~~deaA 43 (48) T cd01324 7 LRGLADSWGLLYLALFFLGVVVWAFRPGRKKAFDEAA 43 (48) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH T ss_conf 9999999999999999999987505887623578875 No 40 >pfam03356 Pox_LP_H2 Viral late protein H2. All Members of this family show similarity to the vaccinia virus late protein H2. This protein is often referred# to by its gene name H2R. Members from this family all belong to the viral taxon Poxviridae. Probab=50.90 E-value=10 Score=17.39 Aligned_cols=22 Identities=32% Similarity=0.640 Sum_probs=17.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 887999999999999999997508 Q gi|254780529|r 34 KIVYADFMTVLMAFFLVMWIINAT 57 (343) Q Consensus 34 ~vtyAD~~TlLmaFFvll~s~s~~ 57 (343) -+.||--+|+. |||+++++|.. T Consensus 23 ~i~~a~~stl~--ff~l~L~iS~l 44 (189) T pfam03356 23 SIRYAKTSTLI--FFVLILLISVL 44 (189) T ss_pred HHHHHHHHHHH--HHHHHHHHHHH T ss_conf 65454023289--99999999999 No 41 >TIGR01059 gyrB DNA gyrase, B subunit; InterPro: IPR011557 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . Topoisomerase II (called gyrase in bacteria) primarily introduces negative supercoils into DNA. In bacteria, topoisomerase II consists of two polypeptide subunits, gyrA and gyrB, which form a heterotetramer: (BA)2. In most eukaryotes, topoisomerase II consists of a single polypeptide, where the N- and C-terminal regions correspond to gyrB and gyrA, respectively. This entry represents the B subunit (gyrB) as found predominantly in bacteria, but does not include the topoisomerase II enzymes composed of a single polypeptide, as are found in most eukaryotes. GyrB has two functional domains: an N-terminal ATPase and a C-terminal responsible for subunit interactions, the latter differing between subunit B and single polypeptide topoisomerase II . More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0005694 chromosome. Probab=49.64 E-value=7.8 Score=18.11 Aligned_cols=25 Identities=12% Similarity=0.109 Sum_probs=8.9 Q ss_pred CCCCCCHHHHHHH-HHHHHHHHHCCC Q ss_conf 8754683565654-436789861278 Q gi|254780529|r 240 SSSIPLPETIVVL-QKIGEVLAHSTE 264 (343) Q Consensus 240 gsa~l~~~~~~~l-~~ia~~l~~~~~ 264 (343) |+++.++--..++ ..+..+|-.+|. T Consensus 355 GNsEVr~~V~~~v~e~l~~f~eeNP~ 380 (818) T TIGR01059 355 GNSEVRSIVESLVYEKLTEFFEENPT 380 (818) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCH T ss_conf 46416777888879999999752906 No 42 >PRK00159 putative septation inhibitor protein; Reviewed Probab=46.99 E-value=19 Score=15.55 Aligned_cols=17 Identities=18% Similarity=0.735 Sum_probs=9.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 88743188799999999999999 Q gi|254780529|r 28 NNLGSWKIVYADFMTVLMAFFLV 50 (343) Q Consensus 28 ~~~~~W~vtyAD~~TlLmaFFvl 50 (343) .+.|.| |+ .+|++||++ T Consensus 27 ~~sp~W---~~---p~m~~lmli 43 (87) T PRK00159 27 GPSSVW---YV---VLMLGLMLI 43 (87) T ss_pred CCCCCC---HH---HHHHHHHHH T ss_conf 999850---99---999999999 No 43 >pfam00737 PsbH Photosystem II 10 kDa phosphoprotein. This protein is phosphorylated in a light dependent reaction. Probab=45.75 E-value=18 Score=15.66 Aligned_cols=30 Identities=13% Similarity=0.190 Sum_probs=14.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8887887431887999999999999999997 Q gi|254780529|r 24 DDTPNNLGSWKIVYADFMTVLMAFFLVMWII 54 (343) Q Consensus 24 ~~~~~~~~~W~vtyAD~~TlLmaFFvll~s~ 54 (343) .+-+.-+|.| -|..=|.-.|..|+|.+..+ T Consensus 13 seyGKvapGW-GTtp~Mgv~m~lf~vFl~ii 42 (52) T pfam00737 13 SEYGKVAPGW-GTTPVMGVFMALFAVFLLII 42 (52) T ss_pred CCCCCCCCCC-CCCHHHHHHHHHHHHHHHHH T ss_conf 4347426886-67589999999999999999 No 44 >PRK11267 biopolymer transport protein ExbD; Provisional Probab=45.31 E-value=20 Score=15.38 Aligned_cols=43 Identities=9% Similarity=0.242 Sum_probs=28.2 Q ss_pred HHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCH Q ss_conf 54436789861278-338987314676678741069999999999999999869996 Q gi|254780529|r 251 VLQKIGEVLAHSTE-VISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSE 306 (343) Q Consensus 251 ~l~~ia~~l~~~~~-~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~ 306 (343) +...+....+..|. .|.|.|..+ .+..+-..|.+.+.+.|+.. T Consensus 86 L~~~l~~~~~~~~~~~v~i~aD~~-------------~~y~~vv~Vmd~~~~aG~~k 129 (141) T PRK11267 86 MITALDALTEGKKDTTIFFRADKT-------------VDYETLMKVMDTLHQAGYLK 129 (141) T ss_pred HHHHHHHHHHCCCCCEEEEECCCC-------------CCHHHHHHHHHHHHHCCCCE T ss_conf 999999998328996299975899-------------86889999999999869986 No 45 >TIGR03656 IsdC heme uptake protein IsdC. Isd proteins are iron-regulated surface proteins found in Bacillus, Staphylococcus and Listeria species and are responsible for heme scavenging from hemoproteins. The IsdC protein consists of an N-terminal hydrophobic signal sequence, a central NEAT (NEAr Transporter, pfam05031) domain which confers the ability to bind heme and a C-terminal SrtB processing signal which targets the protein to the cell wall. IsdC is believed to make a direct contact with, and transfer heme to, the heme-binding component (IsdE) of an ABC transporter in the cytoplasmic membrane, and to receive heme from other NEAT-containing heme-binding proteins also localized in the cell wall. Probab=45.02 E-value=12 Score=16.81 Aligned_cols=15 Identities=20% Similarity=0.388 Sum_probs=11.8 Q ss_pred HHHHHHHHHHHHHCC Q ss_conf 999999999997508 Q gi|254780529|r 43 VLMAFFLVMWIINAT 57 (343) Q Consensus 43 lLmaFFvll~s~s~~ 57 (343) |+++||++|++|..+ T Consensus 5 ~~~~~f~~~~~f~~~ 19 (217) T TIGR03656 5 LVFAFFTTILAFIIL 19 (217) T ss_pred EHHHHHHHHHHHHHC T ss_conf 038999999998650 No 46 >COG0848 ExbD Biopolymer transport protein [Intracellular trafficking and secretion] Probab=43.90 E-value=20 Score=15.33 Aligned_cols=26 Identities=23% Similarity=0.160 Sum_probs=20.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 43188799999999999999999750 Q gi|254780529|r 31 GSWKIVYADFMTVLMAFFLVMWIINA 56 (343) Q Consensus 31 ~~W~vtyAD~~TlLmaFFvll~s~s~ 56 (343) .-=.+.|-|-|-.|+.||++--.|.. T Consensus 15 eINvtPlIDVmLVLLiiFmvtap~~t 40 (137) T COG0848 15 EINVTPLIDVMLVLLIIFMVTAPFIT 40 (137) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 66655379999999999998511311 No 47 >pfam11548 Receptor_IA-2 Protein-tyrosine phosphatase receptor IA-2. IA-2 is a protein-tyrosine phosphatase receptor that upon exocytosis, the cytoplasmic domain is cleaved and moves to the nucleus where it enhances transcription of the insulin gene. The mature exodomain of IA-2 participates in adhesion to the extracellular matrix and is self-proteolyzed in vitro by reactive oxygen species which may be a new shedding mechanism. Probab=43.53 E-value=21 Score=15.20 Aligned_cols=69 Identities=23% Similarity=0.283 Sum_probs=45.4 Q ss_pred CHHHHHHHHHHHHHHHHCCC---CEEEEEEECCCCCCCCCCCHHHHH--HHHHHHHHHHHHH-CCCCHHHEEEEEECCCC Q ss_conf 83565654436789861278---338987314676678741069999--9999999999998-69996571225630334 Q gi|254780529|r 245 LPETIVVLQKIGEVLAHSTE---VISIRGHTDASPFRNIARDNWRLS--LDRAYSAYQVLMK-SGVSEDRISKISGFAHH 318 (343) Q Consensus 245 ~~~~~~~l~~ia~~l~~~~~---~i~IeGHTD~~p~~g~~~~Nw~LS--~~RA~aV~~~L~~-~Gv~~~ri~~v~G~g~~ 318 (343) ..++.++|..+|..|.=... .|.|.|+.=+=.++ +..-||..+ ++.|.+....|.+ .|+ ||+. +|.|+. T Consensus 16 ~~~G~~Ll~~lA~ll~l~~s~F~~i~V~gpaVTFrV~-~N~qN~t~adVa~~a~~~K~~Le~~tGl---~Ilq-tGvG~k 90 (91) T pfam11548 16 LDEGVRLMEILAERVHLPSSSFANISVVGPAVTFRVR-PNEQNVSTADVAKAAVDNKDQLEKQTGL---RILQ-SGVGQR 90 (91) T ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEEEECCCEEEEEC-CCCCCCCHHHHHHHHHHHHHHHHHHHCE---EEEE-ECCCCC T ss_conf 8899999999999957971123678985781799964-6713587999999999868888774182---9864-054679 No 48 >pfam10624 TraS Plasmid conjugative transfer entry exclusion protein TraS. Entry exclusion (Eex) is a process which prevents redundant transfer of DNA between donor cells. TraS is a protein involved in Eex. It blocks redundant conjugative DNA synthesis and transport between donor cells, and it is suggested that TraS interferes with a signalling pathway that is required to trigger DNA transfer. TraS on the recipient cell is known to form an interaction with TraG on the donor cell. Probab=42.94 E-value=13 Score=16.71 Aligned_cols=28 Identities=21% Similarity=0.497 Sum_probs=19.5 Q ss_pred HHHHHHHHHHH------HHHHCCCHHHHHHHHHHHH Q ss_conf 99999999999------9975089999999999853 Q gi|254780529|r 41 MTVLMAFFLVM------WIINATDDDTKKAIEQYFN 70 (343) Q Consensus 41 ~TlLmaFFvll------~s~s~~d~~k~~~i~~~f~ 70 (343) -|..|-||+|| |+|+.+ .|+..|++.|+ T Consensus 27 et~f~l~fi~lwqglfiwlf~qi--rkk~~vsdefk 60 (164) T pfam10624 27 ETGFILCFILLWQGLFIWLFIQI--RKKNHISDEFK 60 (164) T ss_pred CHHHHHHHHHHHHHHHHHHHHHH--HHHCCCCHHHC T ss_conf 14578999999978999999999--87237513321 No 49 >pfam10959 DUF2761 Protein of unknown function (DUF2761). Members in this family of proteins are annotated as KleF however no function is known. Probab=40.29 E-value=5.3 Score=19.24 Aligned_cols=10 Identities=20% Similarity=0.454 Sum_probs=4.0 Q ss_pred EEEEEEECCC Q ss_conf 3898731467 Q gi|254780529|r 266 ISIRGHTDAS 275 (343) Q Consensus 266 i~IeGHTD~~ 275 (343) |.|.-+.|+. T Consensus 16 VLVr~YAdSd 25 (96) T pfam10959 16 VLVRDYADSD 25 (96) T ss_pred EEEHHHCCCC T ss_conf 8754223443 No 50 >pfam00543 P-II Nitrogen regulatory protein P-II. P-II modulates the activity of glutamine synthetase. Probab=40.17 E-value=24 Score=14.86 Aligned_cols=52 Identities=17% Similarity=0.274 Sum_probs=33.6 Q ss_pred HHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCC--CCCC----CCCCEEEEEEECC Q ss_conf 99999999999869996571225630334888898--8521----3375599999569 Q gi|254780529|r 290 DRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIAS--DPMN----SANRRIDILVEDR 341 (343) Q Consensus 290 ~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n--~t~~----~~NRRVEi~i~~~ 341 (343) .|...|.+.|.+.|++.=-+..|.|||..+....- .+.. -.-++|||++.+. T Consensus 8 ~kl~~v~~AL~~~G~~g~Tv~~v~G~G~q~g~~~~~~g~~~~~~~~pK~~ieivv~d~ 65 (102) T pfam00543 8 EKLDEVKEALEKAGVPGMTVTEVKGRGRQKGHTEIYRGAEYYVEFLPKVKIEIVVPDE 65 (102) T ss_pred HHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCCEECCCEEEECCCCEEEEEEEECHH T ss_conf 9999999999977998589998289767777441541623571314215999998738 No 51 >TIGR00946 2a69 Auxin Efflux Carrier; InterPro: IPR014024 This entry is mostly composed of known or predicted PIN proteins from plants, though some homologous prokaryotic proteins are also included. The PIN proteins are components of auxin efflux systems from plants. These carriers are saturable, auxin-specific, and localized to the basal ends of auxin transport-competent cells , . Plants typically posses several of these proteins, each displaying a unique tissue-specific expression pattern. They are expressed in almost all plant tissues including vascular tissues and roots, and influence many processes including the establishment of embryonic polarity, plant growth, apical hook formation in seedlings and the photo- and gravitrophic responses. These plant proteins are typically 600-700 amino acyl residues long and exhibit 8-12 transmembrane segments.. Probab=39.00 E-value=25 Score=14.74 Aligned_cols=26 Identities=19% Similarity=0.356 Sum_probs=20.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 18879999999999999999975089 Q gi|254780529|r 33 WKIVYADFMTVLMAFFLVMWIINATD 58 (343) Q Consensus 33 W~vtyAD~~TlLmaFFvll~s~s~~d 58 (343) -.+.|-|.++.||.++|.++..+... T Consensus 161 ~~~~~~~~g~~l~~~~~~~~~~~~~g 186 (608) T TIGR00946 161 IAALFIDTGAVLMTIALGLFLVSEDG 186 (608) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 75657773257789999899863103 No 52 >pfam03032 Brevenin Brevenin/esculentin/gaegurin/rugosin family. This family contains a number of defence peptides secreted from the skin of amphibians, including the opiate-like dermorphins and deltorphins, and the antimicrobial dermoseptins and temporins. Probab=35.59 E-value=24 Score=14.88 Aligned_cols=21 Identities=14% Similarity=0.496 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHHCCCHHHH Q ss_conf 999999999999750899999 Q gi|254780529|r 42 TVLMAFFLVMWIINATDDDTK 62 (343) Q Consensus 42 TlLmaFFvll~s~s~~d~~k~ 62 (343) +||+.||+=+.|+|.-.+++. T Consensus 6 Slll~fFLG~islSlCe~er~ 26 (46) T pfam03032 6 SLLLVLFLGLVSLSLCEEEKR 26 (46) T ss_pred HHHHHHHHHHHHHHHHHHHHC T ss_conf 389999986514765244305 No 53 >PRK10665 nitrogen regulatory protein P-II 2; Provisional Probab=35.46 E-value=28 Score=14.38 Aligned_cols=52 Identities=13% Similarity=0.266 Sum_probs=31.8 Q ss_pred HHHHHHHHHHHHCCCCHHHEEEEEECCCCCCC---CCCCCCC---CCCCEEEEEEECC Q ss_conf 99999999999869996571225630334888---8988521---3375599999569 Q gi|254780529|r 290 DRAYSAYQVLMKSGVSEDRISKISGFAHHRLK---IASDPMN---SANRRIDILVEDR 341 (343) Q Consensus 290 ~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~---~~n~t~~---~~NRRVEi~i~~~ 341 (343) .|-..|...|...|++.=-+..|.|||..+.. +...... -.-+||||++.+. T Consensus 11 ~kl~~vk~aL~~~G~~g~Tv~~V~G~G~q~g~~~~~rg~~~~~~~~pKv~ieivv~D~ 68 (112) T PRK10665 11 FKLEDVREALSSIGIQGLTVTEVKGFGRQKGHAELYRGAEYSVNFLPKVKIDVAIADD 68 (112) T ss_pred HHHHHHHHHHHHCCCCEEEEEECEEECCCCCCCCCCCCCCCCCCCCCEEEEEEEECHH T ss_conf 9999999999978997399984868756887553314651021521226999999728 No 54 >pfam00101 RuBisCO_small Ribulose bisphosphate carboxylase, small chain. Probab=34.72 E-value=29 Score=14.30 Aligned_cols=51 Identities=18% Similarity=0.196 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHH---HHHHHHHHHHHHH Q ss_conf 6565443678986127833898731467667874106999---9999999999999 Q gi|254780529|r 248 TIVVLQKIGEVLAHSTEVISIRGHTDASPFRNIARDNWRL---SLDRAYSAYQVLM 300 (343) Q Consensus 248 ~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g~~~~Nw~L---S~~RA~aV~~~L~ 300 (343) ..++.++|--+| .....+.|| |||.+.+.+.+=.=|.| ...++..|...|. T Consensus 13 ~~~i~~QV~yll-~qG~~i~iE-~ad~~~~r~~~W~mW~lP~f~~~d~~~Vl~ele 66 (98) T pfam00101 13 DEQIAKQIRYLL-SQGWAPCLE-FADDRRFRTRYWTMWKLPMFGCTDAAQVLAELE 66 (98) T ss_pred HHHHHHHHHHHH-HCCCEEEEE-ECCCCCCEECEECCCCCCCCCCCCHHHHHHHHH T ss_conf 999999999999-889987798-657888430230256777666899999999999 No 55 >pfam12000 DUF3495 Domain of unknown function (DUF3495). This presumed domain is functionally uncharacterized. This domain is found in bacteria. This domain is about 170 amino acids in length. This domain is found associated with pfam00534. This domain has a single completely conserved residue G that may be functionally important. Probab=34.70 E-value=29 Score=14.30 Aligned_cols=39 Identities=13% Similarity=0.160 Sum_probs=22.4 Q ss_pred CCCHHHHHHHHHHHHHHHHH---HCCCCHHHEEEEEECCCCC Q ss_conf 41069999999999999999---8699965712256303348 Q gi|254780529|r 281 ARDNWRLSLDRAYSAYQVLM---KSGVSEDRISKISGFAHHR 319 (343) Q Consensus 281 ~~~Nw~LS~~RA~aV~~~L~---~~Gv~~~ri~~v~G~g~~~ 319 (343) +..+||....|+.+|++.+. +.|..++-|..=.|.|++- T Consensus 39 ~~~~~E~~~~rg~av~~aa~~L~~~Gf~PDvIi~H~GWGe~L 80 (172) T pfam12000 39 YLRDFEAAVIRGQAVARAARQLRAQGFRPDVIVAHPGWGETL 80 (172) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCHH T ss_conf 320599999999999999999997499999898758764010 No 56 >pfam11587 Prion_bPrPp Major prion protein bPrPp - N terminal. This family represents the N-terminal domain (1-30) of the bovine prion protein (bPrPp). The proteins structure consists of mainly alpha helices. BPrPp forms a stable helix which inserts in a transmembrane location in the bilayer, with the N -terminal (1-30) functioning as a cell-penetrating peptide. Probab=34.34 E-value=23 Score=14.96 Aligned_cols=10 Identities=20% Similarity=0.946 Sum_probs=8.3 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780529|r 44 LMAFFLVMWI 53 (343) Q Consensus 44 LmaFFvll~s 53 (343) ++++||.+|| T Consensus 10 ilvLfvatws 19 (29) T pfam11587 10 ILVLFVATWS 19 (29) T ss_pred HHHHHHHHHH T ss_conf 9999999886 No 57 >cd06869 PX_UP2_fungi The phosphoinositide binding Phox Homology domain of uncharacterized fungal proteins. The PX domain is a phosphoinositide (PI) binding module involved in targeting proteins to PI-enriched membranes. Members in this subfamily are uncharacterized fungal proteins containing a PX domain. PX domain harboring proteins have been implicated in highly diverse functions such as cell signaling, vesicular trafficking, protein sorting, lipid modification, cell polarity and division, activation of T and B cells, and cell survival. In addition to protein-lipid interaction, the PX domain may also be involved in protein-protein interaction. Probab=30.35 E-value=28 Score=14.41 Aligned_cols=32 Identities=19% Similarity=0.413 Sum_probs=18.9 Q ss_pred CCCCCCCEEEEEECCCCCCCCCCCCHHHH--HHHHHHHH Q ss_conf 33768868999821688887887431887--99999999 Q gi|254780529|r 8 NQENDQFIIIKKKVAIDDTPNNLGSWKIV--YADFMTVL 44 (343) Q Consensus 8 ~~~~~~~~iikr~~~~~~~~~~~~~W~vt--yAD~~TlL 44 (343) .|.+.+. |||-++...+ .+.|+|+ |.||.+|- T Consensus 30 ~~~h~ef-iIrv~r~~~~----~~~~~V~RRYsdF~~Lh 63 (119) T cd06869 30 SKHHYEF-IIRVRREGEE----YRTIYVARRYSDFKKLH 63 (119) T ss_pred CCEEEEE-EEEEEECCCC----CCCEEEECCHHHHHHHH T ss_conf 4300789-9999636989----98789956579999999 No 58 >COG1735 Php Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only] Probab=29.65 E-value=35 Score=13.76 Aligned_cols=64 Identities=19% Similarity=0.180 Sum_probs=49.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEEC Q ss_conf 55568754683565654436789861278338987314676678741069999999999999999869996571225630 Q gi|254780529|r 236 MFDKSSSIPLPETIVVLQKIGEVLAHSTEVISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGF 315 (343) Q Consensus 236 lF~~gsa~l~~~~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~ 315 (343) =|-.+|..++|.-++.|+..|..-++. .+-|.-||- +.. .|..+.+.|...|++++|++ + |+ T Consensus 138 k~~~~~~~iTp~Eek~lrAaA~A~~~T--g~Pi~tHt~-------------~gt-~g~eq~~il~~egvdl~~v~-i-gH 199 (316) T COG1735 138 KEAGGSPAITPLEEKSLRAAARAHKET--GAPISTHTP-------------AGT-MGLEQLRILAEEGVDLRKVS-I-GH 199 (316) T ss_pred EECCCCCCCCHHHHHHHHHHHHHHHHC--CCCEEEECC-------------CHH-HHHHHHHHHHHCCCCHHHEE-E-EC T ss_conf 642576667989999999999986411--897177344-------------003-01999999997299866736-7-32 Q ss_pred CC Q ss_conf 33 Q gi|254780529|r 316 AH 317 (343) Q Consensus 316 g~ 317 (343) -+ T Consensus 200 ~d 201 (316) T COG1735 200 MD 201 (316) T ss_pred CC T ss_conf 58 No 59 >PRK10369 heme lyase subunit NrfE; Provisional Probab=29.65 E-value=35 Score=13.76 Aligned_cols=27 Identities=22% Similarity=0.341 Sum_probs=22.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 318879999999999999999975089 Q gi|254780529|r 32 SWKIVYADFMTVLMAFFLVMWIINATD 58 (343) Q Consensus 32 ~W~vtyAD~~TlLmaFFvll~s~s~~d 58 (343) ++..+|+-|.-+++||++|+|+|-.-| T Consensus 22 a~~~~~~~f~~~~~a~~~L~~af~~~d 48 (552) T PRK10369 22 TCIGILAQFALLLLAFGVLTYCFLISD 48 (552) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 889999999999999999999998017 No 60 >COG4646 DNA methylase [Transcription / DNA replication, recombination, and repair] Probab=29.56 E-value=25 Score=14.68 Aligned_cols=10 Identities=30% Similarity=0.378 Sum_probs=3.6 Q ss_pred EEEEEECCCC Q ss_conf 8999821688 Q gi|254780529|r 15 IIIKKKVAID 24 (343) Q Consensus 15 ~iikr~~~~~ 24 (343) |-+|+.+..+ T Consensus 104 i~vrkg~~~~ 113 (637) T COG4646 104 ITVRKGGASE 113 (637) T ss_pred EEEECCCCCC T ss_conf 9981477655 No 61 >pfam01707 Peptidase_C9 Peptidase family C9. Probab=29.53 E-value=35 Score=13.75 Aligned_cols=94 Identities=15% Similarity=0.145 Sum_probs=45.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHC---------CC-CEEE-EEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 55687546835656544367898612---------78-3389-8731467667874106999999999999999986999 Q gi|254780529|r 237 FDKSSSIPLPETIVVLQKIGEVLAHS---------TE-VISI-RGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVS 305 (343) Q Consensus 237 F~~gsa~l~~~~~~~l~~ia~~l~~~---------~~-~i~I-eGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~ 305 (343) |..++|- ...-.|+.++.-+... |. .+.+ .-|=||.| |...|.+..-.+|-.+-+.=....-+. T Consensus 78 FkeDka~---S~e~aLn~iCtr~fG~DLdSGLFSap~vpl~y~n~HWDNsP--g~~mYG~n~~va~~l~rryP~~~~a~~ 152 (202) T pfam01707 78 FKEDKAY---SPEVALNEICTKFFGVDLDSGLFSAPSVPLTYHNNHWDNSP--GGRMYGFNKAVARQLERRYPFLLGAGK 152 (202) T ss_pred HHCCCCC---CHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCC--CCCCCCCCHHHHHHHHHHCCHHHHHHC T ss_conf 6136666---88999999999984643576543688664376536556899--865357599999999886735443221 Q ss_pred HHHEEEEEECCCCCCCCCCCCCCCCCCEEEE Q ss_conf 6571225630334888898852133755999 Q gi|254780529|r 306 EDRISKISGFAHHRLKIASDPMNSANRRIDI 336 (343) Q Consensus 306 ~~ri~~v~G~g~~~P~~~n~t~~~~NRRVEi 336 (343) ..|..-+ -.|-.+|..+..--=.-|||+-= T Consensus 153 tG~q~d~-~t~~~~~~~p~~N~VP~NRrLPH 182 (202) T pfam01707 153 TGRQIDV-TTGRVQVPSARINLVPLNRRLPH 182 (202) T ss_pred CCCEEEC-CCCCCCCCCCCCCEEECCCCCCC T ss_conf 5838862-56842258865433123456871 No 62 >KOG3385 consensus Probab=29.24 E-value=34 Score=13.88 Aligned_cols=10 Identities=30% Similarity=0.365 Sum_probs=5.4 Q ss_pred CCCCCCCCCH Q ss_conf 5568754683 Q gi|254780529|r 237 FDKSSSIPLP 246 (343) Q Consensus 237 F~~gsa~l~~ 246 (343) |++-+.-|+- T Consensus 73 fdsts~~L~g 82 (118) T KOG3385 73 FDSTSGFLSG 82 (118) T ss_pred HHHHHHHHHH T ss_conf 2566788887 No 63 >PRK13455 F0F1 ATP synthase subunit B; Provisional Probab=28.93 E-value=32 Score=14.01 Aligned_cols=26 Identities=23% Similarity=0.361 Sum_probs=14.5 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 878874318879999999999999999 Q gi|254780529|r 26 TPNNLGSWKIVYADFMTVLMAFFLVMW 52 (343) Q Consensus 26 ~~~~~~~W~vtyAD~~TlLmaFFvll~ 52 (343) -.++||-||.+=.+|- .+.+|+++++ T Consensus 16 ~~~~~~~~~~~d~~FW-v~IsFvif~~ 41 (184) T PRK13455 16 LAAGGPFFSLSNTDFI-VTLAFLLFIG 41 (184) T ss_pred HHCCCCCCCCCCCHHH-HHHHHHHHHH T ss_conf 7647998888895499-9999999999 No 64 >TIGR00835 agcS amino acid carrier protein; InterPro: IPR001463 Sodium symporters can be divided by sequence and functional similarity into various groups. One such group is the sodium/alanine symporter family, the members of which transport alanine in association with sodium ions. These transporters are believed to possess 8 transmembrane (TM) helices , , forming a channel or pore through the cytoplasmic membrane, the interior face being hydrophilic to allow the passage of alanine molecules and sodium ions . This family is restricted to the bacteria and archaea, examples are the alanine carrier protein from the Thermophilic bacterium PS-3; the D-alanine/glycine permease from Alteromonas haloplanktis; and the hypothetical protein yaaJ from Escherichia coli.; GO: 0005283 sodium:amino acid symporter activity, 0006814 sodium ion transport, 0016020 membrane. Probab=28.01 E-value=37 Score=13.57 Aligned_cols=55 Identities=15% Similarity=0.138 Sum_probs=38.4 Q ss_pred EECCCCCCCCCCCCHHHHH------HHHHHHHHHHH-HHHHHHHCC-CHHHHHHHHHHHHHHH Q ss_conf 8216888878874318879------99999999999-999997508-9999999999853554 Q gi|254780529|r 19 KKVAIDDTPNNLGSWKIVY------ADFMTVLMAFF-LVMWIINAT-DDDTKKAIEQYFNPFG 73 (343) Q Consensus 19 r~~~~~~~~~~~~~W~vty------AD~~TlLmaFF-vll~s~s~~-d~~k~~~i~~~f~p~~ 73 (343) |+++.+.--+|+|+|=+-= |--|..++|.| +|+.+|... +-.+.-.|++.|...+ T Consensus 112 r~~d~~g~~~GGPmYYi~~GL~~rGaG~la~~FA~~~~~i~~fG~~fn~~Q~N~Ia~~l~~~f 174 (461) T TIGR00835 112 RERDADGVFRGGPMYYIKKGLGMRGAGWLAVLFAVFSVLIASFGIGFNAVQANSIASALSNAF 174 (461) T ss_pred HEECCCCCEECCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 300589860066189999651899750799999999999999998741788989999998621 No 65 >KOG0605 consensus Probab=27.97 E-value=22 Score=15.08 Aligned_cols=27 Identities=30% Similarity=0.239 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 999999999999999975089999999999853 Q gi|254780529|r 38 ADFMTVLMAFFLVMWIINATDDDTKKAIEQYFN 70 (343) Q Consensus 38 AD~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~ 70 (343) .|||||||--=+| .+...+.-|++.+. T Consensus 226 GD~mTLL~~~~~L------~e~~arfYiaE~vl 252 (550) T KOG0605 226 GDMMTLLMRKDTL------TEDWARFYIAETVL 252 (550) T ss_pred CCHHHHHHHCCCC------CHHHHHHHHHHHHH T ss_conf 4399998735867------44789999999999 No 66 >TIGR00944 2a6301s04 Multicomponent K+:H+antiporter; InterPro: IPR004775 A group of uncharacterised proteins some of which are similar to various chains of the NADH-ubiquinone/plastoquinone (complex I).; GO: 0015386 potassium:hydrogen antiporter activity, 0006812 cation transport, 0016021 integral to membrane. Probab=27.88 E-value=38 Score=13.56 Aligned_cols=29 Identities=24% Similarity=0.639 Sum_probs=14.4 Q ss_pred CCHHHHHH-----H-HHHHHHH------HHHHHHHHHCCCH Q ss_conf 43188799-----9-9999999------9999999750899 Q gi|254780529|r 31 GSWKIVYA-----D-FMTVLMA------FFLVMWIINATDD 59 (343) Q Consensus 31 ~~W~vtyA-----D-~~TlLma------FFvll~s~s~~d~ 59 (343) |.|++.|. | |..||+| .-|-+|||...-. T Consensus 66 G~WkAPygIVlvlD~fsaLl~~~t~ii~l~v~lYS~~~~g~ 106 (504) T TIGR00944 66 GNWKAPYGIVLVLDRFSALLVLTTSIIALLVQLYSYARVGT 106 (504) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 87106179999999999999999999999999862565222 No 67 >PRK10858 nitrogen regulatory protein P-II 1; Provisional Probab=27.83 E-value=38 Score=13.55 Aligned_cols=52 Identities=15% Similarity=0.289 Sum_probs=30.5 Q ss_pred HHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCC---CCCCCC---CCCCEEEEEEECC Q ss_conf 999999999998699965712256303348888---988521---3375599999569 Q gi|254780529|r 290 DRAYSAYQVLMKSGVSEDRISKISGFAHHRLKI---ASDPMN---SANRRIDILVEDR 341 (343) Q Consensus 290 ~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~---~n~t~~---~~NRRVEi~i~~~ 341 (343) .|-..|.+.|.+.|++.=-+..|.|||-.+... ...... -.-+||||++.+. T Consensus 11 ~kl~~vk~aL~~~G~~g~Tv~~V~G~G~q~g~~~~~rg~~~~~~~~pK~~ieivv~d~ 68 (112) T PRK10858 11 FKLDDVREALAEVGITGMTVTEVKGFGRQKGHTELYRGAEYMVDFLPKVKIEIVVPDD 68 (112) T ss_pred HHHHHHHHHHHHCCCCEEEEEECEEECCCCCCCCCCCCCCEEECCCCEEEEEEEECHH T ss_conf 9999999999978997399995778745888554313652041223105999996788 No 68 >cd03527 RuBisCO_small Ribulose bisphosphate carboxylase/oxygenase (Rubisco), small subunit. Rubisco is a bifunctional enzyme catalyzes the initial steps of two opposing metabolic pathways: photosynthetic carbon fixation and the competing process of photorespiration. Rubisco Form I, present in plants and green algae, is composed of eight large and eight small subunits. The nearly identical small subunits are encoded by a family of nuclear genes. After translation, the small subunits are translocated across the chloroplast membrane, where an N-terminal signal peptide is cleaved off. While the large subunits contain the catalytic activities, it has been shown that the small subunits are important for catalysis by enhancing the catalytic rate through inducing conformational changes in the large subunits. Probab=27.74 E-value=38 Score=13.54 Aligned_cols=72 Identities=10% Similarity=0.251 Sum_probs=36.0 Q ss_pred HHHHHCCCCEEEEEEECCCCCCCCCCCHHHHH---HHHHHHHHHHHHH--CCCCHHHEEEEEECCCCCCCCCCCCCCCCC Q ss_conf 89861278338987314676678741069999---9999999999998--699965712256303348888988521337 Q gi|254780529|r 257 EVLAHSTEVISIRGHTDASPFRNIARDNWRLS---LDRAYSAYQVLMK--SGVSEDRISKISGFAHHRLKIASDPMNSAN 331 (343) Q Consensus 257 ~~l~~~~~~i~IeGHTD~~p~~g~~~~Nw~LS---~~RA~aV~~~L~~--~Gv~~~ri~~v~G~g~~~P~~~n~t~~~~N 331 (343) ..+-.....+.|| |||...+.+.|=.-|.|- ..-+.+|+..|.. .-.+. ..+++.||-..+ +- T Consensus 22 ~y~l~qGw~p~iE-~~d~~~~~~~yW~mWkLPmF~~~d~~~Vl~Eie~C~k~~p~-~yVRligfD~~~----------q~ 89 (99) T cd03527 22 DYIISNGWAPCLE-FTEPEHYDNRYWTMWKLPMFGCTDPAQVLREIEACRKAYPD-HYVRVVGFDNYK----------QS 89 (99) T ss_pred HHHHHCCCEEEEE-EECCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCC-CEEEEEEEECCC----------EE T ss_conf 9999689987899-84157666773334467467898999999999999998998-669999980875----------08 Q ss_pred CEEEEEEEC Q ss_conf 559999956 Q gi|254780529|r 332 RRIDILVED 340 (343) Q Consensus 332 RRVEi~i~~ 340 (343) +.++++|.+ T Consensus 90 q~~sfIV~R 98 (99) T cd03527 90 QCMSFIVHR 98 (99) T ss_pred EEEEEEEEC T ss_conf 999999768 No 69 >PTZ00264 circumsporozoite-related antigen; Provisional Probab=27.74 E-value=35 Score=13.80 Aligned_cols=14 Identities=43% Similarity=0.852 Sum_probs=10.4 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999997 Q gi|254780529|r 41 MTVLMAFFLVMWII 54 (343) Q Consensus 41 ~TlLmaFFvll~s~ 54 (343) |-+|-+||++||.. T Consensus 1 MKiL~a~FL~LFca 14 (162) T PTZ00264 1 MKILAAFFLALFCA 14 (162) T ss_pred CHHHHHHHHHHHHH T ss_conf 90899999999999 No 70 >PRK02251 putative septation inhibitor protein; Reviewed Probab=27.64 E-value=38 Score=13.53 Aligned_cols=20 Identities=15% Similarity=0.529 Sum_probs=11.0 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 88788743188799999999999999 Q gi|254780529|r 25 DTPNNLGSWKIVYADFMTVLMAFFLV 50 (343) Q Consensus 25 ~~~~~~~~W~vtyAD~~TlLmaFFvl 50 (343) .....+|.| | ..+|++||++ T Consensus 22 ~~~~~~P~W---~---~p~m~~lm~i 41 (84) T PRK02251 22 KGTKVNPRW---F---APVMVALMII 41 (84) T ss_pred CCCCCCCCC---H---HHHHHHHHHH T ss_conf 667899840---8---9999999999 No 71 >PRK10053 hypothetical protein; Provisional Probab=27.28 E-value=38 Score=13.49 Aligned_cols=16 Identities=13% Similarity=0.391 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHHHC Q ss_conf 9999999999999750 Q gi|254780529|r 41 MTVLMAFFLVMWIINA 56 (343) Q Consensus 41 ~TlLmaFFvll~s~s~ 56 (343) +..|+|||++.++++. T Consensus 5 ~~~~~~~~~~~~a~a~ 20 (130) T PRK10053 5 AIALASFLVMPYALAD 20 (130) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 8889998852065575 No 72 >pfam06151 Trehalose_recp Trehalose receptor. In Drosophila, taste is perceived by gustatory neurons located in sensilla distributed on several different appendages throughout the body of the animal. This family represents the taste receptor sensitive to trehalose. Probab=26.92 E-value=39 Score=13.45 Aligned_cols=37 Identities=16% Similarity=0.416 Sum_probs=31.0 Q ss_pred CHHHHHHHHHHHHHHH-------HHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 3188799999999999-------99999975089999999999853 Q gi|254780529|r 32 SWKIVYADFMTVLMAF-------FLVMWIINATDDDTKKAIEQYFN 70 (343) Q Consensus 32 ~W~vtyAD~~TlLmaF-------Fvll~s~s~~d~~k~~~i~~~f~ 70 (343) .|++-|.-+++..|+| |+.|.|++-. .+|+|+...+. T Consensus 204 ~~~~~~~~~in~~~TF~Wnf~DlFIiliS~gLa--~RF~Qin~rI~ 247 (414) T pfam06151 204 AWLAPLGEWVNVASTFVWNFMDIFIMMISIGLA--ARFQQLNRRIR 247 (414) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH T ss_conf 999999999999986341798899999999999--99999999999 No 73 >cd00649 catalase_peroxidase Catalase-peroxidases. This is a subgroup of heme-dependent peroxidases of the plant superfamily that share a heme prosthetic group and catalyze a multistep oxidative reaction involving hydrogen peroxide as the electron acceptor. Catalase-peroxidases can exhibit both catalase and broad-spectrum peroxidase activities depending on the steady-state concentration of hydrogen peroxide. These enzymes are found in many archaeal and bacterial organisms, where they neutralize potentially lethal hydrogen peroxide molecules generated during photosynthesis or stationary phase. Along with related intracellular fungal and plant peroxidases, catalase-peroxidases belong class I of the plant peroxidase superfamily. Unlike the eukaryotic enzymes, they are typically comprised of two homologous domains that probably arose via a single gene duplication event. The heme binding motif is present only in the N-terminal domain; the function of the C-terminal domain is not clear. Probab=26.74 E-value=28 Score=14.42 Aligned_cols=12 Identities=25% Similarity=0.429 Sum_probs=9.2 Q ss_pred CHHHHHHHHHHHH Q ss_conf 3188799999999 Q gi|254780529|r 32 SWKIVYADFMTVL 44 (343) Q Consensus 32 ~W~vtyAD~~TlL 44 (343) .| +||||||+|- T Consensus 128 ~~-iSwaDL~iLa 139 (405) T cd00649 128 QK-ISWADLIILA 139 (405) T ss_pred CC-CCHHHHHHHH T ss_conf 88-7799999999 No 74 >TIGR02544 III_secr_YscJ type III secretion apparatus lipoprotein, YscJ/HrcJ family; InterPro: IPR003282 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior. There have been four secretion systems described in animal enteropathogens such as Salmonella and Yersinia, with further sequence similarities in plant pathogens like Ralstonia and Erwinia. The type III secretion system is of great interest as it is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. The protein subunits of the system are very similar to those of bacterial flagellar biosynthesis . However, while the latter forms a ring structure to allow secretion of flagellin and is an integral part of the flagellum itself, type III subunits in the outer membrane translocate secreted proteins through a channel-like structure. One of the outer membrane protein subunit families, termed "K" here for nomenclature purposes, aids in the structural assembly of the invasion complex . It is also described as a lipoprotein. Members of this family include the Salmonella PrgK and SsaJ genes, MxiJ from Shigella, YscJ from Yersinia, and from the plant enteropathogens NolT (Rhizobium) and HrcJ Erwinia). The flagellar M-ring protein FliF also shares a low level of similarity, presumably due to evolution of the type III secretion system from the flagellar biosynthetic pathway.; GO: 0009306 protein secretion. Probab=26.44 E-value=31 Score=14.10 Aligned_cols=11 Identities=27% Similarity=0.610 Sum_probs=4.4 Q ss_pred CCCCCCCCCCH Q ss_conf 55568754683 Q gi|254780529|r 236 MFDKSSSIPLP 246 (343) Q Consensus 236 lF~~gsa~l~~ 246 (343) +|+.++=.-+| T Consensus 95 ~Fp~~gLvsSP 105 (203) T TIGR02544 95 LFPKDGLVSSP 105 (203) T ss_pred HCCCCCEECCH T ss_conf 51357703486 No 75 >PRK13165 cytochrome c-type biogenesis protein CcmE; Reviewed Probab=25.07 E-value=42 Score=13.23 Aligned_cols=25 Identities=12% Similarity=0.339 Sum_probs=16.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 31887999999999999999997508 Q gi|254780529|r 32 SWKIVYADFMTVLMAFFLVMWIINAT 57 (343) Q Consensus 32 ~W~vtyAD~~TlLmaFFvll~s~s~~ 57 (343) .|++.+. ++.+-.+.|++|++|... T Consensus 9 l~~i~~~-~~~l~~a~~Lil~al~~n 33 (161) T PRK13165 9 LWLACAV-LAGLALTIGLVLYALRSN 33 (161) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHC T ss_conf 9999999-999999999999998717 No 76 >PRK13289 nitric oxide dioxygenase; Provisional Probab=24.90 E-value=42 Score=13.21 Aligned_cols=29 Identities=24% Similarity=0.293 Sum_probs=24.1 Q ss_pred HHHHHHHHHHCCCCHHHEEEEEECCCCCCC Q ss_conf 999999999869996571225630334888 Q gi|254780529|r 292 AYSAYQVLMKSGVSEDRISKISGFAHHRLK 321 (343) Q Consensus 292 A~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~ 321 (343) -.+|+..|...||+++||. .+-||....+ T Consensus 370 m~av~~~L~~~GVp~~~Ih-~E~FGP~~~l 398 (399) T PRK13289 370 MQFVAKQLLDLGVPEERIH-YEFFGPAKVL 398 (399) T ss_pred HHHHHHHHHHCCCCHHHEE-EECCCCCHHC T ss_conf 9999999997799789978-8335880411 No 77 >cd04742 NPD_FabD 2-Nitropropane dioxygenase (NPD)-like domain, associated with the (acyl-carrier-protein) S-malonyltransferase FabD. NPD is part of the nitroalkaneoxidizing enzyme family, that catalyzes oxidative denitrification of nitroalkanes to their corresponding carbonyl compounds and nitrites. NDPs are members of the NAD(P)H-dependent flavin oxidoreductase family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN. Probab=24.71 E-value=43 Score=13.18 Aligned_cols=36 Identities=28% Similarity=0.509 Sum_probs=19.6 Q ss_pred CCCCCHHHHHHHHHH----------HHHHHHCC--CCEEEE----EEECCCC Q ss_conf 754683565654436----------78986127--833898----7314676 Q gi|254780529|r 241 SSIPLPETIVVLQKI----------GEVLAHST--EVISIR----GHTDASP 276 (343) Q Consensus 241 sa~l~~~~~~~l~~i----------a~~l~~~~--~~i~Ie----GHTD~~p 276 (343) ..-|.|.-..+|+++ |......| +.|.|| ||||++| T Consensus 144 ~~Fl~PaP~~il~~L~~~G~iT~eqA~lA~~vPmAdDItvEaDSGGHTD~~~ 195 (418) T cd04742 144 EAFMSPAPERILKKLLAEGKITEEQAELARRVPVADDITVEADSGGHTDNRP 195 (418) T ss_pred HHHCCCCCHHHHHHHHHCCCCCHHHHHHHHCCCCCCCEEEECCCCCCCCCCC T ss_conf 9870899899999999869988999998827985323677315787777864 No 78 >COG1315 Uncharacterized conserved protein [Function unknown] Probab=24.46 E-value=43 Score=13.15 Aligned_cols=19 Identities=21% Similarity=0.240 Sum_probs=10.3 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 87999999999999999997 Q gi|254780529|r 35 IVYADFMTVLMAFFLVMWII 54 (343) Q Consensus 35 vtyAD~~TlLmaFFvll~s~ 54 (343) +|=.++...|.+-=|+ |.+ T Consensus 101 vt~~eiL~~l~a~gVv-~GI 119 (543) T COG1315 101 VTKDEILNALKASGVV-YGI 119 (543) T ss_pred CCHHHHHHHHHHCCCE-EEH T ss_conf 5279999999860711-203 No 79 >TIGR02416 CO_dehy_Mo_lg carbon-monoxide dehydrogenase, large subunit; InterPro: IPR012780 Carbon monoxide dehydrogenases catalyse the reversible oxidation of carbon monoxide to carbon dioxide as shown below , :CO + H(2)O + A == CO(2) + AH(2) A variety of electron acceptors can be used by these enzymes including ferredoxin, methyl viologen and benzyl viologen. Under anaerobic conditions, carbon monoxide is utilised by a number of methanogenic archaea and acetogenic, sulphate-reducing or phototrophic bacteria. Under aerobic conditions carbon monoxide can be used as sole carbon and energy source by the carboxidotrophic bacteria, a taxonomically diverse group of obligate or facultative chemolithoautotrophic species. The anaerobic dehydrogenases are mostly nickel-containing iron-sulphur enzymes, while the The aerobic enzymes categorised so far are molybdenum-containing iron-sulphur proteins. This entry represents the large subunit of the molybdenum-containing carbon monoxide dehydrogenase found in aerobic bacteria. This enzyme is a dimer of heterotrimers, where each heterotrimer consists of a small iron-sulphur subunit, a medium FAD-containing flavoprotein subunit and a large molybdopterin-containign subunit which contains the active site , . The large subunit is divided into two domains. The N-terminal domain is composed of mainly of a five-and four-stranded mixed beta-sheet and interacts with the small and medium subunits. The C-terminal domain contains a region which interacts with its counterpart in the overall dimer, providing the main dimer contact, and a beta-sheet surrounded by alpha helices. The active site located in this subunit contains a novel dinuclear heterometal [CuSMo(=O)OH] cluster.; GO: 0005507 copper ion binding, 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0030151 molybdenum ion binding. Probab=24.33 E-value=29 Score=14.26 Aligned_cols=48 Identities=25% Similarity=0.304 Sum_probs=31.6 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHH----------------HHHHHHHCCCCEE-EEEEECCCCCCC Q ss_conf 56745556875468356565443----------------6789861278338-987314676678 Q gi|254780529|r 232 RNTPMFDKSSSIPLPETIVVLQK----------------IGEVLAHSTEVIS-IRGHTDASPFRN 279 (343) Q Consensus 232 ~~~~lF~~gsa~l~~~~~~~l~~----------------ia~~l~~~~~~i~-IeGHTD~~p~~g 279 (343) .+..||++..=.+.|.+..+.+- ||..|.=.++.|. ++|-||..||+- T Consensus 473 LGvgM~DscE~RiHPTG~~i~R~g~~sQGQGH~TTyAQIiA~ELGIP~e~I~v~~GdTd~aPyGL 537 (775) T TIGR02416 473 LGVGMFDSCEIRIHPTGSAIARLGTKSQGQGHETTYAQIIATELGIPAEDIDVEEGDTDTAPYGL 537 (775) T ss_pred EECCCCCCCEEEECCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCCCCCC T ss_conf 20100154425677788378986302369864304666655552788011067207878888566 No 80 >PRK05986 cob(I)yrinic acid a,c-diamide adenosyltransferase; Validated Probab=24.30 E-value=33 Score=13.89 Aligned_cols=23 Identities=22% Similarity=0.339 Sum_probs=13.5 Q ss_pred HHHHHHHHHHCCC--CEEEEEEECC Q ss_conf 4436789861278--3389873146 Q gi|254780529|r 252 LQKIGEVLAHSTE--VISIRGHTDA 274 (343) Q Consensus 252 l~~ia~~l~~~~~--~i~IeGHTD~ 274 (343) .+.+-..|...|. .|.+.|+.-+ T Consensus 134 ~eevi~~L~~rp~~~evVLTGR~~p 158 (190) T PRK05986 134 LEEVLEALNNRPGMQHVVITGRGAP 158 (190) T ss_pred HHHHHHHHHHCCCCCEEEEECCCCC T ss_conf 9999999982899876999799999 No 81 >pfam07509 DUF1523 Protein of unknown function (DUF1523). Probab=24.08 E-value=44 Score=13.11 Aligned_cols=37 Identities=16% Similarity=0.091 Sum_probs=26.0 Q ss_pred CCCCCCCCCCCHHHHHHHHH-------------HHHHHHHCCCCEEEEEE Q ss_conf 45556875468356565443-------------67898612783389873 Q gi|254780529|r 235 PMFDKSSSIPLPETIVVLQK-------------IGEVLAHSTEVISIRGH 271 (343) Q Consensus 235 ~lF~~gsa~l~~~~~~~l~~-------------ia~~l~~~~~~i~IeGH 271 (343) .+|..+||.|+.++..+..+ --+.|.-.|+.|.|+-- T Consensus 91 pYFKFdSanlqAeA~~lst~~~~~wV~I~hYGWR~~~ls~FPNAvsi~~v 140 (174) T pfam07509 91 PYFKFDSADLQAEAQDLSTADDPQWVEITYYGWRINMFSIFPNAVSIKPV 140 (174) T ss_pred CCEEECHHHHHHHHHHHHCCCCCCEEEEEEEEHHHHHHHHCCCCEEEEEC T ss_conf 62555405789998874225798479999733114788758873278875 No 82 >cd05536 POLBc_B3 DNA polymerase type-B B3 subfamily catalytic domain. Archaeal proteins that are involved in DNA replication are similar to those from eukaryotes. Some members of the archaea also possess multiple family B DNA polymerases (B1, B2 and B3). So far there is no specific function(s) has been assigned for different members of the archaea type B DNA polymerases. Phylogenetic analyses of eubacterial, archaeal, and eukaryotic family B DNA polymerases are support independent gene duplications during the evolution of archaeal and eukaryotic family B DNA polymerases. Structural comparison of the thermostable DNA polymerase type B to its mesostable homolog suggests several adaptations to high temperature such as shorter loops, disulfide bridges, and increasing electrostatic interaction at subdomain interfaces. Probab=23.67 E-value=45 Score=13.06 Aligned_cols=32 Identities=13% Similarity=-0.037 Sum_probs=13.8 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 468356565443678986127833898731467 Q gi|254780529|r 243 IPLPETIVVLQKIGEVLAHSTEVISIRGHTDAS 275 (343) Q Consensus 243 ~l~~~~~~~l~~ia~~l~~~~~~i~IeGHTD~~ 275 (343) .++.-++..|......+.+.... .|-|.|||+ T Consensus 133 ~it~~Gr~~l~~~~~~~e~~g~~-VIyGDTDSv 164 (371) T cd05536 133 AVTAWGREYIKTTIKIAEEKGFK-VIYGDTDSL 164 (371) T ss_pred HHHHHHHHHHHHHHHHHHHCCCE-EEEECCCEE T ss_conf 99999999999999999975998-999626526 No 83 >TIGR02855 spore_yabG sporulation peptidase YabG; InterPro: IPR008764 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This is a group of peptidases belong to MEROPS peptidase family U57 (clan U-). The type example is the YabG protein of Bacillus subtilis. This is a protease involved in the synthesis and maturation of the spore coat proteins SpoIVA and YrbA of Bacillus subtilis .. Probab=23.45 E-value=45 Score=13.03 Aligned_cols=22 Identities=27% Similarity=0.601 Sum_probs=16.6 Q ss_pred HHHHHHHHHC-CCCEEEEEEECCC Q ss_conf 4367898612-7833898731467 Q gi|254780529|r 253 QKIGEVLAHS-TEVISIRGHTDAS 275 (343) Q Consensus 253 ~~ia~~l~~~-~~~i~IeGHTD~~ 275 (343) .+|..+|..+ |+.|.|.|| |+. T Consensus 152 e~v~~L~~~~~PDIlViTGH-Da~ 174 (292) T TIGR02855 152 EKVLDLIEEVRPDILVITGH-DAY 174 (292) T ss_pred HHHHHHHHHHCCCEEEEECC-CCE T ss_conf 88999997309978999466-630 No 84 >pfam09961 DUF2195 Uncharacterized protein conserved in bacteria (DUF2195). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=23.41 E-value=45 Score=13.02 Aligned_cols=53 Identities=13% Similarity=0.048 Sum_probs=36.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 41069999999999999999869996571225630334888898852133755999995 Q gi|254780529|r 281 ARDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGFAHHRLKIASDPMNSANRRIDILVE 339 (343) Q Consensus 281 ~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~~P~~~n~t~~~~NRRVEi~i~ 339 (343) ..-++.+|..++..++.+|.+.-++-.+- . +. +-|+++ |+.-..||+|.+.|. T Consensus 64 AL~~Y~s~v~~~~g~~~~LQ~G~~~~~~~-~--~r--tl~LAs-d~~l~~dr~v~l~lg 116 (121) T pfam09961 64 ALVRYQSLVEFARGSRLVLQEGLFNLAKS-Q--GR--TLPLAT-DAALVFDGRLALRLG 116 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHC-C--CE--EEEEEC-CHHHHCCCEEEEEEE T ss_conf 99999999887277899999760665415-8--64--799753-744513870799983 No 85 >TIGR02054 MerD mercuric resistence transcriptional repressor protein MerD; InterPro: IPR011797 This entry represents a transcriptional repressor protein of the MerR family whose expression is regulated by the mercury-sensitive transcriptional activator, MerR. MerD has been shown to repress the transcription of the mer operon .; GO: 0003677 DNA binding, 0045892 negative regulation of transcription DNA-dependent, 0046689 response to mercury ion. Probab=23.32 E-value=24 Score=14.84 Aligned_cols=22 Identities=18% Similarity=0.289 Sum_probs=12.2 Q ss_pred HHHHHHHCCCCHHHEEEEEECCC Q ss_conf 99999986999657122563033 Q gi|254780529|r 295 AYQVLMKSGVSEDRISKISGFAH 317 (343) Q Consensus 295 V~~~L~~~Gv~~~ri~~v~G~g~ 317 (343) |++||.++=+-|-|-+. +|||= T Consensus 20 vRDYlLRGLLhPvrrT~-sGYGi 41 (120) T TIGR02054 20 VRDYLLRGLLHPVRRTE-SGYGI 41 (120) T ss_pred EHHHHHHHHCCCCCCCC-CCCCH T ss_conf 12135540046323058-99853 No 86 >TIGR02707 butyr_kinase butyrate kinase; InterPro: IPR011245 This group represents bacterial butyrate kinase, an enzyme that facilitates the formation of butyryl-CoA by phosphorylating butyrate in the presence of ATP to form butyryl phosphate . The final steps in butyrate synthesis by anaerobic bacteria can occur via butyrate kinase and phosphotransbutyrylase or via butyryl-CoA:acetate CoA-transferase, the latter providing the dominant route for butyrate formation in human colonic bacteria .; GO: 0005524 ATP binding, 0047761 butyrate kinase activity, 0016310 phosphorylation, 0005737 cytoplasm. Probab=23.00 E-value=46 Score=12.97 Aligned_cols=25 Identities=20% Similarity=0.384 Sum_probs=8.9 Q ss_pred HHHHHHHHHHHCCCCHHHEEEEEEC Q ss_conf 9999999999869996571225630 Q gi|254780529|r 291 RAYSAYQVLMKSGVSEDRISKISGF 315 (343) Q Consensus 291 RA~aV~~~L~~~Gv~~~ri~~v~G~ 315 (343) |-..++++|..+|++=.++.+|.|. T Consensus 48 R~~~i~~~L~e~Gi~~~~l~AvVgR 72 (353) T TIGR02707 48 RKQVILEVLEEKGINISKLDAVVGR 72 (353) T ss_pred HHHHHHHHHHHCCCCHHHEEEEEEC T ss_conf 9999999987408871243157972 No 87 >TIGR00768 rimK_fam alpha-L-glutamate ligases, RimK family; InterPro: IPR004666 Escherichia coli RimK adds additional Glu residues to the native Glu-Glu C-terminus of ribosomal protein S6. Mutation of the Glu-Glu terminus to Lys-Glu blocked addition. S6 has the C-terminal sequence Glu-Glu in few species, suggesting the homolog of rimK may have a function other than S6 modification in those species. However, most species having a member of this protein subfamily do not have an S6 homolog ending in Glu-Glu.; GO: 0006464 protein modification process. Probab=22.66 E-value=47 Score=12.93 Aligned_cols=18 Identities=28% Similarity=0.287 Sum_probs=7.9 Q ss_pred HHHHHHHHHHHHHHHCCC Q ss_conf 999999999999998699 Q gi|254780529|r 287 LSLDRAYSAYQVLMKSGV 304 (343) Q Consensus 287 LS~~RA~aV~~~L~~~Gv 304 (343) +|+.|+.+|.++|...|+ T Consensus 82 v~~~~~~~~~r~lE~~G~ 99 (321) T TIGR00768 82 VSSFRGLAVLRALESLGV 99 (321) T ss_pred CCHHHHHHHHHHHHHCCC T ss_conf 110268899999985896 No 88 >cd00530 PTE Phosphotriesterase (PTE) catalyzes the hydrolysis of organophosphate nerve agents, including the chemical warfare agents VX, soman, and sarin as well as the insecticide paraoxon. PTE exists as a homodimer with one active site per monomer. The active site is located next to a binuclear metal center, at the C-terminal end of a TIM alpha- beta barrel motif. The native enzyme contains two zinc ions at the active site however these can be replaced with other metals such as cobalt, cadmium, nickel or manganese and the enzyme remains active. Probab=21.92 E-value=48 Score=12.83 Aligned_cols=65 Identities=22% Similarity=0.289 Sum_probs=43.1 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCC Q ss_conf 68754683565654436789861278338987314676678741069999999999999999869996571225630334 Q gi|254780529|r 239 KSSSIPLPETIVVLQKIGEVLAHSTEVISIRGHTDASPFRNIARDNWRLSLDRAYSAYQVLMKSGVSEDRISKISGFAHH 318 (343) Q Consensus 239 ~gsa~l~~~~~~~l~~ia~~l~~~~~~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV~~~L~~~Gv~~~ri~~v~G~g~~ 318 (343) .++..+++.-+++|+..|.+-+... +-|.=||.. + ..-+..+.++|.+.|++++|+. + |+-|. T Consensus 125 ~~~~~it~~E~k~lrAaA~A~~~TG--~pI~~Ht~~----g---------~~~~~e~~~il~~~G~d~~rvv-i-gH~D~ 187 (293) T cd00530 125 GGSPAITPLEEKVLRAAARAQKETG--VPISTHTQA----G---------LTMGLEQLRILEEEGVDPSKVV-I-GHLDR 187 (293) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHC--CEEEECCCC----C---------CCCCHHHHHHHHHCCCCHHHEE-E-ECCCC T ss_conf 4689998899999999999999879--828982798----8---------6530999999998598922578-9-63689 Q ss_pred CC Q ss_conf 88 Q gi|254780529|r 319 RL 320 (343) Q Consensus 319 ~P 320 (343) .| T Consensus 188 ~~ 189 (293) T cd00530 188 ND 189 (293) T ss_pred CC T ss_conf 89 No 89 >pfam05545 FixQ Cbb3-type cytochrome oxidase component FixQ. This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon. Probab=21.39 E-value=49 Score=12.76 Aligned_cols=34 Identities=18% Similarity=0.368 Sum_probs=19.3 Q ss_pred HHHHHHH-HHHHHH-HHHH--HHHHHHCCCHHHHHHHH Q ss_conf 1887999-999999-9999--99997508999999999 Q gi|254780529|r 33 WKIVYAD-FMTVLM-AFFL--VMWIINATDDDTKKAIE 66 (343) Q Consensus 33 W~vtyAD-~~TlLm-aFFv--ll~s~s~~d~~k~~~i~ 66 (343) .+..|++ +.|++| +||+ ++|+++--....++..+ T Consensus 5 ~l~~~a~~~~~l~~~~~Figiv~wa~~p~~k~~fdeaa 42 (49) T pfam05545 5 TLRGFARSWGTLVLFVFFIGIVFWAYSPRRKKRFDEAA 42 (49) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 99999999999999999999999872877504579986 No 90 >COG0711 AtpF F0F1-type ATP synthase, subunit b [Energy production and conversion] Probab=21.38 E-value=49 Score=12.76 Aligned_cols=21 Identities=19% Similarity=0.638 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999999999997508 Q gi|254780529|r 37 YADFMTVLMAFFLVMWIINAT 57 (343) Q Consensus 37 yAD~~TlLmaFFvll~s~s~~ 57 (343) +.+|...+++|++|+|.+..+ T Consensus 6 ~~~~~~~~i~F~ill~ll~~~ 26 (161) T COG0711 6 DTNILWQLIAFVILLWLLKKF 26 (161) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 247999999999999999999 No 91 >PRK12891 allantoate amidohydrolase; Reviewed Probab=20.59 E-value=51 Score=12.65 Aligned_cols=12 Identities=25% Similarity=0.185 Sum_probs=6.4 Q ss_pred HHCCCCHHHEEE Q ss_conf 986999657122 Q gi|254780529|r 300 MKSGVSEDRISK 311 (343) Q Consensus 300 ~~~Gv~~~ri~~ 311 (343) .+.|+..-+|.+ T Consensus 344 ~~~g~~~~~m~S 355 (414) T PRK12891 344 RALGLSHMDIVS 355 (414) T ss_pred HHCCCCCCEECC T ss_conf 973999743065 No 92 >COG1115 AlsT Na+/alanine symporter [Amino acid transport and metabolism] Probab=20.59 E-value=51 Score=12.65 Aligned_cols=53 Identities=13% Similarity=0.134 Sum_probs=35.3 Q ss_pred EEEECCCCCCCCCCCCHHHHHH---HHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHHH Q ss_conf 9982168888788743188799---99999999999999975-089999999999853 Q gi|254780529|r 17 IKKKVAIDDTPNNLGSWKIVYA---DFMTVLMAFFLVMWIIN-ATDDDTKKAIEQYFN 70 (343) Q Consensus 17 ikr~~~~~~~~~~~~~W~vtyA---D~~TlLmaFFvll~s~s-~~d~~k~~~i~~~f~ 70 (343) ..|+|..+..-.|+|+|-+..+ -.+..++| |.++.+|. ..+--+--.|+++++ T Consensus 120 ~Yr~kd~~G~~~GGP~yYi~kGl~~r~l~v~FA-~~li~afg~i~n~vQ~NsIa~a~~ 176 (452) T COG1115 120 KYRVKDKDGEYRGGPAYYIEKGLGMRWLAVLFA-FALIAAFGFIGNGVQSNSIASALA 176 (452) T ss_pred HEEEECCCCCCCCCHHHHHHHHCCCCHHHHHHH-HHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 126867999873772999986538738999999-999999973002675889999999 No 93 >pfam03402 V1R Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family. Probab=20.39 E-value=52 Score=12.62 Aligned_cols=11 Identities=27% Similarity=0.235 Sum_probs=4.5 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254780529|r 289 LDRAYSAYQVL 299 (343) Q Consensus 289 ~~RA~aV~~~L 299 (343) +.||+-..-.| T Consensus 196 E~RATqtILlL 206 (265) T pfam03402 196 EQRATRSILLL 206 (265) T ss_pred HHHHHHHHHHH T ss_conf 89998898999 No 94 >pfam03571 Peptidase_M49 Peptidase family M49. Probab=20.29 E-value=15 Score=16.13 Aligned_cols=40 Identities=30% Similarity=0.404 Sum_probs=24.5 Q ss_pred HHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 67898612783389873146766787410699999999999 Q gi|254780529|r 255 IGEVLAHSTEVISIRGHTDASPFRNIARDNWRLSLDRAYSA 295 (343) Q Consensus 255 ia~~l~~~~~~i~IeGHTD~~p~~g~~~~Nw~LS~~RA~aV 295 (343) ++-.|...+..+.|-||+|+.-...-.+.|| |+..||--+ T Consensus 363 VglYL~~~~~vL~iFg~~~~~ea~dv~Y~~~-L~m~r~GL~ 402 (550) T pfam03571 363 VALYLSTNKEVLEIFGFTDKEEADDVVYVNW-LQMIRAGLL 402 (550) T ss_pred HHHHHHCCHHHHHHHCCCCHHHHHHHHHHHH-HHHHHHHHH T ss_conf 7877744698998828997567888899999-999999763 No 95 >TIGR02644 Y_phosphoryl pyrimidine-nucleoside phosphorylase; InterPro: IPR000053 Two highly similar activities are represented in this group: thymidine phosphorylase (TP, gene deoA, 2.4.2.4 from EC) and pyrimidine-nucleoside phosphorylase (PyNP, gene pdp, 2.4.2.2 from EC). Both are dimeric enzymes that function in the salvage pathway to catalyse the reversible phosphorolysis of pyrimidine nucleosides to the free base and sugar moieties. In the case of thymidine phosphorylase, thymidine (and to a lesser extent, 2'-deoxyuridine) is lysed to produce thymine (or uracil) and 2'-deoxyribose-1-phosphate. Pyrimidine-nucleoside phosphorylase performs the analogous reaction on thymidine (to produce the same products) and uridine (to produce uracil and ribose-1-phosphate). PyNP is typically the only pyrimidine nucleoside phosphorylase encoded by Gram positive bacteria, while eukaryotes and proteobacteria encode two: TP, and the unrelated uridine phosphorylase. In humans, TP was originally characterised as platelet-derived endothelial cell growth factor and gliostatin . Structurally, the enzymes are homodimers, each composed of a rigid all alpha-helix lobe and a mixed alpha-helix/beta-sheet lobe, which are connected by a flexible hinge , . Prior to substrate binding, the lobes are separated by a large cleft. A functional active site and subsequent catalysis occurs upon closing of the cleft. The active site, composed of a phosphate binding site and a (deoxy)ribonucleotide binding site within the cleft region, is highly conserved between the two enzymes of this group. Active site residues (Escherichia coli DeoA numbering) include the phosphate binding Lys84 and Ser86 (close to a glycine-rich loop), Ser113, and Thr123, and the pyrimidine nucleoside-binding Arg171, Ser186, and Lys190. Sequence comparison between the active site residues for both enzymes reveals only one difference , which has been proposed to partially mediate substrate specificity. In TP, position 111 is a methionine, while the analogous position in PyNP is lysine. It should be noted that the uncharacterised archaeal members of this family differ in a number of respects from either of the characterised activities. The residue at position 108 is lysine, indicating the activity might be PyNP-like (though the determinants of substrate specificity have not been fully elucidated). Position 171 is glutamate (negative charge side chain) rather than arginine (positive charge side chain). In addition, a large loop that may "lock in" the substrates within the active site is much smaller than in the characterised members. It is not clear what effect these and other differences have on activity and specificity.; GO: 0006206 pyrimidine base metabolic process. Probab=20.07 E-value=28 Score=14.36 Aligned_cols=52 Identities=21% Similarity=0.285 Sum_probs=32.9 Q ss_pred EEEEECCCCCCCCCCCC-HHHH-HHH-------HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 99982168888788743-1887-999-------999999999999997508999999999985355 Q gi|254780529|r 16 IIKKKVAIDDTPNNLGS-WKIV-YAD-------FMTVLMAFFLVMWIINATDDDTKKAIEQYFNPF 72 (343) Q Consensus 16 iikr~~~~~~~~~~~~~-W~vt-yAD-------~~TlLmaFFvll~s~s~~d~~k~~~i~~~f~p~ 72 (343) ||++||+.+.=...-=. |+|. |.+ |..||||-| |+-+|.++...+...|-.. T Consensus 4 iI~kKr~g~~L~~eEI~Ef~I~gy~~g~i~DYQ~sAl~MAI~-----F~GM~~~Et~~LT~AM~~S 64 (425) T TIGR02644 4 IIEKKRDGKELSAEEINEFFINGYTNGEIPDYQMSALLMAIY-----FNGMTDEETAYLTKAMIDS 64 (425) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH-----HCCCCHHHHHHHHHHHHHH T ss_conf 378740578769888989988633279666589999999998-----5589988999999999973 No 96 >pfam02572 CobA_CobO_BtuR ATP:corrinoid adenosyltransferase BtuR/CobO/CobP. This family consists of the BtuR, CobO, CobP proteins all of which are Cob(I)alamin adenosyltransferase, EC:2.5.1.17, involved in cobalamin (vitamin B12) biosynthesis. These enzymes catalyse the adenosylation reaction: ATP + cob(I)alamin + H2O <= phosphate + diphosphate + adenosylcobalamin. Probab=20.06 E-value=53 Score=12.58 Aligned_cols=23 Identities=22% Similarity=0.406 Sum_probs=14.4 Q ss_pred HHHHHHHHHHCCC--CEEEEEEECC Q ss_conf 4436789861278--3389873146 Q gi|254780529|r 252 LQKIGEVLAHSTE--VISIRGHTDA 274 (343) Q Consensus 252 l~~ia~~l~~~~~--~i~IeGHTD~ 274 (343) .+.+-..|+..|. .|.+.|+.-. T Consensus 116 ~~~v~~~l~~rp~~~evVlTGr~~p 140 (172) T pfam02572 116 LEEVLELLRNRPEGQHVVLTGRGAP 140 (172) T ss_pred HHHHHHHHHHCCCCCEEEEECCCCC T ss_conf 9999999982899877999899999 Done!