Query gi|254780772|ref|YP_003065185.1| surface antigen (D15) [Candidatus Liberibacter asiaticus str. psy62] Match_columns 781 No_of_seqs 308 out of 3546 Neff 8.7 Searched_HMMs 39220 Date Sun May 29 19:14:11 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780772.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK11067 outer membrane protei 100.0 0 0 1248.3 67.2 756 23-781 1-801 (801) 2 TIGR03303 OM_YaeT outer membra 100.0 0 0 1187.4 66.0 729 46-781 1-741 (741) 3 COG4775 Outer membrane protein 100.0 0 0 1123.3 60.5 752 25-781 3-766 (766) 4 COG0729 Outer membrane protein 100.0 0 0 443.1 27.4 506 240-781 77-594 (594) 5 pfam01103 Bac_surface_Ag Surfa 100.0 0 0 410.9 31.9 309 466-781 1-317 (317) 6 TIGR03303 OM_YaeT outer membra 100.0 3.4E-44 0 306.6 42.1 308 43-364 66-397 (741) 7 TIGR00992 3a0901s03IAP75 chlor 100.0 1.4E-45 0 316.0 19.6 455 295-781 236-768 (768) 8 PRK11067 outer membrane protei 100.0 8.9E-43 0 297.3 32.3 312 46-374 92-430 (801) 9 KOG2602 consensus 100.0 5.6E-45 0 311.6 15.7 406 366-781 37-457 (457) 10 COG4775 Outer membrane protein 100.0 4.1E-37 1.1E-41 260.2 38.9 307 44-364 90-419 (766) 11 COG2831 FhaC Hemolysin activat 100.0 1.6E-38 4.1E-43 269.4 30.7 463 277-781 67-554 (554) 12 pfam03865 ShlB Haemolysin secr 99.9 8.2E-23 2.1E-27 166.6 27.8 342 386-765 10-384 (404) 13 COG2831 FhaC Hemolysin activat 99.4 8.5E-10 2.2E-14 81.4 22.6 159 188-368 67-226 (554) 14 COG0729 Outer membrane protein 99.3 2.8E-11 7.1E-16 91.1 11.7 212 145-368 64-281 (594) 15 pfam07244 Surf_Ag_VNR Surface 99.1 7.6E-10 1.9E-14 81.7 9.2 77 114-191 1-78 (78) 16 pfam07244 Surf_Ag_VNR Surface 98.8 3.6E-09 9.3E-14 77.3 3.1 77 284-362 2-78 (78) 17 TIGR00992 3a0901s03IAP75 chlor 98.7 5.1E-08 1.3E-12 69.8 7.6 220 47-283 145-388 (768) 18 PRK05529 cell division protein 98.5 7.7E-06 2E-10 55.5 14.2 120 2-121 15-137 (255) 19 pfam08479 POTRA_2 POTRA domain 98.4 3.9E-06 9.9E-11 57.4 9.4 75 115-191 2-76 (76) 20 pfam06898 YqfD Putative stage 98.2 6.4E-05 1.6E-09 49.5 12.2 95 10-113 76-177 (383) 21 COG1589 FtsQ Cell division sep 98.0 0.00025 6.3E-09 45.6 11.9 78 46-123 60-138 (269) 22 pfam08479 POTRA_2 POTRA domain 97.7 1.8E-05 4.5E-10 53.1 2.2 75 195-280 2-76 (76) 23 pfam08478 POTRA_1 POTRA domain 97.2 0.0013 3.3E-08 40.9 6.6 66 48-113 1-67 (67) 24 TIGR02876 spore_yqfD sporulati 97.1 0.011 2.8E-07 34.8 10.2 103 10-119 73-185 (406) 25 PRK10049 pgaA outer membrane p 94.0 0.37 9.5E-06 24.8 9.0 37 60-96 81-117 (818) 26 PRK10775 cell division protein 92.4 0.65 1.6E-05 23.3 14.7 140 43-186 53-203 (274) 27 PRK05529 cell division protein 87.7 0.24 6.2E-06 26.1 1.3 71 116-195 63-133 (255) 28 TIGR03509 OMP_MtrB_PioB decahe 87.0 1.7 4.4E-05 20.5 8.3 36 516-551 345-380 (649) 29 pfam08478 POTRA_1 POTRA domain 85.8 0.23 5.8E-06 26.2 0.3 29 116-144 1-29 (67) 30 pfam11854 DUF3374 Protein of u 84.6 2.2 5.7E-05 19.7 7.3 32 518-549 374-405 (675) 31 PRK09408 ompX outer membrane p 82.9 2.6 6.7E-05 19.3 7.4 43 738-781 129-171 (171) 32 PRK04922 tolB translocation pr 80.6 3.1 8E-05 18.8 8.6 66 25-97 5-76 (439) 33 PRK01742 tolB translocation pr 78.0 3.8 9.6E-05 18.3 9.7 44 50-98 28-74 (430) 34 PRK00178 tolB translocation pr 72.9 5.1 0.00013 17.4 7.5 67 24-98 3-73 (433) 35 TIGR01000 bacteriocin_acc bact 72.3 1.6 4.2E-05 20.6 1.1 52 25-81 25-76 (476) 36 PRK04792 tolB translocation pr 70.7 5.7 0.00015 17.1 10.8 45 50-97 25-72 (450) 37 PRK02889 tolB translocation pr 68.7 6.3 0.00016 16.8 7.9 43 50-97 29-74 (430) 38 PRK12788 flgH flagellar basal 64.2 3.3 8.4E-05 18.6 1.2 22 93-114 65-86 (231) 39 PRK04043 tolB translocation pr 61.9 7.3 0.00019 16.4 2.7 19 78-96 47-66 (419) 40 PRK09818 putative kinase inhib 58.8 8.1 0.00021 16.1 2.4 20 476-495 55-74 (183) 41 pfam06316 Ail_Lom Enterobacter 56.5 10 0.00027 15.4 7.8 43 738-781 157-199 (199) 42 PRK13513 putative outer membra 54.4 11 0.00029 15.2 4.3 42 15-58 4-45 (660) 43 PRK03629 tolB translocation pr 53.1 12 0.0003 15.0 4.4 45 50-98 24-72 (430) 44 PRK00249 flgH flagellar basal 52.6 5 0.00013 17.4 0.5 21 93-113 66-86 (231) 45 PRK10641 btuB vitamin B12/coba 43.6 16 0.0004 14.2 1.9 34 24-57 1-34 (615) 46 pfam10079 DUF2317 Uncharacteri 41.1 18 0.00046 13.8 4.8 28 64-91 39-66 (541) 47 pfam10082 DUF2320 Uncharacteri 39.7 19 0.00048 13.7 8.5 22 526-547 223-244 (380) 48 TIGR02636 galM_Leloir galactos 39.4 12 0.00031 14.9 0.9 85 586-672 160-247 (346) 49 pfam09743 DUF2042 Uncharacteri 39.4 18 0.00045 13.9 1.6 26 310-335 105-130 (272) 50 pfam08339 RTX_C RTX C-terminal 39.4 19 0.00049 13.6 5.3 12 97-108 10-21 (145) 51 pfam06935 DUF1284 Protein of u 39.0 19 0.00049 13.6 2.7 19 382-400 58-76 (103) 52 TIGR01994 SUF_scaf_2 SUF syste 36.9 21 0.00053 13.4 4.7 22 180-203 38-59 (139) 53 PRK10716 long-chain fatty acid 34.2 23 0.00058 13.1 20.4 64 686-761 341-404 (437) 54 TIGR02851 spore_V_T stage V sp 33.1 13 0.00034 14.7 0.2 15 182-196 21-35 (184) 55 TIGR02997 Sig70-cyanoRpoD RNA 32.2 13 0.00034 14.7 0.1 13 349-361 182-194 (336) 56 TIGR00407 proA gamma-glutamyl 30.3 21 0.00054 13.3 0.9 26 63-90 53-78 (415) 57 COG2063 FlgH Flagellar basal b 29.6 27 0.0007 12.6 1.9 19 94-112 66-84 (230) 58 pfam07437 YfaZ YfaZ precursor. 28.4 29 0.00073 12.5 3.9 42 739-781 132-180 (180) 59 PRK07429 phosphoribulokinase; 28.1 29 0.00074 12.5 4.6 71 199-279 108-187 (331) 60 PRK12396 5-methylribose kinase 28.0 29 0.00074 12.5 4.2 37 154-190 13-50 (409) 61 pfam10437 Lip_prot_lig_C Bacte 27.8 29 0.00075 12.4 7.2 12 310-321 52-63 (85) 62 TIGR02582 cas_TM1809 CRISPR-as 26.0 24 0.0006 13.0 0.5 10 657-667 234-243 (248) 63 TIGR03519 Bac_Flav_fam_1 Bacte 25.7 32 0.00081 12.2 11.0 39 740-781 252-292 (292) 64 pfam10566 Glyco_hydro_97 Glyco 25.3 32 0.00083 12.1 2.5 26 694-719 583-608 (643) 65 COG1386 scpB Chromosome segreg 25.0 33 0.00084 12.1 1.1 49 117-170 14-62 (184) 66 TIGR01645 half-pint poly-U bin 24.9 29 0.00073 12.5 0.8 20 4-23 71-91 (791) 67 pfam05158 RNA_pol_Rpc34 RNA po 24.8 33 0.00084 12.1 6.5 151 57-255 24-188 (313) 68 PRK05799 coproporphyrinogen II 24.7 33 0.00085 12.1 8.4 11 203-213 116-126 (374) 69 PRK13484 putative iron-regulat 23.0 36 0.00091 11.9 2.3 31 29-59 11-41 (682) 70 pfam09609 Cas_GSU0054 CRISPR-a 22.7 36 0.00093 11.8 1.1 22 739-760 488-510 (519) 71 TIGR00174 miaA tRNA delta(2)-i 22.1 23 0.00059 13.1 -0.2 22 231-252 67-88 (307) 72 KOG1391 consensus 21.8 38 0.00096 11.7 4.6 32 160-193 192-223 (396) 73 PRK00286 xseA exodeoxyribonucl 21.5 38 0.00098 11.7 7.0 50 74-124 4-62 (443) 74 TIGR01780 SSADH succinic semia 21.4 38 0.00098 11.7 1.6 189 128-338 57-265 (454) 75 COG1570 XseA Exonuclease VII, 21.0 39 0.001 11.6 8.3 51 74-125 4-63 (440) 76 PRK05137 tolB translocation pr 20.8 40 0.001 11.6 9.0 21 78-98 56-77 (437) No 1 >PRK11067 outer membrane protein assembly factor YaeT; Provisional Probab=100.00 E-value=0 Score=1248.33 Aligned_cols=756 Identities=26% Similarity=0.448 Sum_probs=685.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEE Q ss_conf 99999999999999987511465268589996650079889999748789882688999999999996798003899986 Q gi|254780772|r 23 FQMGFIILFYAIFGLSAVYGSNTSIVRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSNVKIKIV 102 (781) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~I~i~G~~~i~~~~i~~~~~~~~G~~~~~~~i~~~i~~l~~~g~F~~v~~~~~ 102 (781) |+|+.+++.+++|.+..++++++++|++|+|+||++|++++|++++++++|++++++.++++||+||+||||+||+|..+ T Consensus 1 ~~~k~~~~~~~l~~~~~~~~a~~~~I~~I~v~G~~ri~~~ti~~~l~~~~G~~~~~~~i~~~ik~L~~tg~F~dV~v~~~ 80 (801) T PRK11067 1 MAMKKLLIASLLFSSATVYGAEGFVVKDIHFEGLQRVAVGAALLSMPVRVGDTVNDEDISNTIRALFATGNFEDVRVLRD 80 (801) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEE T ss_conf 97589999999999999875587299799998973769889996469899992798999999999985899524899971 Q ss_pred CCEEEEEEEECCEEEEEEEECCCCCCHHHHHHHH---HHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCC Q ss_conf 9989999993653657999565436988998873---0057863799999999999999999729700179987643389 Q gi|254780772|r 103 DSVLIIDLIERKIINHLFFSGNNNLKDDQLKMIV---RSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSISP 179 (781) Q Consensus 103 ~~~l~~~v~e~p~i~~i~~~Gn~~~~~~~l~~~~---~~~~g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~~~ 179 (781) +.+|+|.|+|+|+|++|.|+||+++++++|++.+ ++++|++|++.++++++++|+++|+++||++|+|+++++++++ T Consensus 81 g~~L~~~V~E~P~I~~i~~~Gn~~i~~~~l~~~l~~~~lk~g~~~~~~~l~~~~~~i~~~Y~~~G~~~a~V~~~~~~~~~ 160 (801) T PRK11067 81 GDTLLVQVKERPTIASITFSGNKAVKDDMLKQNLEASGVRVGEALDRTTLSNIEKGLEDFYYSVGKYSASVKAVVTPLPR 160 (801) T ss_pred CCEEEEEEEECCEEEEEEEECCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCEECCEEEEEEEECCC T ss_conf 99999999957738999998874378899998775436465754699999999999999999679212199899987899 Q ss_pred CCEEEEEEEECCCCEEEEEEEEECCCCCCHHHHHHHHHCCCC-CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 842588986036632467787504431101222211100122-1124456664310233334578888887542322100 Q gi|254780772|r 180 TTLNITYVIEEGVKAKINSIRFVGNKNYSHARLERVISIRTS-GYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKV 258 (781) Q Consensus 180 ~~v~i~~~I~EG~~~~I~~I~f~Gn~~f~~~~L~~~~~~k~~-~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V 258 (781) |+|+|+|.|+||++++|++|+|+||++|++++|+++|++++. .||+|++.+.|++++|+.|+++|++||+++||+||+| T Consensus 161 n~v~l~f~I~EG~~~~I~~I~f~GN~~fs~~~L~~~~~~~~~~~~~~~~~~~~~~~~~l~~D~~~L~~fY~~~Gy~d~~V 240 (801) T PRK11067 161 NRVDLKLVFQEGVSAKIQQINIVGNHAFTTDELISRFQLRDEVPWWNVVGDRKYQKQKLAGDLETLRSYYLDRGYARFNI 240 (801) T ss_pred CEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 86999999950888999999998783247899998862376643357616885498999989999999998718662266 Q ss_pred C-CCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCC Q ss_conf 1-231012455554101220222210000012333444557677775321169962112100000000000023445642 Q gi|254780772|r 259 S-SQVLFDKQKSGYVLIFQIDEGEIYTVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPF 337 (781) Q Consensus 259 ~-~~~~~~~~~~~~~i~~~I~EG~~y~i~~i~i~g~~~~~~~~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~f 337 (781) . +++.+.++++.+.|+|.|+||+||++++|+|.||.. .+.++|.+++++++|++||+.+++++.++|.+.|.+.||+| T Consensus 241 ~~~~v~~~~dk~~~~I~~~I~EG~~Y~~g~i~~~gn~~-~~~~el~~~~~~~~G~~y~~~~i~~~~~~i~~~~~~~GY~f 319 (801) T PRK11067 241 DSTQVSLTPDKKGIYITVNITEGDQYKLSGVQVSGNLA-GHSAEIEKLTKIEPGELYNGTKVTKMEDDIKKLLGRYGYAY 319 (801) T ss_pred ECCEEEECCCCCEEEEEEEECCCCEEEEEEEEEECCCC-CCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 04425634776269999998689749872379967777-77899998754578863469999999999999998669721 Q ss_pred CCCCCCCCCCCCCCEEEEEEECCCCCCCEEECCCCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 01123322343440466554123332201100333343200332100100100000001234445666776510123445 Q gi|254780772|r 338 VRVKTRINRDFAKRIVDIEYLIDQGSPLYVKRIEIEGNDQSYDSVIRRELELSEGDPINYSMIERAKRRIMATGYFSEVN 417 (781) Q Consensus 338 a~v~~~~~~d~~~~~vdi~~~I~eg~~~~i~~I~I~GN~~T~d~VIrrel~~~~Gd~~~~~~i~~s~~~L~~lg~F~~V~ 417 (781) |+|.|.++.+.++++|+|+|.|+||+++||++|+|.||++|+|+||||||++.|||+||..+|++|++||.+||||+.|+ T Consensus 320 a~V~~~~~~~~~~~~v~l~~~I~eg~~~yV~~I~I~GN~~T~D~VIRRel~~~eGd~~~~~~i~~S~~rL~~lg~F~~V~ 399 (801) T PRK11067 320 PRVQTQPEINDADKTVKLRVNVDAGNRFYVRKIRFEGNDTSKDSVLRREMRQMEGAWLGSDLVEQGKERLNRLGFFETVD 399 (801) T ss_pred CCCCCCEEECCCCCEEEEEEEECCCCEEEEEEEEEECCCCCHHHHHHHEECCCCCCCCCHHHHHHHHHHHHHCCCCCCCC T ss_conf 00464126747898899999986686279998674277061347510100036765338889999999998558742034 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEECCC Q ss_conf 33254435654310233221122111111111223321100000111233323442200378741122111134541123 Q gi|254780772|r 418 ISQLPANDVSDYVILRVSVKQLSAGSVGIATNYEVDKGMGVEGHIDDNNFFGQGYRARLAAGFGRHAVQNYTFSVEDPYF 497 (781) Q Consensus 418 i~~~~~~~~~~~vdl~i~v~E~~tg~~~~g~gys~~~g~~~~~~~~~~Nl~G~G~~l~l~~~~~~~~~~~~~~s~~~P~~ 497 (781) +++.|+++.+++|||.+.|+|++||++++|+|||+.+|+.+.++++++||+|+|++++++++.+... +.++++|++||| T Consensus 400 ~~~~~~~~~~~~vdl~~~V~E~~tG~~~~g~Gyss~~g~~~~~~i~~~Nf~G~Gq~l~l~~~~s~~~-~~~~lsftePyf 478 (801) T PRK11067 400 TDTQRVPGSPDQVDVVYKVKERNTGSFNFGIGYGTESGVSFQAGVQQDNWLGTGNSVGINGTKNDYQ-TYAELSVTDPYF 478 (801) T ss_pred CCCCCCCCCCCEEEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEECCCCCCCCEEEEEEEECCCE-EEEEEEECCCCC T ss_conf 3343589998738999998986788799998886278828999999667576772999999954326-999999625355 Q ss_pred CCCCCCCCCCCCCCCCCC-----CCCCCCCCCCEEEEEEECCCCEEEEECCCEEEEEEECCCCCCCCCHHHH-------- Q ss_conf 577754310011113333-----3343233320245542025430235404125776401467544201112-------- Q gi|254780772|r 498 LGSPISAGFDLQKTHLED-----GSLDINDESAAVRMIVPITESISTSFKYDLRFLQYGAISEKEKIPSIYT-------- 564 (781) Q Consensus 498 ~~~~~s~~~~~~~~~~~~-----~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------- 564 (781) ++++++++++++++..+. ..|+.++.|+++++++|+.++.++++++++.+.++.........+.... T Consensus 479 ~~~~~s~g~~~f~~~~~~~~~~~~~Y~~~~~G~~~~~g~pi~e~~~~~~g~~y~~~~l~~~~~~~~~~~~~~~~~~~~~~ 558 (801) T PRK11067 479 TVDGVSLGGRIFYNDFEADDADLSDYTNKSYGTDVTLGFPINEYNSLRAGLGYVHNKLSNMQPQVAMWRYLESMGENPST 558 (801) T ss_pred CCCCCEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEEEECCCCEEEEEEEEEEEEECCCCCCHHHHHHHHHCCCCCCCC T ss_conf 77885247888876613566553331358972688997761442589999889998634566203234454313656443 Q ss_pred HHHCCCCCEEEEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCCCCEEEEEEEEEEEEEEECCCEEEE--EEEEEEEEE Q ss_conf 210158502467884321554201100246515654202456555420366663004799935880999--999997898 Q gi|254780772|r 565 TLIEHGKFSSHSISQSIIYNTLDNPIVPRKGMLISSSYDYAGFGGDSQYHRIGSRASYFYLLSDDSDIV--GSLRFGYGC 642 (781) Q Consensus 565 ~~~~~~~~~~~~i~~~l~~dtrD~~~~Pt~G~~~~~~~~~a~~g~d~~f~k~~~~~~~y~~l~~~~~l~--~~~~~g~~~ 642 (781) ............++++|+||+||++++||+|+..++++++|+||||.+|+|+.+++++|+|+.++..++ ++.++|++. T Consensus 559 ~~~~~~~~~~~~~~~~~t~dt~d~~~~Pt~G~~~~ls~~~a~~Ggd~~f~K~~~~~~~Y~pl~~~~~~vl~~~~~~G~~~ 638 (801) T PRK11067 559 SDQDSFKTDDFTFNYGWTYNNLDRGYFPTAGNRVNLTGKVTIPGSDNEYYKVTLDTATYVPLDDDHKWVLLGRGRLGYGD 638 (801) T ss_pred CCCCCEEEEEEEEEEEEEEECCCCCCCCCCCCEEEEEEEECCCCCCCEEEEEEEEEEEEEEECCCCEEEEEEEEEEEEEE T ss_conf 23565034226899988996687773899875699886844567785279999999999991579559999999998740 Q ss_pred EECCCCCCCCHHEEEECCCCCCCCCCCCCCCCC-----------------CCCCCCCCCEEEEEEEEEEEEECCCCCCC- Q ss_conf 516997412301275078971402126861550-----------------37997222147899999999614565654- Q gi|254780772|r 643 VIPSNKNLQLFDQFSVSSNYYLRGFAYKGIGPR-----------------VDKKYAIGGKIYSSASAAVSFPMPLVPER- 704 (781) Q Consensus 643 ~~~~~~~lp~~erF~lGG~~svRGy~~~~iGP~-----------------~~~~~~~GG~~~~~~s~E~r~pl~~~~~~- 704 (781) .++ ++++|++||||+||.++||||++++|||| +..++++||+.++.+++|++||+|++++. T Consensus 639 ~~g-~~~vP~~erFylGG~~svRGf~~~~iGPr~~~~~~~~~~~~~~~~~~~~~d~iGG~~~~~~~~El~fP~p~~~~~~ 717 (801) T PRK11067 639 GLG-GKEMPFYENFYAGGSSTVRGFQSNTIGPKAVYYPGSGSNYDPDGALCKSDDAVGGNAMAVASLELITPTPFISDKY 717 (801) T ss_pred CCC-CCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCHHH T ss_conf 579-8867710008747988647837899498732356655455654555667776555489999999991378766011 Q ss_pred -CCEEEEEEEEEEEEECCCCCCCCCC----C--CCCCCCEEEEEEEEEEEECCCCCEEEEEEEECCCCCCCEEEEEEEEE Q ss_conf -5379999997677415666677544----4--56766424799999997557001589988722677876025899964 Q gi|254780772|r 705 -AGLRGAFFVDSATLYANHVALGADK----L--EGNDSFWRVSTGVEIMWNSPLGMMGVYYGIPLRHREGDKIQQFGFRI 777 (781) Q Consensus 705 -~~l~~~~F~D~G~v~~~~~~~~~~~----~--~~~~~~~r~s~G~Glr~~tP~Gpirld~a~pl~~~~~D~~~~f~Fsi 777 (781) ..+++++|+|+|+||+...+..... . ..++..+|+|+|+|+||.||+||||||||+||+++++|++++|||+| T Consensus 718 ~~~vr~~~F~D~Gnvw~~~~~~~~~~~~~~~~d~~d~~~lR~S~G~Girw~sPiGPLr~d~a~pl~~~~gD~~~~F~Fsi 797 (801) T PRK11067 718 ANSVRTSFFWDMGTVWDTNWDSSAYTGASGYPDYSDPSNIRMSAGIALQWMSPLGPLVFSYAQPFKKYDGDKAEQFQFNI 797 (801) T ss_pred CCEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEECCCCCCCCCEEEEEEEC T ss_conf 57389999998122366655544544555654668768657999799999877257899998668788997422899957 Q ss_pred EECC Q ss_conf 1059 Q gi|254780772|r 778 GNRM 781 (781) Q Consensus 778 G~~f 781 (781) ||+| T Consensus 798 G~~F 801 (801) T PRK11067 798 GKTW 801 (801) T ss_pred CCCC T ss_conf 5769 No 2 >TIGR03303 OM_YaeT outer membrane protein assembly complex, YaeT protein. Members of this protein family are the YaeT protein of the YaeT/YfiO complex for assembling proteins into the outer membrane of Gram-negative bacteria. This protein is similar in sequence and function to a non-essential paralog, YtfM, that is also in the Omp85 family. Members of this family typically have five tandem copies of the surface antigen variable number repeat (pfam07244), followed by an outer membrane beta-barrel domain (pfam01103), while the YtfM family typically has a single pfam07244 repeat. Probab=100.00 E-value=0 Score=1187.42 Aligned_cols=729 Identities=33% Similarity=0.581 Sum_probs=673.3 Q ss_pred EEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEEEECCEEEEEEEECCC Q ss_conf 26858999665007988999974878988268899999999999679800389998699899999936536579995654 Q gi|254780772|r 46 SIVRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSNVKIKIVDSVLIIDLIERKIINHLFFSGNN 125 (781) Q Consensus 46 ~~i~~I~i~G~~~i~~~~i~~~~~~~~G~~~~~~~i~~~i~~l~~~g~F~~v~~~~~~~~l~~~v~e~p~i~~i~~~Gn~ 125 (781) ++|++|+|+||+++++++|++.+++++|++++++.++++||+||+||||+||+++.++++|+|+|+|+|+|++|.|+||+ T Consensus 1 ~~I~~I~v~G~~~i~~~~i~~~~~~~~G~~~~~~~i~~~ik~L~~tg~F~dv~v~~~~~~l~i~v~E~p~i~~i~~~G~~ 80 (741) T TIGR03303 1 FVVKDIRVEGLQRVSAGTVLNYLPVKVGDTISDEKIDEAIKALYATGYFEDVKIEREGGVLVIKVKERPIINSIVFSGNK 80 (741) T ss_pred CEEEEEEEECCEECCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEECCEEEEEEEECCC T ss_conf 98989999997165989998636999998459999999999999679900589997299999999967639999999985 Q ss_pred CCCHHHHHHHH-HHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEEEEEECCCCEEEEEEEEECC Q ss_conf 36988998873-00578637999999999999999997297001799876433898425889860366324677875044 Q gi|254780772|r 126 NLKDDQLKMIV-RSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSISPTTLNITYVIEEGVKAKINSIRFVGN 204 (781) Q Consensus 126 ~~~~~~l~~~~-~~~~g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~~~~~v~i~~~I~EG~~~~I~~I~f~Gn 204 (781) ++++++|++++ .+++|+++++.++++++++|+++|+++||++|+|++++++.++++|+|+|+|+||++++|++|.|+|| T Consensus 81 ~~~~~~l~~~l~~lk~g~~~~~~~~~~~~~~I~~~Y~~~Gy~~~~V~~~~~~~~~~~v~v~f~I~Eg~~~~I~~I~f~Gn 160 (741) T TIGR03303 81 EIKKEQLKKALVGIKKGEIFDRAKLEKDEKALKEFYRSRGKYNAEVEAKVTPLPRNRVDLEFNIKEGKKAKIKKINFVGN 160 (741) T ss_pred CCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCEEEEEEEEECCCEEEEEEEEEECC T ss_conf 58989999887455789703999999999999999996695204999999988998799999995388589999999887 Q ss_pred CCCCHHHHHHHHHCCCCCCCCC-CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCCCCCCCCCEEEEECCCCC Q ss_conf 3110122221110012211244-566643102333345788888875423221001-23101245555410122022221 Q gi|254780772|r 205 KNYSHARLERVISIRTSGYFSF-GKTDVYSKERMSFDEEAIRAFYHDRGYAAVKVS-SQVLFDKQKSGYVLIFQIDEGEI 282 (781) Q Consensus 205 ~~f~~~~L~~~~~~k~~~~~~~-~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V~-~~~~~~~~~~~~~i~~~I~EG~~ 282 (781) +.|++++|+++|++++..||+| ++++.|++++++.|+++|++||+++||+||+|. +.+.++++++.+.|+|.|+||+| T Consensus 161 ~~fs~~~L~~~~~~k~~~~~~~~~~~~~~~~~~l~~D~~~l~~fY~~~Gy~d~~v~~~~v~~~~~~~~~~i~~~I~EG~~ 240 (741) T TIGR03303 161 KAFSDSELRDVFALKESNFLSWFTGDDKYSKQKLEADLELLRSFYLNRGYLDFKVESTQVSITPDKKGVYITYNIKEGEQ 240 (741) T ss_pred CCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCEEECCCCCEEEEEEEECCCCC T ss_conf 42676898877752664120023667833999998799999999976052037983220388378863999999867881 Q ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCC Q ss_conf 00000123334445576777753211699621121000000000000234456420112332234344046655412333 Q gi|254780772|r 283 YTVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFVRVKTRINRDFAKRIVDIEYLIDQG 362 (781) Q Consensus 283 y~i~~i~i~g~~~~~~~~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~fa~v~~~~~~d~~~~~vdi~~~I~eg 362 (781) |++++|+|+||.. .+.++|.+++.+++|++|++..++++.++|.+.|.++||+||+|.|+.+.+.++++|+|+|.|+|| T Consensus 241 Y~i~~i~~~Gn~~-~~~~~l~~~~~~~~g~~y~~~~i~~~~~~I~~~y~~~Gy~fa~v~~~~~~~~~~~~v~l~~~I~eg 319 (741) T TIGR03303 241 YKFGEVNLEGDLI-GPDEELKKLVKIKKGEVFNREKVTKIVKAIKDLLGDKGYAFANVNPRPQINDENKTVDLTFKVDPG 319 (741) T ss_pred EEEEEEEEECCCC-CCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCCCEEEEEEEEECC T ss_conf 5667999725777-898999976544688651588999999999999985697438998889970578689899999528 Q ss_pred CCCEEECCCCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22011003333432003321001001000000012344456667765101234453325443565431023322112211 Q gi|254780772|r 363 SPLYVKRIEIEGNDQSYDSVIRRELELSEGDPINYSMIERAKRRIMATGYFSEVNISQLPANDVSDYVILRVSVKQLSAG 442 (781) Q Consensus 363 ~~~~i~~I~I~GN~~T~d~VIrrel~~~~Gd~~~~~~i~~s~~~L~~lg~F~~V~i~~~~~~~~~~~vdl~i~v~E~~tg 442 (781) +++||++|.|.||++|+|+||||||.+.|||+||+.++++|+++|++||||++|++.+.|.+ .+++|||.+.|+|++|| T Consensus 320 ~~~~i~~I~I~GN~~T~d~VIrRel~~~~Gd~f~~~~i~~s~~~L~~lg~F~~V~i~~~~~~-~~~~vdl~v~V~Ek~tg 398 (741) T TIGR03303 320 KRVYVRRINISGNTRTRDEVIRRELRQLEGDWYNLSKIKRSKRRLERLGYFEEVNIETVPVG-SPDQVDLNVNVKEQPTG 398 (741) T ss_pred CEEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCC-CCCEEEEEEEEEEECCC T ss_conf 71899999998874155375010002488653588999999999986688233101214689-98717899999980277 Q ss_pred CCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEECCCCCCCCCCCCCCCCCCCC-----CCC Q ss_conf 111111122332110000011123332344220037874112211113454112357775431001111333-----333 Q gi|254780772|r 443 SVGIATNYEVDKGMGVEGHIDDNNFFGQGYRARLAAGFGRHAVQNYTFSVEDPYFLGSPISAGFDLQKTHLE-----DGS 517 (781) Q Consensus 443 ~~~~g~gys~~~g~~~~~~~~~~Nl~G~G~~l~l~~~~~~~~~~~~~~s~~~P~~~~~~~s~~~~~~~~~~~-----~~~ 517 (781) ++++|+||++.+|+.+.++++++||||+|++++++++.|... +.++++|++||+++++++++++++++..+ ... T Consensus 399 ~~~~g~Gyss~~g~~~~~~~~~~N~~G~G~~l~l~~~~s~~~-~~~~lsf~~P~~~~~~~sl~~~~~~~~~~~~~~~~~~ 477 (741) T TIGR03303 399 SISFGVGYGSSSGLIFNASISERNFLGTGNRLSLSANKSSLS-QSYSLSFTDPYFTDDGVSLGFSIYYSETDRNYKNFSD 477 (741) T ss_pred EEEEEEEECCCCCEEEEEEEEEECCCCCCCEEEEEEEECCCE-EEEEEEEECCCCCCCCCEEEEEEEEEECCCCCCCCCC T ss_conf 899996431468749999875431136660999999974451-9999999886025899507899999872354445466 Q ss_pred CCCCCCCCEEEEEEECCCCEEEEECCCEEEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCEE Q ss_conf 43233320245542025430235404125776401467544201112210158502467884321554201100246515 Q gi|254780772|r 518 LDINDESAAVRMIVPITESISTSFKYDLRFLQYGAISEKEKIPSIYTTLIEHGKFSSHSISQSIIYNTLDNPIVPRKGML 597 (781) Q Consensus 518 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~dtrD~~~~Pt~G~~ 597 (781) |+..+.|+.+++++++.++++++++|++.+.++........ +.......+.+.+..++++++++||+||++++||+|++ T Consensus 478 y~~~~~G~~~~lg~~l~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~dtrD~~~~Pt~G~~ 556 (741) T TIGR03303 478 YKTKTYGGSINLGYPITEYLRVSLGYGYEQNKIKDSSDSDS-SYSKFIKEQGGKFIDSSLSYGWSYDTRDSGYFPTSGSI 556 (741) T ss_pred CCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEEECCCCCCH-HHHHHHHHCCCCEEEEEEEEEEEEECCCCCCCCCCCEE T ss_conf 62178879999978825656999999999999871688743-56799986189627899999999963677778986408 Q ss_pred EEEEECCCCCCCCCEEEEEEEEEEEEEEECCCE--EEEEEEEEEEEEEECCCCCCCCHHEEEECCCCCCCCCCCCCCCCC Q ss_conf 654202456555420366663004799935880--999999997898516997412301275078971402126861550 Q gi|254780772|r 598 ISSSYDYAGFGGDSQYHRIGSRASYFYLLSDDS--DIVGSLRFGYGCVIPSNKNLQLFDQFSVSSNYYLRGFAYKGIGPR 675 (781) Q Consensus 598 ~~~~~~~a~~g~d~~f~k~~~~~~~y~~l~~~~--~l~~~~~~g~~~~~~~~~~lp~~erF~lGG~~svRGy~~~~iGP~ 675 (781) ++++.++|+||||.+|+|+.+++++|+|+.+.. ++++++++|+...++ +..+|++||||+||.++||||++++|||+ T Consensus 557 ~~~~~~~a~~ggd~~f~K~~~~~~~y~~l~~~~~~vl~~~~~~G~~~~~~-~~~~p~~erF~lGG~~slRGf~~~~iGP~ 635 (741) T TIGR03303 557 QRLSQEFAGPGGDLKYYKSTYDSEYYIPLSKEDDWVLSLKGRLGYGNGYG-GKDLPFYERFYAGGIGSVRGFESNSIGPR 635 (741) T ss_pred EEEEEEEECCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEEEEEEECCC-CCCCCHHHEEECCCCCCCCCCCCCCCCCC T ss_conf 99998984467651037899999999996667767999999998875378-88688777388589980468678992887 Q ss_pred C--CCCCCCCCEEEEEEEEEEEEECCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCC Q ss_conf 3--79972221478999999996145656545379999997677415666677544456766424799999997557001 Q gi|254780772|r 676 V--DKKYAIGGKIYSSASAAVSFPMPLVPERAGLRGAFFVDSATLYANHVALGADKLEGNDSFWRVSTGVEIMWNSPLGM 753 (781) Q Consensus 676 ~--~~~~~~GG~~~~~~s~E~r~pl~~~~~~~~l~~~~F~D~G~v~~~~~~~~~~~~~~~~~~~r~s~G~Glr~~tP~Gp 753 (781) + ..++++||+.++.+++|+|||+++++...++++++|+|+|+||+...+.... ......+|+|+|+||||.||+|| T Consensus 636 d~~~~~~~~GG~~~~~~~~E~r~Pl~~~~~~~~l~~~~F~D~Gnv~~~~~~~~~~--~~~~~~~r~s~G~Glr~~tPiGP 713 (741) T TIGR03303 636 DINDSGDSIGGNAYATGNVELIFPLPFLPEDNGLRGSVFFDAGNVWGTDQDKAGD--YSDDSSLRASVGVGLRWISPMGP 713 (741) T ss_pred CCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCEEEEEEEEEECCCCCCCCCCCCC--CCCCCCEEEEEEEEEEEECCCCC T ss_conf 4578887546689999999999425888866618999999815635777776666--47868646899999999877216 Q ss_pred EEEEEEEECCCCCCCEEEEEEEEEEECC Q ss_conf 5899887226778760258999641059 Q gi|254780772|r 754 MGVYYGIPLRHREGDKIQQFGFRIGNRM 781 (781) Q Consensus 754 irld~a~pl~~~~~D~~~~f~FsiG~~f 781 (781) |||||||||++.++|..++|||+|||+| T Consensus 714 lrld~a~pl~~~~~d~~~~f~F~iG~~F 741 (741) T TIGR03303 714 LRFSYAKPLKKKKGDKTQSFQFNIGTTF 741 (741) T ss_pred EEEEEEECCCCCCCCCEEEEEEECCCCC T ss_conf 8999997678888986118999585659 No 3 >COG4775 Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=1123.27 Aligned_cols=752 Identities=32% Similarity=0.580 Sum_probs=686.7 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECC Q ss_conf 99999999999998751146526858999665007988999974878988268899999999999679800389998699 Q gi|254780772|r 25 MGFIILFYAIFGLSAVYGSNTSIVRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSNVKIKIVDS 104 (781) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~i~~I~i~G~~~i~~~~i~~~~~~~~G~~~~~~~i~~~i~~l~~~g~F~~v~~~~~~~ 104 (781) |+..+..++++..+.+.+++..+|++|+|+|+++++.++++..+++++|+.+++++++++||+||+||||+||++..+++ T Consensus 3 ~~~~~~~~~l~~~~~~~~a~~~vv~~I~v~G~~~~~~~~~~~~l~~~~G~~~s~~~i~~~ik~L~~tg~F~~v~~~~~~~ 82 (766) T COG4775 3 MKKLLSALLLFASASAVAAEAFVVKDIRVEGLQRVSSGTALSYLPVRVGDTVSDEDIDEAIKALYATGYFEDVRVEREGG 82 (766) T ss_pred HHHHHHHHHHHHHCCHHHCCCEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECCC T ss_conf 57889999987513111033627878999655424512366415735575136778899999997168976137887098 Q ss_pred EEEEEEEECCEEEEEEEECCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEE Q ss_conf 89999993653657999565436988998873005786379999999999999999972970017998764338984258 Q gi|254780772|r 105 VLIIDLIERKIINHLFFSGNNNLKDDQLKMIVRSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSISPTTLNI 184 (781) Q Consensus 105 ~l~~~v~e~p~i~~i~~~Gn~~~~~~~l~~~~~~~~g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~~~~~v~i 184 (781) .|+++|+|+|.|++|.|+||+.+++++|++.+.+++|++|+...++++.++|++.|...||+.++|++++++.++++|+| T Consensus 83 ~Lvv~V~e~p~I~~v~~~Gn~~i~~~~L~~~l~~~~g~~~~~~~l~~~~~~l~e~Y~~~G~~~~~v~~~~~~~~~~~v~v 162 (766) T COG4775 83 TLVVTVKERPVINSVVFSGNKSIKDDALKKNLKIKPGEVLNRAKLEADVEALKEAYLSRGYYSVKVTYKVTPLPRNRVDV 162 (766) T ss_pred EEEEEEECCCCEEEEEEECCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCEECCEEEEEEEECCCCEEEE T ss_conf 79999962772878998687137758988753647787345789998999999988746830434679878779970799 Q ss_pred EEEEECCCCEEEEEEEEECCCCCCHHHHHHHHHCCCCCCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC-CCC Q ss_conf 89860366324677875044311012222111001221124-4566643102333345788888875423221001-231 Q gi|254780772|r 185 TYVIEEGVKAKINSIRFVGNKNYSHARLERVISIRTSGYFS-FGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKVS-SQV 262 (781) Q Consensus 185 ~~~I~EG~~~~I~~I~f~Gn~~f~~~~L~~~~~~k~~~~~~-~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V~-~~~ 262 (781) .|.|+||++.+|+.|.|+||++|++++|+++|.+++..+|. ++..+.|++++++.|+++|++||+++||+|++|. +++ T Consensus 163 ~f~i~eG~~~~i~~I~~~Gn~~~s~~~l~~~~~~k~~~~~~~~~~~~~y~~~~l~~D~e~lr~~Y~n~Gy~d~~v~s~~~ 242 (766) T COG4775 163 VFVINEGPSAKIKQINFEGNTAFSDSDLESVVQTKESNIWGWLTRLDKYDPDKLEADLETLRSFYLNRGYADFRVSSTQV 242 (766) T ss_pred EEEECCCCCCCCCEEEEECCCCCCCHHHHHHEEECCCEEEEEEECCCCCCHHHHHCCHHHHHHHHHHCCEEEEEEEEEEE T ss_conf 99937997210014687055443601223331520220588871566127666306289999999757628999864130 Q ss_pred CCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 01245555410122022221000001233344455767777532116996211210000000000002344564201123 Q gi|254780772|r 263 LFDKQKSGYVLIFQIDEGEIYTVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFVRVKT 342 (781) Q Consensus 263 ~~~~~~~~~~i~~~I~EG~~y~i~~i~i~g~~~~~~~~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~fa~v~~ 342 (781) ..+++++.+.|++.|+||+||++++|.++|++. ...++|.+++++++|++|+..+++++.++|.+.|.+.||++++|.| T Consensus 243 ~~~~~k~~~~vt~~V~EG~~y~~~~i~i~g~~~-~~~~el~~~~~~~~g~~fn~~~i~~~~~~I~~~~~~~GY~~a~V~p 321 (766) T COG4775 243 SLDPDKKGVTVTYTVKEGPQYKFGSVLIEGNLA-GVSEELEKLLKVKPGKLFNRKKIEDDADKIKERYARYGYAFANVSP 321 (766) T ss_pred CCCCCCCCEEEEEEECCCCCEEEEEEEEECCCC-CCHHHHHHHHCCCCCCEECHHHHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 345787637999997448833788899975766-7389987765458895276899888999999999865930368610 Q ss_pred CCCCCCCCCEEEEEEECCCCCCCEEECCCCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 32234344046655412333220110033334320033210010010000000123444566677651012344533254 Q gi|254780772|r 343 RINRDFAKRIVDIEYLIDQGSPLYVKRIEIEGNDQSYDSVIRRELELSEGDPINYSMIERAKRRIMATGYFSEVNISQLP 422 (781) Q Consensus 343 ~~~~d~~~~~vdi~~~I~eg~~~~i~~I~I~GN~~T~d~VIrrel~~~~Gd~~~~~~i~~s~~~L~~lg~F~~V~i~~~~ 422 (781) ++..|.++++++|.|.|++|+++||++|.|.||.+|+|+||||||+++|||+||+.+++++++||++||||++|+++..| T Consensus 322 ~~~~n~~~~tv~v~~~V~~g~~~~V~~i~i~gn~rT~D~VIrRE~~~~eGd~fn~~~v~~~~~rL~~lgyF~~V~i~~~~ 401 (766) T COG4775 322 QPDANDENKTVDVVFRVDEGDRVYVRRIRIRGNTRTKDYVIRREMRLKEGDVFNRKLVQRGKRRLRRLGYFESVNIDTAP 401 (766) T ss_pred ECCCCCCCCEEEEEEEEECCCCEEEEEEEECCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 11338888589999999758833664544527873031776645532773223389999999999846883345788555 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEECCCCCCCC Q ss_conf 43565431023322112211111111122332110000011123332344220037874112211113454112357775 Q gi|254780772|r 423 ANDVSDYVILRVSVKQLSAGSVGIATNYEVDKGMGVEGHIDDNNFFGQGYRARLAAGFGRHAVQNYTFSVEDPYFLGSPI 502 (781) Q Consensus 423 ~~~~~~~vdl~i~v~E~~tg~~~~g~gys~~~g~~~~~~~~~~Nl~G~G~~l~l~~~~~~~~~~~~~~s~~~P~~~~~~~ 502 (781) .++.+ +|||+++|+|++||++++|+||++.+|+.+.++++++||+|+|+.+++.++.+.. .+.++++|++|||.++++ T Consensus 402 ~~~~~-~vdvvv~VkE~~Tgsi~~G~Gy~s~~gl~~~~s~~e~N~~G~G~~~~~~~~~g~~-~~~~~l~ft~P~f~~~~~ 479 (766) T COG4775 402 GSGSD-QVDVVVDVKERSTGSINFGLGYGSDSGLSGFASLSERNFLGTGQSLSLNANLGDK-QTSYSLSFTDPYFLDDRV 479 (766) T ss_pred CCCCC-EEEEEEEEEECCCEEEEECCCCCCCCCEEEEEEEEEEECCCCCCEEEEEEEECCC-EEEEEEEEECCCCCCCCC T ss_conf 89997-0799999875672059864300688864899999983047656389999984341-689888875555468882 Q ss_pred CCCCCCCCCCC-----CCC--CCCCCCCCCEEEEEEECCCCEEEEECCCEEEEEEECCCCCCC-CCHHHHHHHCCCCCEE Q ss_conf 43100111133-----333--343233320245542025430235404125776401467544-2011122101585024 Q gi|254780772|r 503 SAGFDLQKTHL-----EDG--SLDINDESAAVRMIVPITESISTSFKYDLRFLQYGAISEKEK-IPSIYTTLIEHGKFSS 574 (781) Q Consensus 503 s~~~~~~~~~~-----~~~--~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 574 (781) +++++++++.+ +.. +|+..+.|+.+++++|++++.+++++|.+.+.+......... .........+.+.... T Consensus 480 slg~~~f~~~~~~~~~~~~~~~y~~~~~G~~~~lg~pi~e~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (766) T COG4775 480 SLGFNLFSNRYDTFDADTANDSYRVKTYGGGVSLGRPITENLSLGLGYGYEQNSLSLLNKSPPDEYASLGVKLQGGKSDL 559 (766) T ss_pred EEEEEEEEEEECCCCCCCCCCCCEEEECCCEEEECCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCEE T ss_conf 36778674221133223444551576104348856632677338999889997861366676654445221324576224 Q ss_pred EEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCCCCEEEEEEEEEEEEEEECCCE-EEEEEEEEEEEEEECCCCCCCCH Q ss_conf 67884321554201100246515654202456555420366663004799935880-99999999789851699741230 Q gi|254780772|r 575 HSISQSIIYNTLDNPIVPRKGMLISSSYDYAGFGGDSQYHRIGSRASYFYLLSDDS-DIVGSLRFGYGCVIPSNKNLQLF 653 (781) Q Consensus 575 ~~i~~~l~~dtrD~~~~Pt~G~~~~~~~~~a~~g~d~~f~k~~~~~~~y~~l~~~~-~l~~~~~~g~~~~~~~~~~lp~~ 653 (781) ++++++|+||+||++++||+|++.+...+++++|||.+|+|+++++++|+|+.+.. ++.++..+|++..++... +|++ T Consensus 560 ~~~s~~~tyD~rD~~~~Pt~G~~~~~~~e~~~~Ggd~~~~K~~~~~~~Y~~l~~~~~~l~~~~~~g~~~~~g~~~-~p~~ 638 (766) T COG4775 560 SSLSLGWTYDTRDNALFPTKGSYLSLTQEVAGLGGDIKYYKLELDGSKYYPLTDYIFTLSLRGEVGYGKGYGTKL-LPIY 638 (766) T ss_pred EEEEEEEEECCCCCCCCCCCCEEEEEEEEEECCCCCCEEEEEEEEEEEEEECCCCCEEEEEEEEEEEEECCCCCC-CCCC T ss_conf 799975888277776799887288434697235776217999998999998354307998777888760458735-4642 Q ss_pred HEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCEEEEEEEEEEEEECCCCCCCCC-CCCC Q ss_conf 127507897140212686155037997222147899999999614565654537999999767741566667754-4456 Q gi|254780772|r 654 DQFSVSSNYYLRGFAYKGIGPRVDKKYAIGGKIYSSASAAVSFPMPLVPERAGLRGAFFVDSATLYANHVALGAD-KLEG 732 (781) Q Consensus 654 erF~lGG~~svRGy~~~~iGP~~~~~~~~GG~~~~~~s~E~r~pl~~~~~~~~l~~~~F~D~G~v~~~~~~~~~~-~~~~ 732 (781) ++|++||.+|||||+.+.+||++..++++||+.++.+|+|++||+|..+.. ++++++|+|+|.+|+........ .... T Consensus 639 e~F~~GG~~svRGf~~~~igp~~~~~~~~GG~~~~~~S~El~fPl~~~~~~-~~r~~lF~D~G~v~~~~~~~~~~~~~~~ 717 (766) T COG4775 639 ERFYLGGSNSVRGFESGGLGPKDGYTDALGGTSYFVASAELRFPLPKVIGS-GVRGALFFDAGSVWNTGTDPSTVRNFYG 717 (766) T ss_pred CCEEECCCCCCCCEECCCCCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCC-CEEEEEEEECCCCCCCCCCCCCCCCCCC T ss_conf 325607975223644488487643455567628999989998106777776-5389999973512366766555557767 Q ss_pred CCCCEEEEEEEEEEEECCCCCEEEEEEEECCCCCCCEEEEEEEEEEECC Q ss_conf 7664247999999975570015899887226778760258999641059 Q gi|254780772|r 733 NDSFWRVSTGVEIMWNSPLGMMGVYYGIPLRHREGDKIQQFGFRIGNRM 781 (781) Q Consensus 733 ~~~~~r~s~G~Glr~~tP~Gpirld~a~pl~~~~~D~~~~f~FsiG~~f 781 (781) ....+|.|+|+||+|.+|+||||||||+|+.+.++|..++|||+||++| T Consensus 718 ~~~~~r~S~Gvgl~w~sP~gPlr~d~a~pi~~~~~D~~q~F~F~iG~~F 766 (766) T COG4775 718 SGSELRASAGVGLRWASPLGPLRFDYAFPIKKKEGDDTQRFQFGLGTRF 766 (766) T ss_pred CCCCEEEEEEEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCCC T ss_conf 6733367762257896687767998611233677875116999704529 No 4 >COG0729 Outer membrane protein [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=443.14 Aligned_cols=506 Identities=18% Similarity=0.267 Sum_probs=375.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCC--CCCCCCCCCHHHHHHH---HHCCCCCEE Q ss_conf 45788888875423221001231012455554101220222210000012--3334445576777753---211699621 Q gi|254780772|r 240 DEEAIRAFYHDRGYAAVKVSSQVLFDKQKSGYVLIFQIDEGEIYTVGNIS--IQSTLQEIQKKTLLSL---IRIRSGNLY 314 (781) Q Consensus 240 D~~~l~~~Y~~~GY~d~~V~~~~~~~~~~~~~~i~~~I~EG~~y~i~~i~--i~g~~~~~~~~~l~~~---~~~k~G~~~ 314 (781) +..+...+|...++++..| ...+....+..+|..|++-++..+. +.|. . ...+.+... ...++|..+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~v~pg~~~~i~~~~~~~~G~-a-~~d~~~~~~~~~~~~~~G~~~ 148 (594) T COG0729 77 DGLRALGYLQPKGRFGGKV------SVTEKRGKLIAKVTPGEPTPIAVVIVVLPGP-A-FTDGDYRLLGDAAGPKEGEDL 148 (594) T ss_pred HHHHHHCEECCEEEECCCC------EEECCCCCEEEECCCCCCCCCCEEEEECCCC-C-CCCHHHHHHHHHHCCCCCCCC T ss_conf 6776420004503541431------0003455213323899986653256622462-0-037346554343121358735 Q ss_pred CCCCCCHHHHHCCCCCCCCCCCCCCC-CCCCCCCCCCCEEEEEEECCCCCCCEEECCCCCCCCCCCCCCCHHHCCCCCCC Q ss_conf 12100000000000023445642011-23322343440466554123332201100333343200332100100100000 Q gi|254780772|r 315 NPQEIKESSEKISKYFFSGERPFVRV-KTRINRDFAKRIVDIEYLIDQGSPLYVKRIEIEGNDQSYDSVIRRELELSEGD 393 (781) Q Consensus 315 ~~~~~~~~~~~i~~~y~~~GY~fa~v-~~~~~~d~~~~~vdi~~~I~eg~~~~i~~I~I~GN~~T~d~VIrrel~~~~Gd 393 (781) ++..+.+....+.....+.||+.+++ ..+..+|.+.++|++.+.++.|+++.++.|.++|-.+..+..+++-+++.+|+ T Consensus 149 ~~~~~~n~k~~i~~~~~~~gy~~a~~~~~r~~v~~~~~~a~~~~~~dsG~~y~~g~v~~~g~~r~~~~~~~~~~~~~~G~ 228 (594) T COG0729 149 NQGTYENAKSSIVRALLRKGYFLARFTKSRLVVDPATHTADVDLNYDSGRRYRFGPVTVEGSQRIDEEYLQNLVPFKYGL 228 (594) T ss_pred CCCHHHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCCEEEEEEEECCCCEEEECCEEECCCEECCHHHHHHCCCCCCCC T ss_conf 65211114678999998728300025403011162315688998633541697676797166035877876303666788 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCC Q ss_conf 0012344456667765101234453325443565-431023322112211111111122332110000011123332344 Q gi|254780772|r 394 PINYSMIERAKRRIMATGYFSEVNISQLPANDVS-DYVILRVSVKQLSAGSVGIATNYEVDKGMGVEGHIDDNNFFGQGY 472 (781) Q Consensus 394 ~~~~~~i~~s~~~L~~lg~F~~V~i~~~~~~~~~-~~vdl~i~v~E~~tg~~~~g~gys~~~g~~~~~~~~~~Nl~G~G~ 472 (781) +|..+++.+..+||.++++|++|.+++...+..+ ..+++.+.+.|++...+.+|+||+++.|+.+.+.|.++|+||+|+ T Consensus 229 ~Y~~~~l~~~~~rl~~~~~F~sv~v~~~~~~~~~~~~~pv~v~~~~rk~~~~~lG~gyst~~G~~~~~~w~~~~~~~~~~ 308 (594) T COG0729 229 PYDPEDLAELNQRLRQTGYFSSVVVQPADKQAGPDGLLPVKVSVSERKENTFELGVGYSTDVGAGLEAGWEKRNLFGRGH 308 (594) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCEECEEEEECCCCCCEEEEEEEEECCCCCCCEEEEEEECCCCCCE T ss_conf 67877999999987615850348981275555767603539885246553377646884263565168863422357650 Q ss_pred CCCEEEEEEECCCCEEEEEEEECCCCCCC-CCCCCCCCCCCCCCCCCCCCCCC--CEEEEEEECCCCEEEEECCCEEEEE Q ss_conf 22003787411221111345411235777-54310011113333334323332--0245542025430235404125776 Q gi|254780772|r 473 RARLAAGFGRHAVQNYTFSVEDPYFLGSP-ISAGFDLQKTHLEDGSLDINDES--AAVRMIVPITESISTSFKYDLRFLQ 549 (781) Q Consensus 473 ~l~l~~~~~~~~~~~~~~s~~~P~~~~~~-~s~~~~~~~~~~~~~~y~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (781) ++..+..+++.. |.+..+|..|.++..+ .-...+.+....+...-.....+ .+...++.....+...+..++.... T Consensus 309 ~l~~~~~is~~~-~~~~~~y~~p~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~l~~~~~~~~~~~~~~~~~~r~~~~~ 387 (594) T COG0729 309 SLRIESEISAPL-QTLEATYKAPLLFNPLGQYNPVSAGFDRLDLYDTESDAYTESLSASRGWDASLGWQRSLALRFSYDN 387 (594) T ss_pred EEEEEEEECCCH-HCCCEEEEEECCCCCCCCEECCCCCCCCHHHHCCCCCCCCCEEEEEEHHCCCCCCEEEEEEEEECCC T ss_conf 575412660001-0175379820135876512033223442011014432234313111000003672037888887255 Q ss_pred EECCCCCCCCCHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCEEEEEEECCC--CCCCCCEEEEEEEEEEEEEEEC Q ss_conf 40146754420111221015850246788432155420110024651565420245--6555420366663004799935 Q gi|254780772|r 550 YGAISEKEKIPSIYTTLIEHGKFSSHSISQSIIYNTLDNPIVPRKGMLISSSYDYA--GFGGDSQYHRIGSRASYFYLLS 627 (781) Q Consensus 550 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~dtrD~~~~Pt~G~~~~~~~~~a--~~g~d~~f~k~~~~~~~y~~l~ 627 (781) .... .......+....+.+++.||+ |+..+||.|+.+.+.++.+ .++|+..|..+.....|+.... T Consensus 388 -~~~~----------~~~~~~~~~~~g~~~~~~~~~-dd~~~Pt~G~~~~~~~~~s~~~~~s~~~~~~~~~~~~~~~~~~ 455 (594) T COG0729 388 -FRQG----------DISGKTSLLTPGVEFSYTRDT-DDGLFPTWGDRLDLTIGPSYEALGSDVDFRLVQARSGYLRTGG 455 (594) T ss_pred -CCCC----------CCCCCCEEEECCEEEEEEECC-CCCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHCCCEEEECCC T ss_conf -4346----------545783577423478999716-8877887650455552001011457257765431350551376 Q ss_pred CCEEEEEEEEEEEEEEECCCCCCCCHHEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCE Q ss_conf 88099999999789851699741230127507897140212686155037997222147899999999614565654537 Q gi|254780772|r 628 DDSDIVGSLRFGYGCVIPSNKNLQLFDQFSVSSNYYLRGFAYKGIGPRVDKKYAIGGKIYSSASAAVSFPMPLVPERAGL 707 (781) Q Consensus 628 ~~~~l~~~~~~g~~~~~~~~~~lp~~erF~lGG~~svRGy~~~~iGP~~~~~~~~GG~~~~~~s~E~r~pl~~~~~~~~l 707 (781) ..+.++.+..+|+... ..-+.+|++.|||+||..|||||.+++|||++.++.++||..++..++|++|++. .++ T Consensus 456 ~~~~~~~R~~~G~il~-~~~~~vPps~rFfaGG~~svRGY~y~sI~p~~~~g~~~Gg~~l~~~s~Ey~~~~t-----~~~ 529 (594) T COG0729 456 ADHRLAGRGELGAILT-GDLDDVPPSLRFFAGGDRSVRGYGYQSIGPQDANGDLLGGRSLVTGSLEYQYLVT-----DPW 529 (594) T ss_pred CCEEEEEECCCCEEEE-CCCCCCCCCEEEECCCCCCEECCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEECC-----CCE T ss_conf 4248887223213850-7500189510464079741515203223676778973264689996279999634-----866 Q ss_pred EEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEECCCCCCCEEEEEEEEEEECC Q ss_conf 99999976774156666775444567664247999999975570015899887226778760258999641059 Q gi|254780772|r 708 RGAFFVDSATLYANHVALGADKLEGNDSFWRVSTGVEIMWNSPLGMMGVYYGIPLRHREGDKIQQFGFRIGNRM 781 (781) Q Consensus 708 ~~~~F~D~G~v~~~~~~~~~~~~~~~~~~~r~s~G~Glr~~tP~Gpirld~a~pl~~~~~D~~~~f~FsiG~~f 781 (781) ++++|+|+|.||++..... +.++.++|+|+||.||+||||||+|+||++.+ |+.-+||++|||.| T Consensus 530 ~~a~FvD~G~a~n~~~~~~--------~d~k~G~Gigvr~~SPvGPIr~DiA~pl~~~~-~~~~~~Yi~iG~~f 594 (594) T COG0729 530 GLAVFVDAGSAGNNFTRLF--------SDLKKGAGIGVRWDSPVGPIRFDIAVPLEDKD-DKKFQFYIGIGQEF 594 (594) T ss_pred EEEEEEECCCCCCCCCHHH--------HHHHCCCEEEEEECCCCCCEEEEEECCCCCCC-CCCEEEEEEECCCC T ss_conf 8999987265335886455--------44202523667654886875777432555567-88716999815669 No 5 >pfam01103 Bac_surface_Ag Surface antigen. This entry includes the following surface antigens; D15 antigen from H.influenzae, OMA87 from P.multocida, OMP85 from N.meningitidis and N.gonorrhoeae. The family also includes a number of eukaryotic proteins that are members of the UPF0140 family. There also appears to be a relationship to pfam03865 (personal obs: C Yeats). In eukaryotes, it appears that these proteins are not surface antigens; S. cerevisiae YNL026W (SAM50) is an essential component of the Sorting and Assembly Machinery (SAM) of the mitochondrial outer membrane. The protein was localized to the mitochondria. Probab=100.00 E-value=0 Score=410.88 Aligned_cols=309 Identities=29% Similarity=0.536 Sum_probs=264.1 Q ss_pred CCCCCCCCCCEEEEEEECCCCEEEEEEEECCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCEEEEEEECCCCEEEEEC Q ss_conf 3332344220037874112211113454112357775431001111333---3334323332024554202543023540 Q gi|254780772|r 466 NFFGQGYRARLAAGFGRHAVQNYTFSVEDPYFLGSPISAGFDLQKTHLE---DGSLDINDESAAVRMIVPITESISTSFK 542 (781) Q Consensus 466 Nl~G~G~~l~l~~~~~~~~~~~~~~s~~~P~~~~~~~s~~~~~~~~~~~---~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 542 (781) ||+|+|++++++++.++. .+.++++|++||+++++++++++++.++.+ ...|+.++.++.+++++++.+...++++ T Consensus 1 N~~G~G~~l~~~~~~~~~-~~~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~g~~~~~~~~~~~~~~~~~~ 79 (317) T pfam01103 1 NLLGRGQSLSVSLSYSQY-EQSFSLSFTDPYFLGDGLSLGYSLFYNTYDEDEYDSYDTKSVGFSLSLGYPISENWSLSLG 79 (317) T ss_pred CCCCCCEEEEEEEEECCC-EEEEEEEEEECCCCCCCCEEEEEEEEEEEECCCCCCCEEEEEEEEEEEEEECCCCCEEEEE T ss_conf 968878799999996755-8999999983776999928999999999985767651688776899999992788447788 Q ss_pred CCEEEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCEEEEEEECCCCC--CCCCEEEEEEEEE Q ss_conf 41257764014675442011122101585024678843215542011002465156542024565--5542036666300 Q gi|254780772|r 543 YDLRFLQYGAISEKEKIPSIYTTLIEHGKFSSHSISQSIIYNTLDNPIVPRKGMLISSSYDYAGF--GGDSQYHRIGSRA 620 (781) Q Consensus 543 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~dtrD~~~~Pt~G~~~~~~~~~a~~--g~d~~f~k~~~~~ 620 (781) +.++..++........ ......+.+....++++++++||+||++++|++|+++++..+++++ ++|.+|+|+.+++ T Consensus 80 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~d~rD~~~~Pt~G~~~~~~~~~~~~~~~~d~~f~k~~~~~ 156 (317) T pfam01103 80 LSYSRDKIRDEDGSDN---IRNYASQTGKSLLSSLSFGLSYDTRDNGLFPTRGYYLRFNLEFAGPILGGDVKFLKLTGDA 156 (317) T ss_pred EEEEEEEEEECCCCCC---CHHHHHCCCCEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEECCCCCCCCCEEEEEEEE T ss_conf 9999999982588642---0014422883699999999999656775577621489888898035577876337899999 Q ss_pred EEEEEECCCEEEEEEEEEEEEEE--ECCCCCCCCHHEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEC Q ss_conf 47999358809999999978985--1699741230127507897140212686155037997222147899999999614 Q gi|254780772|r 621 SYFYLLSDDSDIVGSLRFGYGCV--IPSNKNLQLFDQFSVSSNYYLRGFAYKGIGPRVDKKYAIGGKIYSSASAAVSFPM 698 (781) Q Consensus 621 ~~y~~l~~~~~l~~~~~~g~~~~--~~~~~~lp~~erF~lGG~~svRGy~~~~iGP~~~~~~~~GG~~~~~~s~E~r~pl 698 (781) ++|+||.+...++++.+++.+.. ....+.+|+++||++||.++||||++++|||++..++++||+.++.+++|+|+|+ T Consensus 157 ~~y~~l~~~~~~~l~~~~~~g~~~~~~~~~~ip~~~rF~lGG~~svRGf~~~~lgP~~~~~~~~GG~~~~~~~~E~r~pl 236 (317) T pfam01103 157 SWYYPLGEDHDFVLAARVGLGYLLGYGAKDKLPPSERFFAGGSNSVRGYGYGSIGPRDEDGDALGGRSYFTASAELRYPL 236 (317) T ss_pred EEEEECCCCCCEEEEEEEEEEEEECCCCCCCCCHHHEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEC T ss_conf 99999445873899998671187616888857966846138986435623376388766886368738999999999867 Q ss_pred CCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEECCCCCCCEEE-EEEEEE Q ss_conf 5656545379999997677415666677544456766424799999997557001589988722677876025-899964 Q gi|254780772|r 699 PLVPERAGLRGAFFVDSATLYANHVALGADKLEGNDSFWRVSTGVEIMWNSPLGMMGVYYGIPLRHREGDKIQ-QFGFRI 777 (781) Q Consensus 699 ~~~~~~~~l~~~~F~D~G~v~~~~~~~~~~~~~~~~~~~r~s~G~Glr~~tP~Gpirld~a~pl~~~~~D~~~-~f~Fsi 777 (781) +. ...+++++|+|+|+||+...+............+|+|+|+||||.||+||||||||+||+++++|+++ +|||+| T Consensus 237 ~~---~~~~~~~~F~D~G~~~~~~~~~~~~~~~~~~~~~~~s~G~Glr~~tP~gpi~ld~a~pl~~~~~d~~~~~f~F~i 313 (317) T pfam01103 237 PF---IPNLRGALFVDAGNVWNSGGDTPGGSDGVRSKNIRASVGVGLRWDSPFGPLRFDYAIPLKKPDGDRTEFRFYFGI 313 (317) T ss_pred CC---CCCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCEEEEEEC T ss_conf 88---896799999980175045677775444356787779999999997781879999998766777887220899984 Q ss_pred EECC Q ss_conf 1059 Q gi|254780772|r 778 GNRM 781 (781) Q Consensus 778 G~~f 781 (781) |++| T Consensus 314 G~~F 317 (317) T pfam01103 314 GTEF 317 (317) T ss_pred CCCC T ss_conf 0569 No 6 >TIGR03303 OM_YaeT outer membrane protein assembly complex, YaeT protein. Members of this protein family are the YaeT protein of the YaeT/YfiO complex for assembling proteins into the outer membrane of Gram-negative bacteria. This protein is similar in sequence and function to a non-essential paralog, YtfM, that is also in the Omp85 family. Members of this family typically have five tandem copies of the surface antigen variable number repeat (pfam07244), followed by an outer membrane beta-barrel domain (pfam01103), while the YtfM family typically has a single pfam07244 repeat. Probab=100.00 E-value=3.4e-44 Score=306.59 Aligned_cols=308 Identities=22% Similarity=0.384 Sum_probs=261.2 Q ss_pred CCCEEEEEEEEECCCCCCHHHHHHHC-CCCCCCCCCHHHHHHHHHHH---HH-CCCCC-EEEEEEE---CC--EEEEEEE Q ss_conf 46526858999665007988999974-87898826889999999999---96-79800-3899986---99--8999999 Q gi|254780772|r 43 SNTSIVRRIEIRGATNVGKEVILSRI-PVVVGQSISDADLDHAVKNI---YA-MGYFS-NVKIKIV---DS--VLIIDLI 111 (781) Q Consensus 43 ~~~~~i~~I~i~G~~~i~~~~i~~~~-~~~~G~~~~~~~i~~~i~~l---~~-~g~F~-~v~~~~~---~~--~l~~~v~ 111 (781) .+..+|.+|.++||+.++++.++..+ ++++|+.+++..+++++++| |+ -|||. .|+++.+ ++ .++|.|. T Consensus 66 ~E~p~i~~i~~~G~~~~~~~~l~~~l~~lk~g~~~~~~~~~~~~~~I~~~Y~~~Gy~~~~V~~~~~~~~~~~v~v~f~I~ 145 (741) T TIGR03303 66 KERPIINSIVFSGNKEIKKEQLKKALVGIKKGEIFDRAKLEKDEKALKEFYRSRGKYNAEVEAKVTPLPRNRVDLEFNIK 145 (741) T ss_pred EECCEEEEEEEECCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCEEEEEEEEE T ss_conf 96763999999998558989999887455789703999999999999999996695204999999988998799999995 Q ss_pred ECC--EEEEEEEECCCCCCHHHHHHHHHHCCC---------CCCCHHHHHHHHHHHHHHHHHCCCCEEEEEE-EEEE-CC Q ss_conf 365--365799956543698899887300578---------6379999999999999999972970017998-7643-38 Q gi|254780772|r 112 ERK--IINHLFFSGNNNLKDDQLKMIVRSRSA---------AAYDEDTVNADVHNIKQAYASIGYLNVMVKV-QHHS-IS 178 (781) Q Consensus 112 e~p--~i~~i~~~Gn~~~~~~~l~~~~~~~~g---------~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~-~~~~-~~ 178 (781) |.+ .|.+|.|+||+.+++++|++.+..++. ..|++..++.|+++|+++|+++||++|+|.. +++- .+ T Consensus 146 Eg~~~~I~~I~f~Gn~~fs~~~L~~~~~~k~~~~~~~~~~~~~~~~~~l~~D~~~l~~fY~~~Gy~d~~v~~~~v~~~~~ 225 (741) T TIGR03303 146 EGKKAKIKKINFVGNKAFSDSELRDVFALKESNFLSWFTGDDKYSKQKLEADLELLRSFYLNRGYLDFKVESTQVSITPD 225 (741) T ss_pred CCCEEEEEEEEEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCEEECCC T ss_conf 38858999999988742676898877752664120023667833999998799999999976052037983220388378 Q ss_pred CCCEEEEEEEECCCCEEEEEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 98425889860366324677875044311012222111001221124456664310233334578888887542322100 Q gi|254780772|r 179 PTTLNITYVIEEGVKAKINSIRFVGNKNYSHARLERVISIRTSGYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKV 258 (781) Q Consensus 179 ~~~v~i~~~I~EG~~~~I~~I~f~Gn~~f~~~~L~~~~~~k~~~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V 258 (781) ++.+.|+|.|+||++++|.+|.|+||..+++++|++++.+++ ++.|+++.++.++++|+++|.++||..++| T Consensus 226 ~~~~~i~~~I~EG~~Y~i~~i~~~Gn~~~~~~~l~~~~~~~~--------g~~y~~~~i~~~~~~I~~~y~~~Gy~fa~v 297 (741) T TIGR03303 226 KKGVYITYNIKEGEQYKFGEVNLEGDLIGPDEELKKLVKIKK--------GEVFNREKVTKIVKAIKDLLGDKGYAFANV 297 (741) T ss_pred CCEEEEEEEECCCCCEEEEEEEEECCCCCCHHHHHHHHCCCC--------CCCCCHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 863999999867881566799972577789899997654468--------865158899999999999998569743899 Q ss_pred CCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCCC Q ss_conf 12310124555541012202222100000123334445576777753211699621121000000000000234456420 Q gi|254780772|r 259 SSQVLFDKQKSGYVLIFQIDEGEIYTVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFV 338 (781) Q Consensus 259 ~~~~~~~~~~~~~~i~~~I~EG~~y~i~~i~i~g~~~~~~~~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~fa 338 (781) .+....+.++..+.|+|.|+||++++|+.|.|.||.. ..+..+++.+.+++|++|+..++++...+|.+. || |. T Consensus 298 ~~~~~~~~~~~~v~l~~~I~eg~~~~i~~I~I~GN~~-T~d~VIrRel~~~~Gd~f~~~~i~~s~~~L~~l----g~-F~ 371 (741) T TIGR03303 298 NPRPQINDENKTVDLTFKVDPGKRVYVRRINISGNTR-TRDEVIRRELRQLEGDWYNLSKIKRSKRRLERL----GY-FE 371 (741) T ss_pred EEEEEEECCCCEEEEEEEEECCCEEEEEEEEEECCCC-CCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHC----CC-CE T ss_conf 8889970578689899999528718999999988741-553750100024886535889999999999866----88-23 Q ss_pred CCCCCCCCCCCCCEEEEEEECCCCCC Q ss_conf 11233223434404665541233322 Q gi|254780772|r 339 RVKTRINRDFAKRIVDIEYLIDQGSP 364 (781) Q Consensus 339 ~v~~~~~~d~~~~~vdi~~~I~eg~~ 364 (781) .|.+.+.+...+..|||.+.|.|.+. T Consensus 372 ~V~i~~~~~~~~~~vdl~v~V~Ek~t 397 (741) T TIGR03303 372 EVNIETVPVGSPDQVDLNVNVKEQPT 397 (741) T ss_pred EEEEEECCCCCCCEEEEEEEEEEECC T ss_conf 31012146899871789999998027 No 7 >TIGR00992 3a0901s03IAP75 chloroplast envelope protein translocase, IAP75 family; InterPro: IPR005689 Chloroplast function requires the import of nuclear encoded proteins from the cytoplasm across the chloroplast double membrane. This is accompished by two protein complexes, the Toc complex located at the outer membrane and the Tic complex loacted at the inner membrane . The Toc complex recognises specific proteins by a cleavable N-terminal sequence and is primarily responsible for translocation through the outer membrane, while the Tic complex translocates the protein through the inner membrane. This entry represents Toc750, a core component of the Toc complex. This protein is deeply embedded in the outer membrane and forms the voltage-dependent translocation channel . Toc75 itself appears to be capable of at least some discrimination between substrate and non-substrate proteins, with recognition based on based on both conformational and electrostatic interactions. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0009707 chloroplast outer membrane. Probab=100.00 E-value=1.4e-45 Score=316.01 Aligned_cols=455 Identities=17% Similarity=0.223 Sum_probs=334.2 Q ss_pred CCCCHHHHHHHHHCCCC-CEECCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCEEECCCC- Q ss_conf 45576777753211699-6211210000000000002344564201123322343440466554123332201100333- Q gi|254780772|r 295 QEIQKKTLLSLIRIRSG-NLYNPQEIKESSEKISKYFFSGERPFVRVKTRINRDFAKRIVDIEYLIDQGSPLYVKRIEI- 372 (781) Q Consensus 295 ~~~~~~~l~~~~~~k~G-~~~~~~~~~~~~~~i~~~y~~~GY~fa~v~~~~~~d~~~~~vdi~~~I~eg~~~~i~~I~I- 372 (781) ..++.+...+.+....+ ...+...|++.+++|+++|.+.||..|+|---++.+.+. .|++.|.||. |.+|.| T Consensus 236 ~lLP~~V~~E~~~~~~~q~~~~~R~LQ~~~~~v~~WY~~~GY~~AQVV~~g~~~~~G---~V~~~v~EG~---~~~l~~~ 309 (768) T TIGR00992 236 CLLPQEVVDEVFGMLYGQGKVSARLLQEIRDRVQEWYHNEGYALAQVVNFGNLNTDG---EVVLEVVEGD---ITNLQIQ 309 (768) T ss_pred CCCCHHHHHHHHHHHHCCCEECHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCC---EEEEEEECCC---EEEEEEE T ss_conf 548736788999875405510067777788888631100575077873467637765---1899996471---2167899 Q ss_pred ---------CCCCCCCCCCCHHHCC--CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf ---------3432003321001001--00000001234445666776510123445332544356543102332211221 Q gi|254780772|r 373 ---------EGNDQSYDSVIRRELE--LSEGDPINYSMIERAKRRIMATGYFSEVNISQLPANDVSDYVILRVSVKQLSA 441 (781) Q Consensus 373 ---------~GN~~T~d~VIrrel~--~~~Gd~~~~~~i~~s~~~L~~lg~F~~V~i~~~~~~~~~~~vdl~i~v~E~~t 441 (781) +| +|++.||+|||+ |+||++||...-+...+|++.|++|++|.+.+.|....+..+.+.|+|+|..+ T Consensus 310 f~D~~~~~V~G--~T~~~vv~REl~kqL~~G~vFN~~~~~~~L~~~~SLgLF~~~~~~~~P~~~~~~~~IV~~~~~E~~~ 387 (768) T TIGR00992 310 FLDKLGNVVEG--NTRDPVVTRELPKQLKPGDVFNIEAGKTALQNINSLGLFSNIEVNPRPDEKNEGEIIVEIKLKELEQ 387 (768) T ss_pred EECCCCCEECC--CCCCCEEEECCHHHCCCCCEECHHHHHHHHHHHHHHHHHCCCEECCCCCCCCCCEEEEEEEEEECCC T ss_conf 97267874305--5466238523353257884441367888876543221000115546888888862799989985477 Q ss_pred CCCCCCCCC-----------CCCCCCEEEECCCCCCCCCCCCCCCEEEEEEEC----CCCEEEEEEEECCCCCC--CC-- Q ss_conf 111111112-----------233211000001112333234422003787411----22111134541123577--75-- Q gi|254780772|r 442 GSVGIATNY-----------EVDKGMGVEGHIDDNNFFGQGYRARLAAGFGRH----AVQNYTFSVEDPYFLGS--PI-- 502 (781) Q Consensus 442 g~~~~g~gy-----------s~~~g~~~~~~~~~~Nl~G~G~~l~l~~~~~~~----~~~~~~~s~~~P~~~~~--~~-- 502 (781) ++.+....| .+..++.+.+++.+|||-|..+++..+++.+.. ..-.+.++|++||+-+. |. T Consensus 388 ~saevsteWSI~~G~GG~P~~~S~~~~Gti~f~~RNL~G~N~~l~~~~~~~nflnPQ~~L~F~~~y~~Pyl~G~~nP~~~ 467 (768) T TIGR00992 388 KSAEVSTEWSIVPGRGGAPILASSQLGGTITFEERNLQGLNRSLGGSVTVSNFLNPQDDLLFKVEYTDPYLDGVDNPRNR 467 (768) T ss_pred CCEEEEEEEEEECCCCCCCCEECCCCCCEEEEEECCCCCCCCEEEEEEEECCCCCCCHHHEEEEEEECCCCCCCCCCCCC T ss_conf 63033100255148888641000267621677201567756147898887144573113114544326634787774541 Q ss_pred CCCCCCCCCCCC----------------CCCCCCCCCCCEEEEEEECCCCEEEEECCCEEEEE----EECCCCC--CCCC Q ss_conf 431001111333----------------33343233320245542025430235404125776----4014675--4420 Q gi|254780772|r 503 SAGFDLQKTHLE----------------DGSLDINDESAAVRMIVPITESISTSFKYDLRFLQ----YGAISEK--EKIP 560 (781) Q Consensus 503 s~~~~~~~~~~~----------------~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~--~~~~ 560 (781) ++..++|+++.- -.+....+.|..+.+..++.+...+..++.++..- ....... ...+ T Consensus 468 ~~~~~~F~~R~lS~VF~G~~~sirt~~~~~s~~V~R~G~~~~~~~~~~~~~~~~~~l~l~~~~~~~~~~~~~a~G~~~~~ 547 (768) T TIGR00992 468 TYKASLFNSRKLSLVFTGGPGSIRTVNEVPSIWVDRAGVKANITENFARQSKFTYGLVLEEIFTRDESRHINANGQRSLP 547 (768) T ss_pred EEEEEEEECCCEEEEEECCCCCEEEECCCCCCEEEECCCCEEEECCCHHHHCCCCCCEEECCCCCCHHHHHHHCCCCCCC T ss_conf 15662230451015674788522221478986588515202431020233047857255101240011233316852466 Q ss_pred H--------HHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCEEEEEEECCCCC-CCCC-EEEEEEEEEEEEEEECC-- Q ss_conf 1--------1122101585024678843215542011002465156542024565-5542-03666630047999358-- Q gi|254780772|r 561 S--------IYTTLIEHGKFSSHSISQSIIYNTLDNPIVPRKGMLISSSYDYAGF-GGDS-QYHRIGSRASYFYLLSD-- 628 (781) Q Consensus 561 ~--------~~~~~~~~~~~~~~~i~~~l~~dtrD~~~~Pt~G~~~~~~~~~a~~-g~d~-~f~k~~~~~~~y~~l~~-- 628 (781) + ....+.+.|--....+.+.++.|++++..-|+.|.+..+.++.+.+ |+.. =|.|+.+..++|.++.+ T Consensus 548 s~~~~G~~~~P~~lS~~GvD~l~~L~~~~~~DN~~~~~~~~~G~~~~F~~~Q~lgvG~~~~~~~R~~~~~~~fi~~~~~~ 627 (768) T TIGR00992 548 SGPLSGADGPPLTLSGTGVDRLAFLQANITRDNTKFVNGPVVGSRVRFQVDQGLGVGSGFIFFNRLQLTYTKFIQLNEVE 627 (768) T ss_pred CCCCCCCCCCCCEECCCCHHHHHHHHCCCCHHHHHHHCCCCCCCEEEEEEECCCCCCCCCHHHEEECCCEEEEEEEEEEE T ss_conf 66546778887100334312010010355301234202341121256643125455767501000101245666634430 Q ss_pred -----------CEEEEEEEEEEEEEEECCCCCCCCHHEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEE Q ss_conf -----------809999999978985169974123012750789714021268615503799722214789999999961 Q gi|254780772|r 629 -----------DSDIVGSLRFGYGCVIPSNKNLQLFDQFSVSSNYYLRGFAYKGIGPRVDKKYAIGGKIYSSASAAVSFP 697 (781) Q Consensus 629 -----------~~~l~~~~~~g~~~~~~~~~~lp~~erF~lGG~~svRGy~~~~iGP~~~~~~~~GG~~~~~~s~E~r~p 697 (781) ..+++++.+. |... .+||.+|-|.|||++|||||..+.||- |+..+...+|+|+| T Consensus 628 ~~GfGl~~~~~P~~~~l~~~~--G~~~---GDLP~Y~AF~LGG~~SVRGY~~GElG~---------aR~~~~~~AE~R~P 693 (768) T TIGR00992 628 LGGFGLAGKSPPPVLVLHGHY--GGVV---GDLPSYDAFILGGPYSVRGYNMGELGA---------ARNILEVTAEIRIP 693 (768) T ss_pred CCCCCCCCCCCCCEEEEECCC--CCEE---CCCCCCCCEEECCCCCCCCCCCCCCCH---------HHHHHHHCCCCCCC T ss_conf 466111136889889995232--4154---478872314406873004445544320---------23366530310144 Q ss_pred CCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCC-CCCCCEEEEEEEEEEEECCCCCEEEEEEEECCCCCCCEEEEEEEE Q ss_conf 4565654537999999767741566667754445-676642479999999755700158998872267787602589996 Q gi|254780772|r 698 MPLVPERAGLRGAFFVDSATLYANHVALGADKLE-GNDSFWRVSTGVEIMWNSPLGMMGVYYGIPLRHREGDKIQQFGFR 776 (781) Q Consensus 698 l~~~~~~~~l~~~~F~D~G~v~~~~~~~~~~~~~-~~~~~~r~s~G~Glr~~tP~Gpirld~a~pl~~~~~D~~~~f~Fs 776 (781) +. ....+++|+|.|+--++..+....... .....--.|+|+|+|..+|.+|+++|.|+ .++=+++..|. T Consensus 694 i~-----~~~G~~~Fv~~G~~LGS~~~V~G~P~~V~~~~G~G~~YG~Gv~l~~~~a~~~vD~Gf-----t~~G~~~~~F~ 763 (768) T TIGR00992 694 IK-----ATHGVYAFVEYGSDLGSSKDVKGNPTIVYRRPGQGSSYGAGVKLGLVRAEYAVDLGF-----TNEGTGAIFFR 763 (768) T ss_pred EE-----ECCEEEEEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCEEEEECCCCCEEEECCC-----CCCCCEEEEEC T ss_conf 11-----033047888415467867766888628863678862112203860431224662341-----56774378723 Q ss_pred EEECC Q ss_conf 41059 Q gi|254780772|r 777 IGNRM 781 (781) Q Consensus 777 iG~~f 781 (781) ||-|| T Consensus 764 ~GE~f 768 (768) T TIGR00992 764 FGERF 768 (768) T ss_pred CCCCC T ss_conf 57769 No 8 >PRK11067 outer membrane protein assembly factor YaeT; Provisional Probab=100.00 E-value=8.9e-43 Score=297.26 Aligned_cols=312 Identities=19% Similarity=0.296 Sum_probs=254.5 Q ss_pred EEEEEEEEECCCCCCHHHHHHHC---CCCCCCCCCHHHHHHH---HHHHHHCCCCC--EEEEEEE---CC--EEEEEEEE Q ss_conf 26858999665007988999974---8789882688999999---99999679800--3899986---99--89999993 Q gi|254780772|r 46 SIVRRIEIRGATNVGKEVILSRI---PVVVGQSISDADLDHA---VKNIYAMGYFS--NVKIKIV---DS--VLIIDLIE 112 (781) Q Consensus 46 ~~i~~I~i~G~~~i~~~~i~~~~---~~~~G~~~~~~~i~~~---i~~l~~~g~F~--~v~~~~~---~~--~l~~~v~e 112 (781) .+|++|.++||+.++++.+...+ ++++|+.+++..++++ |+++|...=+. .|++... ++ .|+|.|.| T Consensus 92 P~I~~i~~~Gn~~i~~~~l~~~l~~~~lk~g~~~~~~~l~~~~~~i~~~Y~~~G~~~a~V~~~~~~~~~n~v~l~f~I~E 171 (801) T PRK11067 92 PTIASITFSGNKAVKDDMLKQNLEASGVRVGEALDRTTLSNIEKGLEDFYYSVGKYSASVKAVVTPLPRNRVDLKLVFQE 171 (801) T ss_pred CEEEEEEEECCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCEECCEEEEEEEECCCCEEEEEEEEEE T ss_conf 73899999887437889999877543646575469999999999999999967921219989998789986999999950 Q ss_pred CCE--EEEEEEECCCCCCHHHHHHHHHHCC---------CCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEE-EEEEC-CC Q ss_conf 653--6579995654369889988730057---------86379999999999999999972970017998-76433-89 Q gi|254780772|r 113 RKI--INHLFFSGNNNLKDDQLKMIVRSRS---------AAAYDEDTVNADVHNIKQAYASIGYLNVMVKV-QHHSI-SP 179 (781) Q Consensus 113 ~p~--i~~i~~~Gn~~~~~~~l~~~~~~~~---------g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~-~~~~~-~~ 179 (781) .+. |.+|.|.||+.+++++|++.+.+++ ++.|++..++.|+++|+++|.++||.+++|.. ++.-. ++ T Consensus 172 G~~~~I~~I~f~GN~~fs~~~L~~~~~~~~~~~~~~~~~~~~~~~~~l~~D~~~L~~fY~~~Gy~d~~V~~~~v~~~~dk 251 (801) T PRK11067 172 GVSAKIQQINIVGNHAFTTDELISRFQLRDEVPWWNVVGDRKYQKQKLAGDLETLRSYYLDRGYARFNIDSTQVSLTPDK 251 (801) T ss_pred CCEEEEEEEEEECCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEECCEEEECCCC T ss_conf 88899999999878324789999886237664335761688549899998999999999871866226604425634776 Q ss_pred CCEEEEEEEECCCCEEEEEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 84258898603663246778750443110122221110012211244566643102333345788888875423221001 Q gi|254780772|r 180 TTLNITYVIEEGVKAKINSIRFVGNKNYSHARLERVISIRTSGYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKVS 259 (781) Q Consensus 180 ~~v~i~~~I~EG~~~~I~~I~f~Gn~~f~~~~L~~~~~~k~~~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V~ 259 (781) ..+.|+|.|+||+++++.+|.|+||..+++.+|++.+..++ ++.|++.+++.+.+.|+++|.++||..|+|. T Consensus 252 ~~~~I~~~I~EG~~Y~~g~i~~~gn~~~~~~el~~~~~~~~--------G~~y~~~~i~~~~~~i~~~~~~~GY~fa~V~ 323 (801) T PRK11067 252 KGIYITVNITEGDQYKLSGVQVSGNLAGHSAEIEKLTKIEP--------GELYNGTKVTKMEDDIKKLLGRYGYAYPRVQ 323 (801) T ss_pred CEEEEEEEECCCCEEEEEEEEEECCCCCCHHHHHHHHCCCC--------CCCCCHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 26999999868974987237996777777899998754578--------8634699999999999999986697210046 Q ss_pred CCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCCCC Q ss_conf 23101245555410122022221000001233344455767777532116996211210000000000002344564201 Q gi|254780772|r 260 SQVLFDKQKSGYVLIFQIDEGEIYTVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFVR 339 (781) Q Consensus 260 ~~~~~~~~~~~~~i~~~I~EG~~y~i~~i~i~g~~~~~~~~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~fa~ 339 (781) +....+++.+.+.|+|.|+||++++|+.|.|.||.. ..+..+++.+.+.+|++|+..++++..++|.+. || |-. T Consensus 324 ~~~~~~~~~~~v~l~~~I~eg~~~yV~~I~I~GN~~-T~D~VIRRel~~~eGd~~~~~~i~~S~~rL~~l----g~-F~~ 397 (801) T PRK11067 324 TQPEINDADKTVKLRVNVDAGNRFYVRKIRFEGNDT-SKDSVLRREMRQMEGAWLGSDLVEQGKERLNRL----GF-FET 397 (801) T ss_pred CCEEECCCCCEEEEEEEECCCCEEEEEEEEEECCCC-CHHHHHHHEECCCCCCCCCHHHHHHHHHHHHHC----CC-CCC T ss_conf 412674789889999998668627999867427706-134751010003676533888999999999855----87-420 Q ss_pred CCCCCCC-CCCCCEEEEEEECCCCCCCEEECCCCCC Q ss_conf 1233223-4344046655412333220110033334 Q gi|254780772|r 340 VKTRINR-DFAKRIVDIEYLIDQGSPLYVKRIEIEG 374 (781) Q Consensus 340 v~~~~~~-d~~~~~vdi~~~I~eg~~~~i~~I~I~G 374 (781) |.+.+.+ ...+..|||.+.|.|.+. +.+.+-+ T Consensus 398 V~~~~~~~~~~~~~vdl~~~V~E~~t---G~~~~g~ 430 (801) T PRK11067 398 VDTDTQRVPGSPDQVDVVYKVKERNT---GSFNFGI 430 (801) T ss_pred CCCCCCCCCCCCCEEEEEEEEEECCC---CEEEEEE T ss_conf 34334358999873899999898678---8799998 No 9 >KOG2602 consensus Probab=100.00 E-value=5.6e-45 Score=311.63 Aligned_cols=406 Identities=22% Similarity=0.339 Sum_probs=312.6 Q ss_pred EEECCCCCCCCCCCCCCCHHHCCCC-CCCHH-H-HHHHHHHHHHHHHHHCCCCCCCCCCC---CCCCCCCCCCCCCCCCC Q ss_conf 1100333343200332100100100-00000-1-23444566677651012344533254---43565431023322112 Q gi|254780772|r 366 YVKRIEIEGNDQSYDSVIRRELELS-EGDPI-N-YSMIERAKRRIMATGYFSEVNISQLP---ANDVSDYVILRVSVKQL 439 (781) Q Consensus 366 ~i~~I~I~GN~~T~d~VIrrel~~~-~Gd~~-~-~~~i~~s~~~L~~lg~F~~V~i~~~~---~~~~~~~vdl~i~v~E~ 439 (781) .|..|+|.|+.+|+|.+|.+++..- ....+ + ......+..+|..+++|++|++.... .+..|+..+|.+.|.|. T Consensus 37 ~v~~V~v~Gl~~Tkdd~I~k~v~~~~ka~nl~el~~~s~~a~~nL~~l~iF~~v~~~iD~~~g~~~~p~~~~Vt~~v~E~ 116 (457) T KOG2602 37 RVDHVIVSGLERTKDDFIMKEVDLVFKAKNLQELLLASHEAASNLRALGIFDSVNILIDTKEGSDALPGGLVVTFLVTEP 116 (457) T ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCEEEEEEEEEC T ss_conf 13589851310153278999866666300599999878999988988452100116602455787789851689996525 Q ss_pred CCCCCCCCCCCCCCCC---CEEEECCCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEECCCCCCCCCCCCCCCCC---CC Q ss_conf 2111111111223321---10000011123332344220037874112211113454112357775431001111---33 Q gi|254780772|r 440 SAGSVGIATNYEVDKG---MGVEGHIDDNNFFGQGYRARLAAGFGRHAVQNYTFSVEDPYFLGSPISAGFDLQKT---HL 513 (781) Q Consensus 440 ~tg~~~~g~gys~~~g---~~~~~~~~~~Nl~G~G~~l~l~~~~~~~~~~~~~~s~~~P~~~~~~~s~~~~~~~~---~~ 513 (781) +. +....| |+.| -.+..++..+|++|+|+.++.+.+++......+.++|..|.+-+....+....+.. .. T Consensus 117 kr--~tg~~G--T~~gn~~~s~~~~l~~~ni~G~ge~l~~~~sy~~~~~~~~~l~f~~P~~~~~~~~~~~~~~~~~~d~~ 192 (457) T KOG2602 117 KR--LTGSTG--TDVGNREGSVELNLKLPNILGRGENLSGQVSYGCTRSTDMGLSFYKPRFHGLKTPFSSFSIFRTQDWM 192 (457) T ss_pred CH--HEEECC--CCCCCCCCCCCHHCCCCCCCCCCEEEEEEEEEECCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHCCC T ss_conf 20--102103--11268776501010465445751441006887204675242578710357877640434434221036 Q ss_pred CCCCCCCCCCCCEEEEEEECCCCEEEEECCCEEEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCC Q ss_conf 33334323332024554202543023540412577640146754420111221015850246788432155420110024 Q gi|254780772|r 514 EDGSLDINDESAAVRMIVPITESISTSFKYDLRFLQYGAISEKEKIPSIYTTLIEHGKFSSHSISQSIIYNTLDNPIVPR 593 (781) Q Consensus 514 ~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~dtrD~~~~Pt 593 (781) ....++....+..+.+..+....+.+.+.|..-+-+....+.. .......+.|....+++.+++.+|+||..+.|| T Consensus 193 ~~ss~d~~~~g~sL~l~~~~~~~l~H~l~~~~~WR~l~~~sq~----as~svR~qaG~~L~s~L~~tfv~D~Rd~~~~pt 268 (457) T KOG2602 193 KWSSLDETHRGASLELSAEDWFLLFHELKYEIAWRNLGDLSQG----ASFSVRRQAGHSLKSSLSYTFVFDKRDSSIIPT 268 (457) T ss_pred CCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHCCCCCC----CCHHHHHHHHHHHHHHHCCEEEEECCCCCCCCC T ss_conf 5642355665228987324455346799888888761243334----568889875577887410069996267755665 Q ss_pred CCEEEEEEECCCCCCCCCEEEEEEEEEEEEEEECCCEEEEEEEEEEEEEEECCCCCCCCHHEEEECCCCCCCCCCCCCCC Q ss_conf 65156542024565554203666630047999358809999999978985169974123012750789714021268615 Q gi|254780772|r 594 KGMLISSSYDYAGFGGDSQYHRIGSRASYFYLLSDDSDIVGSLRFGYGCVIPSNKNLQLFDQFSVSSNYYLRGFAYKGIG 673 (781) Q Consensus 594 ~G~~~~~~~~~a~~g~d~~f~k~~~~~~~y~~l~~~~~l~~~~~~g~~~~~~~~~~lp~~erF~lGG~~svRGy~~~~iG 673 (781) +|.++....++++++++..|.|.+++++.-.|+.-..++.++++.|+..+.++ ..+++.||||+||+.+||||..++|| T Consensus 269 rG~~~k~~~e~~g~~~~~sf~k~e~~lq~avpl~~~~~~s~~~~~G~l~~~~~-~~v~I~dRFylGGps~lRGFk~~giG 347 (457) T KOG2602 269 RGILLKSTSELAGLFGDVSFLKSELDLQKAVPLGFDATLSFSFSGGVLKPLGS-RPVSIADRFYLGGPSDLRGFKTRGIG 347 (457) T ss_pred CCCEEEEEHHHCCCCCCHHHHHHHHHHEECCCCCCCEEEEEEEEEEEEECCCC-CCCCHHHHHCCCCCCCCCCCCCCCCC T ss_conf 65314760432476554023223111101243134303676676206615898-76435465302896321331267758 Q ss_pred CCCCCCCCCCCEEEEEEEEEEEEECCCCCCC-CCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEECC-C Q ss_conf 5037997222147899999999614565654-5379999997677415666677544456766424799999997557-0 Q gi|254780772|r 674 PRVDKKYAIGGKIYSSASAAVSFPMPLVPER-AGLRGAFFVDSATLYANHVALGADKLEGNDSFWRVSTGVEIMWNSP-L 751 (781) Q Consensus 674 P~~~~~~~~GG~~~~~~s~E~r~pl~~~~~~-~~l~~~~F~D~G~v~~~~~~~~~~~~~~~~~~~r~s~G~Glr~~tP-~ 751 (781) |++. ++++||++++.+.+.+-+|+|+.+.. ..+++.+|+.+|++-+-...............+|+|+|+||.+.+| + T Consensus 348 P~~~-~~~lGG~a~~a~~~~L~~~lP~~~~~~~~fr~h~F~naG~l~~l~s~~~~~~~~kl~e~~r~S~G~Glv~~~~~~ 426 (457) T KOG2602 348 PKDE-GDFLGGDAFVAAGAHLYFPLPFAKGFKSGFRGHVFANAGNLANLSSEAPKNTIPKLLETFRTSVGAGLVVPTPRA 426 (457) T ss_pred CCCC-CCCCCCCEEEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCHH T ss_conf 7877-765477036675017402477654322451457774046512467654331035555400100023268752413 Q ss_pred CCEEEEEEEECCCCCCCEE-EEEEEEEEECC Q ss_conf 0158998872267787602-58999641059 Q gi|254780772|r 752 GMMGVYYGIPLRHREGDKI-QQFGFRIGNRM 781 (781) Q Consensus 752 Gpirld~a~pl~~~~~D~~-~~f~FsiG~~f 781 (781) .-+.|.|..|+.+.++|.. +.|||.+|..| T Consensus 427 aR~ELNy~~pl~~~e~D~~~~GfqfG~g~~F 457 (457) T KOG2602 427 ARLELNYCLPLKKQENDRIRKGFQFGAGIDF 457 (457) T ss_pred HCEEEEEEEEEEEECCCHHHCCEEECCCCCC T ss_conf 2045555777555036323241474131469 No 10 >COG4775 Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=4.1e-37 Score=260.17 Aligned_cols=307 Identities=21% Similarity=0.427 Sum_probs=259.5 Q ss_pred CCEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHH---HH-CCCCC-EEEEEEE----CC-EEEEEEEEC Q ss_conf 652685899966500798899997487898826889999999999---96-79800-3899986----99-899999936 Q gi|254780772|r 44 NTSIVRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNI---YA-MGYFS-NVKIKIV----DS-VLIIDLIER 113 (781) Q Consensus 44 ~~~~i~~I~i~G~~~i~~~~i~~~~~~~~G~~~~~~~i~~~i~~l---~~-~g~F~-~v~~~~~----~~-~l~~~v~e~ 113 (781) +..+|++|.++||+.++++.++..+.+++|+.++...+++++++| |. .|+|. .|+...+ +. .+.|.+.|. T Consensus 90 e~p~I~~v~~~Gn~~i~~~~L~~~l~~~~g~~~~~~~l~~~~~~l~e~Y~~~G~~~~~v~~~~~~~~~~~v~v~f~i~eG 169 (766) T COG4775 90 ERPVINSVVFSGNKSIKDDALKKNLKIKPGEVLNRAKLEADVEALKEAYLSRGYYSVKVTYKVTPLPRNRVDVVFVINEG 169 (766) T ss_pred CCCCEEEEEEECCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCEECCEEEEEEEECCCCEEEEEEEECCC T ss_conf 27728789986871377589887536477873457899989999999887468304346798787799707999993799 Q ss_pred C--EEEEEEEECCCCCCHHHHHHHHHHCCCC---------CCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEEC--CCC Q ss_conf 5--3657999565436988998873005786---------37999999999999999997297001799876433--898 Q gi|254780772|r 114 K--IINHLFFSGNNNLKDDQLKMIVRSRSAA---------AYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSI--SPT 180 (781) Q Consensus 114 p--~i~~i~~~Gn~~~~~~~l~~~~~~~~g~---------~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~--~~~ 180 (781) | .|..|.|.||+++++.+|++.+.+++.. .|++.+++.+...|+.+|++.||.++.|....... +++ T Consensus 170 ~~~~i~~I~~~Gn~~~s~~~l~~~~~~k~~~~~~~~~~~~~y~~~~l~~D~e~lr~~Y~n~Gy~d~~v~s~~~~~~~~k~ 249 (766) T COG4775 170 PSAKIKQINFEGNTAFSDSDLESVVQTKESNIWGWLTRLDKYDPDKLEADLETLRSFYLNRGYADFRVSSTQVSLDPDKK 249 (766) T ss_pred CCCCCCEEEEECCCCCCCHHHHHHEEECCCEEEEEEECCCCCCHHHHHCCHHHHHHHHHHCCEEEEEEEEEEECCCCCCC T ss_conf 72100146870554436012233315202205888715661276663062899999997576289998641303457876 Q ss_pred CEEEEEEEECCCCEEEEEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 42588986036632467787504431101222211100122112445666431023333457888888754232210012 Q gi|254780772|r 181 TLNITYVIEEGVKAKINSIRFVGNKNYSHARLERVISIRTSGYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKVSS 260 (781) Q Consensus 181 ~v~i~~~I~EG~~~~I~~I~f~Gn~~f~~~~L~~~~~~k~~~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V~~ 260 (781) .+.|++.|+||+++++.+|.++||..++..+|.+.+..++. ..|+.+.++.+.++|.++|.+.||..++|.+ T Consensus 250 ~~~vt~~V~EG~~y~~~~i~i~g~~~~~~~el~~~~~~~~g--------~~fn~~~i~~~~~~I~~~~~~~GY~~a~V~p 321 (766) T COG4775 250 GVTVTYTVKEGPQYKFGSVLIEGNLAGVSEELEKLLKVKPG--------KLFNRKKIEDDADKIKERYARYGYAFANVSP 321 (766) T ss_pred CEEEEEEECCCCCEEEEEEEEECCCCCCHHHHHHHHCCCCC--------CEECHHHHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 37999997448833788899975766738998776545889--------5276899888999999999865930368610 Q ss_pred CCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCCCCC Q ss_conf 31012455554101220222210000012333444557677775321169962112100000000000023445642011 Q gi|254780772|r 261 QVLFDKQKSGYVLIFQIDEGEIYTVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFVRV 340 (781) Q Consensus 261 ~~~~~~~~~~~~i~~~I~EG~~y~i~~i~i~g~~~~~~~~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~fa~v 340 (781) +...+.+...+.|.+.|.||++++|..|.|.||.. ..+..++..+++++|++||..+++.++.+|.+. || |.+| T Consensus 322 ~~~~n~~~~tv~v~~~V~~g~~~~V~~i~i~gn~r-T~D~VIrRE~~~~eGd~fn~~~v~~~~~rL~~l----gy-F~~V 395 (766) T COG4775 322 QPDANDENKTVDVVFRVDEGDRVYVRRIRIRGNTR-TKDYVIRREMRLKEGDVFNRKLVQRGKRRLRRL----GY-FESV 395 (766) T ss_pred ECCCCCCCCEEEEEEEEECCCCEEEEEEEECCCCC-CCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHC----CC-CEEE T ss_conf 11338888589999999758833664544527873-031776645532773223389999999999846----88-3345 Q ss_pred CCCCCCCCCCCEEEEEEECCCCCC Q ss_conf 233223434404665541233322 Q gi|254780772|r 341 KTRINRDFAKRIVDIEYLIDQGSP 364 (781) Q Consensus 341 ~~~~~~d~~~~~vdi~~~I~eg~~ 364 (781) ++...+......|||.+.|.|+.- T Consensus 396 ~i~~~~~~~~~~vdvvv~VkE~~T 419 (766) T COG4775 396 NIDTAPGSGSDQVDVVVDVKERST 419 (766) T ss_pred EEEECCCCCCCEEEEEEEEEECCC T ss_conf 788555899970799999875672 No 11 >COG2831 FhaC Hemolysin activation/secretion protein [Intracellular trafficking and secretion] Probab=100.00 E-value=1.6e-38 Score=269.36 Aligned_cols=463 Identities=16% Similarity=0.203 Sum_probs=325.0 Q ss_pred ECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 02222100000123334445576777753211699621121000000000000234456420112332234344046655 Q gi|254780772|r 277 IDEGEIYTVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFVRVKTRINRDFAKRIVDIE 356 (781) Q Consensus 277 I~EG~~y~i~~i~i~g~~~~~~~~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~fa~v~~~~~~d~~~~~vdi~ 356 (781) -.|.+++.|..|.++|+ ...+..+|.+++....|.......++....+|++.|.++||+.++|... .++.+++++ . T Consensus 67 ~~E~~~f~I~~i~l~g~-~~~~~~~l~~~~~~~~g~~l~~~~l~~l~~~lt~~y~~~GYvtsra~lp-~Q~i~~G~l--~ 142 (554) T COG2831 67 PDETPCFPINRIELEGN-TLLDALELQAILAPYLGRCLGLADLNQLASALTNLYIDRGYVTTRALLP-PQDLKSGTL--R 142 (554) T ss_pred CCCCCEEEEEEEEEECC-CCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECC-CCCCCCCEE--E T ss_conf 77885078878998177-5578589998667751886678999999999999998669478899726-874557879--9 Q ss_pred EECCCCCCCEEECCCCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 41233322011003333432003321001001000000012344456667765101234453325443565431023322 Q gi|254780772|r 357 YLIDQGSPLYVKRIEIEGNDQSYDSVIRRELELSEGDPINYSMIERAKRRIMATGYFSEVNISQLPANDVSDYVILRVSV 436 (781) Q Consensus 357 ~~I~eg~~~~i~~I~I~GN~~T~d~VIrrel~~~~Gd~~~~~~i~~s~~~L~~lg~F~~V~i~~~~~~~~~~~vdl~i~v 436 (781) +.|.||. |++|.|.|+...++..++..++..+|+++|..+||+....|+++.--+ |.++..|.. .++..||.+++ T Consensus 143 l~VveG~---i~~i~i~g~~~~~~~~~~~~~p~~~g~~Ln~~~lEq~l~~l~~~P~~q-a~~~l~pg~-~~G~S~l~i~~ 217 (554) T COG2831 143 LQVVEGR---IEDIRITGDSDLRSVALRSLFPAHRGDPLNLRDLEQGLELLNRLPGVQ-ADAELVPGS-EPGESDLVIKV 217 (554) T ss_pred EEEEEEE---EEEEEECCCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCC-CEEEECCCC-CCCCCEEEEEE T ss_conf 9995218---565898278874146677547667899778899999999862274422-125677778-88841699984 Q ss_pred CCCCCCCCCCCCC-CCC--CCCCEEEECCCCCCCCCCCCCCCEEEEEEECCC--CEEEEEEEECCCCCCCCCCCCCCCCC Q ss_conf 1122111111111-223--321100000111233323442200378741122--11113454112357775431001111 Q gi|254780772|r 437 KQLSAGSVGIATN-YEV--DKGMGVEGHIDDNNFFGQGYRARLAAGFGRHAV--QNYTFSVEDPYFLGSPISAGFDLQKT 511 (781) Q Consensus 437 ~E~~tg~~~~g~g-ys~--~~g~~~~~~~~~~Nl~G~G~~l~l~~~~~~~~~--~~~~~s~~~P~~~~~~~s~~~~~~~~ 511 (781) ++.++.++.+++. |++ .....+.+++...|++|.|+.+++++..+.... .+|+++|+.|+ ....++++++.... T Consensus 218 ~~~~~~~~~~~~DN~G~~~tG~~r~~~~l~~dn~lglgD~l~~~~~~~~~~~~~~~~~~~Ys~P~-g~~~~~~~~s~~~y 296 (554) T COG2831 218 QQGKPWRVSASADNYGSKSTGRYRLGVGLSLDNPLGLGDQLSLSYSRSLDGQTTNNYSLSYSVPL-GYWTFSLGASYSEY 296 (554) T ss_pred EECCCEEEEEEECCCCCCCCCCEEEEEEEEECCCCCCCHHEEEECCCCCCCCCCCEEEEEEEEEE-CCCCEEEEEEECCE T ss_conf 43796789998637888666631567689726976521222310033677753224789999897-47535898501013 Q ss_pred CC------CCCCCCCCCCCCEEEEEEECCCCEEEEECC--CEEEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEEEC- Q ss_conf 33------333343233320245542025430235404--125776401467544201112210158502467884321- Q gi|254780772|r 512 HL------EDGSLDINDESAAVRMIVPITESISTSFKY--DLRFLQYGAISEKEKIPSIYTTLIEHGKFSSHSISQSII- 582 (781) Q Consensus 512 ~~------~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~- 582 (781) +. ...++.-++...++.+.+++.++....+.. ++.+.+....-....... +........++.+.. T Consensus 297 ~~~~~~~~~~~~~~G~s~~~~~~l~~~l~R~~~~~~s~~~~~~~r~~~~~~~~~~~~~------q~r~~~~~~~gl~~~~ 370 (554) T COG2831 297 RQVLEGPFDVLDYQGKSQNLSLRLSHPLLRNRSSKLSLGLGLSHRRSENYLDDTEIEV------QRRRLTAVELGLSGRR 370 (554) T ss_pred EEECCCCCCEEEEECCEEEEEEEEEEEEEECCCCEEEEEEEEEEECCHHHHCCCCCCC------CCCCCCCEEECCEEEE T ss_conf 4402567731453011135657875578976761688888999840600103443333------2212330897126899 Q ss_pred ---CCCCCCCCCCCCCEEEEEEECCC---CCC-CCCEEEEEEEEEEEEEEECC-CEEEEEEEEEEEEEEECCCCCCCCHH Q ss_conf ---55420110024651565420245---655-54203666630047999358-80999999997898516997412301 Q gi|254780772|r 583 ---YNTLDNPIVPRKGMLISSSYDYA---GFG-GDSQYHRIGSRASYFYLLSD-DSDIVGSLRFGYGCVIPSNKNLQLFD 654 (781) Q Consensus 583 ---~dtrD~~~~Pt~G~~~~~~~~~a---~~g-~d~~f~k~~~~~~~y~~l~~-~~~l~~~~~~g~~~~~~~~~~lp~~e 654 (781) ..+.|..+.-.+|... ++.... ..+ .+.+|.++.+++.+.+|+.. .+.+..++.+++ +.+.++..| T Consensus 371 ~~~~~~~~~~l~~~~G~~~-~Ga~~~~~~~~~~~~~~f~~~~~~~~~~~~f~~~~~~~~~~~~~Q~-----s~~~L~s~e 444 (554) T COG2831 371 YLGGGTLDASLGYRRGLGA-LGASKDDEETNGEPTSRFKKLNGNASLLQPFGLGPFLYATQFNAQY-----SGDPLLSSE 444 (554) T ss_pred EECCCCCCCCCHHHCCCCC-CCCCCCCHHHCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEE-----CCCCCCCHH T ss_conf 7136510322354428620-1881676110477764307999851677641655279986666788-----899788378 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCC Q ss_conf 27507897140212686155037997222147899999999614565654537999999767741566667754445676 Q gi|254780772|r 655 QFSVSSNYYLRGFAYKGIGPRVDKKYAIGGKIYSSASAAVSFPMPLVPERAGLRGAFFVDSATLYANHVALGADKLEGND 734 (781) Q Consensus 655 rF~lGG~~svRGy~~~~iGP~~~~~~~~GG~~~~~~s~E~r~pl~~~~~~~~l~~~~F~D~G~v~~~~~~~~~~~~~~~~ 734 (781) +|.+||.++||||+.+++. |+..+.++.|+++|++.. .. +...+|+|+|.||....... . T Consensus 445 ~fsiGg~~sVRGf~~~~ls----------gD~G~~~snel~~~~~~~--~~-~~~y~f~D~G~v~~~~~~~~-------~ 504 (554) T COG2831 445 KFSIGGRYSVRGFDGGSLS----------GDRGWYLSNELRWPLPPG--GA-LQPYVFVDYGKVYNNSAEYL-------S 504 (554) T ss_pred HEECCCCCEECCCCCCCCC----------CCCEEEEEEEEEEECCCC--CC-EEEEEEEEEEEEECCCCCCC-------C T ss_conf 6531786523562688644----------341689976654303787--74-47988998678833455556-------7 Q ss_pred CCEEEEEEEEEEEECCC-CCEEEEEEEECCCCCCC--EEEEEEEEEEECC Q ss_conf 64247999999975570-01589988722677876--0258999641059 Q gi|254780772|r 735 SFWRVSTGVEIMWNSPL-GMMGVYYGIPLRHREGD--KIQQFGFRIGNRM 781 (781) Q Consensus 735 ~~~r~s~G~Glr~~tP~-Gpirld~a~pl~~~~~D--~~~~f~FsiG~~f 781 (781) ..-.+|+|+|+||.-+- .-.+|++|+||.++.+. ...++.|+++-+| T Consensus 505 ~~~l~g~~~Glr~~~~~~~~~~l~~g~pl~~p~~~~~~~~~~~~~~~~~f 554 (554) T COG2831 505 GETLAGAGLGLRGNLKDGFSYDLDLGRPLSKPAGFDNRRTVVGFSFSYSF 554 (554) T ss_pred CCEEEEEEEEEEEEECCCEEEEEEEEEECCCCCCCCCCCCEEEEEEEEEC T ss_conf 77166799767676067547899870063561236777630689999979 No 12 >pfam03865 ShlB Haemolysin secretion/activation protein ShlB/FhaC/HecB. This family represents a group of sequences that are related to ShlB from Serratia marcescens. ShlB is an outer membrane protein pore involved in the Type Vb or Two-partner secretion system where it is functions to secrete and activate the haemolysin ShlA. The activation of ShlA occurs during secretion when ShlB imposes a conformational change in the inactive haemolysin to form the active protein. Probab=99.94 E-value=8.2e-23 Score=166.57 Aligned_cols=342 Identities=15% Similarity=0.104 Sum_probs=207.2 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCEEEECC Q ss_conf 001000000012344456667765101234453325443565431023322112211111111---12233211000001 Q gi|254780772|r 386 ELELSEGDPINYSMIERAKRRIMATGYFSEVNISQLPANDVSDYVILRVSVKQLSAGSVGIAT---NYEVDKGMGVEGHI 462 (781) Q Consensus 386 el~~~~Gd~~~~~~i~~s~~~L~~lg~F~~V~i~~~~~~~~~~~vdl~i~v~E~~tg~~~~g~---gys~~~g~~~~~~~ 462 (781) -++..+|+++|..+||++.++|++|.- .+|++++.|.. .++.-||.+..++.++..+++++ |..+.....+.+++ T Consensus 10 ~FP~~~G~~LNlrdlEQgleqlnRl~s-~~a~i~I~Pg~-~~G~S~i~i~~~~~~~~~~s~~~DN~G~~~TGr~q~~~~l 87 (404) T pfam03865 10 LFPGIEGKPLKLAELDQGLDQANRLQS-NKTKLDILPGR-ELGGSVIKLRNQHAKPWHINAASDNYGQKNSGRWLARANA 87 (404) T ss_pred ECCCCCCCCCCHHHHHHHHHHHHCCCC-CCCEEEECCCC-CCCCEEEEEEEECCCCEEEEEEECCCCCCCCCCEEEEEEE T ss_conf 178988989737688889999706777-76159985799-9992089998601787799998769998143727878899 Q ss_pred CCCCCCCCCCCCCEEEEEEE-----CCCCEEEEEEEECCCCCCCCCCCCCCCCCCCC--------CCCCCCCCCCCEEEE Q ss_conf 11233323442200378741-----12211113454112357775431001111333--------333432333202455 Q gi|254780772|r 463 DDNNFFGQGYRARLAAGFGR-----HAVQNYTFSVEDPYFLGSPISAGFDLQKTHLE--------DGSLDINDESAAVRM 529 (781) Q Consensus 463 ~~~Nl~G~G~~l~l~~~~~~-----~~~~~~~~s~~~P~~~~~~~s~~~~~~~~~~~--------~~~y~~~~~~~~~~~ 529 (781) ...|+||.++.+++++.... ...+.++++|+.|+ + .+.+.+...++++. ...+.-++....+.+ T Consensus 88 ~~dnplgLnD~lsls~~~~~~~~~~~~~~~~s~~YSvP~--G-ywt~s~~~s~S~Y~~~~~~~~~~~~~~G~S~~~~l~~ 164 (404) T pfam03865 88 SLDSPFGLSDFVSLNANITLDNPAHRFNRAYTLLYSIPY--G-AFTFSGFASFSEYEFHQKLPHNTVKLHGDTQQVGIRG 164 (404) T ss_pred EECCCCCCCCEEEEECCCCCCCCCCCCCCEEEEEEEEEE--C-CEEEEEEEEEEEEEEEEECCCCCEEEECEEEEEEEEE T ss_conf 964866754058983243367877776437899999100--0-1344237751237888704785178433165896377 Q ss_pred EEECCCCE--EEEECCCEEEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCEEEEEEECCCC- Q ss_conf 42025430--235404125776401467544201112210158502467884321554201100246515654202456- Q gi|254780772|r 530 IVPITESI--STSFKYDLRFLQYGAISEKEKIPSIYTTLIEHGKFSSHSISQSIIYNTLDNPIVPRKGMLISSSYDYAG- 606 (781) Q Consensus 530 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~dtrD~~~~Pt~G~~~~~~~~~a~- 606 (781) .+.+.++. ..++...+.+.+....-.. ...+........+.+++.+.. ..|.-....+++++.+. T Consensus 165 srvl~R~~~~k~sl~~~l~~k~~~nyi~d--------~~l~vqsr~lt~~~lGl~~~~----~l~~g~l~~~l~~~~Gl~ 232 (404) T pfam03865 165 DYAFSRDHDQIDSLNGQLTHKRIDNYFEQ--------IRLDLSSPKLTLAELGANHLQ----IIPNGVFSANLSVEKAMG 232 (404) T ss_pred EEEEECCCCCEEEEEEEEEEEECCCEECC--------EEEEECCCCEEEEEEEEEEEE----EECCCEEEEEEEHHHCCC T ss_conf 88788688843899999999650311777--------797602783678999620279----924858976772632667 Q ss_pred -C------------CCCCEEEEEEEEEEEEEEECC-CEEEEEEEEEEEEEEECCCCCCCCHHEEEECCCCCCCCCCCCCC Q ss_conf -5------------554203666630047999358-80999999997898516997412301275078971402126861 Q gi|254780772|r 607 -F------------GGDSQYHRIGSRASYFYLLSD-DSDIVGSLRFGYGCVIPSNKNLQLFDQFSVSSNYYLRGFAYKGI 672 (781) Q Consensus 607 -~------------g~d~~f~k~~~~~~~y~~l~~-~~~l~~~~~~g~~~~~~~~~~lp~~erF~lGG~~svRGy~~~~i 672 (781) + +-+.+|.|..+.+.++.|+.- ...+.+. ..+...+ +.+.|+.+|+|.+||.++||||+.+++ T Consensus 233 wfgA~~~~~~~~~~~~~~~f~k~~l~~~~~~~f~l~~~~~~~~--~~~~gQy-S~d~L~~sE~~sIGg~~sVRGF~e~sl 309 (404) T pfam03865 233 WFGAEEHPSIAHGNGLDSQFTKGKLFANLSQRFRLGDATFLFN--NLFLGQY-SHDPLPGVEWLSLTDKNAIRGFDQSTL 309 (404) T ss_pred CCCCCCCCHHHCCCCCCCEEEEEEEEEEEEECCEECCCCEEEE--EEEEEEE-CCCCCCCHHEECCCCCCEECCCCCCCE T ss_conf 6578778212037776524799999998860505556448989--9899650-899888044671448854776488807 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCC Q ss_conf 55037997222147899999999614565654537999999767741566667754445676642479999999755700 Q gi|254780772|r 673 GPRVDKKYAIGGKIYSSASAAVSFPMPLVPERAGLRGAFFVDSATLYANHVALGADKLEGNDSFWRVSTGVEIMWNSPLG 752 (781) Q Consensus 673 GP~~~~~~~~GG~~~~~~s~E~r~pl~~~~~~~~l~~~~F~D~G~v~~~~~~~~~~~~~~~~~~~r~s~G~Glr~~tP~G 752 (781) . |+.-++...|+.+|+++- ...+..-+++|+|.|.......+. ....-.|+|+|+||..-.. T Consensus 310 s----------GD~G~y~rNels~~~~~~--~~~l~Pyig~D~G~v~~~~~~~~~------~~~~g~~~g~~l~~~~~~~ 371 (404) T pfam03865 310 S----------GDNGGYLQNTLSRPFNLG--AFSITPRIGADIGQIKPHGDNKGW------QGGAGISSGANIQYQQALL 371 (404) T ss_pred E----------ECCEEEEEEEEECCCCCC--CEEEEEEEEEEEEEEEECCCCCCC------CCCCEEEEEEEEEEEEEEE T ss_conf 6----------455699999995324689--705889999775899716887555------4334055688999961089 Q ss_pred CEEEEEEEECCCC Q ss_conf 1589988722677 Q gi|254780772|r 753 MMGVYYGIPLRHR 765 (781) Q Consensus 753 pirld~a~pl~~~ 765 (781) .+.+.+|+||.++ T Consensus 372 ~~~~~~g~~l~~~ 384 (404) T pfam03865 372 DLEVAKGEILYHQ 384 (404) T ss_pred EEEEECCCHHCCC T ss_conf 8997716200173 No 13 >COG2831 FhaC Hemolysin activation/secretion protein [Intracellular trafficking and secretion] Probab=99.40 E-value=8.5e-10 Score=81.41 Aligned_cols=159 Identities=19% Similarity=0.257 Sum_probs=118.1 Q ss_pred EECCCCEEEEEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC-C Q ss_conf 603663246778750443110122221110012211244566643102333345788888875423221001231012-4 Q gi|254780772|r 188 IEEGVKAKINSIRFVGNKNYSHARLERVISIRTSGYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKVSSQVLFD-K 266 (781) Q Consensus 188 I~EG~~~~I~~I~f~Gn~~f~~~~L~~~~~~k~~~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V~~~~~~~-~ 266 (781) -+|.+...|+.|.++||+.++..+|...+. ...+.+.....|..-..+|+++|.++||.-++| .+. . T Consensus 67 ~~E~~~f~I~~i~l~g~~~~~~~~l~~~~~--------~~~g~~l~~~~l~~l~~~lt~~y~~~GYvtsra----~lp~Q 134 (554) T COG2831 67 PDETPCFPINRIELEGNTLLDALELQAILA--------PYLGRCLGLADLNQLASALTNLYIDRGYVTTRA----LLPPQ 134 (554) T ss_pred CCCCCEEEEEEEEEECCCCCCHHHHHHHHH--------HHHCCCCCHHHHHHHHHHHHHHHHHCCCEEEEE----ECCCC T ss_conf 778850788789981775578589998667--------751886678999999999999998669478899----72687 Q ss_pred CCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 55554101220222210000012333444557677775321169962112100000000000023445642011233223 Q gi|254780772|r 267 QKSGYVLIFQIDEGEIYTVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFVRVKTRINR 346 (781) Q Consensus 267 ~~~~~~i~~~I~EG~~y~i~~i~i~g~~~~~~~~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~fa~v~~~~~~ 346 (781) +-+.-+|.+.|.|| ++++|.+.|+.. .....+...++.++|++.+..++++.++.++.....+ +...+.+ T Consensus 135 ~i~~G~l~l~VveG---~i~~i~i~g~~~-~~~~~~~~~~p~~~g~~Ln~~~lEq~l~~l~~~P~~q------a~~~l~p 204 (554) T COG2831 135 DLKSGTLRLQVVEG---RIEDIRITGDSD-LRSVALRSLFPAHRGDPLNLRDLEQGLELLNRLPGVQ------ADAELVP 204 (554) T ss_pred CCCCCEEEEEEEEE---EEEEEEECCCCC-CHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCC------CEEEECC T ss_conf 45578799999521---856589827887-4146677547667899778899999999862274422------1256777 Q ss_pred CCCCCEEEEEEECCCCCCCEEE Q ss_conf 4344046655412333220110 Q gi|254780772|r 347 DFAKRIVDIEYLIDQGSPLYVK 368 (781) Q Consensus 347 d~~~~~vdi~~~I~eg~~~~i~ 368 (781) ..+.+.-+|.+.+.+++++.+. T Consensus 205 g~~~G~S~l~i~~~~~~~~~~~ 226 (554) T COG2831 205 GSEPGESDLVIKVQQGKPWRVS 226 (554) T ss_pred CCCCCCCEEEEEEEECCCEEEE T ss_conf 7888841699984437967899 No 14 >COG0729 Outer membrane protein [Cell envelope biogenesis, outer membrane] Probab=99.33 E-value=2.8e-11 Score=91.13 Aligned_cols=212 Identities=12% Similarity=0.191 Sum_probs=149.2 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEE-CCCCCEEEEEEEECCCCEEEE--EEEEECCCCCCHHHHHHHHH-CCC Q ss_conf 99999999999999999729700179987643-389842588986036632467--78750443110122221110-012 Q gi|254780772|r 145 DEDTVNADVHNIKQAYASIGYLNVMVKVQHHS-ISPTTLNITYVIEEGVKAKIN--SIRFVGNKNYSHARLERVIS-IRT 220 (781) Q Consensus 145 ~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~-~~~~~v~i~~~I~EG~~~~I~--~I~f~Gn~~f~~~~L~~~~~-~k~ 220 (781) .+.......+......+..||+.+++.+++.. ..+..-.++..|..|++.+|. .+...| .++.+..++.... ..+ T Consensus 64 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~pg~~~~i~~~~~~~~G-~a~~d~~~~~~~~~~~~ 142 (594) T COG0729 64 DSRFRGLLDKARRDGLRALGYLQPKGRFGGKVSVTEKRGKLIAKVTPGEPTPIAVVIVVLPG-PAFTDGDYRLLGDAAGP 142 (594) T ss_pred CCCHHHHHHHCCCHHHHHHCEECCEEEECCCCEEECCCCCEEEECCCCCCCCCCEEEEECCC-CCCCCHHHHHHHHHHCC T ss_conf 31034565641016776420004503541431000345521332389998665325662246-20037346554343121 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCH Q ss_conf 21124456664310233334578888887542322100-12310124555541012202222100000123334445576 Q gi|254780772|r 221 SGYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKV-SSQVLFDKQKSGYVLIFQIDEGEIYTVGNISIQSTLQEIQK 299 (781) Q Consensus 221 ~~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V-~~~~~~~~~~~~~~i~~~I~EG~~y~i~~i~i~g~~~~~~~ 299 (781) . .+..+++..++.-+.+|.+....+||++++. .+++..+++.+.+.+++.++-|++|.++.|+|+|. .-+++ T Consensus 143 ~------~G~~~~~~~~~n~k~~i~~~~~~~gy~~a~~~~~r~~v~~~~~~a~~~~~~dsG~~y~~g~v~~~g~-~r~~~ 215 (594) T COG0729 143 K------EGEDLNQGTYENAKSSIVRALLRKGYFLARFTKSRLVVDPATHTADVDLNYDSGRRYRFGPVTVEGS-QRIDE 215 (594) T ss_pred C------CCCCCCCCHHHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCCEEEEEEEECCCCEEEECCEEECCC-EECCH T ss_conf 3------5873565211114678999998728300025403011162315688998633541697676797166-03587 Q ss_pred HHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCCC-CCCCCCCCCCCCCEEEEEEECCCCCCCEEE Q ss_conf 777753211699621121000000000000234456420-112332234344046655412333220110 Q gi|254780772|r 300 KTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFV-RVKTRINRDFAKRIVDIEYLIDQGSPLYVK 368 (781) Q Consensus 300 ~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~fa-~v~~~~~~d~~~~~vdi~~~I~eg~~~~i~ 368 (781) +.+...+++++|++|..+.+.+..++ +...+|... .|.+..++...+..+.+.+.+.|+++.+++ T Consensus 216 ~~~~~~~~~~~G~~Y~~~~l~~~~~r----l~~~~~F~sv~v~~~~~~~~~~~~~pv~v~~~~rk~~~~~ 281 (594) T COG0729 216 EYLQNLVPFKYGLPYDPEDLAELNQR----LRQTGYFSSVVVQPADKQAGPDGLLPVKVSVSERKENTFE 281 (594) T ss_pred HHHHHCCCCCCCCCCCHHHHHHHHHH----HHHCCCEEEEEEECCCCCCCCCCEECEEEEECCCCCCEEE T ss_conf 78763036667886787799999998----7615850348981275555767603539885246553377 No 15 >pfam07244 Surf_Ag_VNR Surface antigen variable number repeat. This family is found primarily in bacterial surface antigens, normally as variable number repeats at the N-terminus. The C-terminus of these proteins is normally represented by pfam01103. The alignment centres on a -GY- or -GF- motif. Some members of this family are found in the mitochondria. It is predicted to have a mixed alpha/beta secondary structure. Probab=99.09 E-value=7.6e-10 Score=81.73 Aligned_cols=77 Identities=23% Similarity=0.487 Sum_probs=69.7 Q ss_pred CEEEEEEEECCCC-CCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEEEEEECC Q ss_conf 5365799956543-69889988730057863799999999999999999729700179987643389842588986036 Q gi|254780772|r 114 KIINHLFFSGNNN-LKDDQLKMIVRSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSISPTTLNITYVIEEG 191 (781) Q Consensus 114 p~i~~i~~~Gn~~-~~~~~l~~~~~~~~g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~~~~~v~i~~~I~EG 191 (781) |.|++|.|+||.. ++++.|.+.+.+++|++|++..+++++++|.++|.+.||++++|++...+. ++.|+|+|.|+|| T Consensus 1 ~~i~~I~~~Gn~~~~~~~~l~~~~~~~~G~~~~~~~i~~~~~~l~~~y~~~Gy~~~~V~~~~~~~-~~~v~l~~~V~Eg 78 (78) T pfam07244 1 YKIKDINIEGNLKRTKDEVLRRELKLKEGDVFNREKLEEDIERLYDYYGALGYAFADVKVEPEPD-PKTVDLTFNVDEG 78 (78) T ss_pred CEEEEEEEEECCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEECC-CCEEEEEEEEECC T ss_conf 97979999948866899999998635887631699999799999999986797654433177138-9989999998269 No 16 >pfam07244 Surf_Ag_VNR Surface antigen variable number repeat. This family is found primarily in bacterial surface antigens, normally as variable number repeats at the N-terminus. The C-terminus of these proteins is normally represented by pfam01103. The alignment centres on a -GY- or -GF- motif. Some members of this family are found in the mitochondria. It is predicted to have a mixed alpha/beta secondary structure. Probab=98.78 E-value=3.6e-09 Score=77.26 Aligned_cols=77 Identities=22% Similarity=0.513 Sum_probs=40.5 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCC Q ss_conf 0000123334445576777753211699621121000000000000234456420112332234344046655412333 Q gi|254780772|r 284 TVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFVRVKTRINRDFAKRIVDIEYLIDQG 362 (781) Q Consensus 284 ~i~~i~i~g~~~~~~~~~l~~~~~~k~G~~~~~~~~~~~~~~i~~~y~~~GY~fa~v~~~~~~d~~~~~vdi~~~I~eg 362 (781) +|++|+|+||...+++++|.+.+++++|++|+...++++.++|.++|.+.||+|++|.+.+..+ ++.|+|+|.|+|| T Consensus 2 ~i~~I~~~Gn~~~~~~~~l~~~~~~~~G~~~~~~~i~~~~~~l~~~y~~~Gy~~~~V~~~~~~~--~~~v~l~~~V~Eg 78 (78) T pfam07244 2 KIKDINIEGNLKRTKDEVLRRELKLKEGDVFNREKLEEDIERLYDYYGALGYAFADVKVEPEPD--PKTVDLTFNVDEG 78 (78) T ss_pred EEEEEEEEECCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEECC--CCEEEEEEEEECC T ss_conf 7979999948866899999998635887631699999799999999986797654433177138--9989999998269 No 17 >TIGR00992 3a0901s03IAP75 chloroplast envelope protein translocase, IAP75 family; InterPro: IPR005689 Chloroplast function requires the import of nuclear encoded proteins from the cytoplasm across the chloroplast double membrane. This is accompished by two protein complexes, the Toc complex located at the outer membrane and the Tic complex loacted at the inner membrane . The Toc complex recognises specific proteins by a cleavable N-terminal sequence and is primarily responsible for translocation through the outer membrane, while the Tic complex translocates the protein through the inner membrane. This entry represents Toc750, a core component of the Toc complex. This protein is deeply embedded in the outer membrane and forms the voltage-dependent translocation channel . Toc75 itself appears to be capable of at least some discrimination between substrate and non-substrate proteins, with recognition based on based on both conformational and electrostatic interactions. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0009707 chloroplast outer membrane. Probab=98.73 E-value=5.1e-08 Score=69.76 Aligned_cols=220 Identities=15% Similarity=0.132 Sum_probs=150.2 Q ss_pred EEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECC----EEEEEEEECCEEEE-EEE Q ss_conf 6858999665007988999974878988268899999999999679800389998699----89999993653657-999 Q gi|254780772|r 47 IVRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSNVKIKIVDS----VLIIDLIERKIINH-LFF 121 (781) Q Consensus 47 ~i~~I~i~G~~~i~~~~i~~~~~~~~G~~~~~~~i~~~i~~l~~~g~F~~v~~~~~~~----~l~~~v~e~p~i~~-i~~ 121 (781) .++++++.|..-.-+...+......+|..=....+.+++...+++++|.+|.+-..+. ...+.++|+|.-.. .+. T Consensus 145 L~~E~~~~aT~g~~E~~~~~~~~~~~G~~~~~~~~~E~~~~~~~~~~~~NV~~~~~~~P~~~~~~~~~~~~~~f~~~l~~ 224 (768) T TIGR00992 145 LQKELEVLATCGMLEKVDLEGKTTQDGRLGVTISLAESVWAIADRFRCINVGLMPQSKPLEVDVDMEVKEKLEFTRRLEK 224 (768) T ss_pred HHHHHHHHCCCCCEEEEEECCEECCCCCEEEEEEEHHHHHHHHHCEEEEEEEECCCCCCCEECCCHHHCCCEEEEHHHHH T ss_conf 77666552478620222113400689824789874123675540401587775246754100466000123111000111 Q ss_pred E-------C-CCCCCHHHHHHHHHHCCC-CCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEEEEEECCC Q ss_conf 5-------6-543698899887300578-637999999999999999997297001799876433898425889860366 Q gi|254780772|r 122 S-------G-NNNLKDDQLKMIVRSRSA-AAYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSISPTTLNITYVIEEGV 192 (781) Q Consensus 122 ~-------G-n~~~~~~~l~~~~~~~~g-~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~~~~~v~i~~~I~EG~ 192 (781) + | ...+..+-..+......+ ..++-..|++.+++|.+.|+++||--|+|--. -.+. .+-.|+..|.||. T Consensus 225 ~~~~~~~~a~~~lLP~~V~~E~~~~~~~q~~~~~R~LQ~~~~~v~~WY~~~GY~~AQVV~~-g~~~-~~G~V~~~v~EG~ 302 (768) T TIGR00992 225 RYKRRIAEARPCLLPQEVVDEVFGMLYGQGKVSARLLQEIRDRVQEWYHNEGYALAQVVNF-GNLN-TDGEVVLEVVEGD 302 (768) T ss_pred HHCCHHHCCCCCCCCHHHHHHHHHHHHCCCEECHHHHHHHHHHHHHHHHHCCCEEEEEECC-CCCC-CCCEEEEEEECCC T ss_conf 1011013056554873678899987540551006777778888863110057507787346-7637-7651899996471 Q ss_pred CEEEEEEEE--------ECCCCC--CHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 324677875--------044311--0122221110012211244566643102333345788888875423221001231 Q gi|254780772|r 193 KAKINSIRF--------VGNKNY--SHARLERVISIRTSGYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKVSSQV 262 (781) Q Consensus 193 ~~~I~~I~f--------~Gn~~f--~~~~L~~~~~~k~~~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V~~~~ 262 (781) -..|. |+| +||+.- =.++|++++. .++.|+.+.=..|+++|.++-+ |-|++|+..= T Consensus 303 ~~~l~-~~f~D~~~~~V~G~T~~~vv~REl~kqL~----------~G~vFN~~~~~~~L~~~~SLgL---F~~~~~~~~P 368 (768) T TIGR00992 303 ITNLQ-IQFLDKLGNVVEGNTRDPVVTRELPKQLK----------PGDVFNIEAGKTALQNINSLGL---FSNIEVNPRP 368 (768) T ss_pred EEEEE-EEEECCCCCEECCCCCCCEEEECCHHHCC----------CCCEECHHHHHHHHHHHHHHHH---HCCCEECCCC T ss_conf 21678-99972678743055466238523353257----------8844413678888765432210---0011554688 Q ss_pred CCCCCCCCCCEEEEECCCCCC Q ss_conf 012455554101220222210 Q gi|254780772|r 263 LFDKQKSGYVLIFQIDEGEIY 283 (781) Q Consensus 263 ~~~~~~~~~~i~~~I~EG~~y 283 (781) -..+...+.|.++|.|+++- T Consensus 369 -~~~~~~~~IV~~~~~E~~~~ 388 (768) T TIGR00992 369 -DEKNEGEIIVEIKLKELEQK 388 (768) T ss_pred -CCCCCCEEEEEEEEEECCCC T ss_conf -88888627999899854776 No 18 >PRK05529 cell division protein FtsQ; Provisional Probab=98.54 E-value=7.7e-06 Score=55.51 Aligned_cols=120 Identities=15% Similarity=0.215 Sum_probs=93.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHCCCCEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHH Q ss_conf 740378999999999998888999999999--999999875114652685899966500798899997487898826889 Q gi|254780772|r 2 HKSTEDFRRIKRLLEKYFPRSFQMGFIILF--YAIFGLSAVYGSNTSIVRRIEIRGATNVGKEVILSRIPVVVGQSISDA 79 (781) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~i~~I~i~G~~~i~~~~i~~~~~~~~G~~~~~~ 79 (781) ||+..+.++++|+.+++-.|.+...--+-. .++..+-.++...-.-|++|+|+|+++++.+.|...+....|+++-.- T Consensus 15 ~~~~~~~~~~~~~~~~~r~r~~~~~~~~g~v~~l~~~v~~~~fSpll~v~~I~V~G~~~~~~~~V~~a~~~~~G~pL~~v 94 (255) T PRK05529 15 RKERKDYERVRRFTTRIRRRFILLACAVGAVLTLLLFVMLSAYSPLLALRSIEVAGNMRVKPQDIVAALRDQFGKPLPLV 94 (255) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEECCCCCCHHHHHHHHCCCCCCEEEEE T ss_conf 88888899999999988888898999999999999999985417546988999989834799999998658789851388 Q ss_pred HHHHHHHHHHHCCCCCEEEEEEE-CCEEEEEEEECCEEEEEEE Q ss_conf 99999999996798003899986-9989999993653657999 Q gi|254780772|r 80 DLDHAVKNIYAMGYFSNVKIKIV-DSVLIIDLIERKIINHLFF 121 (781) Q Consensus 80 ~i~~~i~~l~~~g~F~~v~~~~~-~~~l~~~v~e~p~i~~i~~ 121 (781) +.++.-++|-+.-+-.++++... .+.|.|.|+|+--+.-+.- T Consensus 95 d~~~v~~~l~~lp~V~sa~V~r~~P~~l~I~V~ER~Pva~v~~ 137 (255) T PRK05529 95 DPETVRKKLAAFPLIRSYSVESKPPGTIVVRVVERVPLAFIQR 137 (255) T ss_pred CHHHHHHHHHCCCCEEEEEEEECCCCCEEEEEEEEEEEEEEEE T ss_conf 6999999874499855899996397926999995052799982 No 19 >pfam08479 POTRA_2 POTRA domain, ShlB-type. The POTRA domain (for polypeptide-transport-associated domain) is found towards the N-terminus of ShlB family proteins (pfam03865). ShlB is important in the secretion and activation of the haemolysin ShlA. It has been postulated that the POTRA domain has a chaperone-like function over ShlA; it may fold back into the C-terminal beta-barrel channel. Probab=98.37 E-value=3.9e-06 Score=57.43 Aligned_cols=75 Identities=19% Similarity=0.279 Sum_probs=65.6 Q ss_pred EEEEEEEECCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEEEEEECC Q ss_conf 36579995654369889988730057863799999999999999999729700179987643389842588986036 Q gi|254780772|r 115 IINHLFFSGNNNLKDDQLKMIVRSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSISPTTLNITYVIEEG 191 (781) Q Consensus 115 ~i~~i~~~Gn~~~~~~~l~~~~~~~~g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~~~~~v~i~~~I~EG 191 (781) .|++|.|+||+.+++++|.+.+.-..|..++...+++.+..|.++|.++||..+.|...-.+..++. |.+.|.|| T Consensus 2 ~v~~i~~~G~~~~~~~~l~~~~~~~~g~~l~~~~L~~~~~~i~~~Y~~~GY~~srv~lP~Q~i~~G~--l~i~VvEG 76 (76) T pfam08479 2 PIKGIELEGNTLLPPADLAALLQPYLGRCLGLADLNQLAKALTNAYIERGYVTSRVYLPPQDLSDGV--LRLTVLEG 76 (76) T ss_pred CEEEEEEECCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCE--EEEEEECC T ss_conf 0789999898257999999987875098369999999999999999988987999981785067988--99999688 No 20 >pfam06898 YqfD Putative stage IV sporulation protein YqfD. This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis is known to be essential for efficient sporulation although its exact function is unknown. Probab=98.19 E-value=6.4e-05 Score=49.47 Aligned_cols=95 Identities=18% Similarity=0.254 Sum_probs=68.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHCCCCEEEEEEEEECCCCCCHHHHHHHC---CCCCCCC---CCHHHHH Q ss_conf 99999999988889999999999999-998751146526858999665007988999974---8789882---6889999 Q gi|254780772|r 10 RIKRLLEKYFPRSFQMGFIILFYAIF-GLSAVYGSNTSIVRRIEIRGATNVGKEVILSRI---PVVVGQS---ISDADLD 82 (781) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~i~~I~i~G~~~i~~~~i~~~~---~~~~G~~---~~~~~i~ 82 (781) ++-+++.|..+|+.++.-+++|++++ .+|. .|=+|+|+||..++++.|+..+ ++++|.. ++...+. T Consensus 76 GlpF~~~r~~kR~~~~~G~~~f~~~l~~lS~-------fVW~I~i~G~~~~t~~~i~~~L~~~gi~~G~~k~~id~~~ie 148 (383) T pfam06898 76 GLPFLFFRLRKRRGIVLGILLFVILLYFLSN-------FIWSIDIKTAKNIAEHEIRQQLNEIGVKSGVKKFSLDVYKLE 148 (383) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------EEEEEEEECCCCCCHHHHHHHHHHCCCCCCEECCCCCHHHHH T ss_conf 7178999988879999999999999997840-------699999986998998999999998699778072778999999 Q ss_pred HHHHHHHHCCCCCEEEEEEECCEEEEEEEEC Q ss_conf 9999999679800389998699899999936 Q gi|254780772|r 83 HAVKNIYAMGYFSNVKIKIVDSVLIIDLIER 113 (781) Q Consensus 83 ~~i~~l~~~g~F~~v~~~~~~~~l~~~v~e~ 113 (781) +.+..- ..-++.|.++.+|..|.+.|+|+ T Consensus 149 ~~l~~~--~~~i~Wv~v~i~Gt~l~v~v~Ek 177 (383) T pfam06898 149 RELTDN--NNNIMWIGVRVRGTTLKVKVVEK 177 (383) T ss_pred HHHHHH--CCCEEEEEEEEEEEEEEEEEEEC T ss_conf 999864--89829999999836999999965 No 21 >COG1589 FtsQ Cell division septal protein [Cell envelope biogenesis, outer membrane] Probab=97.99 E-value=0.00025 Score=45.63 Aligned_cols=78 Identities=22% Similarity=0.318 Sum_probs=67.3 Q ss_pred EEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEE-CCEEEEEEEECCEEEEEEEEC Q ss_conf 268589996650079889999748789882688999999999996798003899986-998999999365365799956 Q gi|254780772|r 46 SIVRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSNVKIKIV-DSVLIIDLIERKIINHLFFSG 123 (781) Q Consensus 46 ~~i~~I~i~G~~~i~~~~i~~~~~~~~G~~~~~~~i~~~i~~l~~~g~F~~v~~~~~-~~~l~~~v~e~p~i~~i~~~G 123 (781) ..|++|.|+||..+++++|+..+++..+..+=.-+++++-.+|-+.-+-++|+|... -+.+.|.|+|+..|.-....+ T Consensus 60 ~~i~~v~v~Gn~~~s~~~I~~~~~l~~~~~~~~ld~~~~~~~i~~~PwVk~a~V~r~~Pntv~I~v~Er~piA~w~~~~ 138 (269) T COG1589 60 FPIRKVSVSGNNQVSEEDILKALGLDGGTSFLTLDLNAIRENIEKLPWVKSAEVRRQFPNTLEIEVVEREPIAYWQRND 138 (269) T ss_pred CCCEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHHCCCEEEEEEEEECCCCEEEEEEEEEEEEEEEECC T ss_conf 7520899967843699999998575689854887699999998629765799999838994799999850379997079 No 22 >pfam08479 POTRA_2 POTRA domain, ShlB-type. The POTRA domain (for polypeptide-transport-associated domain) is found towards the N-terminus of ShlB family proteins (pfam03865). ShlB is important in the secretion and activation of the haemolysin ShlA. It has been postulated that the POTRA domain has a chaperone-like function over ShlA; it may fold back into the C-terminal beta-barrel channel. Probab=97.71 E-value=1.8e-05 Score=53.12 Aligned_cols=75 Identities=23% Similarity=0.268 Sum_probs=42.4 Q ss_pred EEEEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEE Q ss_conf 46778750443110122221110012211244566643102333345788888875423221001231012455554101 Q gi|254780772|r 195 KINSIRFVGNKNYSHARLERVISIRTSGYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKVSSQVLFDKQKSGYVLI 274 (781) Q Consensus 195 ~I~~I~f~Gn~~f~~~~L~~~~~~k~~~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V~~~~~~~~~~~~~~i~ 274 (781) .|++|.|+||+.|++.+|...+.--. +.....+.|..-.+.|.++|+++||.-+++.-+- .+-.+..|. T Consensus 2 ~v~~i~~~G~~~~~~~~l~~~~~~~~--------g~~l~~~~L~~~~~~i~~~Y~~~GY~~srv~lP~---Q~i~~G~l~ 70 (76) T pfam08479 2 PIKGIELEGNTLLPPADLAALLQPYL--------GRCLGLADLNQLAKALTNAYIERGYVTSRVYLPP---QDLSDGVLR 70 (76) T ss_pred CEEEEEEECCCCCCHHHHHHHHHHHC--------CCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCC---CCCCCCEEE T ss_conf 07899998982579999999878750--------9836999999999999999998898799998178---506798899 Q ss_pred EEECCC Q ss_conf 220222 Q gi|254780772|r 275 FQIDEG 280 (781) Q Consensus 275 ~~I~EG 280 (781) +.|.|| T Consensus 71 i~VvEG 76 (76) T pfam08479 71 LTVLEG 76 (76) T ss_pred EEEECC T ss_conf 999688 No 23 >pfam08478 POTRA_1 POTRA domain, FtsQ-type. FtsQ/DivIB bacterial division proteins (pfam03799) contain an N-terminal POTRA domain (for polypeptide-transport-associated domain). This is found in different types of proteins, usually associated with a transmembrane beta-barrel. FtsQ/DivIB may have chaperone-like roles, which has also been postulated for the POTRA domain in other contexts. Probab=97.21 E-value=0.0013 Score=40.90 Aligned_cols=66 Identities=21% Similarity=0.392 Sum_probs=58.1 Q ss_pred EEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEE-CCEEEEEEEEC Q ss_conf 8589996650079889999748789882688999999999996798003899986-99899999936 Q gi|254780772|r 48 VRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSNVKIKIV-DSVLIIDLIER 113 (781) Q Consensus 48 i~~I~i~G~~~i~~~~i~~~~~~~~G~~~~~~~i~~~i~~l~~~g~F~~v~~~~~-~~~l~~~v~e~ 113 (781) |++|+|.||+++++++|+..+++..|+++=.-+.++.-++|-+.-+-++|+|... .+.|.+.|.|+ T Consensus 1 i~~I~V~G~~~~~~~~i~~~~~~~~~~~~~~i~~~~i~~~l~~~p~I~~a~V~r~~P~~l~I~I~Er 67 (67) T pfam08478 1 IRQVEVSGNKHVSAEEIRKALGIQKGTSFFSVDLNAIEDRLEKLPWIKSASVRRQWPNTLEIRVVER 67 (67) T ss_pred CCEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEEECCCEEEEEEEEC T ss_conf 9299999983479999999767658992799989999998747887269999997799899999969 No 24 >TIGR02876 spore_yqfD sporulation protein YqfD; InterPro: IPR010690 This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis (P54469 from SWISSPROT) is known to be essential for efficient sporulation although its exact function is unknown .. Probab=97.08 E-value=0.011 Score=34.83 Aligned_cols=103 Identities=17% Similarity=0.233 Sum_probs=71.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHCCCCEEEEEEE--EECCCCCCHHHHH---HHCCCCCCCCCCHHH-HH Q ss_conf 9999999998888999999999-999999875114652685899--9665007988999---974878988268899-99 Q gi|254780772|r 10 RIKRLLEKYFPRSFQMGFIILF-YAIFGLSAVYGSNTSIVRRIE--IRGATNVGKEVIL---SRIPVVVGQSISDAD-LD 82 (781) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~I~--i~G~~~i~~~~i~---~~~~~~~G~~~~~~~-i~ 82 (781) ++.|+..|-.+|.+++.-++++ ++++++|.. |=+|+ |.|...+.+..|+ ..+++++|..-..-+ ++ T Consensus 73 G~pF~~~r~~kr~~~~~G~L~f~~~~~~lS~f-------iW~IdvrI~g~~~~~~~~i~~~L~~~G~k~Gv~k~~~Ds~~ 145 (406) T TIGR02876 73 GLPFLIKRLRKRPGILIGILLFLAIVYLLSTF-------IWKIDVRITGVKGETEYEIRKELKEMGIKPGVKKFSIDSVY 145 (406) T ss_pred CCHHHHHHHHHCCCHHHHHHHHHHHHHHHCCC-------EEEEEEEEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCCHH T ss_conf 81478889885500766899999999986175-------58999997279997789999999855875683203688988 Q ss_pred HHHHHHHH-CCCCCEEEEEEECCEEEEEEEECCEE--EEE Q ss_conf 99999996-79800389998699899999936536--579 Q gi|254780772|r 83 HAVKNIYA-MGYFSNVKIKIVDSVLIIDLIERKII--NHL 119 (781) Q Consensus 83 ~~i~~l~~-~g~F~~v~~~~~~~~l~~~v~e~p~i--~~i 119 (781) ..-+.|.+ ..-..-|.++.+|+.|.|+|+|+-.+ ..+ T Consensus 146 ~~~~~L~~~~~ei~Wv~v~~~Gt~l~~kvvEk~~~ki~~~ 185 (406) T TIGR02876 146 KLERKLLDRVPEIMWVGVRVRGTKLVIKVVEKQEPKIKEV 185 (406) T ss_pred HHHHHHHHCCCCEEEEEEEEECCEEEEEEEECCCCCCCCC T ss_conf 9999998459948999999853099999982133324456 No 25 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=93.97 E-value=0.37 Score=24.85 Aligned_cols=37 Identities=16% Similarity=0.129 Sum_probs=20.9 Q ss_pred CHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 9889999748789882688999999999996798003 Q gi|254780772|r 60 GKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSN 96 (781) Q Consensus 60 ~~~~i~~~~~~~~G~~~~~~~i~~~i~~l~~~g~F~~ 96 (781) +.+.|.-...-.+-..+....+....|+.-..+-|.. T Consensus 81 d~~v~~~~~~~~~~~~lp~~~l~~~Aka~Rn~kq~~~ 117 (818) T PRK10049 81 DQEVITVYERYQSHMQLPARGLAAAARAYRNLKQWQN 117 (818) T ss_pred CHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 7799999996188888997999999999863554677 No 26 >PRK10775 cell division protein FtsQ; Provisional Probab=92.36 E-value=0.65 Score=23.28 Aligned_cols=140 Identities=14% Similarity=0.161 Sum_probs=89.6 Q ss_pred CCCEEEEEEEEEC-CCCCCHHHHHHHC-CCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEE-CCEEEEEEEECCEEEEE Q ss_conf 4652685899966-5007988999974-8789882688999999999996798003899986-99899999936536579 Q gi|254780772|r 43 SNTSIVRRIEIRG-ATNVGKEVILSRI-PVVVGQSISDADLDHAVKNIYAMGYFSNVKIKIV-DSVLIIDLIERKIINHL 119 (781) Q Consensus 43 ~~~~~i~~I~i~G-~~~i~~~~i~~~~-~~~~G~~~~~~~i~~~i~~l~~~g~F~~v~~~~~-~~~l~~~v~e~p~i~~i 119 (781) ++..=|+.|.|.| .++++.+.|...+ .....-.|=..+++.+-+.|-++-+-.+|+|... .+.|.|.|+|.-.|..= T Consensus 53 ~~~lPis~i~v~G~~~~~~~~dir~ai~~~~~~G~Fft~Dl~~iq~~le~lPWV~~asVRR~WPd~L~V~v~E~~PvArW 132 (274) T PRK10775 53 AQRLPLSKLVLTGERHYTRNDDIRQAILALGAPGTFMTQDVNIIQQQIERLPWIKQVSVRKQWPDELKIHLVEYVPIARW 132 (274) T ss_pred CCCCCCEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHCCCEEEEEEEEECCCCEEEEEEEECCEEEE T ss_conf 54388538999030041799999999984478997575069999999984999337889984899609999985348998 Q ss_pred ------EEECCC-CCCHHHH-HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEEE Q ss_conf ------995654-3698899-887300578637999999999999999997297001799876433898425889 Q gi|254780772|r 120 ------FFSGNN-NLKDDQL-KMIVRSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSISPTTLNITY 186 (781) Q Consensus 120 ------~~~Gn~-~~~~~~l-~~~~~~~~g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~~~~~v~i~~ 186 (781) .-.|.- ....+.+ ...+..-.|..=++..+-+.-..+.+.+...|+--+.+... +.+.+.|++ T Consensus 133 n~~~lln~~G~vF~~p~~~~~~~~LP~L~GP~gs~~~Vl~~Y~~l~~~L~~~gl~I~~L~l~----~R~aW~l~L 203 (274) T PRK10775 133 NDQHMVDAEGNTFSVPSDRTGKQVLPMLYGPEGSENEVLQGYREMGQVLAKDKFTLKEAAMT----ARRSWQLTL 203 (274) T ss_pred CCCCCCCCCCEEEECCCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEC----CCCEEEEEE T ss_conf 99872768980986670004556885578999879999999999999998779876899975----786189996 No 27 >PRK05529 cell division protein FtsQ; Provisional Probab=87.74 E-value=0.24 Score=26.06 Aligned_cols=71 Identities=14% Similarity=0.082 Sum_probs=41.9 Q ss_pred EEEEEEECCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEEEEEECCCCEE Q ss_conf 65799956543698899887300578637999999999999999997297001799876433898425889860366324 Q gi|254780772|r 116 INHLFFSGNNNLKDDQLKMIVRSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSISPTTLNITYVIEEGVKAK 195 (781) Q Consensus 116 i~~i~~~Gn~~~~~~~l~~~~~~~~g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~~~~~v~i~~~I~EG~~~~ 195 (781) |.+|+++||..++++++.+.+....|.++-.-...+..++|.+. -+--. .++...-++. |.+.|.|-.++- T Consensus 63 v~~I~V~G~~~~~~~~V~~a~~~~~G~pL~~vd~~~v~~~l~~l---p~V~s----a~V~r~~P~~--l~I~V~ER~Pva 133 (255) T PRK05529 63 LRSIEVAGNMRVKPQDIVAALRDQFGKPLPLVDPETVRKKLAAF---PLIRS----YSVESKPPGT--IVVRVVERVPLA 133 (255) T ss_pred EEEEEEECCCCCCHHHHHHHHCCCCCCEEEEECHHHHHHHHHCC---CCEEE----EEEEECCCCC--EEEEEEEEEEEE T ss_conf 88999989834799999998658789851388699999987449---98558----9999639792--699999505279 No 28 >TIGR03509 OMP_MtrB_PioB decaheme-associated outer membrane protein, MtrB/PioB family. Members of this protein family are integral proteins of the bacterial outer membrane, associated with multiheme c-type cytochromes involved in electron transfer. The MtrB protein of Shewanella oneidensis MR-1 (SO1776) has been shown to form a complex with 1:1:1 stochiometry with the small, periplasmic decaheme cytochrome MtrA and large, surface-exposed decaheme cytochrome MtrC. Probab=87.01 E-value=1.7 Score=20.51 Aligned_cols=36 Identities=17% Similarity=0.147 Sum_probs=19.7 Q ss_pred CCCCCCCCCCEEEEEEECCCCEEEEECCCEEEEEEE Q ss_conf 334323332024554202543023540412577640 Q gi|254780772|r 516 GSLDINDESAAVRMIVPITESISTSFKYDLRFLQYG 551 (781) Q Consensus 516 ~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (781) ..|+.++..+.+...|++.+..++..+|.++..... T Consensus 345 ~~~d~tk~~~~l~a~YR~~~~~~l~~gy~~~~~dRs 380 (649) T TIGR03509 345 RPYDKTKQKAELAADYRFTRSVKLTAGYDYKNDDRS 380 (649) T ss_pred CCCCCCCEECCEEEEEEECCCCEEECCEEEEECCCC T ss_conf 665500003203334875467277611207752222 No 29 >pfam08478 POTRA_1 POTRA domain, FtsQ-type. FtsQ/DivIB bacterial division proteins (pfam03799) contain an N-terminal POTRA domain (for polypeptide-transport-associated domain). This is found in different types of proteins, usually associated with a transmembrane beta-barrel. FtsQ/DivIB may have chaperone-like roles, which has also been postulated for the POTRA domain in other contexts. Probab=85.80 E-value=0.23 Score=26.23 Aligned_cols=29 Identities=14% Similarity=0.407 Sum_probs=19.2 Q ss_pred EEEEEEECCCCCCHHHHHHHHHHCCCCCC Q ss_conf 65799956543698899887300578637 Q gi|254780772|r 116 INHLFFSGNNNLKDDQLKMIVRSRSAAAY 144 (781) Q Consensus 116 i~~i~~~Gn~~~~~~~l~~~~~~~~g~~~ 144 (781) |.+|.++||+.++++++.+.+++..|..+ T Consensus 1 i~~I~V~G~~~~~~~~i~~~~~~~~~~~~ 29 (67) T pfam08478 1 IRQVEVSGNKHVSAEEIRKALGIQKGTSF 29 (67) T ss_pred CCEEEEECCCCCCHHHHHHHHCCCCCCCE T ss_conf 92999999834799999997676589927 No 30 >pfam11854 DUF3374 Protein of unknown function (DUF3374). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 665 to 712 amino acids in length. Probab=84.57 E-value=2.2 Score=19.74 Aligned_cols=32 Identities=13% Similarity=0.166 Sum_probs=13.2 Q ss_pred CCCCCCCCEEEEEEECCCCEEEEECCCEEEEE Q ss_conf 43233320245542025430235404125776 Q gi|254780772|r 518 LDINDESAAVRMIVPITESISTSFKYDLRFLQ 549 (781) Q Consensus 518 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (781) |+.++..+.+...|++....++..+|.++... T Consensus 374 ~d~tk~~~~l~a~YR~~~~~~l~agy~~~~~~ 405 (675) T pfam11854 374 YDRTRQKAKLAASYRLARGLKLDAGYDYKNDE 405 (675) T ss_pred CCCCCEECCCCCEEEECCCCEEEEEEEEEECC T ss_conf 64101021001128844674576116675050 No 31 >PRK09408 ompX outer membrane protein X; Provisional Probab=82.92 E-value=2.6 Score=19.30 Aligned_cols=43 Identities=14% Similarity=0.270 Sum_probs=31.0 Q ss_pred EEEEEEEEEEECCCCCEEEEEEEECCCCCCCEEEEEEEEEEECC Q ss_conf 47999999975570015899887226778760258999641059 Q gi|254780772|r 738 RVSTGVEIMWNSPLGMMGVYYGIPLRHREGDKIQQFGFRIGNRM 781 (781) Q Consensus 738 r~s~G~Glr~~tP~Gpirld~a~pl~~~~~D~~~~f~FsiG~~f 781 (781) ..++|+|+.++ |+--+-+|.+|-=.+-..-+...|-..+|-|| T Consensus 129 ~fayGAGvQ~N-P~~nvaid~gYE~S~~~~~~~~~f~vGvGYRF 171 (171) T PRK09408 129 GFSYGAGLQFN-PMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 171 (171) T ss_pred CEEEEEEEEEC-CCCCEEEEEEEEECCCCCEEECCEEEEEEECC T ss_conf 14666547980-67787999876111458727685998114139 No 32 >PRK04922 tolB translocation protein TolB; Provisional Probab=80.65 E-value=3.1 Score=18.77 Aligned_cols=66 Identities=20% Similarity=0.380 Sum_probs=31.0 Q ss_pred HHHHHHHHHHH-HHHHHHCCCCEEEEEEEEEC-C-CCCCHHHHHHHCCCC-CCCC-CCHHHHHHHHH-HHHHCCCCCEE Q ss_conf 99999999999-99875114652685899966-5-007988999974878-9882-68899999999-99967980038 Q gi|254780772|r 25 MGFIILFYAIF-GLSAVYGSNTSIVRRIEIRG-A-TNVGKEVILSRIPVV-VGQS-ISDADLDHAVK-NIYAMGYFSNV 97 (781) Q Consensus 25 ~~~~~~~~~~~-~~~~~~~~~~~~i~~I~i~G-~-~~i~~~~i~~~~~~~-~G~~-~~~~~i~~~i~-~l~~~g~F~~v 97 (781) ++.+.++++++ ++.+.+++.+. +|+|.| . +.++-. | .++. .|+. .-..++.+.|. .|-.+|+|.-+ T Consensus 5 ~~~~~~~~~~~~~~~a~~a~a~L---~IeIt~G~~~~~pIA-V---~pF~~~~~~~~~~~~i~~II~~DL~~SG~F~~l 76 (439) T PRK04922 5 LRWLAALTALLLPLSALAQQQGL---TIDIVGGSASATPIA-V---VPMPYQGSGTAPQTDVSAVVSADLDRSGQFRTL 76 (439) T ss_pred HHHHHHHHHHHHHHHHHHHCCCE---EEEEECCCCCCEEEE-E---ECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 89999999999987788616836---999935777705069-9---375657876550364999998657727274415 No 33 >PRK01742 tolB translocation protein TolB; Provisional Probab=78.04 E-value=3.8 Score=18.25 Aligned_cols=44 Identities=20% Similarity=0.314 Sum_probs=24.3 Q ss_pred EEEEE-CCC-CCCHHHHHHHCCCCCCCCCCHHHHHHHHH-HHHHCCCCCEEE Q ss_conf 89996-650-07988999974878988268899999999-999679800389 Q gi|254780772|r 50 RIEIR-GAT-NVGKEVILSRIPVVVGQSISDADLDHAVK-NIYAMGYFSNVK 98 (781) Q Consensus 50 ~I~i~-G~~-~i~~~~i~~~~~~~~G~~~~~~~i~~~i~-~l~~~g~F~~v~ 98 (781) +|+|. |.. .++-. ..++.-.... +++|.+.|. .|-.+|+|+-+. T Consensus 28 ~I~It~G~~~~~pIA----i~pF~~~~~~-~~~i~~vI~~DL~~SG~F~~i~ 74 (430) T PRK01742 28 RIVIDEGVDGARPIA----VVPFKWNGSA-PEDIAKIIAADLRNSGKFNPIA 74 (430) T ss_pred EEEEECCCCCCCCEE----EECCCCCCCC-HHHHHHHHHHHHHCCCCCCCCC T ss_conf 999945767740459----9253678976-6789999987776084726259 No 34 >PRK00178 tolB translocation protein TolB; Provisional Probab=72.89 E-value=5.1 Score=17.39 Aligned_cols=67 Identities=18% Similarity=0.334 Sum_probs=33.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEC-C-CCCCHHHHHHHCCCC-CCCCCCHHHHHHHH-HHHHHCCCCCEEE Q ss_conf 99999999999999875114652685899966-5-007988999974878-98826889999999-9999679800389 Q gi|254780772|r 24 QMGFIILFYAIFGLSAVYGSNTSIVRRIEIRG-A-TNVGKEVILSRIPVV-VGQSISDADLDHAV-KNIYAMGYFSNVK 98 (781) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~i~~I~i~G-~-~~i~~~~i~~~~~~~-~G~~~~~~~i~~~i-~~l~~~g~F~~v~ 98 (781) .|+.++++++|++++..+.++ -.|+|.+ . ..++- +| .++. .|..--++++.+.| ..|-.+|+|+-+. T Consensus 3 ~l~~~~~l~~~l~~~~~a~A~----l~I~It~g~~~~ipI-AI---~pF~~~~~~~~~~~i~~iI~~DL~~sG~F~~l~ 73 (433) T PRK00178 3 KLKRIALLVLCLLAGGAAAAE----LNIEITSGSDRAIPI-AV---VPFGWQGGSVLPEDMAQIIGNDLRNSGYFEPIP 73 (433) T ss_pred CHHHHHHHHHHHHHHHHHCEE----EEEEEECCCCCEEEE-EE---CCCCCCCCCCHHHHHHHHHHHHHHCCCCCEECC T ss_conf 179999999999876762203----999993477760406-98---576666876224899999998886288855248 No 35 >TIGR01000 bacteriocin_acc bacteriocin secretion accessory protein; InterPro: IPR005696 Bacteriocins are antibacterial proteinaceous compounds produced by bacteria. In Gram-positive bacteria, they are divided into four classes. Within the class II, constituted by non-modified peptides produced mainly by lactic acid bacteria, bacteriocins of the subclass IIa, such as mesentericin Y105, are of particular interest. They are active against the foodborne pathogen Listeria monocytogenes and share a similar primary structure, with a conserved N-terminal motif (YGNGV). Subclass IIa bacteriocins induce membrane permeabilization of sensitive strains, but their target specificity and their molecular mode of action remain elusive. This family of proteins is involved in the secretion of such bacteriocins although the mechanism of this process is not well understood. ; GO: 0005215 transporter activity, 0006810 transport. Probab=72.26 E-value=1.6 Score=20.61 Aligned_cols=52 Identities=17% Similarity=0.324 Sum_probs=32.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHH Q ss_conf 999999999999987511465268589996650079889999748789882688999 Q gi|254780772|r 25 MGFIILFYAIFGLSAVYGSNTSIVRRIEIRGATNVGKEVILSRIPVVVGQSISDADL 81 (781) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~i~~I~i~G~~~i~~~~i~~~~~~~~G~~~~~~~i 81 (781) +-.+++++|++++.+.+| =++|.|+|.-.|...-++..|..-...++...-| T Consensus 25 I~P~~lLLv~~~lFs~fA-----kkEivi~~~g~~~p~~~~~~iQst~n~~I~~N~L 76 (476) T TIGR01000 25 IVPIFLLLVFLVLFSLFA-----KKEIVIRSTGSIEPAKVIAKIQSTSNNAIKENYL 76 (476) T ss_pred HHHHHHHHHHHHHHHCCC-----EEEEEEEEEEEECCCCCCHHHCCCCCCHHHHHHH T ss_conf 999999999999972201-----0788984112661432012122884214565133 No 36 >PRK04792 tolB translocation protein TolB; Provisional Probab=70.66 E-value=5.7 Score=17.06 Aligned_cols=45 Identities=18% Similarity=0.330 Sum_probs=24.1 Q ss_pred EEEEEC-CCCCCHHHHHHHCCCC-CCCCCCHHHHHHHHH-HHHHCCCCCEE Q ss_conf 899966-5007988999974878-988268899999999-99967980038 Q gi|254780772|r 50 RIEIRG-ATNVGKEVILSRIPVV-VGQSISDADLDHAVK-NIYAMGYFSNV 97 (781) Q Consensus 50 ~I~i~G-~~~i~~~~i~~~~~~~-~G~~~~~~~i~~~i~-~l~~~g~F~~v 97 (781) .|+|.+ ....-. ....++. .|+.--+.+|.+.|. .|..+|+|+-+ T Consensus 25 ~IeIt~G~~~~~P---IAV~pF~~~g~~~~~~~i~~II~~DL~rSG~F~~l 72 (450) T PRK04792 25 ELVITDGIDSARP---IAIVPFKWEGEGKLPTDISAVIASDLQRSGKFSPV 72 (450) T ss_pred EEEEECCCCCCCC---EEEECCCCCCCCCHHHHHHHHHHHHHHHCCCCEEC T ss_conf 9999547777425---49937545687432579999999887608785525 No 37 >PRK02889 tolB translocation protein TolB; Provisional Probab=68.69 E-value=6.3 Score=16.80 Aligned_cols=43 Identities=26% Similarity=0.334 Sum_probs=24.2 Q ss_pred EEEEEC--CCCCCHHHHHHHCCCCCCCCCCHHHHHHHHH-HHHHCCCCCEE Q ss_conf 899966--5007988999974878988268899999999-99967980038 Q gi|254780772|r 50 RIEIRG--ATNVGKEVILSRIPVVVGQSISDADLDHAVK-NIYAMGYFSNV 97 (781) Q Consensus 50 ~I~i~G--~~~i~~~~i~~~~~~~~G~~~~~~~i~~~i~-~l~~~g~F~~v 97 (781) .|+|.+ ...++-. ..++. |+.--+.++.++|. .|-.+|+|+-+ T Consensus 29 ~IeIt~g~~~~~pIA----V~pF~-~~~~~~~~i~~II~~DL~~SG~F~~l 74 (430) T PRK02889 29 NVEITGVGATQFPIA----TANFA-NEAGLPQQVTSIVRADLARSGKFSNV 74 (430) T ss_pred EEEEECCCCCEEEEE----EECCC-CCCCCHHHHHHHHHHHHHHCCCCEEC T ss_conf 999935788707279----90657-88875067999999888728884005 No 38 >PRK12788 flgH flagellar basal body L-ring protein; Reviewed Probab=64.21 E-value=3.3 Score=18.64 Aligned_cols=22 Identities=5% Similarity=0.208 Sum_probs=12.3 Q ss_pred CCCEEEEEEECCEEEEEEEECC Q ss_conf 8003899986998999999365 Q gi|254780772|r 93 YFSNVKIKIVDSVLIIDLIERK 114 (781) Q Consensus 93 ~F~~v~~~~~~~~l~~~v~e~p 114 (781) +|.|-+....|++|+|.+.|+- T Consensus 65 LF~D~RA~~VGDIlTV~i~e~~ 86 (231) T PRK12788 65 LYRDPRASRTGDLLTVTISMND 86 (231) T ss_pred CCCCCCCCCCCCEEEEEEEECC T ss_conf 3366642337984999998724 No 39 >PRK04043 tolB translocation protein TolB; Provisional Probab=61.87 E-value=7.3 Score=16.38 Aligned_cols=19 Identities=5% Similarity=0.043 Sum_probs=13.7 Q ss_pred HHHHHHHHH-HHHHCCCCCE Q ss_conf 899999999-9996798003 Q gi|254780772|r 78 DADLDHAVK-NIYAMGYFSN 96 (781) Q Consensus 78 ~~~i~~~i~-~l~~~g~F~~ 96 (781) +.+|.+.|+ .|..+|+|+- T Consensus 47 ~~~i~~II~~DL~rSG~F~~ 66 (419) T PRK04043 47 KLKFFKILVNDLKVSSHFDV 66 (419) T ss_pred HHHHHHHHHHHHHCCCCCCC T ss_conf 68999999875341658512 No 40 >PRK09818 putative kinase inhibitor; Provisional Probab=58.77 E-value=8.1 Score=16.07 Aligned_cols=20 Identities=20% Similarity=0.325 Sum_probs=7.4 Q ss_pred EEEEEEECCCCEEEEEEEEC Q ss_conf 03787411221111345411 Q gi|254780772|r 476 LAAGFGRHAVQNYTFSVEDP 495 (781) Q Consensus 476 l~~~~~~~~~~~~~~s~~~P 495 (781) |+|+-....++++.+...+| T Consensus 55 L~W~~~P~gTkSfAl~~~DP 74 (183) T PRK09818 55 LTWSGAPEGTKSFAVTVYDP 74 (183) T ss_pred EEECCCCCCCEEEEEEEECC T ss_conf 25448999876889877679 No 41 >pfam06316 Ail_Lom Enterobacterial Ail/Lom protein. This family consists of several bacterial and phage Ail/Lom-like proteins. The Yersinia enterocolitica Ail protein is a known virulence factor. Proteins in this family are predicted to consist of eight transmembrane beta-sheets and four cell surface-exposed loops. It is thought that Ail directly promotes invasion and loop 2 contains an active site, perhaps a receptor-binding domain. The phage protein Lom is expressed during lysogeny, and encode host-cell envelope proteins. Lom is found in the bacterial outer membrane, and is homologous to virulence proteins of two other enterobacterial genera. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=56.50 E-value=10 Score=15.37 Aligned_cols=43 Identities=16% Similarity=0.232 Sum_probs=30.8 Q ss_pred EEEEEEEEEEECCCCCEEEEEEEECCCCCCCEEEEEEEEEEECC Q ss_conf 47999999975570015899887226778760258999641059 Q gi|254780772|r 738 RVSTGVEIMWNSPLGMMGVYYGIPLRHREGDKIQQFGFRIGNRM 781 (781) Q Consensus 738 r~s~G~Glr~~tP~Gpirld~a~pl~~~~~D~~~~f~FsiG~~f 781 (781) ..++|+|+.++ |+--+-+|.||-=.+...-+...|-..+|-|| T Consensus 157 sfAyGAGvQfN-P~~nvaiD~gYEgS~~~d~~~~gf~vGVGYRF 199 (199) T pfam06316 157 ALAWGAGVQFN-PTENIAVDLAYEASGRGDWRTDAFIVGTGYKF 199 (199) T ss_pred CEEEEEEEEEC-CCCCEEEEEEEEECCCCCCEECCEEEEEEECC T ss_conf 22442027971-57667999877630348705175899514409 No 42 >PRK13513 putative outer membrane receptor; Provisional Probab=54.42 E-value=11 Score=15.15 Aligned_cols=42 Identities=7% Similarity=0.173 Sum_probs=25.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCC Q ss_conf 99998888999999999999999875114652685899966500 Q gi|254780772|r 15 LEKYFPRSFQMGFIILFYAIFGLSAVYGSNTSIVRRIEIRGATN 58 (781) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~I~i~G~~~ 58 (781) .++.|+++.+.. +++.++++.++.++++.....+|.|.+.++ T Consensus 4 ~~~~~~~~~~~a--~~~~~~~~~~~~aa~~~~~le~ivVTAtr~ 45 (660) T PRK13513 4 VNQRFRKTHSAA--LVIAAVISSQGYAAEKTTPTDTMVVTASGF 45 (660) T ss_pred HHHHHHHHHHHH--HHHHHHHHHHHHHCCCCCCCCEEEEEECCC T ss_conf 555443799999--999998756886376788987299970487 No 43 >PRK03629 tolB translocation protein TolB; Provisional Probab=53.14 E-value=12 Score=15.02 Aligned_cols=45 Identities=24% Similarity=0.431 Sum_probs=25.6 Q ss_pred EEEEE-CCCC-CCHHHHHHHCCCC-CCCCCCHHHHHHHHHH-HHHCCCCCEEE Q ss_conf 89996-6500-7988999974878-9882688999999999-99679800389 Q gi|254780772|r 50 RIEIR-GATN-VGKEVILSRIPVV-VGQSISDADLDHAVKN-IYAMGYFSNVK 98 (781) Q Consensus 50 ~I~i~-G~~~-i~~~~i~~~~~~~-~G~~~~~~~i~~~i~~-l~~~g~F~~v~ 98 (781) .|+|. |... ++ ....++. .|+.-.+++|.++|.+ |-.+|+|+-+. T Consensus 24 ~I~It~G~~~~~P----IAV~pF~~~g~~~~~~~i~~II~~DL~rSG~F~~l~ 72 (430) T PRK03629 24 RIEITQGVDSARP----IGVVPFKWAGPGAAPEDIGGIVAADLRNSGKFNPLD 72 (430) T ss_pred EEEEECCCCCCCC----EEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCEECC T ss_conf 9999346677325----699286447888646789999987687577815039 No 44 >PRK00249 flgH flagellar basal body L-ring protein; Reviewed Probab=52.55 E-value=5 Score=17.44 Aligned_cols=21 Identities=24% Similarity=0.411 Sum_probs=8.6 Q ss_pred CCCEEEEEEECCEEEEEEEEC Q ss_conf 800389998699899999936 Q gi|254780772|r 93 YFSNVKIKIVDSVLIIDLIER 113 (781) Q Consensus 93 ~F~~v~~~~~~~~l~~~v~e~ 113 (781) +|.|-+....|++|+|.|.|+ T Consensus 66 Lf~D~RA~~VGDiiTV~i~E~ 86 (231) T PRK00249 66 LFKDRRARNVGDILTVVLQEN 86 (231) T ss_pred CCCCCCCCCCCCEEEEEEEEC T ss_conf 512344378999699999970 No 45 >PRK10641 btuB vitamin B12/cobalamin outer membrane transporter; Provisional Probab=43.57 E-value=16 Score=14.22 Aligned_cols=34 Identities=6% Similarity=-0.085 Sum_probs=20.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCC Q ss_conf 9999999999999987511465268589996650 Q gi|254780772|r 24 QMGFIILFYAIFGLSAVYGSNTSIVRRIEIRGAT 57 (781) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~i~~I~i~G~~ 57 (781) |||++.++.++.+.+..+.|+.....+|.|.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~~~~devvVTAtr 34 (615) T PRK10641 1 MIKKASLLTALSVTAFSAWAQDTSPDTLVVTANR 34 (615) T ss_pred CCHHHHHHHHHHHHCCHHHHCCCCCCEEEEECCC T ss_conf 9448999999987513354335899719990348 No 46 >pfam10079 DUF2317 Uncharacterized protein conserved in bacteria (DUF2317). Members of this family of hypothetical bacterial proteins have no known function. Probab=41.10 E-value=18 Score=13.82 Aligned_cols=28 Identities=14% Similarity=0.116 Sum_probs=13.1 Q ss_pred HHHHCCCCCCCCCCHHHHHHHHHHHHHC Q ss_conf 9997487898826889999999999967 Q gi|254780772|r 64 ILSRIPVVVGQSISDADLDHAVKNIYAM 91 (781) Q Consensus 64 i~~~~~~~~G~~~~~~~i~~~i~~l~~~ 91 (781) -...+.......+++..+-+.+++-|+- T Consensus 39 f~~~~~~~~~~~~~R~~L~~~L~~q~~~ 66 (541) T pfam10079 39 FKKRLEELASRFFNREALVDALKAQNKR 66 (541) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHC T ss_conf 9999998645973199999999999855 No 47 >pfam10082 DUF2320 Uncharacterized protein conserved in bacteria (DUF2320). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=39.66 E-value=19 Score=13.68 Aligned_cols=22 Identities=14% Similarity=0.096 Sum_probs=9.0 Q ss_pred EEEEEEECCCCEEEEECCCEEE Q ss_conf 2455420254302354041257 Q gi|254780772|r 526 AVRMIVPITESISTSFKYDLRF 547 (781) Q Consensus 526 ~~~~~~~~~~~~~~~~~~~~~~ 547 (781) .+++.+.++..++..+..++.. T Consensus 223 ~~G~~~~~t~k~~g~~~~Gy~~ 244 (380) T pfam10082 223 RAGVSFDLTEKLRGEVKVGYLA 244 (380) T ss_pred EEEEEEEECCCEEEEEEEEEEE T ss_conf 8206998410079999998899 No 48 >TIGR02636 galM_Leloir galactose mutarotase; InterPro: IPR013458 Galactose mutarotases (D-galactose 1-epimerase) participate in the Leloir pathway for galactose/glucose interconversion. Genes encoding the proteins found in this entry are found clustered with genes encoding other enzymes of the Leloir pathway. These proteins belong to the aldose 1-epimerase family. However, the aldose 1-epimerase itself (5.1.3.3 from EC) has a relatively broad specificity and can utilise D-glucose, L-arabinose, D-xylose, D-galactose, maltose and lactose. The restricted genomic context of the genes suggests members should act primarily on D-galactose.; GO: 0004034 aldose 1-epimerase activity, 0006012 galactose metabolic process. Probab=39.43 E-value=12 Score=14.89 Aligned_cols=85 Identities=14% Similarity=0.162 Sum_probs=58.5 Q ss_pred CCCCCCCCCCEEEEEEECCCCCCCCCEEEEEEEEEEEEEEECCC-EEEEEEEEEEEEEEECCCCCCCCHHEEEECC--CC Q ss_conf 20110024651565420245655542036666300479993588-0999999997898516997412301275078--97 Q gi|254780772|r 586 LDNPIVPRKGMLISSSYDYAGFGGDSQYHRIGSRASYFYLLSDD-SDIVGSLRFGYGCVIPSNKNLQLFDQFSVSS--NY 662 (781) Q Consensus 586 rD~~~~Pt~G~~~~~~~~~a~~g~d~~f~k~~~~~~~y~~l~~~-~~l~~~~~~g~~~~~~~~~~lp~~erF~lGG--~~ 662 (781) -+-+++||.-.|++|.-+- .++|+.=+++.+.+..|.|+.+. ..-...+.-=-+..+.-.+.=.+.+.|-+.- -. T Consensus 160 k~Tp~NlTNHvYFNL~g~~--~~~d~~~H~L~lnAd~yLP~D~~tgIP~G~l~~V~~TsFDFr~~K~i~~~f~~~~~QqQ 237 (346) T TIGR02636 160 KATPVNLTNHVYFNLDGAE--AGSDVLSHELQLNADRYLPLDEETGIPLGELKSVDGTSFDFRKEKAIGQDFLADDDQQQ 237 (346) T ss_pred CCCCCCCCCCEEECCCCCC--CCCCHHHCEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHCCHHHHH T ss_conf 6576588733011268772--56864265268864843012574346456520135788565562011254525834878 Q ss_pred CCCCCCCCCC Q ss_conf 1402126861 Q gi|254780772|r 663 YLRGFAYKGI 672 (781) Q Consensus 663 svRGy~~~~i 672 (781) .+.||+.--| T Consensus 238 ~~kGYDHaF~ 247 (346) T TIGR02636 238 LAKGYDHAFL 247 (346) T ss_pred HHCCCCCHHH T ss_conf 6406274451 No 49 >pfam09743 DUF2042 Uncharacterized conserved protein (DUF2042). This entry is the conserved N-terminal 300 residues of a group of proteins found from protozoa to Humans. The function is unknown. Probab=39.38 E-value=18 Score=13.89 Aligned_cols=26 Identities=15% Similarity=0.243 Sum_probs=12.0 Q ss_pred CCCEECCCCCCHHHHHCCCCCCCCCC Q ss_conf 99621121000000000000234456 Q gi|254780772|r 310 SGNLYNPQEIKESSEKISKYFFSGER 335 (781) Q Consensus 310 ~G~~~~~~~~~~~~~~i~~~y~~~GY 335 (781) .|+..+..-++...+.|++.+.+.|. T Consensus 105 ~geLit~~Yld~iaeeIne~Lqe~G~ 130 (272) T pfam09743 105 QGELITDDYLDRLAEEINEKLQECGQ 130 (272) T ss_pred CCEECCHHHHHHHHHHHHHHHHHCCC T ss_conf 57762458999999999999998295 No 50 >pfam08339 RTX_C RTX C-terminal domain. This family describes the C-terminal region of various bacterial haemolysins and leukotoxins, which belong to the RTX family of toxins. These are produced by various Gram negative bacteria, such as E. coli and Actinobacillus pleuropneumoniae. RTX toxins may interact with lipopolysaccharide (LPS) to functionally impair and eventually kill leukocytes. This region is found in association with the RTX N-terminal domain (pfam02382) and multiple hemolysin-type calcium-binding repeats (pfam00353). Probab=39.37 E-value=19 Score=13.65 Aligned_cols=12 Identities=42% Similarity=0.479 Sum_probs=4.0 Q ss_pred EEEEEECCEEEE Q ss_conf 899986998999 Q gi|254780772|r 97 VKIKIVDSVLII 108 (781) Q Consensus 97 v~~~~~~~~l~~ 108 (781) |.++..+..|++ T Consensus 10 l~fer~~~~Lii 21 (145) T pfam08339 10 LTFKRVGNDLII 21 (145) T ss_pred CEEEEECCCEEE T ss_conf 057882881699 No 51 >pfam06935 DUF1284 Protein of unknown function (DUF1284). This family consists of several hypothetical bacterial and archaeal proteins of around 130 residues in length. The function of this family is unknown, although it is thought that they may be iron-sulphur binding proteins. Probab=39.04 E-value=19 Score=13.62 Aligned_cols=19 Identities=11% Similarity=0.221 Sum_probs=7.7 Q ss_pred CCHHHCCCCCCCHHHHHHH Q ss_conf 1001001000000012344 Q gi|254780772|r 382 VIRRELELSEGDPINYSMI 400 (781) Q Consensus 382 VIrrel~~~~Gd~~~~~~i 400 (781) -+.+.+.+++|+.|....+ T Consensus 58 ~vl~~lgl~~g~~~~~~e~ 76 (103) T pfam06935 58 RVLELLGIKPGKVLTWDEA 76 (103) T ss_pred HHHHHCCCCCCCEECHHHH T ss_conf 9999849998878849999 No 52 >TIGR01994 SUF_scaf_2 SUF system FeS assembly protein, NifU family; InterPro: IPR011341 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This family represents SufU, NifU-like proteins that are part of the SUF system, one of the two different proteins proposed to act as a scaffold on which the Fe-S cluster is built and from which it is transferred.; GO: 0005506 iron ion binding, 0005515 protein binding, 0051536 iron-sulfur cluster binding, 0016226 iron-sulfur cluster assembly. Probab=36.93 E-value=21 Score=13.40 Aligned_cols=22 Identities=27% Similarity=0.428 Sum_probs=10.0 Q ss_pred CCEEEEEEEECCCCEEEEEEEEEC Q ss_conf 842588986036632467787504 Q gi|254780772|r 180 TTLNITYVIEEGVKAKINSIRFVG 203 (781) Q Consensus 180 ~~v~i~~~I~EG~~~~I~~I~f~G 203 (781) +.+.|.+.++.+. +|++|.|+| T Consensus 38 D~I~L~~k~~~~~--~i~D~~F~G 59 (139) T TIGR01994 38 DEITLTVKLEGDR--VIEDIAFEG 59 (139) T ss_pred CEEEEEEEECCCC--EEEEEEEEC T ss_conf 7378999874988--888888735 No 53 >PRK10716 long-chain fatty acid outer membrane transporter; Provisional Probab=34.25 E-value=23 Score=13.13 Aligned_cols=64 Identities=13% Similarity=0.189 Sum_probs=32.8 Q ss_pred EEEEEEEEEEEECCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEE Q ss_conf 7899999999614565654537999999767741566667754445676642479999999755700158998872 Q gi|254780772|r 686 IYSSASAAVSFPMPLVPERAGLRGAFFVDSATLYANHVALGADKLEGNDSFWRVSTGVEIMWNSPLGMMGVYYGIP 761 (781) Q Consensus 686 ~~~~~s~E~r~pl~~~~~~~~l~~~~F~D~G~v~~~~~~~~~~~~~~~~~~~r~s~G~Glr~~tP~Gpirld~a~p 761 (781) ..+..-+||++ .++..|++.+.+|---+-+.. ....++. ......|+|++..|.-. +.+|+||. T Consensus 341 w~~~~G~~Y~~-----~~~l~LRaG~aYD~sPv~~~~---~~~~iPd-~dR~~~S~G~~y~~~~~---~tvD~ay~ 404 (437) T PRK10716 341 YRIALGTTYYY-----DDNWTFRTGIAFDDSPVPAQN---RSISIPD-QDRFWLSAGTTYAFNKD---ASVDVGVS 404 (437) T ss_pred EEEEEEEEEEC-----CCCCEEEEEEEECCCCCCCCC---CCCCCCC-CCCEEEEEEEEEECCCC---CEEEEEEE T ss_conf 07997899987-----998189887787468888886---5620768-88859981268854889---57999889 No 54 >TIGR02851 spore_V_T stage V sporulation protein T; InterPro: IPR014213 Members of this protein entry include the stage V sporulation protein T (SpoVT), a protein of the sporulation/germination program in Bacillus subtilis and related species. The amino-terminal 50 amino acids are nearly perfectly conserved across all endospore-forming bacteria. SpoVT is a DNA-binding transcriptional regulator related to AbrB.. Probab=33.10 E-value=13 Score=14.70 Aligned_cols=15 Identities=33% Similarity=0.363 Sum_probs=9.0 Q ss_pred EEEEEEEECCCCEEE Q ss_conf 258898603663246 Q gi|254780772|r 182 LNITYVIEEGVKAKI 196 (781) Q Consensus 182 v~i~~~I~EG~~~~I 196 (781) ++=++.|.||.|..| T Consensus 21 IRRTLRIrEGDPLEI 35 (184) T TIGR02851 21 IRRTLRIREGDPLEI 35 (184) T ss_pred CCCCCCCCCCCCCEE T ss_conf 411010125997148 No 55 >TIGR02997 Sig70-cyanoRpoD RNA polymerase sigma factor, cyanobacterial RpoD-like family. Probab=32.18 E-value=13 Score=14.69 Aligned_cols=13 Identities=15% Similarity=0.314 Sum_probs=6.0 Q ss_pred CCCEEEEEEECCC Q ss_conf 4404665541233 Q gi|254780772|r 349 AKRIVDIEYLIDQ 361 (781) Q Consensus 349 ~~~~vdi~~~I~e 361 (781) ...||-+=+||.| T Consensus 182 qSRTIRLPiH~~E 194 (336) T TIGR02997 182 QSRTIRLPIHITE 194 (336) T ss_pred CCCCCCCCCCHHH T ss_conf 6883247410766 No 56 >TIGR00407 proA gamma-glutamyl phosphate reductase; InterPro: IPR000965 Gamma-glutamyl phosphate reductase (1.2.1.41 from EC) (GPR) is the enzyme that catalyzes the second step in the biosynthesis of proline from glutamate, the NADP-dependent reduction of L-glutamate 5-phosphate into L-glutamate 5-semialdehyde and phosphate. In bacteria (gene proA) and yeast (gene PRO2), GPR is a monofunctional protein, while in plants and mammals, it is a bifunctional enzyme (P5CS) that consists of two domains, an N-terminal glutamate 5-kinase domain (2.7.2.11 from EC) and a C-terminal GPR domain. ; GO: 0004350 glutamate-5-semialdehyde dehydrogenase activity, 0006561 proline biosynthetic process. Probab=30.32 E-value=21 Score=13.34 Aligned_cols=26 Identities=15% Similarity=0.289 Sum_probs=13.3 Q ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 9999748789882688999999999996 Q gi|254780772|r 63 VILSRIPVVVGQSISDADLDHAVKNIYA 90 (781) Q Consensus 63 ~i~~~~~~~~G~~~~~~~i~~~i~~l~~ 90 (781) .++.++.+.+++. -..+-++||.+-. T Consensus 53 ~l~dRL~L~~~dr--l~~i~~~V~~v~~ 78 (415) T TIGR00407 53 ALLDRLLLTEKDR--LKGIADGVKDVIE 78 (415) T ss_pred HHHHHHCCCCCHH--HHHHHHHHHHHHH T ss_conf 8998634780125--8999999999851 No 57 >COG2063 FlgH Flagellar basal body L-ring protein [Cell motility and secretion] Probab=29.61 E-value=27 Score=12.63 Aligned_cols=19 Identities=26% Similarity=0.399 Sum_probs=6.0 Q ss_pred CCEEEEEEECCEEEEEEEE Q ss_conf 0038999869989999993 Q gi|254780772|r 94 FSNVKIKIVDSVLIIDLIE 112 (781) Q Consensus 94 F~~v~~~~~~~~l~~~v~e 112 (781) |+|.+....|++|+|.+.| T Consensus 66 f~D~RA~~vGDilTVvi~E 84 (230) T COG2063 66 FEDRRASNVGDILTIVIQE 84 (230) T ss_pred HCCCCCCCCCCEEEEEEEE T ss_conf 2254325689879999985 No 58 >pfam07437 YfaZ YfaZ precursor. This family contains the precursor of the bacterial protein YfaZ (approximately 180 residues long). Many members of this family are hypothetical proteins. Probab=28.37 E-value=29 Score=12.49 Aligned_cols=42 Identities=14% Similarity=0.168 Sum_probs=24.9 Q ss_pred EEEEEEEEEECCCCCEEEEEEEECCC---CCC--CEE--EEEEEEEEECC Q ss_conf 79999999755700158998872267---787--602--58999641059 Q gi|254780772|r 739 VSTGVEIMWNSPLGMMGVYYGIPLRH---REG--DKI--QQFGFRIGNRM 781 (781) Q Consensus 739 ~s~G~Glr~~tP~Gpirld~a~pl~~---~~~--D~~--~~f~FsiG~~f 781 (781) .-+.+|+|| +|+-|+.++.||..-. .++ |.+ ..|+...|=+| T Consensus 132 ~ea~~gv~~-~v~~pl~v~aGYRyi~~~~~dg~~d~~ladG~yvG~~~sF 180 (180) T pfam07437 132 YEANSGVRY-NINRPMALYVGYRYINFEGKDGARDNTFADGAYIGGKFRF 180 (180) T ss_pred EEECCCEEE-EECCCCEEEEEEEEEEEECCCCCCCCEECCCCEEEEEECC T ss_conf 373461699-9737728997579998663679877524237468337509 No 59 >PRK07429 phosphoribulokinase; Provisional Probab=28.12 E-value=29 Score=12.47 Aligned_cols=71 Identities=13% Similarity=0.065 Sum_probs=37.5 Q ss_pred EEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC---------CCCCCCCCC Q ss_conf 87504431101222211100122112445666431023333457888888754232210012---------310124555 Q gi|254780772|r 199 IRFVGNKNYSHARLERVISIRTSGYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKVSS---------QVLFDKQKS 269 (781) Q Consensus 199 I~f~Gn~~f~~~~L~~~~~~k~~~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d~~V~~---------~~~~~~~~~ 269 (781) |.++|-..|.+.+|++.|..|-. . +.+.=-.-..+|.+=-..+||---+|-. ..-+.|.++ T Consensus 108 IIvEGLh~L~~~~lR~l~DlKIF-----V-----D~d~diR~~rRI~RDv~ERG~s~E~Vl~qi~~RkpD~~~yI~PQk~ 177 (331) T PRK07429 108 VVVEGLHPLYDERVRDLYDFKVY-----L-----DPPDEVKIAWKIKRDMAERGHTYEDVLAEIEKREPDFEAYIDPQRQ 177 (331) T ss_pred EEEECCHHCCCHHHHHHCCEEEE-----E-----CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHHHHCCCCHH T ss_conf 99916121287989975493799-----6-----4878899999888778661899999999998511789976680411 Q ss_pred CCCEEEEECC Q ss_conf 5410122022 Q gi|254780772|r 270 GYVLIFQIDE 279 (781) Q Consensus 270 ~~~i~~~I~E 279 (781) .++|++.+-. T Consensus 178 ~ADiVI~~~p 187 (331) T PRK07429 178 YADVVIQVLP 187 (331) T ss_pred CCCEEEEECC T ss_conf 2728999627 No 60 >PRK12396 5-methylribose kinase; Reviewed Probab=28.04 E-value=29 Score=12.46 Aligned_cols=37 Identities=11% Similarity=0.087 Sum_probs=27.8 Q ss_pred HHHHHHHHHC-CCCEEEEEEEEEECCCCCEEEEEEEEC Q ss_conf 9999999972-970017998764338984258898603 Q gi|254780772|r 154 HNIKQAYASI-GYLNVMVKVQHHSISPTTLNITYVIEE 190 (781) Q Consensus 154 ~~i~~~y~~~-Gy~~a~V~~~~~~~~~~~v~i~~~I~E 190 (781) +.+.+|-+++ |++...-..+.++.++|..+++|.|.- T Consensus 13 ~~~i~Y~k~~~~~f~~~~~l~~~EIGDGNLN~Vf~V~~ 50 (409) T PRK12396 13 NDVIVYVKEKLCKFEHAKGLQCKEIGDGNLNYVFRVWD 50 (409) T ss_pred HHHHHHHHHHCCCCCCCCCCEEEEECCCCEEEEEEEEE T ss_conf 36999999826879998885368974785689999970 No 61 >pfam10437 Lip_prot_lig_C Bacterial lipoate protein ligase C-terminus. This is the C-terminal domain of a bacterial lipoate protein ligase. There is no conservation between this C-terminus and that of vertebrate lipoate protein ligase C-termini, but both are associated with the domain BPL_LipA_LipB pfam03099, further upstream. This C-terminal domain is more stable than the BPL_LipA_LipB and the hypothesis is that the C-terminal domain has a role in recognising the lipoyl domain and/or transferring the lipoyl group onto it from the lipoyl-AMP intermediate. C-terminal fragments of length 172 to 193 amino acid residues are observed in the eubacterial enzymes whereas in their archaeal counterparts the C-terminal segment is significantly smaller, ranging in size from 87 to 107 amino acid residues. Probab=27.77 E-value=29 Score=12.43 Aligned_cols=12 Identities=33% Similarity=0.556 Sum_probs=3.8 Q ss_pred CCCEECCCCCCH Q ss_conf 996211210000 Q gi|254780772|r 310 SGNLYNPQEIKE 321 (781) Q Consensus 310 ~G~~~~~~~~~~ 321 (781) .|.+|+...+.+ T Consensus 52 ~G~~y~~~~i~~ 63 (85) T pfam10437 52 IGVKYDKEAIKK 63 (85) T ss_pred CCCCCCHHHHHH T ss_conf 799869999999 No 62 >TIGR02582 cas_TM1809 CRISPR-associated RAMP protein, Csm3 family; InterPro: IPR013412 Members of this CRISPR-associated (cas) gene family are found in the mtube subtype CRISPR/cas locus, and and designated Csm3 for CRISPR/cas Subtype Mtube, protein 3.. Probab=25.96 E-value=24 Score=13.03 Aligned_cols=10 Identities=20% Similarity=0.404 Sum_probs=5.4 Q ss_pred EECCCCCCCCC Q ss_conf 50789714021 Q gi|254780772|r 657 SVSSNYYLRGF 667 (781) Q Consensus 657 ~lGG~~svRGy 667 (781) ||||..| ||| T Consensus 234 YLGG~Gs-RGY 243 (248) T TIGR02582 234 YLGGSGS-RGY 243 (248) T ss_pred CCCCCCE-ECC T ss_conf 0688702-156 No 63 >TIGR03519 Bac_Flav_fam_1 Bacteroidetes-specific putative membrane protein. This model describes a protein family unique to, and greatly expanded in, the Bacteriodetes. Species in this lineage include several, such as Cytophaga hutchinsonii and Flavobacterium johnsoniae, that exhibit a poorly understood rapid gliding phenotype. Several members of this protein family are found in operons with other genes whose loss leads to a loss a this motility. Probab=25.73 E-value=32 Score=12.19 Aligned_cols=39 Identities=10% Similarity=0.058 Sum_probs=20.9 Q ss_pred EEEEEEEEECCCCCEEEEEEEECCCCC--CCEEEEEEEEEEECC Q ss_conf 999999975570015899887226778--760258999641059 Q gi|254780772|r 740 STGVEIMWNSPLGMMGVYYGIPLRHRE--GDKIQQFGFRIGNRM 781 (781) Q Consensus 740 s~G~Glr~~tP~Gpirld~a~pl~~~~--~D~~~~f~FsiG~~f 781 (781) +..+|+.+. ..+++.|+|-+.-.+ ....+.++++++-|| T Consensus 252 ~~~~g~~~~---~~~~~gysYd~~~s~l~~~~~gshEi~l~y~F 292 (292) T TIGR03519 252 IGLVGFNLN---KRLSIGYSYDFSTSSLSAYNGGSHEISVSYRF 292 (292) T ss_pred EEEEEEEEC---CCEEEEEEEEEECCCCCCCCCCEEEEEEEEEC T ss_conf 999979977---98999999970233101779980899999879 No 64 >pfam10566 Glyco_hydro_97 Glycoside hydrolase 97. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands. Probab=25.33 E-value=32 Score=12.14 Aligned_cols=26 Identities=15% Similarity=0.268 Sum_probs=15.1 Q ss_pred EEEECCCCCCCCCEEEEEEEEEEEEE Q ss_conf 99614565654537999999767741 Q gi|254780772|r 694 VSFPMPLVPERAGLRGAFFVDSATLY 719 (781) Q Consensus 694 ~r~pl~~~~~~~~l~~~~F~D~G~v~ 719 (781) +..||.+++........+|-|+.++- T Consensus 583 ~~v~LsFL~~g~~y~a~iy~Dg~~a~ 608 (643) T pfam10566 583 VSLDLSFLGEGKDYTAELYKDGRDAD 608 (643) T ss_pred EEEECCCCCCCCCEEEEEEECCCCCC T ss_conf 88665235899808999996087655 No 65 >COG1386 scpB Chromosome segregation and condensation protein B [DNA replication, recombination and repair] Probab=25.01 E-value=33 Score=12.11 Aligned_cols=49 Identities=24% Similarity=0.360 Sum_probs=27.1 Q ss_pred EEEEEECCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 579995654369889988730057863799999999999999999729700179 Q gi|254780772|r 117 NHLFFSGNNNLKDDQLKMIVRSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMV 170 (781) Q Consensus 117 ~~i~~~Gn~~~~~~~l~~~~~~~~g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V 170 (781) ..+.|.|.+-++.++|.+.+++. +...+......+++-|.+.|.--..| T Consensus 14 Eall~a~~~pls~~~L~~il~~~-----~~~~~~~~l~~l~~~y~~rg~~L~~~ 62 (184) T COG1386 14 EALLFAGGEPLSLKELAEILGIV-----SADAIIDALAELKEEYEDRGLELVEV 62 (184) T ss_pred HHHHHHCCCCCCHHHHHHHHCCC-----CHHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 99999647997899999986888-----47889999999998625798049997 No 66 >TIGR01645 half-pint poly-U binding splicing factor, half-pint family; InterPro: IPR006532 The proteins contain three RNA recognition motifs and have been characterised as poly-pyrimidine tract binding proteins associated with RNA splicing factors , , . In the case of PUF60, in complex with p54, and in the presence of U2AF, it facilitates association of U2 snRNP with pre-mRNA . . Probab=24.94 E-value=29 Score=12.48 Aligned_cols=20 Identities=15% Similarity=0.429 Sum_probs=13.4 Q ss_pred CHHHHHHHHH-HHHHHHHHHH Q ss_conf 0378999999-9999988889 Q gi|254780772|r 4 STEDFRRIKR-LLEKYFPRSF 23 (781) Q Consensus 4 ~~~~~~~~~~-~~~~~~~~~~ 23 (781) -+++++.=++ +|..+++.++ T Consensus 71 ~~d~~qkAKKyAMe~SIK~vL 91 (791) T TIGR01645 71 QKDDIQKAKKYAMEQSIKQVL 91 (791) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 678898887887766699999 No 67 >pfam05158 RNA_pol_Rpc34 RNA polymerase Rpc34 subunit. Subunit specific to RNA Pol III, the tRNA specific polymerase. The C34 subunit of yeast RNA Pol III is part of a subcomplex of three subunits which have no counterpart in the other two nuclear RNA polymerases. This subunit interacts with TFIIIB70 and is therefore participates in Pol III recruitment. Probab=24.81 E-value=33 Score=12.08 Aligned_cols=151 Identities=15% Similarity=0.209 Sum_probs=90.9 Q ss_pred CCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEEEECC--------------EEEEEEEE Q ss_conf 0079889999748789882688999999999996798003899986998999999365--------------36579995 Q gi|254780772|r 57 TNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSNVKIKIVDSVLIIDLIERK--------------IINHLFFS 122 (781) Q Consensus 57 ~~i~~~~i~~~~~~~~G~~~~~~~i~~~i~~l~~~g~F~~v~~~~~~~~l~~~v~e~p--------------~i~~i~~~ 122 (781) +-++.+.+...++ ..+...+-.+++.|.+.|.|.=++ .++.|.++.+..- +-+-|+=. T Consensus 24 ~~~~q~dL~~~~~-----~~~~~~l~~~ln~Ll~~~~l~l~k---~~~~l~yk~~~~e~a~k~~~l~~eE~lVY~~Ie~a 95 (313) T pfam05158 24 NLFSQQDLQALVP-----GLSLNLLMSAINILLDQGLLVLLK---QNDGLVYKAVTREEAKKKATLSDEESLVYSIIEES 95 (313) T ss_pred CCCCHHHHHHHCC-----CCCHHHHHHHHHHHHHCCCEEEEE---CCCEEEEEEECHHHHHHHCCCCHHHHHHHHHHHHC T ss_conf 9837999997657-----999999999999997469789995---49988899848999876528988899999999862 Q ss_pred CCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEEEEEECCCCEEEEEEEEE Q ss_conf 65436988998873005786379999999999999999972970017998764338984258898603663246778750 Q gi|254780772|r 123 GNNNLKDDQLKMIVRSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMVKVQHHSISPTTLNITYVIEEGVKAKINSIRFV 202 (781) Q Consensus 123 Gn~~~~~~~l~~~~~~~~g~~~~~~~~~~~~~~i~~~y~~~Gy~~a~V~~~~~~~~~~~v~i~~~I~EG~~~~I~~I~f~ 202 (781) ||+-+...+|+...++. ... +.++.+.++.+++..+.-. ...++.++-+.|++... ..+. T Consensus 96 g~~GIW~k~i~~~tnL~------~~~----l~K~LK~LEsk~lIKsVks---V~~~~rKvYmL~~LePS-------~eiT 155 (313) T pfam05158 96 GNEGIWTRTIKNRTNLH------QSV----LKKCLKSLESKKYIKSVKS---VKAPTRKMYMLYNLEPS-------EEVT 155 (313) T ss_pred CCCCEEHHHHHHHCCCC------HHH----HHHHHHHHHHCCCEEEECC---CCCCCEEEEEEECCCCC-------CCCC T ss_conf 76774499999880998------899----9999999975686154214---56787059998347747-------1114 Q ss_pred CCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44311012222111001221124456664310233334578888887542322 Q gi|254780772|r 203 GNKNYSHARLERVISIRTSGYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAA 255 (781) Q Consensus 203 Gn~~f~~~~L~~~~~~k~~~~~~~~~~~~~~~~~l~~D~~~l~~~Y~~~GY~d 255 (781) |-.-| ..+.|+.+-++.-++.+..|..++-+.. T Consensus 156 GG~Wy--------------------tD~elD~efI~~L~~~~~~~v~~k~~~~ 188 (313) T pfam05158 156 GGPWY--------------------TDGELDVEFINTLRRVIFRFVSQKSYVA 188 (313) T ss_pred CCCCC--------------------CCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 88651--------------------2785569999999999999998344554 No 68 >PRK05799 coproporphyrinogen III oxidase; Provisional Probab=24.68 E-value=33 Score=12.06 Aligned_cols=11 Identities=18% Similarity=0.437 Sum_probs=4.5 Q ss_pred CCCCCCHHHHH Q ss_conf 44311012222 Q gi|254780772|r 203 GNKNYSHARLE 213 (781) Q Consensus 203 Gn~~f~~~~L~ 213 (781) |-+.|++..|+ T Consensus 116 GVQSf~~~~l~ 126 (374) T PRK05799 116 GLQAWQNSLLK 126 (374) T ss_pred ECCCCCHHHHH T ss_conf 53358899999 No 69 >PRK13484 putative iron-regulated outer membrane virulence protein; Provisional Probab=23.05 E-value=36 Score=11.86 Aligned_cols=31 Identities=13% Similarity=0.075 Sum_probs=19.4 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEEEEECCCCC Q ss_conf 9999999998751146526858999665007 Q gi|254780772|r 29 ILFYAIFGLSAVYGSNTSIVRRIEIRGATNV 59 (781) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~i~~I~i~G~~~i 59 (781) +...|..+++++.|++...+.+|.|...+.- T Consensus 11 ~~~~~~~~~~~~~a~~~~~~e~vvVTAtr~~ 41 (682) T PRK13484 11 IAVMCSAVISSGYASSDKKEDTLVVTASGFT 41 (682) T ss_pred HHHHHHHHHCHHHHHCCCCCCEEEEECCCCC T ss_conf 9999988755354522268874999215873 No 70 >pfam09609 Cas_GSU0054 CRISPR-associated protein, GSU0054 family (Cas_GSU0054). This entry represents a rare CRISPR-associated protein. So far, members are found in Geobacter sulfurreducens and in two unpublished genomes: Gemmata obscuriglobus and Actinomyces naeslundii. CRISPR-associated proteins typically are found near CRISPR repeats and other CRISPR-associated proteins, have low levels of sequence identify, have sequence relationships that suggest lateral transfer, and show some sequence similarity to DNA-active proteins such as helicases and repair proteins. Probab=22.68 E-value=36 Score=11.82 Aligned_cols=22 Identities=23% Similarity=0.155 Sum_probs=16.7 Q ss_pred EEEEEEEEEECCC-CCEEEEEEE Q ss_conf 7999999975570-015899887 Q gi|254780772|r 739 VSTGVEIMWNSPL-GMMGVYYGI 760 (781) Q Consensus 739 ~s~G~Glr~~tP~-Gpirld~a~ 760 (781) .+.++-|+|..|+ |||.|..+- T Consensus 488 ~~~h~rl~F~epV~GPi~lGagr 510 (519) T pfam09609 488 PSYHVRISFPEEVKGPVIIGAGR 510 (519) T ss_pred CCCEEEEECCCCCCCCEEECCCC T ss_conf 66148997787665765624766 No 71 >TIGR00174 miaA tRNA delta(2)-isopentenylpyrophosphate transferase; InterPro: IPR002627 tRNA isopentenyltransferases 2.5.1.8 from EC also known as tRNA delta(2)-isopentenylpyrophosphate transferases or IPP transferases. These enzymes modify both cytoplasmic and mitochondrial tRNAs at A(37) to give isopentenyl A(37) .; GO: 0004811 tRNA isopentenyltransferase activity, 0005524 ATP binding, 0008033 tRNA processing. Probab=22.12 E-value=23 Score=13.10 Aligned_cols=22 Identities=23% Similarity=0.160 Sum_probs=8.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHH Q ss_conf 4310233334578888887542 Q gi|254780772|r 231 VYSKERMSFDEEAIRAFYHDRG 252 (781) Q Consensus 231 ~~~~~~l~~D~~~l~~~Y~~~G 252 (781) .|+-..+..+......--.++| T Consensus 67 ~y~~~~F~~~~~~~~~~i~~~G 88 (307) T TIGR00174 67 SYSAADFQTQALNAIADITARG 88 (307) T ss_pred CCCCHHHHHHHHHHHHHHHHCC T ss_conf 0370889999999999998569 No 72 >KOG1391 consensus Probab=21.83 E-value=38 Score=11.71 Aligned_cols=32 Identities=16% Similarity=0.230 Sum_probs=14.4 Q ss_pred HHHCCCCEEEEEEEEEECCCCCEEEEEEEECCCC Q ss_conf 9972970017998764338984258898603663 Q gi|254780772|r 160 YASIGYLNVMVKVQHHSISPTTLNITYVIEEGVK 193 (781) Q Consensus 160 y~~~Gy~~a~V~~~~~~~~~~~v~i~~~I~EG~~ 193 (781) -++.|||+.++.+-......+ ..+|.|+|.|+ T Consensus 192 aq~aG~fn~Ei~pI~~K~kkg--e~~f~vDEHpR 223 (396) T KOG1391 192 AQDAGYFNDEIAPIEVKTKKG--EQTFQVDEHPR 223 (396) T ss_pred HHHCCCCCCCCCCEEEEECCC--CEEEEECCCCC T ss_conf 654473234302269861367--25467447898 No 73 >PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed Probab=21.48 E-value=38 Score=11.66 Aligned_cols=50 Identities=18% Similarity=0.293 Sum_probs=34.3 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCEEEEEEE--------CCEEEEEEEE-CCEEEEEEEECC Q ss_conf 82688999999999996798003899986--------9989999993-653657999565 Q gi|254780772|r 74 QSISDADLDHAVKNIYAMGYFSNVKIKIV--------DSVLIIDLIE-RKIINHLFFSGN 124 (781) Q Consensus 74 ~~~~~~~i~~~i~~l~~~g~F~~v~~~~~--------~~~l~~~v~e-~p~i~~i~~~Gn 124 (781) ..++-.+++..||++.... |..|+|.-+ .|-+-|.+++ +-.|+-+.|.|+ T Consensus 4 ~~lsVsel~~~ik~~le~~-~~~v~V~GEIS~~~~~~sGH~YF~LkD~~a~i~~v~~~~~ 62 (443) T PRK00286 4 NPLSVSELNGYVKSLLERD-LGQVWVRGEISNFTRPSSGHWYFTLKDEQAQIRCVMFRGS 62 (443) T ss_pred CCEEHHHHHHHHHHHHHHH-CCCEEEEEEECCCEECCCCEEEEEEEECCCEEEEEEECCH T ss_conf 8630999999999999834-8989999996466628996499999818928999999284 No 74 >TIGR01780 SSADH succinic semialdehyde dehydrogenase; InterPro: IPR010102 Succinic semialdehyde dehydrogenase is one of three enzymes constituting 4-aminobutyrate (GABA) degradation in both prokaryotes and eukaryotes, catalysing the (NAD(P)+)-dependent catabolism reaction of succinic semialdehyde to succinate for metabolism by the citric acid cycle. In Escherichia coli, succinic semialdehyde dehydrogenase is located in a unidirectionally transcribed gene cluster encoding enzymes for GABA degradation and is suggested to be co-transcribed with succinic semialdehyde transaminase from a common promoter upstream of SSADH . Similar gene arrangements can be found in characterised Ralstonia eutropha and the genome analysis of Bacillus subtilis. Prokaryotic succinic semialdehyde dehydrogenases (1.2.1.16 from EC) share high sequence homology to characterised succinic semialdehyde dehydrogenases from rat and human (1.2.1.24 from EC), exhibiting conservation of proposed cofactor binding residues, and putative active sites (G-237 and G-242, C-293 and G-259 respectively of rat SSADH) . Eukaryotic SSADH enzymes exclusively utilise NAD+ as a cofactor, exhibiting little to no NADP+ activity . While a NADP+ preference has been detected in prokaryotes in addition to both NADP+- and NAD+-dependencies as in Escherichia coli, Pseudomonas, and Klebsiella pneumoniae . The function of this alternative SSADH currently is unknown, but has been suggested to play a possible role in 4-hydroxyphenylacetic degradation . SSADH enzyme belongs to the aldehyde dehydrogenase family, sharing a common evolutionary origin and enzymatic mechanism with lactaldehyde dehydrogenase . Like in lactaldehyde dehydrogenase and succinate semialdehyde dehydrogenase, the mammalian catalytic glutamic acid and cysteine residues are conserved in all the enzymes of this family. ; GO: 0016620 oxidoreductase activity acting on the aldehyde or oxo group of donors NAD or NADP as acceptor. Probab=21.44 E-value=38 Score=11.66 Aligned_cols=189 Identities=13% Similarity=0.151 Sum_probs=89.0 Q ss_pred CHHHHHHHHHHCCCCCCCHHHHHHHH-HHHHHHHHHCCCCEEEEEEEEEEC---CCCCEEEEEEEECCCCEEEEEEEEEC Q ss_conf 98899887300578637999999999-999999997297001799876433---89842588986036632467787504 Q gi|254780772|r 128 KDDQLKMIVRSRSAAAYDEDTVNADV-HNIKQAYASIGYLNVMVKVQHHSI---SPTTLNITYVIEEGVKAKINSIRFVG 203 (781) Q Consensus 128 ~~~~l~~~~~~~~g~~~~~~~~~~~~-~~i~~~y~~~Gy~~a~V~~~~~~~---~~~~v~i~~~I~EG~~~~I~~I~f~G 203 (781) ..|+|.+.|+++.|.+|.+.+-+-.- .--.+.|.+++- .|.-++.+- +.+|+ +.|. +|+-|..+ T Consensus 57 nkddLA~liT~E~GKPL~eA~GEv~YaA~ylEWfaEEA~---RvyG~~~~~s~~~~~r~---~v~k--QPVGV~a~---- 124 (454) T TIGR01780 57 NKDDLAKLITLENGKPLKEAKGEVLYAASYLEWFAEEAK---RVYGDTIPSSLQSDKRL---LVIK--QPVGVCAA---- 124 (454) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCC---CCCCCCCCCCCCCCCEE---EEEE--CCCCCEEE---- T ss_conf 478999998763689626720102233245655530055---24444136787778637---9972--68766154---- Q ss_pred CCCCCHHHHHHHHHCCCCCC-CCCCCCCCCCHHH-HHHHHHHHHHHHHHHHHHHHHCCCCCCCCC--CCCCCCEEEEECC Q ss_conf 43110122221110012211-2445666431023-333457888888754232210012310124--5555410122022 Q gi|254780772|r 204 NKNYSHARLERVISIRTSGY-FSFGKTDVYSKER-MSFDEEAIRAFYHDRGYAAVKVSSQVLFDK--QKSGYVLIFQIDE 279 (781) Q Consensus 204 n~~f~~~~L~~~~~~k~~~~-~~~~~~~~~~~~~-l~~D~~~l~~~Y~~~GY~d~~V~~~~~~~~--~~~~~~i~~~I~E 279 (781) +..+--=..|-|+..+- |-=+..=++.|.. =-.--=.|..+=..-|+-+-. .+|.... +-..+== ...+ T Consensus 125 ---ITPWNFPaAMITRKagaALAAGCT~i~KPA~~TPlsALaLA~LA~~AG~P~GV--lnVi~g~~~~a~~~G~--~Lt~ 197 (454) T TIGR01780 125 ---ITPWNFPAAMITRKAGAALAAGCTVIIKPAEQTPLSALALARLAEEAGIPKGV--LNVITGSANDAEEVGK--VLTT 197 (454) T ss_pred ---ECCCCCHHHHHHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHHHCCCCCCC--CEEECCCCCCHHHHCC--EECC T ss_conf ---27888723677733446763068278637888847899999999857899984--1356176404355150--0026 Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHH------HHCCCCC-----EECCCCCCHHHHH-CCCCCCCCCCCCC Q ss_conf 22100000123334445576777753------2116996-----2112100000000-0000234456420 Q gi|254780772|r 280 GEIYTVGNISIQSTLQEIQKKTLLSL------IRIRSGN-----LYNPQEIKESSEK-ISKYFFSGERPFV 338 (781) Q Consensus 280 G~~y~i~~i~i~g~~~~~~~~~l~~~------~~~k~G~-----~~~~~~~~~~~~~-i~~~y~~~GY~fa 338 (781) -| .|++|+|+|.+. +=+-.+.+. +.+.=|- +|+-.+++++++. +..+|++.|-..+ T Consensus 198 ~P--~VrK~sFTGST~-vGKiLM~qsA~tvKkvS~ELGGnAPfIVF~DAd~D~AVE~a~AsKFRn~GQTCV 265 (454) T TIGR01780 198 SP--LVRKVSFTGSTN-VGKILMKQSASTVKKVSMELGGNAPFIVFDDADIDQAVEGAMASKFRNAGQTCV 265 (454) T ss_pred CC--CEEEEEECCCCH-HHHHHHHHHHHHHCEEHHHCCCCCCEEECCCCCHHHHHHHHHHHHHCCCCCEEE T ss_conf 88--477654137717-899999961322212123207898830259988768999999874527996377 No 75 >COG1570 XseA Exonuclease VII, large subunit [DNA replication, recombination, and repair] Probab=20.97 E-value=39 Score=11.59 Aligned_cols=51 Identities=25% Similarity=0.387 Sum_probs=36.7 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCEEEEEEE--------CCEEEEEEE-ECCEEEEEEEECCC Q ss_conf 82688999999999996798003899986--------998999999-36536579995654 Q gi|254780772|r 74 QSISDADLDHAVKNIYAMGYFSNVKIKIV--------DSVLIIDLI-ERKIINHLFFSGNN 125 (781) Q Consensus 74 ~~~~~~~i~~~i~~l~~~g~F~~v~~~~~--------~~~l~~~v~-e~p~i~~i~~~Gn~ 125 (781) .+++-..++.-||.+.... |..|.|.-+ .+-+.|.++ ++..|.-+.|.|+. T Consensus 4 ~~~sVSeln~~ik~llE~~-~~~V~v~GEISn~t~~~sgH~YFtLKD~~A~i~c~mf~~~~ 63 (440) T COG1570 4 KILSVSELNDYIKRLLERD-LGQVWVRGEISNFTRPASGHLYFTLKDERAQIRCVMFKGNN 63 (440) T ss_pred CEECHHHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCCCEEEEECCCCCEEEEEEECCCC T ss_conf 7051999999999999840-78599999963776688860799971677168999974700 No 76 >PRK05137 tolB translocation protein TolB; Provisional Probab=20.77 E-value=40 Score=11.57 Aligned_cols=21 Identities=19% Similarity=0.509 Sum_probs=15.4 Q ss_pred HHHHHHHH-HHHHHCCCCCEEE Q ss_conf 89999999-9999679800389 Q gi|254780772|r 78 DADLDHAV-KNIYAMGYFSNVK 98 (781) Q Consensus 78 ~~~i~~~i-~~l~~~g~F~~v~ 98 (781) ..++.++| ..|-.+|+|+-+. T Consensus 56 ~~~i~~II~~DL~~SG~F~~l~ 77 (437) T PRK05137 56 GADIAQVIAADLKRSGLFAPID 77 (437) T ss_pred HHHHHHHHHHHHHHCCCCEECC T ss_conf 6889999997786388955648 Done!