BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781211|ref|YP_003065624.1| hypothetical protein CLIBASIA_05590 [Candidatus Liberibacter asiaticus str. psy62] (234 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781211|ref|YP_003065624.1| hypothetical protein CLIBASIA_05590 [Candidatus Liberibacter asiaticus str. psy62] gi|254040888|gb|ACT57684.1| hypothetical protein CLIBASIA_05590 [Candidatus Liberibacter asiaticus str. psy62] gi|317120676|gb|ADV02499.1| hypothetical protein SC1_gp095 [Liberibacter phage SC1] gi|317120820|gb|ADV02641.1| hypothetical protein SC1_gp095 [Candidatus Liberibacter asiaticus] Length = 234 Score = 479 bits (1234), Expect = e-134, Method: Compositional matrix adjust. Identities = 234/234 (100%), Positives = 234/234 (100%) Query: 1 MSDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKP 60 MSDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKP Sbjct: 1 MSDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKP 60 Query: 61 PIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEE 120 PIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEE Sbjct: 61 PIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEE 120 Query: 121 RKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESS 180 RKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESS Sbjct: 121 RKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESS 180 Query: 181 TGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 TGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ Sbjct: 181 TGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 >gi|315122898|ref|YP_004063387.1| hypothetical protein CKC_05770 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496300|gb|ADR52899.1| hypothetical protein CKC_05770 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 234 Score = 239 bits (609), Expect = 3e-61, Method: Compositional matrix adjust. Identities = 120/230 (52%), Positives = 163/230 (70%), Gaps = 6/230 (2%) Query: 9 TPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLS 68 TP+ STP E E + N P +KE+ V+S+P + S+ +STEEA +P EDYTL+ Sbjct: 7 TPV-STPIDTEVESPEVDNTPSTKEQ-VESNPFKKYASTEDTSTEEAEASQPNSEDYTLN 64 Query: 69 CPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHG----RKIGEQFGASLEEERKLL 124 CP+Y+ + EV AH EAFKEAG+DA+ AQKV+D+LV HG ++ E+ +L E++ L Sbjct: 65 CPEYIPQEEVKAHSEAFKEAGIDAKTAQKVIDRLVAHGQENEKRSIERLDKALAEDKTAL 124 Query: 125 QTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESSTGDT 184 +++ G DY REKDIARYFR+EKI D DVQ+L+S WGF+KTF FFDRYAQ NKE+S GDT Sbjct: 125 KSEYGVDYSRREKDIARYFRQEKIADADVQALVSTWGFQKTFKFFDRYAQMNKETSVGDT 184 Query: 185 FVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 F RSEGS+ + D+DK+FN P+F + +GD A ++QWA +QA L+Q Sbjct: 185 FARSEGSRGSQEDYDKLFNNPEFIEKRKAGDTSANNKIKQWANQQALLDQ 234 >gi|317120719|gb|ADV02541.1| hypothetical protein SC2_gp095 [Liberibacter phage SC2] gi|317120780|gb|ADV02601.1| hypothetical protein SC2_gp095 [Candidatus Liberibacter asiaticus] Length = 246 Score = 112 bits (281), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 77/228 (33%), Positives = 115/228 (50%), Gaps = 24/228 (10%) Query: 31 SKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPI-------------------EDYTLSCPD 71 +K E + +DP + S +E + KP + +DYT+ P Sbjct: 19 TKGEEINTDPSVKEMGSKVEVSEAKFDSKPDVSKKASLSDKLFSDEKVDIPKDYTIDFPK 78 Query: 72 YVSEAEVTAHIEAFKEAGVDARVAQKVVDKLV----DHGRKIGEQFGASLEEERKLLQTK 127 VSEA+ +E+F + G+ AQ + D L +H + E+ +E+ L+ + Sbjct: 79 GVSEADKKQALESFVKNGLSKADAQVLTDSLAKSFQEHIKSQEEKHNRIFDEDILKLKRE 138 Query: 128 LGSD-YETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESSTGDTFV 186 G + + K + + ++ + + I+ G F ++ ++ ++SS Sbjct: 139 YGLEGFSQLNKKVKGFVKEVGLSGETFDNFIAVAGAYNAFKLLEKLSRSTRDSSYSPPPP 198 Query: 187 RSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 GSQEADRDFDKVF+TPDFGSRVLSGD EATKTLRQWAEKQATLNQ Sbjct: 199 SQRGSQEADRDFDKVFDTPDFGSRVLSGDMEATKTLRQWAEKQATLNQ 246 >gi|315121936|ref|YP_004062425.1| hypothetical protein CKC_00930 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495338|gb|ADR51937.1| hypothetical protein CKC_00930 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 60 Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 30/59 (50%), Positives = 42/59 (71%) Query: 176 NKESSTGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 NKE+S GDTF RSEGS+ + D+DK+FN P+F + +GD A ++QWA +QA L+Q Sbjct: 2 NKETSVGDTFARSEGSRGSQEDYDKLFNNPEFIEKRKAGDTSANNKIKQWANQQALLDQ 60 >gi|167924173|ref|ZP_02511264.1| invertase/recombinase protein [Burkholderia pseudomallei BCC215] Length = 279 Score = 38.9 bits (89), Expect = 0.56, Method: Compositional matrix adjust. Identities = 23/66 (34%), Positives = 34/66 (51%), Gaps = 3/66 (4%) Query: 82 IEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTK---LGSDYETREKD 138 +E F+ AGV R Q+ VD GR + FGA E ER L++ + LGS + + Sbjct: 63 VERFEAAGVGLRSLQESVDTASIGGRLVFHLFGALAEFERNLIRAERLPLGSSRSAKSVN 122 Query: 139 IARYFR 144 + +FR Sbjct: 123 MLCHFR 128 >gi|222778489|ref|YP_002576126.1| DNA methylase/helicase [Campylobacter lari RM2100] gi|222539774|gb|ACM64874.1| DNA methylase/helicase [Campylobacter lari RM2100] Length = 1934 Score = 38.9 bits (89), Expect = 0.57, Method: Compositional matrix adjust. Identities = 35/122 (28%), Positives = 54/122 (44%), Gaps = 9/122 (7%) Query: 82 IEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLGSDYETREKDIAR 141 IE F++AG D R + D E G L LLQ +L SD E + Sbjct: 1621 IEQFRKAGADVRTLDDITSGAADAAEMKAEATGNPL----ILLQVQLSSDLRQEEMLYSG 1676 Query: 142 YFRKEKIPDNDVQSLISAWGF----EKTFNFFDRYAQQNK-ESSTGDTFVRSEGSQEADR 196 Y R+ + ++S IS F K +N Q+NK ++ G F+ +E ++E+ + Sbjct: 1677 YKRQIHNNEESLRSNISKIEFLEKETKKYNILQDIIQKNKSDNFQGKCFIYNENNEESAK 1736 Query: 197 DF 198 DF Sbjct: 1737 DF 1738 >gi|257056994|ref|YP_003134826.1| short chain enoyl-CoA hydratase /3-hydroxyacyl-CoA dehydrogenase [Saccharomonospora viridis DSM 43017] gi|256586866|gb|ACU97999.1| short chain enoyl-CoA hydratase /3-hydroxyacyl-CoA dehydrogenase [Saccharomonospora viridis DSM 43017] Length = 728 Score = 37.7 bits (86), Expect = 1.2, Method: Compositional matrix adjust. Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 5/89 (5%) Query: 34 EAVQSDPQGRNPSSSSSSTEEAGEPKPPIE---DYTLSCPDYVSEAEVTAHIEAFKEAGV 90 EAV + +G P++ + +AG P PP++ + TL+ P + E E A +EA Sbjct: 522 EAVAALGEGVEPATIEQAGAQAGYPAPPLQLMDELTLTLPRKIRE-ETKAAVEA-AGGTW 579 Query: 91 DARVAQKVVDKLVDHGRKIGEQFGASLEE 119 A V+D+++D + G FGA E Sbjct: 580 TPHPADAVIDRMIDEFDRKGRSFGAGFYE 608 >gi|76818748|ref|YP_336347.1| invertase/recombinase protein [Burkholderia pseudomallei 1710b] gi|167744016|ref|ZP_02416790.1| invertase/recombinase protein [Burkholderia pseudomallei 14] gi|167821211|ref|ZP_02452891.1| invertase/recombinase protein [Burkholderia pseudomallei 91] gi|167851038|ref|ZP_02476546.1| invertase/recombinase protein [Burkholderia pseudomallei B7210] gi|167916321|ref|ZP_02503412.1| invertase/recombinase protein [Burkholderia pseudomallei 112] gi|76583221|gb|ABA52695.1| invertase/recombinase protein [Burkholderia pseudomallei 1710b] Length = 198 Score = 37.7 bits (86), Expect = 1.2, Method: Compositional matrix adjust. Identities = 23/66 (34%), Positives = 34/66 (51%), Gaps = 3/66 (4%) Query: 82 IEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTK---LGSDYETREKD 138 +E F+ AGV R Q+ VD GR + FGA E ER L++ + LGS + + Sbjct: 63 VERFEAAGVGLRSLQESVDTASIGGRLVFHLFGALAEFERNLIRAERLPLGSSRSAKSVN 122 Query: 139 IARYFR 144 + +FR Sbjct: 123 MLCHFR 128 >gi|302529937|ref|ZP_07282279.1| fatty oxidation complex, alpha subunit FadB [Streptomyces sp. AA4] gi|302438832|gb|EFL10648.1| fatty oxidation complex, alpha subunit FadB [Streptomyces sp. AA4] Length = 735 Score = 35.8 bits (81), Expect = 4.4, Method: Compositional matrix adjust. Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 7/90 (7%) Query: 34 EAVQSDPQGRNPSSSSSSTEEAGEPKPPIE---DYTLSCPDYVSEAEVTAHIEAFKEAGV 90 EAV + +G P+S + +AG P PP++ + TL+ P + AE A IEA E G Sbjct: 529 EAVAALGEGVEPASIEQAGSQAGYPAPPLQLMDELTLTLPRKI-RAETRAAIEA--EGGT 585 Query: 91 -DARVAQKVVDKLVDHGRKIGEQFGASLEE 119 ++ V+D ++D + G GA E Sbjct: 586 WTPHASEAVIDTMLDKYDRKGRSTGAGFYE 615 >gi|156403871|ref|XP_001640131.1| predicted protein [Nematostella vectensis] gi|156227264|gb|EDO48068.1| predicted protein [Nematostella vectensis] Length = 666 Score = 34.7 bits (78), Expect = 10.0, Method: Compositional matrix adjust. Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 8/70 (11%) Query: 95 AQKVVDKLVDHGRKIGEQFGASLEEERKLLQTK---LGSDY--ETREKDIARYFRKEK-- 147 AQ+V+DKL+ GR+ G + E K ++ K LG D+ E++ YF KE+ Sbjct: 124 AQRVLDKLIKLGRRNGLHLPVDKQNEIKSIKKKLSDLGIDFNRNCNEENTKLYFTKEELA 183 Query: 148 -IPDNDVQSL 156 +PD+ V SL Sbjct: 184 GLPDDFVNSL 193 Searching..................................................done Results from round 2 CONVERGED! >gi|254781211|ref|YP_003065624.1| hypothetical protein CLIBASIA_05590 [Candidatus Liberibacter asiaticus str. psy62] gi|254040888|gb|ACT57684.1| hypothetical protein CLIBASIA_05590 [Candidatus Liberibacter asiaticus str. psy62] gi|317120676|gb|ADV02499.1| hypothetical protein SC1_gp095 [Liberibacter phage SC1] gi|317120820|gb|ADV02641.1| hypothetical protein SC1_gp095 [Candidatus Liberibacter asiaticus] Length = 234 Score = 335 bits (859), Expect = 3e-90, Method: Composition-based stats. Identities = 234/234 (100%), Positives = 234/234 (100%) Query: 1 MSDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKP 60 MSDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKP Sbjct: 1 MSDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKP 60 Query: 61 PIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEE 120 PIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEE Sbjct: 61 PIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEE 120 Query: 121 RKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESS 180 RKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESS Sbjct: 121 RKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESS 180 Query: 181 TGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 TGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ Sbjct: 181 TGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 >gi|315122898|ref|YP_004063387.1| hypothetical protein CKC_05770 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496300|gb|ADR52899.1| hypothetical protein CKC_05770 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 234 Score = 292 bits (748), Expect = 2e-77, Method: Composition-based stats. Identities = 120/230 (52%), Positives = 163/230 (70%), Gaps = 6/230 (2%) Query: 9 TPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLS 68 TP+ STP E E + N P +KE+ V+S+P + S+ +STEEA +P EDYTL+ Sbjct: 7 TPV-STPIDTEVESPEVDNTPSTKEQ-VESNPFKKYASTEDTSTEEAEASQPNSEDYTLN 64 Query: 69 CPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHG----RKIGEQFGASLEEERKLL 124 CP+Y+ + EV AH EAFKEAG+DA+ AQKV+D+LV HG ++ E+ +L E++ L Sbjct: 65 CPEYIPQEEVKAHSEAFKEAGIDAKTAQKVIDRLVAHGQENEKRSIERLDKALAEDKTAL 124 Query: 125 QTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESSTGDT 184 +++ G DY REKDIARYFR+EKI D DVQ+L+S WGF+KTF FFDRYAQ NKE+S GDT Sbjct: 125 KSEYGVDYSRREKDIARYFRQEKIADADVQALVSTWGFQKTFKFFDRYAQMNKETSVGDT 184 Query: 185 FVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 F RSEGS+ + D+DK+FN P+F + +GD A ++QWA +QA L+Q Sbjct: 185 FARSEGSRGSQEDYDKLFNNPEFIEKRKAGDTSANNKIKQWANQQALLDQ 234 >gi|317120719|gb|ADV02541.1| hypothetical protein SC2_gp095 [Liberibacter phage SC2] gi|317120780|gb|ADV02601.1| hypothetical protein SC2_gp095 [Candidatus Liberibacter asiaticus] Length = 246 Score = 238 bits (607), Expect = 4e-61, Method: Composition-based stats. Identities = 77/234 (32%), Positives = 115/234 (49%), Gaps = 24/234 (10%) Query: 25 RSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPI-------------------EDY 65 +K E + +DP + S +E + KP + +DY Sbjct: 13 VDTSVSTKGEEINTDPSVKEMGSKVEVSEAKFDSKPDVSKKASLSDKLFSDEKVDIPKDY 72 Query: 66 TLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLV----DHGRKIGEQFGASLEEER 121 T+ P VSEA+ +E+F + G+ AQ + D L +H + E+ +E+ Sbjct: 73 TIDFPKGVSEADKKQALESFVKNGLSKADAQVLTDSLAKSFQEHIKSQEEKHNRIFDEDI 132 Query: 122 KLLQTKLGSD-YETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESS 180 L+ + G + + K + + ++ + + I+ G F ++ ++ ++SS Sbjct: 133 LKLKREYGLEGFSQLNKKVKGFVKEVGLSGETFDNFIAVAGAYNAFKLLEKLSRSTRDSS 192 Query: 181 TGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 GSQEADRDFDKVF+TPDFGSRVLSGD EATKTLRQWAEKQATLNQ Sbjct: 193 YSPPPPSQRGSQEADRDFDKVFDTPDFGSRVLSGDMEATKTLRQWAEKQATLNQ 246 >gi|315121936|ref|YP_004062425.1| hypothetical protein CKC_00930 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495338|gb|ADR51937.1| hypothetical protein CKC_00930 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 60 Score = 92.6 bits (228), Expect = 4e-17, Method: Composition-based stats. Identities = 30/60 (50%), Positives = 42/60 (70%) Query: 175 QNKESSTGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKTLRQWAEKQATLNQ 234 NKE+S GDTF RSEGS+ + D+DK+FN P+F + +GD A ++QWA +QA L+Q Sbjct: 1 MNKETSVGDTFARSEGSRGSQEDYDKLFNNPEFIEKRKAGDTSANNKIKQWANQQALLDQ 60 >gi|77734533|emb|CAI59394.2| hypothetical protein pSG3.03 [Sodalis glossinidius] Length = 517 Score = 46.0 bits (107), Expect = 0.004, Method: Composition-based stats. Identities = 30/134 (22%), Positives = 58/134 (43%), Gaps = 15/134 (11%) Query: 4 ETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIE 63 E +Q P ++PP V + + +EA S P +S + ++A +P + Sbjct: 3 EAEQ--PDIASPPDVLASEPE-----TAPKEAADSAPAPDELTSHAERDDKAQQPDGAPQ 55 Query: 64 DYTLSCPDYVSEAEVTAHIEAF----KEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEE 119 +YT + P+ E + + F KE G+ AQK+VD ++ + +G + E Sbjct: 56 EYTFTAPEG-GELDA-QALGIFEPVAKELGLTQAQAQKLVDIYPQIQQRQAQAWGQQVAE 113 Query: 120 --ERKLLQTKLGSD 131 E+ ++G + Sbjct: 114 WGEQVKTDPEIGGE 127 >gi|89886023|ref|YP_516220.1| hypothetical protein SGPHI_0042 [Sodalis phage phiSG1] gi|89191758|dbj|BAE80505.1| conserved hypothetical protein [Sodalis phage phiSG1] gi|125470053|gb|ABN42245.1| gp40 [Sodalis phage phiSG1] Length = 517 Score = 46.0 bits (107), Expect = 0.004, Method: Composition-based stats. Identities = 30/134 (22%), Positives = 58/134 (43%), Gaps = 15/134 (11%) Query: 4 ETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIE 63 E +Q P ++PP V + + +EA S P +S + ++A +P + Sbjct: 3 EAEQ--PDIASPPDVLASEPE-----TAPKEAADSAPAPDELTSHAERDDKAQQPDGAPQ 55 Query: 64 DYTLSCPDYVSEAEVTAHIEAF----KEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEE 119 +YT + P+ E + + F KE G+ AQK+VD ++ + +G + E Sbjct: 56 EYTFTAPEG-GELDA-QALGIFEPVAKELGLTQAQAQKLVDIYPQIQQRQAQAWGQQVAE 113 Query: 120 --ERKLLQTKLGSD 131 E+ ++G + Sbjct: 114 WGEQVKTDPEIGGE 127 >gi|313891093|ref|ZP_07824712.1| LPXTG-motif cell wall anchor domain protein [Streptococcus pseudoporcinus SPIN 20026] gi|313120456|gb|EFR43576.1| LPXTG-motif cell wall anchor domain protein [Streptococcus pseudoporcinus SPIN 20026] Length = 1027 Score = 42.9 bits (99), Expect = 0.033, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 57/142 (40%), Gaps = 17/142 (11%) Query: 1 MSDETDQLTPL------PSTPPVVECERSQRSNPPPSKE--EAVQSDPQGRNPSSSSSST 52 ++ E+ + P+ +TPP E + + +N P+KE E V +PQ R S S T Sbjct: 58 VNKESSTIDPVVTDLFSDTTPPHSEVQGQEEANTAPTKEASEEVSEEPQDRPKSHSEVKT 117 Query: 53 EEAGEP----KPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDA--RVAQKVVDKLVDHG 106 E+ E K T D++++ + + F ++GV+ ++ G Sbjct: 118 EQDKESHLSDKVDTTAPTWEATDFITKGDT---LVGFSKSGVEKLSHTGHLLLPSRAADG 174 Query: 107 RKIGEQFGASLEEERKLLQTKL 128 + + + ++K + Sbjct: 175 TVLTQVASFAFTPDKKTAIAEY 196 >gi|156403871|ref|XP_001640131.1| predicted protein [Nematostella vectensis] gi|156227264|gb|EDO48068.1| predicted protein [Nematostella vectensis] Length = 666 Score = 42.9 bits (99), Expect = 0.033, Method: Composition-based stats. Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 13/126 (10%) Query: 41 QGRNPSSSSSSTEEAGEPKPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVD--ARVAQKV 98 ++ S+ + E + D+ + + ++ ++ AFKE + AQ+V Sbjct: 71 FPQHMSTDKDVRAASTEADRKLSDFEV---EMSMRKDIFDNLVAFKENNKQPISAEAQRV 127 Query: 99 VDKLVDHGRKIGEQFGASLEEERKLLQTK---LGSDYET--REKDIARYFRKE---KIPD 150 +DKL+ GR+ G + E K ++ K LG D+ E++ YF KE +PD Sbjct: 128 LDKLIKLGRRNGLHLPVDKQNEIKSIKKKLSDLGIDFNRNCNEENTKLYFTKEELAGLPD 187 Query: 151 NDVQSL 156 + V SL Sbjct: 188 DFVNSL 193 >gi|297824649|ref|XP_002880207.1| hypothetical protein ARALYDRAFT_483732 [Arabidopsis lyrata subsp. lyrata] gi|297326046|gb|EFH56466.1| hypothetical protein ARALYDRAFT_483732 [Arabidopsis lyrata subsp. lyrata] Length = 403 Score = 41.8 bits (96), Expect = 0.084, Method: Composition-based stats. Identities = 30/149 (20%), Positives = 62/149 (41%), Gaps = 6/149 (4%) Query: 32 KEEAVQSDPQGRNPS-SSSSSTEEAGEPKPPIEDYTLSCPDY-VSEAEVTAHIEAFKEAG 89 K EA+ + S SS + + P YT+S + V + V ++ KEA Sbjct: 149 KREAITGNTSLEYASTSSDQDSVRSRGSVIPSGQYTISRGNGNVYQLAVFEAEKSKKEAS 208 Query: 90 VDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLGSDYETREKDIARYFRKEKIP 149 ++A Q+V + + ++ E A EE ++ +T++ + + RY + +I Sbjct: 209 LEAFKHQEVEKEKNEAIKRAKEWENAYFEELKQRKETEMELKKVREKLENMRYISENRIT 268 Query: 150 DNDVQSLISAWGFEKTFNFFDRYAQQNKE 178 +S + + +N + ++ KE Sbjct: 269 ----ESYMLVQKLQDKYNLATKVLRKAKE 293 >gi|300711943|ref|YP_003737757.1| DNA mismatch repair protein MutS [Halalkalicoccus jeotgali B3] gi|299125626|gb|ADJ15965.1| DNA mismatch repair protein MutS [Halalkalicoccus jeotgali B3] Length = 882 Score = 41.4 bits (95), Expect = 0.11, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 57/179 (31%), Gaps = 29/179 (16%) Query: 63 EDYTLSCP-DYVSEAEVTAHIEAFKEAGV---------DARVAQKVVDKLVDHG------ 106 ED T + P + ++IE +AG D A VVD+ V Sbjct: 67 EDSTGTYPMAGIPIDSAESYIETLLDAGFRVAVADQVQDPEEASGVVDRAVTRIVTPGTL 126 Query: 107 ------RKIGEQFGASLEEERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAW 160 R + A+L E+ + + G + + + I + ++ IS + Sbjct: 127 TEAELLRSEDNNYVAALAEDESGDEREYGLAF--LDVSTGDFVATGAISEETIRDEISRF 184 Query: 161 GFEKTF--NFFDRYAQQNKESSTGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKE 217 + + + + + + A D F TP+ + L+ D E Sbjct: 185 APAEAIVGPGLEGFDGFDSDCMVTPYDPSRFALEGARELLDSYFGTPE---KRLASDAE 240 >gi|303246334|ref|ZP_07332614.1| multi-sensor signal transduction histidine kinase [Desulfovibrio fructosovorans JJ] gi|302492397|gb|EFL52269.1| multi-sensor signal transduction histidine kinase [Desulfovibrio fructosovorans JJ] Length = 651 Score = 40.6 bits (93), Expect = 0.17, Method: Composition-based stats. Identities = 31/129 (24%), Positives = 52/129 (40%), Gaps = 11/129 (8%) Query: 37 QSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLSCPDY-------VSEAEVTAHIEAFKEAG 89 + DP+ + + ++ + PK E TL+ V A + A + E Sbjct: 481 ELDPETVDVAEFAAGVAQTVAPKADREGVTLTSEIGENLGVMAVDAANLQAALVNILENA 540 Query: 90 VDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLGSDYETREKDIARYFRKEKIP 149 VDA +A DK H R + GA+L + + +G D ETREK +F + + Sbjct: 541 VDACLAD--TDKPEHHIRYRVRREGATLVFDIE--DNGMGMDQETREKAFTLFFSSKGLK 596 Query: 150 DNDVQSLIS 158 + I+ Sbjct: 597 GTGLGLFIA 605 >gi|308064155|gb|ADO06042.1| ATP-dependent protease La [Helicobacter pylori Sat464] Length = 822 Score = 40.6 bits (93), Expect = 0.19, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 47/97 (48%), Gaps = 7/97 (7%) Query: 90 VDARVAQKVVDKLVDHGRKIGEQFGASL--EEERKLLQTKLGSDYETREKDIARYFRKEK 147 ++ QK+ ++ + EQ +E+ K +Q +LG+D + R++D+ +Y++K Sbjct: 199 IEETKTQKLQKEIKSKVHQKMEQTNKEFFLKEQLKQIQKELGTD-KQRDEDLNQYYQKL- 256 Query: 148 IPDNDVQSLISAWGFEKTFNFFDRYAQQNKESSTGDT 184 V+ + F++ DR ++ + +SS T Sbjct: 257 ---ESVKPFLKEEAFKEIKKQIDRLSRTHADSSDSAT 290 >gi|188528169|ref|YP_001910856.1| ATP-dependent protease [Helicobacter pylori Shi470] gi|188144409|gb|ACD48826.1| ATP-dependent protease [Helicobacter pylori Shi470] Length = 822 Score = 40.2 bits (92), Expect = 0.23, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 47/97 (48%), Gaps = 7/97 (7%) Query: 90 VDARVAQKVVDKLVDHGRKIGEQFGASL--EEERKLLQTKLGSDYETREKDIARYFRKEK 147 ++ QK+ ++ + EQ +E+ K +Q +LG+D + R++D+ +Y++K Sbjct: 199 IEETKTQKLQKEIKSKVHQKMEQTNKEFFLKEQLKQIQKELGTD-KQRDEDLNQYYQKL- 256 Query: 148 IPDNDVQSLISAWGFEKTFNFFDRYAQQNKESSTGDT 184 V+ + F++ DR ++ + +SS T Sbjct: 257 ---ESVKPFLKEEAFKEIKKQIDRLSRTHADSSDSAT 290 >gi|308062664|gb|ADO04552.1| ATP-dependent protease La [Helicobacter pylori Cuz20] Length = 822 Score = 40.2 bits (92), Expect = 0.25, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 47/97 (48%), Gaps = 7/97 (7%) Query: 90 VDARVAQKVVDKLVDHGRKIGEQFGASL--EEERKLLQTKLGSDYETREKDIARYFRKEK 147 ++ QK+ ++ + EQ +E+ K +Q +LG+D + R++D+ +Y++K Sbjct: 199 IEETKTQKLQKEIKSKVHQKMEQTNKEFFLKEQLKQIQKELGTD-KQRDEDLNQYYQKL- 256 Query: 148 IPDNDVQSLISAWGFEKTFNFFDRYAQQNKESSTGDT 184 V+ + F++ DR ++ + +SS T Sbjct: 257 ---ESVKPFLKEEAFKEIKKQIDRLSRTHADSSDSAT 290 >gi|281203113|gb|EFA77314.1| hypothetical protein PPL_12525 [Polysphondylium pallidum PN500] Length = 426 Score = 40.2 bits (92), Expect = 0.25, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 33/82 (40%), Gaps = 4/82 (4%) Query: 2 SDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSS-STEEAGEPKP 60 + T +TPL + PP E S PP + P PS+ S ++ Sbjct: 337 TQSTSSITPLVALPPKSETSSPNVSTTPPISSSVSTTPPSLSEPSTQIEMSATDSTTTVT 396 Query: 61 PIEDYTLSCPDYVSEAEVTAHI 82 P ED T P +++++ +I Sbjct: 397 PSEDNT---PCNIAQSKHKYNI 415 >gi|297380561|gb|ADI35448.1| ATP-dependent protease La [Helicobacter pylori v225d] Length = 831 Score = 40.2 bits (92), Expect = 0.26, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 47/97 (48%), Gaps = 7/97 (7%) Query: 90 VDARVAQKVVDKLVDHGRKIGEQFGASL--EEERKLLQTKLGSDYETREKDIARYFRKEK 147 ++ QK+ ++ + EQ +E+ K +Q +LG+D + R++D+ +Y++K Sbjct: 199 IEETKTQKLQKEIKSKVHQKMEQTNKEFFLKEQLKQIQKELGTD-KQRDEDLNQYYQKL- 256 Query: 148 IPDNDVQSLISAWGFEKTFNFFDRYAQQNKESSTGDT 184 V+ + F++ DR ++ + +SS T Sbjct: 257 ---ESVKPFLKEEAFKEIKKQIDRLSRTHADSSDSAT 290 >gi|321263623|ref|XP_003196529.1| bacteriophytochrome histidine kinase [Cryptococcus gattii WM276] gi|317463006|gb|ADV24742.1| Bacteriophytochrome histidine kinase, putative [Cryptococcus gattii WM276] Length = 1858 Score = 39.8 bits (91), Expect = 0.27, Method: Composition-based stats. Identities = 32/159 (20%), Positives = 58/159 (36%), Gaps = 10/159 (6%) Query: 1 MSDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPK- 59 MSD + +P V + Q P PS+ A+ + P ++STE+ G + Sbjct: 1 MSDNST--SPTQPANSVRATYQPQSKRPSPSQSVAMPAAPSFHQHDRQNNSTEDCGSSRT 58 Query: 60 --PPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASL 117 PP + I + + V + + ++D H ++ + S+ Sbjct: 59 IVPPAASSAGPLSPGIPSESFVFPIRSVFQGMVHSDSSNSMID---GHCQRNQDSLQRSI 115 Query: 118 EEERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSL 156 L++ SD D A E PD D+Q++ Sbjct: 116 SNNSNPLRSSAFSDARRLSTDAAESLHDE--PDADIQTI 152 >gi|58268676|ref|XP_571494.1| chitin synthase-related [Cryptococcus neoformans var. neoformans JEC21] gi|57227729|gb|AAW44187.1| chitin synthase 5 [Cryptococcus neoformans var. neoformans JEC21] Length = 1895 Score = 39.8 bits (91), Expect = 0.32, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 55/155 (35%), Gaps = 20/155 (12%) Query: 4 ETDQLTPLPSTPPVVECERSQRSNP--PPSKEEAVQSDPQG----RNPSSSSSST--EEA 55 +T +P+ S + + +S +P P AV P PS +S+ + Sbjct: 1746 QTRSTSPVSSRYQMSQFRQSPYQSPYQGPYGGSAVDFRPSRMDMAHQPSLDDTSSFHQPY 1805 Query: 56 GEPKPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIG---EQ 112 P P Y + PD ++F VD AQ + D ++ + + Sbjct: 1806 QPPPRPQSSYAFNLPD--------PSSDSFTAPAVDYLGAQAITDSQLERSIRKICANAE 1857 Query: 113 FGASLEEE-RKLLQTKLGSDYETREKDIARYFRKE 146 ++ RK L+ + G + R + I R K Sbjct: 1858 LDKLTKKGVRKELEREYGVELTERRETINRLVEKV 1892 >gi|134113210|ref|XP_774630.1| hypothetical protein CNBF3100 [Cryptococcus neoformans var. neoformans B-3501A] gi|50257274|gb|EAL19983.1| hypothetical protein CNBF3100 [Cryptococcus neoformans var. neoformans B-3501A] Length = 1895 Score = 39.8 bits (91), Expect = 0.32, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 55/155 (35%), Gaps = 20/155 (12%) Query: 4 ETDQLTPLPSTPPVVECERSQRSNP--PPSKEEAVQSDPQG----RNPSSSSSST--EEA 55 +T +P+ S + + +S +P P AV P PS +S+ + Sbjct: 1746 QTRSTSPVSSRYQMSQFRQSPYQSPYQGPYGGSAVDFRPSRMDMAHQPSLDDTSSFHQPY 1805 Query: 56 GEPKPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIG---EQ 112 P P Y + PD ++F VD AQ + D ++ + + Sbjct: 1806 QPPPRPQSSYAFNLPD--------PSSDSFTAPAVDYLGAQAITDSQLERSIRKICANAE 1857 Query: 113 FGASLEEE-RKLLQTKLGSDYETREKDIARYFRKE 146 ++ RK L+ + G + R + I R K Sbjct: 1858 LDKLTKKGVRKELEREYGVELTERRETINRLVEKV 1892 >gi|296803743|ref|XP_002842724.1| conserved hypothetical protein [Arthroderma otae CBS 113480] gi|238846074|gb|EEQ35736.1| conserved hypothetical protein [Arthroderma otae CBS 113480] Length = 473 Score = 39.4 bits (90), Expect = 0.39, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 54/135 (40%), Gaps = 16/135 (11%) Query: 9 TPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLS 68 TP+PS P + ++ PP + Q+ PQ +PP ++ L+ Sbjct: 272 TPVPSRPLNGATIKPPVASHPPRTQLTTQTLPQR----------PPQRATEPPRQNIPLA 321 Query: 69 CPDYVSEAEVTAHI-----EAFKEAGVDA-RVAQKVVDKLVDHGRKIGEQFGASLEEERK 122 P E + +H EA K + + + A++ KL++ K + A +++E + Sbjct: 322 DPRTQWELDAESHALKQQNEAEKRSRLKKDKEAERQTKKLLEAEEKEARKRQAQIDKETE 381 Query: 123 LLQTKLGSDYETREK 137 L+ G + + Sbjct: 382 RLKKIYGKEESKVHR 396 >gi|221112929|ref|XP_002162994.1| PREDICTED: similar to GF20795, partial [Hydra magnipapillata] Length = 935 Score = 39.4 bits (90), Expect = 0.44, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 37/86 (43%), Gaps = 6/86 (6%) Query: 74 SEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLGSDY- 132 S +E H+E+ + K D LV + + E+ LE++ LQ KL + Sbjct: 35 SYSEKITHLESLL---LAHDEETKATDSLVTKIKLLHEKHCKELEQQILGLQNKLQQEIL 91 Query: 133 --ETREKDIARYFRKEKIPDNDVQSL 156 E+ EK I Y R K D+Q L Sbjct: 92 KVESLEKKIKLYLRDGKSKSKDIQHL 117 >gi|302667476|ref|XP_003025321.1| hypothetical protein TRV_00501 [Trichophyton verrucosum HKI 0517] gi|291189426|gb|EFE44710.1| hypothetical protein TRV_00501 [Trichophyton verrucosum HKI 0517] Length = 834 Score = 39.1 bits (89), Expect = 0.49, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 43/117 (36%), Gaps = 12/117 (10%) Query: 20 CERSQRSNPPPSKEEAVQSDPQG---------RNPSSSSSSTEEAGEPKPPIEDYTLSCP 70 E PP K + + P + +S SS + D + P Sbjct: 444 VESRNSDFPPNGKMRQLNTQPSAPANRPAVGPKRTTSKSSEYSAEIPAQGDNIDDDIDLP 503 Query: 71 DYVSEAE--VTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQ 125 +Y E + + + + GVD A+++ ++V HG ++ A LE + L+ Sbjct: 504 EYEPEEDLVLKRALSNLPK-GVDQESAKQIFKEIVVHGDEVHWDDVAGLELAKTALK 559 >gi|258566704|ref|XP_002584096.1| conserved hypothetical protein [Uncinocarpus reesii 1704] gi|237905542|gb|EEP79943.1| conserved hypothetical protein [Uncinocarpus reesii 1704] Length = 512 Score = 39.1 bits (89), Expect = 0.54, Method: Composition-based stats. Identities = 43/193 (22%), Positives = 69/193 (35%), Gaps = 14/193 (7%) Query: 2 SDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAV-QSDPQGRNPSSSSSSTEEAGEPKP 60 S + DQ TPL +PP + E S P E QS P + +S +A +P Sbjct: 208 SVDADQQTPLEKSPPAI--ETPASSGPVEGAPEITPQSTPSNADAELEEASVSKAEMTQP 265 Query: 61 PIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKL--VDHGRKIGEQFGASLE 118 E S VS VT A+ K D + VDH + + + Sbjct: 266 APEQ---SSHPLVSSRPVTIFAPPTASTPQSAQTPYKEEDYIPTVDHAKAHQRRLNTAGR 322 Query: 119 EERKLLQTKLGSDYETREKDIARYFR-KEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNK 177 R +L ++ +AR + K+ D ++S + E T + + + Sbjct: 323 NTRLAGDAELALQEAAIKERLARVTEVEIKVRFPDQSQVVSKFTREDTAHSLHEFVR--- 379 Query: 178 ESSTGDTFVRSEG 190 + DT + EG Sbjct: 380 --NCLDTPLAKEG 390 >gi|302504527|ref|XP_003014222.1| hypothetical protein ARB_07527 [Arthroderma benhamiae CBS 112371] gi|291177790|gb|EFE33582.1| hypothetical protein ARB_07527 [Arthroderma benhamiae CBS 112371] Length = 835 Score = 38.3 bits (87), Expect = 0.84, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 43/116 (37%), Gaps = 12/116 (10%) Query: 21 ERSQRSNPPPSKEEAVQSDPQG---------RNPSSSSSSTEEAGEPKPPIEDYTLSCPD 71 E PP K + + P + +S SS + D + P+ Sbjct: 446 ESRNSDFPPNGKMRQLNTQPSAPANRPAVGPKRTTSKSSKYSAEIPAQGDNIDDDIDLPE 505 Query: 72 YVSEAE--VTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQ 125 Y E + + + + GVD A+++ ++V HG ++ A LE + L+ Sbjct: 506 YEPEEDLVLKRALSNLPK-GVDQESAKQIFKEIVVHGDEVHWDDVAGLELAKTALK 560 >gi|268561066|ref|XP_002646356.1| Hypothetical protein CBG12070 [Caenorhabditis briggsae] gi|187029900|emb|CAP31110.1| CBR-UNC-89 protein [Caenorhabditis briggsae AF16] Length = 6561 Score = 38.3 bits (87), Expect = 0.99, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 52/146 (35%), Gaps = 5/146 (3%) Query: 12 PSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLSCPD 71 P + + + PP + E + + S S+TEE P + P+ Sbjct: 1613 PEKMADEKPTSPTQKDKPPEQVEKKPTSQTKKEKSPEKSTTEEVKTPIKKEKS-----PE 1667 Query: 72 YVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLGSD 131 + EV + + K +K ++ V K + E+ + + + + Sbjct: 1668 KSATEEVKSPTKKEKSPEKPTSPTKKTAEEAVKSPTKKEKSPEKIEEKPKSPTKEEKSPE 1727 Query: 132 YETREKDIARYFRKEKIPDNDVQSLI 157 T +D+ +KE+ PD V+ I Sbjct: 1728 KSTGAEDVKSPIKKERSPDTVVEKEI 1753 Score = 36.7 bits (83), Expect = 2.3, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 82/213 (38%), Gaps = 17/213 (7%) Query: 15 PP--VVECERSQRSNPPPSKE---EAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLSC 69 PP V++ ER ++ +P P K+ + SD + ++P++ T E E KP Sbjct: 1355 PPKEVIDPERKKKKSPTPEKKGKSPEMTSDEKPKSPTTE-EKTPEKVEEKPKSPTKKEKS 1413 Query: 70 PDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLG 129 P+ + EV + + KE + + K + K + S E+ + +++ Sbjct: 1414 PEKSTTVEVKSPTK--KEKSPEKPTSP--TKKNAEEAVKSPTKKEKSPEKSEEKVKSPTK 1469 Query: 130 SDYE-TREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESSTGDTFVRS 188 + R +++ +KEK P+ ++++ E T + A+ K S + Sbjct: 1470 KEKSPVRSEEVKSPTKKEKSPEA--KNIVEV-SSETTIKKTEATAEVTKTSEEAKSPTMK 1526 Query: 189 EGSQEADRDFDKVFNTPDFGSRVLSGDKEATKT 221 E S + K +P + TK Sbjct: 1527 EKSPGKVEEKPK---SPTKKEKSPEKPTSPTKK 1556 >gi|328859173|gb|EGG08283.1| hypothetical protein MELLADRAFT_85049 [Melampsora larici-populina 98AG31] Length = 498 Score = 37.9 bits (86), Expect = 1.2, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 69/203 (33%), Gaps = 7/203 (3%) Query: 2 SDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPS-SSSSSTEEAGEPKP 60 S T Q +P + E ++ + PPP + E P R + T+ + E Sbjct: 199 SVSTRQESPQRRRQQIQNQEPARVAQPPPRQTEVSAQAPTRRTHGLIQDNLTKASYEQPE 258 Query: 61 PIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEE 120 +ED+ P K A VD A++ L+ ++ G +E Sbjct: 259 GVEDHLPKRPRAQRSQSPKDQTSKSKSARVDQTEAERAKTMLMSRKGRMVLSNGDVIENG 318 Query: 121 RKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESS 180 R ++ D EK + + ++ I FE+ F D K++ Sbjct: 319 RLIVLD----DSTQMEKSLPELSPVLTVYMQTFKAYIPLSVFERNFLREDAKGWSTKKAP 374 Query: 181 TGDTFVRSEGSQ--EADRDFDKV 201 T + G++ D D++ Sbjct: 375 TATKILEGNGTRVYGGDPPMDEL 397 >gi|213159208|ref|YP_002321255.1| gp71 [Caviid herpesvirus 2] gi|212598213|gb|ACJ35841.1| GP71 [Caviid herpesvirus 2] gi|324039507|dbj|BAJ78529.1| GP71 [Caviid herpesvirus 2] Length = 258 Score = 37.5 bits (85), Expect = 1.3, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 28/56 (50%) Query: 4 ETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPK 59 + +L P+P PP E E ++ PP+ + ++ R ++S+T+ G+ K Sbjct: 196 DISRLPPIPEEPPQDESEGTETERAPPASPDGEKTTRTKREAGGATSTTDTGGDRK 251 >gi|71911536|ref|YP_283086.1| immunogenic secreted protein [Streptococcus pyogenes MGAS5005] gi|71854318|gb|AAZ52341.1| immunogenic secreted protein [Streptococcus pyogenes MGAS5005] Length = 533 Score = 37.5 bits (85), Expect = 1.4, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 60/168 (35%), Gaps = 4/168 (2%) Query: 3 DETDQLTPLPSTPPVVECERSQRSNPPPS----KEEAVQSDPQGRNPSSSSSSTEEAGEP 58 D +DQ TP P + +++ P PS K++ P+ ++ + E + Sbjct: 112 DSSDQSTPSPKDQSSQKESQNKDGRPTPSPDQQKDQTPDKTPEKGPEKATDKTPEPNRDA 171 Query: 59 KPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLE 118 PI+ + P ++ E + K + + + + + + Sbjct: 172 PKPIQPPLAAAPVFIPWRESDKDLSKLKPSSRSSAAYVRHWTGDSAYTHNLLSRRYGITA 231 Query: 119 EERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTF 166 E+ LG Y+ + R EK+ DV+++++ E + Sbjct: 232 EQLDGFLNSLGIHYDKERLNGKRLLEWEKLTGLDVRAIVAIAMAESSL 279 >gi|306826557|ref|ZP_07459866.1| immunogenic secreted protein [Streptococcus pyogenes ATCC 10782] gi|304431284|gb|EFM34284.1| immunogenic secreted protein [Streptococcus pyogenes ATCC 10782] Length = 534 Score = 37.5 bits (85), Expect = 1.5, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 57/169 (33%), Gaps = 5/169 (2%) Query: 3 DETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDP-----QGRNPSSSSSSTEEAGE 57 D +DQ TP P + +++ P PS ++ P +G ++ + Sbjct: 112 DSSDQSTPSPKDQSSQKESQNKDGRPTPSPDQQKDQTPDKTPEKGPEKATEKTPEPNRDA 171 Query: 58 PKPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASL 117 PKP + P + E + K + + + + + + Sbjct: 172 PKPIQPPLAAAAPVFAPWRESDKDLSKLKPSSRSSAAYVRHWTGDSAYTHNLLSRRYGIT 231 Query: 118 EEERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTF 166 E+ LG Y+ + R EK+ DV+++++ E + Sbjct: 232 AEQLDGFLNSLGIHYDKERLNGKRLLEWEKLTGLDVRAIVAIAMAESSL 280 >gi|253730771|ref|ZP_04864936.1| staphylococcal exotoxin [Staphylococcus aureus subsp. aureus USA300_TCH959] gi|253725484|gb|EES94213.1| staphylococcal exotoxin [Staphylococcus aureus subsp. aureus USA300_TCH959] Length = 300 Score = 37.5 bits (85), Expect = 1.5, Method: Composition-based stats. Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 1/60 (1%) Query: 4 ETDQLTPLPSTPPVVECERSQR-SNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPI 62 ET Q TP TPP + E Q+ +N V+ Q N ++ SS+ EA + KP Sbjct: 61 ETPQQTPNAPTPPSTKVEAPQQAANATTPSSTKVEVPKQAANATTPSSTKVEAPQSKPNA 120 >gi|308485050|ref|XP_003104724.1| CRE-LIN-59 protein [Caenorhabditis remanei] gi|308257422|gb|EFP01375.1| CRE-LIN-59 protein [Caenorhabditis remanei] Length = 1336 Score = 37.5 bits (85), Expect = 1.6, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 66/222 (29%), Gaps = 13/222 (5%) Query: 5 TDQLTPLPSTPPVVECERSQRSNPPP-----SKEEAVQSDPQGRNPSSSSSSTEEAGEPK 59 T+Q T S P + S P P +++E V N Sbjct: 170 TNQSTTKSSATPDSGIQSVPTSPPSPRMELLNEQEIVDKSRYEDNDDDDDDDDPADFTDM 229 Query: 60 PPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEE 119 P ++ +Y E ++ + E+ R V E E Sbjct: 230 PRLKPVDED-DEYFEEPSTSSASFS-TESNKSIRTINAVDSTTSTVLPSREEIPNGMSAE 287 Query: 120 ERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKES 179 + D + + + ++ N + LI + + + S Sbjct: 288 DILTFSLAANMDTNEIVRRLIGFDPEKA---NSIAVLIKK---HNADKLKKKKDMEAEVS 341 Query: 180 STGDTFVRSEGSQEADRDFDKVFNTPDFGSRVLSGDKEATKT 221 ST T R+ GS++ + + N+PD + L + + T Sbjct: 342 STTPTTPRARGSRKRKKSATRSTNSPDVTTSNLPAETSTSLT 383 >gi|46019857|emb|CAE52380.1| hypothetical protein [Streptococcus thermophilus] gi|312279218|gb|ADQ63875.1| Putative uncharacterized protein [Streptococcus thermophilus ND03] Length = 403 Score = 37.5 bits (85), Expect = 1.6, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 33/84 (39%), Gaps = 2/84 (2%) Query: 122 KLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKESST 181 + + +++ YF + D V +I + + F DR + E +T Sbjct: 50 TAVTDGYNASWDSLADGFPDYFIEIPALDTTVAIVIGIYAANRVAKFTDRIGSEKSEKNT 109 Query: 182 GDTFVRSEGSQEADRDFDKVFNTP 205 G + S+ D+D+DK N P Sbjct: 110 GKKSLEKMISKMKDKDYDK--NNP 131 >gi|326470052|gb|EGD94061.1| hypothetical protein TESG_01588 [Trichophyton tonsurans CBS 112818] Length = 479 Score = 37.5 bits (85), Expect = 1.6, Method: Composition-based stats. Identities = 30/135 (22%), Positives = 52/135 (38%), Gaps = 16/135 (11%) Query: 9 TPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLS 68 T +PS P S PPP ++ + PQ + +PP ++ + Sbjct: 272 TLVPSRPLNGSAGYPPVSGPPPRFQQPAKPHPQ----------QSQQRPAQPPRQNIPPA 321 Query: 69 CPDYVSEAEVTAH-----IEAFKEAGVDA-RVAQKVVDKLVDHGRKIGEQFGASLEEERK 122 P E + AH EA K A + + A+K KL++ K + A +++E + Sbjct: 322 DPRTQWELDAEAHALKQQSEAEKRARLKREKEAEKQTKKLLEAEEKEARKRQAQVDKETE 381 Query: 123 LLQTKLGSDYETREK 137 L+ G + K Sbjct: 382 RLRKIYGKEESKVLK 396 >gi|194228363|ref|XP_001499770.2| PREDICTED: similar to melanoma antigen family A, 10 [Equus caballus] Length = 534 Score = 37.5 bits (85), Expect = 1.7, Method: Composition-based stats. Identities = 34/141 (24%), Positives = 53/141 (37%), Gaps = 9/141 (6%) Query: 3 DETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPI 62 +E D +T PS P + Q ++P P+ A+ S P ++ SSSS E G Sbjct: 235 EEVDDVTGAPSPP-----QSPQSAHPSPT---AMASPPLSQSEDDSSSSRGEEGPSTSQA 286 Query: 63 EDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERK 122 YT S P V + +V +E F + + L R + F E + Sbjct: 287 LPYTASFPRNVIDGKVAELVE-FLLVKFHTKEPTTKAEMLNTVLRDYQDHFPVIFSEASE 345 Query: 123 LLQTKLGSDYETREKDIARYF 143 +Q G D + + Y Sbjct: 346 CMQLVFGVDVKEVDPSDHAYV 366 >gi|326481452|gb|EGE05462.1| hypothetical protein TEQG_04473 [Trichophyton equinum CBS 127.97] Length = 479 Score = 37.1 bits (84), Expect = 1.8, Method: Composition-based stats. Identities = 30/135 (22%), Positives = 52/135 (38%), Gaps = 16/135 (11%) Query: 9 TPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLS 68 T +PS P S PPP ++ + PQ + +PP ++ + Sbjct: 272 TLVPSRPLNGSAGYPPVSGPPPRFQQPAKPHPQ----------QSQQRPAQPPRQNIPPA 321 Query: 69 CPDYVSEAEVTAH-----IEAFKEAGVDA-RVAQKVVDKLVDHGRKIGEQFGASLEEERK 122 P E + AH EA K A + + A+K KL++ K + A +++E + Sbjct: 322 DPRTQWELDAEAHALKQQSEAEKRARLKREKEAEKQTKKLLEAEEKEARKRQAQVDKETE 381 Query: 123 LLQTKLGSDYETREK 137 L+ G + K Sbjct: 382 RLRKIYGKEESKVLK 396 >gi|327302172|ref|XP_003235778.1| hypothetical protein TERG_02830 [Trichophyton rubrum CBS 118892] gi|326461120|gb|EGD86573.1| hypothetical protein TERG_02830 [Trichophyton rubrum CBS 118892] Length = 479 Score = 37.1 bits (84), Expect = 2.0, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 48/132 (36%), Gaps = 10/132 (7%) Query: 9 TPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIE---DY 65 T +PS P S PPP + + PQ + + P + D Sbjct: 272 TLVPSRPLNGSANHPPVSGPPPRSQPPAKPLPQQSQQRPAQPPRQNIPPADPRTQWELDA 331 Query: 66 TLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQ 125 + SEAE A ++ KE A+K KL++ K + A +++E + L+ Sbjct: 332 EANALKQQSEAEKRARLKREKE-------AEKQTKKLLEAEEKEARKRQAQVDKETERLR 384 Query: 126 TKLGSDYETREK 137 G + K Sbjct: 385 KIYGKEESKVLK 396 >gi|321458678|gb|EFX69742.1| hypothetical protein DAPPUDRAFT_113389 [Daphnia pulex] Length = 239 Score = 37.1 bits (84), Expect = 2.1, Method: Composition-based stats. Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 7/101 (6%) Query: 13 STPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLSCPDY 72 ++ P + + + PPS+ + +Q++ R P + + PP+ D L PD+ Sbjct: 5 NSSPNNDTQSPR----PPSRAQEMQTEYTDRYPPLEDLRQPDQTDRNPPLFD--LRPPDH 58 Query: 73 VSEAEVTAHIEAFKEAGVDARV-AQKVVDKLVDHGRKIGEQ 112 E + A K+A + + A ++ +K++ H RK G + Sbjct: 59 TDEYPLFEDTRALKQARLTKKARATRLGNKIIKHVRKKGSR 99 >gi|21227903|ref|NP_633825.1| hypothetical protein MM_1801 [Methanosarcina mazei Go1] gi|20906321|gb|AAM31497.1| conserved protein [Methanosarcina mazei Go1] Length = 213 Score = 36.7 bits (83), Expect = 2.7, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 75/202 (37%), Gaps = 19/202 (9%) Query: 20 CERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGE-----PKPPIEDYTLSCPDYVS 74 E + PP SKE D G+N E+ G+ PP E++ +SC + + Sbjct: 2 VEYPEIEIPPVSKENQNSEDKPGKNYEKKDEEQEKQGKEPDIRSGPPSEEFCISCLN-MP 60 Query: 75 EAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLGSDYET 134 E + + ++A A KV+ + D + E E+KL+ K G E Sbjct: 61 EEQRKTMLLEIQQAAEKILKAYKVIKEPEDLLKGEWFLKLQKPEFEKKLVIAKSG---EE 117 Query: 135 REKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFFDRYAQQNKE---SSTGD------TF 185 Y + +PD + L+S +GF +++ + + GD Sbjct: 118 VFIGFYTYSGETPVPDPNF-VLLSQYGFWYPQRIEEKFEETVASFFTGAYGDYDFLNIVP 176 Query: 186 VRSEGSQEADRDFDKVFNTPDF 207 E Q RDF K+ ++ Sbjct: 177 EAVEKFQAFQRDFVKMLEDQEW 198 >gi|212546033|ref|XP_002153170.1| cAMP-dependent protein kinase pathway protein (Som1), putative [Penicillium marneffei ATCC 18224] gi|210064690|gb|EEA18785.1| cAMP-dependent protein kinase pathway protein (Som1), putative [Penicillium marneffei ATCC 18224] Length = 764 Score = 36.4 bits (82), Expect = 3.2, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 55/170 (32%), Gaps = 18/170 (10%) Query: 10 PLPSTPPV-VECERSQRSNPPPSKEEAVQSDPQGRN--------PSSSSSSTEEAGEPKP 60 P PS PP + Q+ PP+ ++A + P+G+ + ++ A Sbjct: 583 PPPSAPPQGTKPPSPQQGAAPPTPQQANKPAPKGKKDTGKERKRVTKKGAAAANANTAAT 642 Query: 61 PIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVA--QKVVDKLVDHGRKIGEQFGASLE 118 P + S H +F + GV+A A Q+ + + + Sbjct: 643 PSSEAEPPPTPTPSTPITPQHPNSFTKNGVNATTATQQQPTSAPAPPIVQQPDPNVTNFN 702 Query: 119 EERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNF 168 + + D+ T E E + + D S ++ F F Sbjct: 703 DLSMPDPSNFNLDFSTLEN-------PEVLENFDFDSFLNTDADATGFGF 745 >gi|302503807|ref|XP_003013863.1| hypothetical protein ARB_07975 [Arthroderma benhamiae CBS 112371] gi|291177429|gb|EFE33223.1| hypothetical protein ARB_07975 [Arthroderma benhamiae CBS 112371] Length = 479 Score = 36.4 bits (82), Expect = 3.8, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 48/132 (36%), Gaps = 10/132 (7%) Query: 9 TPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIE---DY 65 T +PS P S PPP + + PQ + + P + D Sbjct: 272 TLVPSRPLNGSTGYPPVSGPPPKSQPPAKPLPQQSQQRPAQPPRQNIPPADPRTQWELDA 331 Query: 66 TLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQ 125 + SEAE A ++ KE A+K KL++ K + A +++E + L+ Sbjct: 332 EANALKQQSEAEKRARLKREKE-------AEKQTKKLLEAEEKEARKRQAQVDKETERLR 384 Query: 126 TKLGSDYETREK 137 G + K Sbjct: 385 KIYGKEESKVLK 396 >gi|50915069|ref|YP_061041.1| immunogenic secreted protein [Streptococcus pyogenes MGAS10394] gi|1399011|gb|AAB52379.1| immunogenic secreted protein precursor [Streptococcus pyogenes] gi|50904143|gb|AAT87858.1| Immunogenic secreted protein [Streptococcus pyogenes MGAS10394] Length = 534 Score = 36.0 bits (81), Expect = 4.0, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 57/169 (33%), Gaps = 5/169 (2%) Query: 3 DETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDP-----QGRNPSSSSSSTEEAGE 57 D +DQ TP P + +++ P PS ++ P +G ++ + Sbjct: 112 DSSDQSTPSPKDQSSQKESQNKDGRPTPSPDQQKDQTPDKTPEKGPEKAAEKTPEPNRDA 171 Query: 58 PKPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQFGASL 117 PKP + P + E + K + + + + + + Sbjct: 172 PKPIQPPLGAAAPVFAPWRESDKDLSKLKPSSRSSAAYVRHWTGDSAYTHNLLSRRYGIT 231 Query: 118 EEERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTF 166 E+ LG Y+ + R EK+ DV+++++ E + Sbjct: 232 AEQLDGFLNSLGIHYDKERLNGKRLLEWEKLTGLDVRAIVAIAMAESSL 280 >gi|328859262|gb|EGG08372.1| hypothetical protein MELLADRAFT_84916 [Melampsora larici-populina 98AG31] Length = 1147 Score = 36.0 bits (81), Expect = 4.6, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 38/94 (40%) Query: 3 DETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPI 62 D T+ +T P+ P E S R + P + P + ++ ++ G+ + Sbjct: 502 DHTEDVTSTPAAPLRSEVTSSFRKSLAPPVASSSTQVPLLSDRTTRVTAVSAEGDGQQKS 561 Query: 63 EDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQ 96 TL+ P+Y + E + + VDA ++ Sbjct: 562 SRLTLAFPEYADDDESMDVEPSKQSTTVDALSSK 595 >gi|255077330|ref|XP_002502308.1| predicted protein [Micromonas sp. RCC299] gi|226517573|gb|ACO63566.1| predicted protein [Micromonas sp. RCC299] Length = 2154 Score = 36.0 bits (81), Expect = 4.6, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 73/170 (42%), Gaps = 14/170 (8%) Query: 30 PSKEEAVQSDPQGRNPSSSSSSTEEAGEPK--PPIEDYTLSCPDYVSEAEVTAHIEAFKE 87 P E V+SD + ++ + S E E K P E T S P+ +E +V A E+ + Sbjct: 395 PEGEAKVESDAEAQSEAEPDSDAEAEAESKPEPDAEAETESKPEPDAEPDVEAQAESESD 454 Query: 88 AGVDARVAQKVVDKLVDHGR-KIGEQFGASLEEERKLLQTKLGSDYETREKDIA-RYFRK 145 A V+ + G +G +F + L+ + + ++ L + E + + R+ Sbjct: 455 ADVEPKTVADAERFEAPSGDIALGSEFESILDPQPEPVEPSLKTSEEELPEGVEHRFIDD 514 Query: 146 EKIPDNDVQSLISAWGFEKTFNFFD-RYAQQNKESSTGDTFVRSEGSQEA 194 + + ++ LI FE + R ++ +K+ + R + S+E+ Sbjct: 515 DDLDLENIDQLI----FETARSLARARLSRGSKKVTF-----RGDDSRES 555 >gi|297822787|ref|XP_002879276.1| hypothetical protein ARALYDRAFT_344831 [Arabidopsis lyrata subsp. lyrata] gi|297325115|gb|EFH55535.1| hypothetical protein ARALYDRAFT_344831 [Arabidopsis lyrata subsp. lyrata] Length = 761 Score = 35.6 bits (80), Expect = 5.0, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 49/124 (39%), Gaps = 7/124 (5%) Query: 2 SDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPP 61 ++E++ +T T V E + + +++P S+ S E + Sbjct: 523 NEESESMTSNNVTGVVTEEHSDKEEH-----GNHEETEPLTSLNISNEESLLEHSDKDSS 577 Query: 62 IEDYTLSCPDYVSEAEVTAHIEAFKE--AGVDARVAQKVVDKLVDHGRKIGEQFGASLEE 119 T + E + +A +E+F + ++ +V+ + ++HGRK LEE Sbjct: 578 KVTETRDTSNGSPELKHSASVESFVSISSDIEGESLVEVLKQQLEHGRKSLRDLNKELEE 637 Query: 120 ERKL 123 ER Sbjct: 638 ERNA 641 >gi|12045125|ref|NP_072936.1| hypothetical protein MG_269 [Mycoplasma genitalium G37] gi|255660364|ref|ZP_05405773.1| hypothetical protein MgenG_02624 [Mycoplasma genitalium G37] gi|2496357|sp|Q49407|Y269_MYCGE RecName: Full=Uncharacterized protein MG269 gi|1045963|gb|AAC71491.1| conserved hypothetical protein [Mycoplasma genitalium G37] gi|166078778|gb|ABY79396.1| conserved hypothetical protein [synthetic Mycoplasma genitalium JCVI-1.0] Length = 340 Score = 35.6 bits (80), Expect = 5.4, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 79/211 (37%), Gaps = 15/211 (7%) Query: 1 MSDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQ----GRNPSSSSSSTEEAG 56 ++ + Q + L S E E Q +KE +++ + + + E+ Sbjct: 98 INRQVQQNSELFSQLKQSESEIIQMQQLVEAKEHQIEALNKQLHAIKEANKKLIEEHESI 157 Query: 57 EPKPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKV-VDKLVDHGRKIGEQFGA 115 + +++Y + C + + + + HI+ E + + + L+ K + Sbjct: 158 NVEELLKEYEVQCNEAIYKRD--QHIQTVFEDKLALKDGEISETQSLLKTAEKEKQALKK 215 Query: 116 SLE------EERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTFNFF 169 + + ++ + L T++ D+ ++ IA F + K P + I + K Sbjct: 216 AYKLVVNSLQKHQKLTTEIEIDFTKLDEIIATIFDETKNPKTGFTNFIKQFEKTKA-KLT 274 Query: 170 DRYAQQNKESSTGDTFVRSEGSQEADRDFDK 200 + A+ K + T + E + + + D+ Sbjct: 275 KKIAEITKLDHSTPTNYQQE-TPASQQQLDQ 304 >gi|239624078|ref|ZP_04667109.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA] gi|239522109|gb|EEQ61975.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA] Length = 356 Score = 35.6 bits (80), Expect = 5.8, Method: Composition-based stats. Identities = 31/147 (21%), Positives = 52/147 (35%), Gaps = 15/147 (10%) Query: 30 PSKEEAVQSDPQGRNPSSSSSSTEEAGEP-KPPIEDYTLSCPDYVSEAEVTAHIEAFKEA 88 P + +S P G + S S TE +P YT Y + + + Sbjct: 54 PGTGVSAESGPSGGDISISGDGTEAQDAGVEPDNSQYTTDFSHYELQKDAVPQVNLLISE 113 Query: 89 GVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLGSD--YETREKDIARYFRKE 146 A++ Q K+ E FG + +E + + +L S+ Y +DI Y + Sbjct: 114 YFQAKIDQ--------DAEKLFELFGKAADESLEARREELKSEAVYIEDYQDIVCYTK-- 163 Query: 147 KIPDNDVQSLISAWGFEKTFNFFDRYA 173 P S ++ +E F D A Sbjct: 164 --PGLTADSYVAYVTYEVKFRRVDTLA 188 >gi|260837573|ref|XP_002613739.1| hypothetical protein BRAFLDRAFT_84482 [Branchiostoma floridae] gi|229299128|gb|EEN69748.1| hypothetical protein BRAFLDRAFT_84482 [Branchiostoma floridae] Length = 1998 Score = 35.6 bits (80), Expect = 6.3, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 44/134 (32%), Gaps = 10/134 (7%) Query: 21 ERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLSCPDYVSEAEVTA 80 E +R+ EAV ++ +G P S EE + V E Sbjct: 1592 ESPERNTTSIRIGEAVHTNTEGAEPEVSQRVHEEKASVDSTNTKSPTEFAESV--RERNP 1649 Query: 81 HIEAFKEAGVDARVA--QKVVDKLVDHGRKIGEQFGA----SLEEERKLLQT--KLGSDY 132 +++ K++ + + DK V G+K E+ +EE L+ ++ Sbjct: 1650 ILDSNKDSSIRKESGSPRNSTDKSVAQGKKRRERKDKPSKVMSDEEIGALEPWCSYRGEF 1709 Query: 133 ETREKDIARYFRKE 146 + ++ Sbjct: 1710 SVINNPVNAVIAEQ 1723 >gi|119498809|ref|XP_001266162.1| hypothetical protein NFIA_038390 [Neosartorya fischeri NRRL 181] gi|119414326|gb|EAW24265.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181] Length = 366 Score = 35.2 bits (79), Expect = 6.5, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 35/97 (36%), Gaps = 1/97 (1%) Query: 7 QLTPLPSTPPVVECERSQR-SNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDY 65 Q TP +TPP + + + N P + E ++ + ++S S T E + + D Sbjct: 261 QGTPGSNTPPNEKADLPEVTENSPAERHEEEKASSTKDDSATSHSKTSEQEDSFGSVGDM 320 Query: 66 TLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKL 102 + + E + VD R +L Sbjct: 321 AAPSAATTAAEKARKADELRRRGSVDERTTTMTNVRL 357 >gi|172087304|ref|XP_001913194.1| hypothetical protein 008-38 [Oikopleura dioica] gi|48994302|gb|AAT47876.1| hypothetical protein 008-38 [Oikopleura dioica] Length = 690 Score = 35.2 bits (79), Expect = 6.5, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 2/109 (1%) Query: 21 ERSQRSNPPP-SKEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLSCPDYVSEAEVT 79 + Q S PPP SK + P P + E E P I D T++ + Sbjct: 154 DAPQESAPPPKSKSAKENTPPVPAFPIIKNEPIFENSEETPNIADMTINTASRMEFNNYK 213 Query: 80 AHIEAF-KEAGVDARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTK 127 + IE + ++ V +K +K+ D + ++ E+ER +K Sbjct: 214 SKIEKYRRQNAETKAVEKKNGEKVKDKSKTAVKEKLRQTEKERTKKPSK 262 >gi|139474486|ref|YP_001129202.1| immunogenic secreted protein [Streptococcus pyogenes str. Manfredo] gi|134272733|emb|CAM31006.1| immunogenic secreted protein [Streptococcus pyogenes str. Manfredo] Length = 542 Score = 35.2 bits (79), Expect = 7.8, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 57/177 (32%), Gaps = 13/177 (7%) Query: 3 DETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEE-------- 54 + +DQ TP P + +++ P PS ++ P S+ + E+ Sbjct: 112 NSSDQSTPSPKDQSSQKESQNKDGRPTPSPDQQKDQTPDKTPEKSADKTPEKGPEKATEK 171 Query: 55 -----AGEPKPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKI 109 PKP + P + E + K + + + + + Sbjct: 172 TPEPNRDAPKPNQPPLAAAAPVFAPWRESDKDLSKLKPSSRSSAAYVRHWTGDSAYTHNL 231 Query: 110 GEQFGASLEEERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTF 166 + E+ LG Y+ + R EK+ DV+++++ E + Sbjct: 232 LSRRYGITAEQLDGFLNSLGIHYDKERLNGKRLLEWEKLTGLDVRAIVAIAMAESSL 288 >gi|254481853|ref|ZP_05095096.1| hypothetical protein GPB2148_1544 [marine gamma proteobacterium HTCC2148] gi|214037982|gb|EEB78646.1| hypothetical protein GPB2148_1544 [marine gamma proteobacterium HTCC2148] Length = 398 Score = 35.2 bits (79), Expect = 7.8, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 54/123 (43%), Gaps = 5/123 (4%) Query: 32 KEEAVQSDPQGRNPSSSSSSTEEAGEPKPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVD 91 K E Q++ + + + A + + + + E + +E F++ + Sbjct: 53 KLEDAQAEREKAHKKLTKEKARAADQAREEAAEELSDLKQQLGEKDEK--LEDFRKEELA 110 Query: 92 ARVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLGSDYETREKDIARYFRKEKIPDN 151 R A++V+D+ R + LEE++ L+ +LG++++ RE ++ + + Sbjct: 111 LRKAKQVLDQ---EKRDLELTLQRQLEEQQSALRAELGNEFQLREAELRKKIDDAHSANE 167 Query: 152 DVQ 154 D++ Sbjct: 168 DLK 170 >gi|308507413|ref|XP_003115890.1| hypothetical protein CRE_18494 [Caenorhabditis remanei] gi|308256425|gb|EFP00378.1| hypothetical protein CRE_18494 [Caenorhabditis remanei] Length = 1011 Score = 35.2 bits (79), Expect = 7.9, Method: Composition-based stats. Identities = 32/145 (22%), Positives = 54/145 (37%), Gaps = 25/145 (17%) Query: 9 TPLPSTPPVVECERSQRSNPPPSKEEA-------------------VQSDPQGRNPSSSS 49 TP+P TP + PP +A + + P+SS+ Sbjct: 527 TPIPKTPLKS-TTIPRLPFPPKLFGQAEVTSTTSTTESPPTTSSQSTTESEKTKTPTSST 585 Query: 50 SSTEEAGEPKPPIEDYTLSCPDY---VSEAEVTAHIEAFKEAGVD--ARVAQKVVDKLVD 104 S EA P P + T S P + E+ +E ++A + ++K+ +K + Sbjct: 586 KSEPEATTPSPTTSESTPSSPQTTRRILSEEIEKSVENLEKAIEESVKNGSEKMNNKTAE 645 Query: 105 HGRKIGEQFGASLEEERKLLQTKLG 129 +I + LEE K LQ +G Sbjct: 646 AVEQIDRELEKRLEEAAKQLQDSVG 670 >gi|198471049|ref|XP_002133648.1| GA22697 [Drosophila pseudoobscura pseudoobscura] gi|198145743|gb|EDY72275.1| GA22697 [Drosophila pseudoobscura pseudoobscura] Length = 1837 Score = 34.8 bits (78), Expect = 9.1, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 27/67 (40%), Gaps = 10/67 (14%) Query: 2 SDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPP 61 S +TD+ PL S P VE + +++E PS S TEE+ KP Sbjct: 1223 SQDTDEDQPLKSVSPDVEAGTPEEDETSQAEQE----------PSPSDVETEESDVEKPS 1272 Query: 62 IEDYTLS 68 D + Sbjct: 1273 ALDSQID 1279 >gi|19746914|ref|NP_608050.1| immunogenic secreted protein precursor [Streptococcus pyogenes MGAS8232] gi|19749161|gb|AAL98549.1| immunogenic secreted protein precursor [Streptococcus pyogenes MGAS8232] Length = 538 Score = 34.8 bits (78), Expect = 9.6, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 58/173 (33%), Gaps = 9/173 (5%) Query: 3 DETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGE----- 57 D +DQ T P + +++ P PS ++ P S+ E+A E Sbjct: 112 DSSDQSTSSPKDQSSQKESQNKDGRPTPSPDQQKDPTPDKTPEKSADKIPEKATEKTPEP 171 Query: 58 ----PKPPIEDYTLSCPDYVSEAEVTAHIEAFKEAGVDARVAQKVVDKLVDHGRKIGEQF 113 PKP + P ++ E + K + + + + + + Sbjct: 172 NRDAPKPIQPPLAAAAPVFIPWRESDKDLSKLKPSSRSSAAYVRHWTGDSAYTHNLLSRR 231 Query: 114 GASLEEERKLLQTKLGSDYETREKDIARYFRKEKIPDNDVQSLISAWGFEKTF 166 E+ LG Y+ + R EK+ DV+++++ E + Sbjct: 232 YGITAEQLDGFLNSLGIHYDKERLNGKRLLEWEKLTGLDVRAIVAIAMAESSL 284 >gi|301165890|emb|CBW25463.1| putative membrane protein [Bacteriovorax marinus SJ] Length = 263 Score = 34.8 bits (78), Expect = 9.8, Method: Composition-based stats. Identities = 33/130 (25%), Positives = 49/130 (37%), Gaps = 3/130 (2%) Query: 93 RVAQKVVDKLVDHGRKIGEQFGASLEEERKLLQTKLGSDYETREKDIARYFRKEKIPDND 152 R AQ+ LV++ ++LL+ K K + F KE +P Sbjct: 95 RGAQEAPLTLVEYSDFQCPFCVRGFNTVKELLK-KYDGKIRFVYKHLPLSFHKEALPAAH 153 Query: 153 VQSLISAWGFEKTFNFFDRYAQQNKESSTGDTFVRSEGSQ-EAD-RDFDKVFNTPDFGSR 210 I EK F F D ++ STG+ F++ + AD + K + R Sbjct: 154 YYEAIRLQSAEKAFKFHDEIFDNQRKLSTGEPFLKKMAKKVGADMKRLAKDVKSKAVIER 213 Query: 211 VLSGDKEATK 220 V S KEA K Sbjct: 214 VESDIKEAAK 223 >gi|170046852|ref|XP_001850961.1| conserved hypothetical protein [Culex quinquefasciatus] gi|167869467|gb|EDS32850.1| conserved hypothetical protein [Culex quinquefasciatus] Length = 736 Score = 34.8 bits (78), Expect = 9.8, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 2/87 (2%) Query: 2 SDETDQLTPLPSTPPVVECERSQRSNPPPSKEEAVQSDPQGRNPSSSSSSTEEAGEPKPP 61 +DET + + V E S+ ++ E Q D ++ + TE A E +P Sbjct: 204 ADETQAASEVSQADTTVAVEESEPNSTDAV--EESQPDTTVKDEEAQPEPTEAAEETQPE 261 Query: 62 IEDYTLSCPDYVSEAEVTAHIEAFKEA 88 T P V+EA + EA +EA Sbjct: 262 TTVATEESPSEVTEAVEESQTEATQEA 288 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.294 0.119 0.294 Lambda K H 0.267 0.0365 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,502,002,921 Number of Sequences: 14124377 Number of extensions: 126286530 Number of successful extensions: 658214 Number of sequences better than 10.0: 354 Number of HSP's better than 10.0 without gapping: 112 Number of HSP's successfully gapped in prelim test: 669 Number of HSP's that attempted gapping in prelim test: 655122 Number of HSP's gapped (non-prelim): 3129 length of query: 234 length of database: 4,842,793,630 effective HSP length: 135 effective length of query: 99 effective length of database: 2,936,002,735 effective search space: 290664270765 effective search space used: 290664270765 T: 11 A: 40 X1: 16 ( 6.8 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (20.9 bits) S2: 78 (34.8 bits)