BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1064_1 (238 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done >gi|227822435|ref|YP_002826407.1| hypothetical protein NGR_c18900 [Sinorhizobium fredii NGR234] gi|227341436|gb|ACP25654.1| hypothetical protein NGR_c18900 [Sinorhizobium fredii NGR234] Length = 453 Score = 192 bits (487), Expect = 3e-47, Method: Composition-based stats. Identities = 84/240 (35%), Positives = 122/240 (50%), Gaps = 12/240 (5%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGK +A PDPKA A+ Q + N+ A AN Y N++++TPDG Y + K D Sbjct: 1 MGKSKAPTPPDPKATAAAQTATNIGTAVANGYMGNVNQVTPDGSLTYSYT-KQKWTDPLS 59 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDF-- 118 G +P + +L +Q +I + + +L L+ L T + L ++ +K Sbjct: 60 GNVYDLPVATATQTLSEMQDKIKKQNDQASLNLATLATSQSSRLNDLLGKPMDISKAPAA 119 Query: 119 -------PPQQLRDNDVPEKPNASLEERKEILYNYPT-MGSQQYEKAFLDRLQSSLQQDR 170 PQ + + PE S+ I +Y T + +YE A + RL L++DR Sbjct: 120 GDHSKLTLPQYQQFSAGPE-LQTSVGNAGNIARSYETDFDTSKYENALMARLNPQLERDR 178 Query: 171 EDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHN 230 LET+L NQGL GS A+NRAIDE NR +D R+AA+L A EQ RL N+ + A F N Sbjct: 179 AALETRLANQGLQPGSEAYNRAIDEANRTSNDARIAAVLNAGQEQTRLANLANQKASFEN 238 >gi|265985067|ref|ZP_06097802.1| conserved hypothetical protein [Brucella sp. 83/13] gi|264663659|gb|EEZ33920.1| conserved hypothetical protein [Brucella sp. 83/13] Length = 299 Score = 173 bits (437), Expect = 2e-41, Method: Composition-based stats. Identities = 68/236 (28%), Positives = 113/236 (47%), Gaps = 1/236 (0%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGK +A +PDPK ++ Q N+ A AN+Y N++++TPDG Y +G+ K D + Sbjct: 1 MGKSKAPKSPDPKETSAAQTGTNIGTAVANSYLNNVNQVTPDGSLTYSQTGMQKYYDPYT 60 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPP 120 G+ IP + + L QQ I ++++ NL L L + L + + Sbjct: 61 GKSYDIPQFTATQQLSQQQQAIKDQEDATNLNLGKLANSQSSRLNDLLGKPFDLSGAPAA 120 Query: 121 QQLRDNDVPEKPNASLEERKEILYNYP-TMGSQQYEKAFLDRLQSSLQQDREDLETKLHN 179 + P+ + + + Y + Q+ E A + R+ L+QDR LE +L N Sbjct: 121 GNAGNMTAPQYQQYTGGPQLQTSYTDDFSADRQKVEDALMSRINPQLEQDRSALEQRLAN 180 Query: 180 QGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQ 235 QG++ GS A+ A+++ + +D R+ A+L EQ RL + A F N A Q Sbjct: 181 QGIMPGSKAFETAMNQNAQASNDARMQAILAGGQEQSRLAGLSRDQATFGNNANQQ 236 >gi|150397020|ref|YP_001327487.1| hypothetical protein Smed_1817 [Sinorhizobium medicae WSM419] gi|150028535|gb|ABR60652.1| hypothetical protein Smed_1817 [Sinorhizobium medicae WSM419] Length = 532 Score = 170 bits (430), Expect = 1e-40, Method: Composition-based stats. Identities = 80/272 (29%), Positives = 116/272 (42%), Gaps = 40/272 (14%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 A APDPK AS Q + N+ A AN N +++TPDG Y + K D G+E Sbjct: 6 SAPEAPDPKQTASAQTATNIGTAVANNVMGNANQVTPDGNLTYTYN-TQKWTDPLSGKEY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDF------ 118 + + +L P QQ I ++++ L L+ L + L + + + Sbjct: 65 DLKVPTATQTLSPAQQAIKDQEDAAQLNLATLANTQSGKLNGLLASKFDISGAPAAGKSD 124 Query: 119 ---PPQQLRDNDVP-----------------------------EKPNASLEERKEILYNY 146 PQ P K SL I +Y Sbjct: 125 AIGLPQYQSFTSGPKLQTSLANAGNVQSSIAGAGSIQSQVADSGKIQTSLGNAGNITESY 184 Query: 147 P-TMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRL 205 + + +YE+A +DRL +++DR LETKL NQGL GS A++RA+DE NR +D R+ Sbjct: 185 DFDIDTSKYEQALMDRLSPQIERDRAALETKLTNQGLQPGSEAYDRAMDEANRAANDARI 244 Query: 206 AAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237 A L A EQ R+ + + A F N AQ Q Sbjct: 245 GATLSAGQEQSRIAGLAQNQAQFQNSAQQQAY 276 >gi|315122526|ref|YP_004063015.1| hypothetical protein CKC_03890 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495928|gb|ADR52527.1| hypothetical protein CKC_03890 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 389 Score = 169 bits (427), Expect = 3e-40, Method: Composition-based stats. Identities = 128/241 (53%), Positives = 171/241 (70%), Gaps = 3/241 (1%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGKQ++ L+PDPKA+AS+QLS N+ N+ N+ R N++ +TPDGI +Y GVDK+ID F Sbjct: 1 MGKQQSFLSPDPKAVASMQLSENINNSLFNSSRANMNEITPDGILRYTQEGVDKMIDPFS 60 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFP- 119 G+E+SIP Y +SY L P+ Q ++NR+N N++L S+LLTQR+Q+ +P + + Sbjct: 61 GQELSIPRYSRSYELSPVAQDLYNRRNANHILFSNLLTQRLQNFMPSPQNNSMNLQQPLA 120 Query: 120 -PQQLRDNDVPEKPNASLEERKE-ILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKL 177 P + + S E++E ILY+Y QQYE LDRLQ L+QDREDLET+L Sbjct: 121 IPDPAHNPIPEGTNHFSQPEQEEGILYDYGKNNGQQYENTLLDRLQPRLKQDREDLETRL 180 Query: 178 HNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237 NQGL+ GSV+WNR IDE NRKL+D RLAA+LK+S+EQERLDN++EK AYFHN AQAQ Sbjct: 181 SNQGLMPGSVSWNRTIDENNRKLNDARLAALLKSSEEQERLDNMREKQAYFHNFAQAQSH 240 Query: 238 Q 238 Q Sbjct: 241 Q 241 >gi|110632598|ref|YP_672806.1| hypothetical protein Meso_0237 [Mesorhizobium sp. BNC1] gi|110283582|gb|ABG61641.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 322 Score = 168 bits (424), Expect = 6e-40, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 73/222 (32%), Gaps = 34/222 (15%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N A I + TP G Y+ +G I D G+ I Sbjct: 2 KAPKAPDPWQTAAAQGAWNSFTAQQQQSMNMIGQNTPWGSLDYQQTGSTWITDP-TGKRI 60 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 +P Y + +L P QQ I R L+ + + Sbjct: 61 EMPTYTANVNLSPEQQAIFERTQAAEGNLAQIAQDQS----------------------- 97 Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLD-RLQSSLQQDREDLETKLHNQGLV 183 L E + + + ++++ R+ +Q+++ L T+L N GL Sbjct: 98 ---------EWLGEYLQEPFEFNNRDAEEWVWDLASPRILQQQEQNQQALRTQLINSGLR 148 Query: 184 SGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKH 225 G+ AW+ + D L Q Sbjct: 149 PGTTAWDAEMTRLTNANTDQMNQLALTGRQMAFNEALAQRNQ 190 >gi|316933872|ref|YP_004108854.1| hypothetical protein Rpdx1_2530 [Rhodopseudomonas palustris DX-1] gi|315601586|gb|ADU44121.1| hypothetical protein Rpdx1_2530 [Rhodopseudomonas palustris DX-1] Length = 341 Score = 161 bits (406), Expect = 7e-38, Method: Composition-based stats. Identities = 48/238 (20%), Positives = 73/238 (30%), Gaps = 32/238 (13%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 M APDP A Q NL +D++TP G Y +G + Sbjct: 1 MDTPEPPAAPDPVKTAEAQGQMNLTTGVQQQLLNMVDQVTPTGSLTYSQNGTTSFV-GAD 59 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPP 120 G+ ++P + + +L P QQ + + N L + + + T++ Sbjct: 60 GKTYTVPRFTSTQTLTPAQQALLDLSNKTQANLGQIGVDQSAKIGSLLGTNLKL------ 113 Query: 121 QQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ 180 E A L E RL Q E L T+L NQ Sbjct: 114 -------GNEATEARLMELGS------------------ARLDPKFAQSEEALRTRLANQ 148 Query: 181 GLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 G+ GS AWN + + +D +L + A G Q Sbjct: 149 GIQPGSAAWNAEMKSFSEGKNDAYNQLLLSGRQLANTEIQAERNAPINEITALLSGSQ 206 >gi|218673260|ref|ZP_03522929.1| hypothetical protein RetlG_17541 [Rhizobium etli GR56] Length = 334 Score = 157 bits (395), Expect = 2e-36, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 85/236 (36%), Gaps = 32/236 (13%) Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+ Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGYQTMKDQ-NGK 62 Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122 +P Y + P Q I+++ L L+ L + + T+V+ + + Sbjct: 63 SYQLPTYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTGKISGILGTNVDLSAGNVDKY 122 Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182 + ++ + + +D+ LE L ++G+ Sbjct: 123 VNNH-------------------------------WQSGFNNQWDRDQASLEQSLADKGI 151 Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 GS A++ A+ + + + + + + A G Q Sbjct: 152 SMGSAAYDNAMRDFSTRKQAASDQYLGDMYSNAQNSILTERNQPLNEISALMSGSQ 207 >gi|327191473|gb|EGE58493.1| hypothetical protein RHECNPAF_300003 [Rhizobium etli CNPAF512] Length = 335 Score = 156 bits (394), Expect = 2e-36, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 84/236 (35%), Gaps = 32/236 (13%) Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK + + D G+ Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTSKSIMKDQ-NGK 62 Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122 +P Y + P Q I+++ L L+ L + + T+V+ + + Sbjct: 63 TYELPVYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTGKISGILGTNVDLSAGNVDKY 122 Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182 + ++ + + +D+ LE L ++G+ Sbjct: 123 VNNH-------------------------------WQSGFDNQWNRDQASLEQSLADKGI 151 Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 GS A++ A+ + + + + + + A G Q Sbjct: 152 AMGSAAYDNAMRDFSTRKQAASDQYLGDMYSNAQNAILTERNQPLNEISALMSGSQ 207 >gi|209548343|ref|YP_002280260.1| hypothetical protein Rleg2_0738 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534099|gb|ACI54034.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 334 Score = 155 bits (390), Expect = 6e-36, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 84/236 (35%), Gaps = 32/236 (13%) Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62 +A APDP A+ Q + N+ A ANA +++ TPDG +YK +G + D G+ Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSYVNQYTPDGSLEYKVTGQQTMTDQ-NGK 62 Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122 IP + P Q I+++ L L+ L + + T+V+ + + Sbjct: 63 TYQIPIRSAYQTYSPQNQAIYDQTQQTQLGLAKLANDQTGKISGILGTNVDLSAGNVDKY 122 Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182 + D+ + + +D+ L+ L ++G+ Sbjct: 123 VNDH-------------------------------WQSGFNNQWDRDQASLDQSLADKGI 151 Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 GS A++ A+ + + + + + + A G Q Sbjct: 152 SMGSAAYDNAMRDFSTRKQAASDQYLGDMYSNAQNSILTERNQPLNEISALMSGSQ 207 >gi|116253668|ref|YP_769506.1| hypothetical protein RL3928 [Rhizobium leguminosarum bv. viciae 3841] gi|115258316|emb|CAK09418.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 335 Score = 155 bits (390), Expect = 6e-36, Method: Composition-based stats. Identities = 46/236 (19%), Positives = 85/236 (36%), Gaps = 32/236 (13%) Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK SG + D G+ Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVSGYQTMKDQ-NGK 62 Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122 +P Y + P Q I+++ L LS L ++ + T+V+ + + Sbjct: 63 TYQLPTYSAYQTYSPQNQAIYDQTQQTQLGLSKLANEQTGKISGILGTNVDLSAGNVDKY 122 Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182 D+ + + +D+ L+ L ++G+ Sbjct: 123 ANDH-------------------------------WQGGFNNQWDRDQASLDQSLADKGI 151 Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 GS A+N A+ + + + + + + A G Q Sbjct: 152 SMGSEAYNNALRDFSTRKQAASDQFLGDMYSNAQNSILTERNQPLNEISALMSGSQ 207 >gi|86356745|ref|YP_468637.1| hypothetical protein RHE_CH01103 [Rhizobium etli CFN 42] gi|86280847|gb|ABC89910.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 334 Score = 150 bits (378), Expect = 1e-34, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 83/236 (35%), Gaps = 32/236 (13%) Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62 + APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+ Sbjct: 4 TPKPPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGYQTMTDQ-NGK 62 Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122 +P Y + P Q I+++ L L+ L + + ++V+ + + Sbjct: 63 TYKLPTYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTAKVSGILGSNVDLSAGNVDKY 122 Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182 + D+ + + +D+ LE L ++G+ Sbjct: 123 VNDH-------------------------------WQSGFNNQWDRDQASLEQSLADKGI 151 Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 GS A++ A+ + + + + + A G Q Sbjct: 152 AIGSAAYDNAMRDFTTRKQAASDQYLGDMHSNAQNSILTERNQPLNEISALMSGSQ 207 >gi|319783503|ref|YP_004142979.1| hypothetical protein Mesci_3812 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169391|gb|ADV12929.1| hypothetical protein Mesci_3812 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 330 Score = 143 bits (359), Expect = 2e-32, Method: Composition-based stats. Identities = 48/226 (21%), Positives = 85/226 (37%), Gaps = 31/226 (13%) Query: 13 KAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQS 72 K ++ + N+ A ANA N++++TPDG Y +G K D + G+ IP Y + Sbjct: 13 KETSAASTATNVGTAIANANLGNVNQVTPDGSLNYSQTGTYKWNDPYTGKSYDIPTYTAT 72 Query: 73 YSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKP 132 +L Q I ++ + L L +L + L V+ + D +L D Sbjct: 73 QTLSGTGQAIKDQTDQAKLNLGELAAGQSSFLKDWLAKPVDLSNDATEGRLMDLG----- 127 Query: 133 NASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRA 192 + RLQ +L R+ E L N+G+ GS + +A Sbjct: 128 --------------------------MKRLQPALDARRQANEADLINRGIRPGSDNYAQA 161 Query: 193 IDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 + ++ +D + +L + + Q + A G Q Sbjct: 162 QNIQDQGENDAYNSLLLSGRGQAVQEALAQNSAPINNLTALLSGSQ 207 >gi|218510551|ref|ZP_03508429.1| hypothetical protein RetlB5_25766 [Rhizobium etli Brasil 5] Length = 271 Score = 137 bits (343), Expect = 2e-30, Method: Composition-based stats. Identities = 44/236 (18%), Positives = 87/236 (36%), Gaps = 32/236 (13%) Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+ Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGKSTMTDQ-NGK 62 Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122 ++P Y +L P Q I+++ L L+ L + Q + T+V+ + + Sbjct: 63 TYNLPVYSAYQTLSPQNQAIYDQSQQTQLGLAKLANDQTQKVSGILGTNVDLSSGNVDKY 122 Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182 + D+ + + +++ L+ L ++G+ Sbjct: 123 VNDH-------------------------------WRAGFDNQWDREQASLDQSLADKGI 151 Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 GS A++ A+ + + + + + A G Q Sbjct: 152 AMGSAAYDNAMRDFTTRKQAAADQYLGDMYSNAQNSILTERNQPLNEISALMSGSQ 207 >gi|126443127|ref|YP_001063336.1| hypothetical protein BURPS668_A2342 [Burkholderia pseudomallei 668] gi|126222618|gb|ABN86123.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 408 Score = 122 bits (305), Expect = 4e-26, Method: Composition-based stats. Identities = 48/235 (20%), Positives = 75/235 (31%), Gaps = 35/235 (14%) Query: 6 ASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREIS 65 A APDP A+A+ N A N + P G Q G D G Sbjct: 36 APAAPDPYAVANATTQTNNQTAQFNKALNLNNYSNPFGSQQSTQIG----TDPATG---- 87 Query: 66 IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRD 125 I+N + L L+ + + T N R Sbjct: 88 --------------APIYNTNITASGPLQSLINSTMGSAGNANSTVNNALFGLGGLTARY 133 Query: 126 NDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDR----LQSSLQQDREDLETKLHNQG 181 + + K A +I N + Q+ + A L Q + LE++L NQG Sbjct: 134 DALNGKLGAL---AGQIDPNAAQLAGQRGQNAAYAAQTQYLDPRFSQGQTSLESQLANQG 190 Query: 182 LVSGSVAWNRAIDET----NRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLA 232 L GS A++ A+ N+ D ++L ++ +Q + A A Sbjct: 191 LTPGSQAYDNAMKNFNLSKNQAYSDAANQSILTGQQIGTQM--LQNELAAVGTQA 243 >gi|167907339|ref|ZP_02494544.1| hypothetical protein BpseN_34235 [Burkholderia pseudomallei NCTC 13177] Length = 399 Score = 119 bits (296), Expect = 5e-25, Method: Composition-based stats. Identities = 48/235 (20%), Positives = 75/235 (31%), Gaps = 35/235 (14%) Query: 6 ASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREIS 65 A APDP A+A+ N A N + P G Q G D G Sbjct: 27 APAAPDPYAVANATTQTNNQTAQFNKALNLNNYSNPFGSQQSTQIG----TDPATG---- 78 Query: 66 IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRD 125 I+N + L L+ + + T N R Sbjct: 79 --------------APIYNTNITASGPLQSLINSTMGSAGNANSTVNNALFGLGGLTARY 124 Query: 126 NDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDR----LQSSLQQDREDLETKLHNQG 181 + + K A +I N + Q+ + A L Q + LE++L NQG Sbjct: 125 DALNGKLGAL---AGQIDPNAAQLAGQRGQNAAYAAQTQYLDPRFSQGQTSLESQLANQG 181 Query: 182 LVSGSVAWNRAIDET----NRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLA 232 L GS A++ A+ N+ D ++L ++ +Q + A A Sbjct: 182 LTPGSQAYDNAMKNFNLSKNQAYSDAANQSILTGQQIGTQM--LQNELAAVGTQA 234 >gi|152982946|ref|YP_001353886.1| hypothetical protein mma_2196 [Janthinobacterium sp. Marseille] gi|151283023|gb|ABR91433.1| Hypothetical protein mma_2196 [Janthinobacterium sp. Marseille] Length = 305 Score = 101 bits (250), Expect = 9e-20, Method: Composition-based stats. Identities = 50/226 (22%), Positives = 82/226 (36%), Gaps = 35/226 (15%) Query: 2 GKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIG 61 G APD A NL A A A ++++TP G Y D Sbjct: 24 GSPSPPPAPDYAGAAQQTAQGNLEAARAAAEANRVNQVTPYGNLTYSRDPNASTPDG--- 80 Query: 62 REISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQ 121 + + +L P QQ + ++QN +L L+ L + + + Q Sbjct: 81 ------GWTATQTLLPAQQALLDQQNKTSLGLAGLADRGLG---------------YVDQ 119 Query: 122 QLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQG 181 L +N A + Q + A + R Q ++Q R+ L+ +L NQG Sbjct: 120 ALSNNITAADLPADMVNAG-----------QTGQDALMARFQPQMEQSRKALDAQLANQG 168 Query: 182 LVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAY 227 + GS A+N A+ + +D+R A L + N Q + Sbjct: 169 ITQGSEAYNNAMRTQQQGENDLRSQAALNGIAVGQNAQNQQLQVKT 214 >gi|15320624|ref|NP_203468.1| hypothetical protein Mx8p54 [Myxococcus phage Mx8] gi|15281734|gb|AAK94389.1|AF396866_54 p54 [Myxococcus phage Mx8] Length = 333 Score = 99.0 bits (244), Expect = 5e-19, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 60/232 (25%), Gaps = 51/232 (21%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGKQ A PD + A Q A+ + + + TP Q+ Sbjct: 1 MGKQ-APAPPDFRGAAEQQSQASQQSINQQTQANRPNINTPWASQQWTQGPNGSW----- 54 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPP 120 + T L + Sbjct: 55 ----------------------------------GMQTSFNGPLGDASNAVQQQLATSLS 80 Query: 121 QQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ 180 Q L + +P + + I Y RL Q+ + T+L NQ Sbjct: 81 QPLDFSGLPGVSSGDAARNQAIESAYSQAT---------SRLDPQWQRREDAERTRLLNQ 131 Query: 181 GLVSGSVAWNRAIDETNRKLHDVRLAAMLK--ASDEQERLDNIQEKHAYFHN 230 GL GS A+ A E ++ +D +AM + A N Sbjct: 132 GLSEGSEAYRNAQSEFGQQRNDAYTSAMASAIGQGTAAGQAVFNQDMAARQN 183 >gi|117924321|ref|YP_864938.1| hypothetical protein Mmc1_1014 [Magnetococcus sp. MC-1] gi|117608077|gb|ABK43532.1| hypothetical protein Mmc1_1014 [Magnetococcus sp. MC-1] Length = 381 Score = 97.1 bits (239), Expect = 2e-18, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 57/204 (27%), Gaps = 37/204 (18%) Query: 28 SANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHNRQN 87 + +A + TP G+ + EI P +L Q+ + Q Sbjct: 30 NESAKVNQFRQETPYGVLDWS-------------GEIGTPDRTMKVTLSEDAQRAYGDQQ 76 Query: 88 INNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYP 147 L+ + R+ + D + L Sbjct: 77 AIAANLAQIAMGRMGQI--------------------DAGPFSLDGVAQVPNGASLEQAR 116 Query: 148 TMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVR--- 204 Q+Y L ++ L+ +L QG+ S A+ +A+ + + H+ Sbjct: 117 NQAMQEYYAHGSQFLDKRTANEQSKLQDRLIQQGVGLDSRAYRQAMQDFQEQSHEAYAEL 176 Query: 205 -LAAMLKASDEQERLDNIQEKHAY 227 A L S E + + + Sbjct: 177 ESRARLAGSSEASQQYQLGRQMRN 200 >gi|13470675|ref|NP_102244.1| hypothetical protein mll0449 [Mesorhizobium loti MAFF303099] gi|14021417|dbj|BAB48030.1| mll0449 [Mesorhizobium loti MAFF303099] Length = 230 Score = 62.8 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 35/92 (38%) Query: 147 PTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLA 206 + + +RL L Q + L+T+L NQG+ GS A++RA+ + +D Sbjct: 9 NDATESRLLQLGRERLDPILAQQSDALQTQLSNQGIKLGSAAYDRAMTQQALHANDATDQ 68 Query: 207 AMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 +L+ + A G Q Sbjct: 69 LILQGHGQAFAEGQAIRNQPINEITALLSGSQ 100 >gi|312214728|emb|CBX94682.1| predicted protein [Leptosphaeria maculans] Length = 592 Score = 47.0 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 60/236 (25%), Gaps = 30/236 (12%) Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62 A P P A++ N A I +S D D+ G Sbjct: 230 TSSAPGVPKPAAVSWQSADWNQPLGQA------ISAFPTFTTQVSSSSNKDIAADTLPGS 283 Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122 +P + P QQ + L H PP Sbjct: 284 SAIMPSLSNHSANKPTQQAVF-------------------PLAMQWGPHSTGL--PPPDN 322 Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDR--LQSSLQQDREDLETKLHNQ 180 L P + Y+Y S +++ A + + + N+ Sbjct: 323 LLYTSGPNPAGVYDLPPGVMPYSY-NHSSLKWKDALAAETNMDKLTALKKAAKQASTANK 381 Query: 181 GLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQG 236 V + N +D+ ++ R L + + E +A + +G Sbjct: 382 SSVKAAEPTNNELDDKQERIKQEREQKRLVSRISSTMSSALAELYARYIKETSERG 437 >gi|313885186|ref|ZP_07818938.1| efflux ABC transporter, permease protein [Eremococcus coleocola ACS-139-V-Col8] gi|312619877|gb|EFR31314.1| efflux ABC transporter, permease protein [Eremococcus coleocola ACS-139-V-Col8] Length = 1145 Score = 42.8 bits (98), Expect = 0.038, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 43/154 (27%), Gaps = 9/154 (5%) Query: 80 QQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEER 139 Q+I + S L + Q + + + + E + Sbjct: 327 QEIQSASQKLEDGRSQLAASKSQ----LDAAADQINQGYAQLEPEKAKLDEVAAQLAGPQ 382 Query: 140 KEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRK 199 ++ + S + Q + L ++L QG+ + +A + Sbjct: 383 AQLDQAKADLDSSMSQLDQAQAQIDEGQAQLDALASQLQEQGIDPATSPDYQA----GQT 438 Query: 200 LHDVRLAAMLKAS-DEQERLDNIQEKHAYFHNLA 232 D + + + L QE+ A F + Sbjct: 439 NLDSQKQTLAAGQAQYEAGLAQYQEQKALFGQES 472 >gi|149477002|ref|XP_001516414.1| PREDICTED: similar to catenin (cadherin-associated protein), alpha 1, 102kDa [Ornithorhynchus anatinus] Length = 732 Score = 42.8 bits (98), Expect = 0.038, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 44/138 (31%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ I N SD +Q+ + N K L ++ +P+ Sbjct: 246 YKQLQQAITGISNAAQATASDDASQQQGAGGELAYALNNFDKQIIVDPLSFSEERFRPSL 305 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 306 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 365 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 366 SAIDKMTKKTRDLRRQLR 383 >gi|237746507|ref|ZP_04576987.1| predicted protein [Oxalobacter formigenes HOxBLS] gi|229377858|gb|EEO27949.1| predicted protein [Oxalobacter formigenes HOxBLS] Length = 552 Score = 41.2 bits (94), Expect = 0.13, Method: Composition-based stats. Identities = 25/219 (11%), Positives = 54/219 (24%), Gaps = 11/219 (5%) Query: 7 SLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISI 66 P + S + + N RE + +P + + + Sbjct: 50 PSPSSPTLASYRAASGSSDTVNQNLSRELTRQASPGLSPVIDNTAFSDKTTPPVSNSATS 109 Query: 67 PHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDN 126 + + P ++ +N + + P H + D Sbjct: 110 SAIRGTETFSPQRKGSSFGRNNTAFKPASAGSDTFPTTDPRHTDAIRYGSDTTTGTSSRL 169 Query: 127 DVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSG- 185 ++ P Q +Q+ L+ L N+G Sbjct: 170 RSQGDTPDYAPAAGNEGFHLPAEIHPGLVSPDSPGQQERTRQETARLQHSLGNEGFTLSP 229 Query: 186 -----SVAWNRAIDETNRKLHDVRLAAMLKASDEQERLD 219 + + A++ T R+ AML + + R Sbjct: 230 DIPRQAARFRAAMEATGRQA-----GAMLSGQERETRFA 263 >gi|118388201|ref|XP_001027200.1| Adenylate and Guanylate cyclase catalytic domain containing protein [Tetrahymena thermophila] gi|89308970|gb|EAS06958.1| Adenylate and Guanylate cyclase catalytic domain containing protein [Tetrahymena thermophila SB210] Length = 3203 Score = 40.4 bits (92), Expect = 0.20, Method: Composition-based stats. Identities = 21/190 (11%), Positives = 54/190 (28%), Gaps = 17/190 (8%) Query: 45 WQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDL 104 + +II R + + + Q ++ N L Sbjct: 413 LKQSQYNNSQIIKPHKLRIFNENGFNLGSDVSIPQSEVKNDTEQFKSQSEQQSKDPSPPL 472 Query: 105 LPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQS 164 ++ N + + Q +D++ A ++ +I Y + + D + Sbjct: 473 KQKNNQKYNNSFNNSLQSQQDSN----TKADKDQTDQIGYEHQETNRELVLHH--DFISP 526 Query: 165 SLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEK 224 + + K +G G+ D + L + ++ + N+Q Sbjct: 527 QITSRENQILQKSSKEGGSLGTEG-----------NSDTESQSPLDSPQKRRQRQNMQSH 575 Query: 225 HAYFHNLAQA 234 + ++ Q Sbjct: 576 QDEYDDIPQE 585 >gi|193204962|ref|NP_494177.3| Prion-like-(Q/N-rich)-domain-bearing protein family member (pqn-66) [Caenorhabditis elegans] gi|163644489|gb|AAB37876.4| Prion-like-(q/n-rich)-domain-bearing protein protein 66 [Caenorhabditis elegans] Length = 898 Score = 40.1 bits (91), Expect = 0.29, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 46/162 (28%), Gaps = 8/162 (4%) Query: 4 QRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGRE 63 A A + K A+ Q + N NA +A +N+ D Q + + Sbjct: 690 PNAPNAQNSKDDANAQNAQNDQNAPNDANGQNVQIDRNDSNAQNGQNAPNDQNAQNDPNA 749 Query: 64 ISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQL 123 + P+ S + Q N QN N + + + P Sbjct: 750 QNAPNVQNSQN-TRNAQNSQNAQNARNAPNAQIAQ-------NDPNAPNAQIAQNAPNAQ 801 Query: 124 RDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSS 165 D + P NA + +++ EK L Sbjct: 802 NDINAPNVQNAQKAPNAQNAQEQQEAQAKELEKEIGQFLCKR 843 >gi|145485313|ref|XP_001428665.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124395752|emb|CAK61267.1| unnamed protein product [Paramecium tetraurelia] Length = 2080 Score = 39.3 bits (89), Expect = 0.40, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 52/218 (23%), Gaps = 18/218 (8%) Query: 28 SANAYRENIDRMTPDGI----------WQYKTSGVDKIIDSFIGREISIPHYLQSYSLHP 77 N EN D P Q + D I S H Sbjct: 1543 VQNNQFENPDDEPPYASPGSENFSSVKSQSSQHCQNSFNDQSQSPLKDISQIQDSEEPHE 1602 Query: 78 IQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLE 137 Q + N T ++Q + + H N F + +ND + + Sbjct: 1603 NSQLSIFDEEKNKSPSKQQKTLQLQKIQDYPPDHYNIVPTFENEYDNENDPQQINQQVEK 1662 Query: 138 ERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETN 197 N + + Q + +SS + + N+G NRA+ Sbjct: 1663 ADSFCKKNQSQLSNNQGDNNLPSNRKSSQSRRELAKSAQFANEG--------NRALQSHQ 1714 Query: 198 RKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQ 235 + A Q + + Q Sbjct: 1715 SNSRESLSVQTNLAQQGQYTQQVSNQDKPLTQSFVYQQ 1752 >gi|307197463|gb|EFN78697.1| Probable exonuclease mut-7-like protein [Harpegnathos saltator] Length = 1058 Score = 39.3 bits (89), Expect = 0.42, Method: Composition-based stats. Identities = 29/217 (13%), Positives = 60/217 (27%), Gaps = 21/217 (9%) Query: 17 SLQLSANLANASANAYRENIDRMTPDGIWQYKT-SGVDKIIDSFIGR-EISIPHYLQSYS 74 S + + N + A+ G Y +G D I + Q+ Sbjct: 579 SQKTNTNKSTYKKPAHLNLATDNR--GNENYPMNTGAVPKYDGMTNHGSIPKHGFSQNNQ 636 Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 ++ +++Q N H + + P +N+ + N Sbjct: 637 HRHDNRKKYDKQKKYNYN-------------KHDSYNKYDNYNKPDSYNGNNNHSKYENH 683 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAID 194 + N +Q + +R + + D + + NQ ++ Sbjct: 684 NRYNNYNKNDNCNKRENQNKQTHSQNRYDNQSRYDDQS---RYDNQNKRDNHNRYDNQDR 740 Query: 195 ETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNL 231 + D R L++ +IQ K F N Sbjct: 741 RDIQNRQDTRNRQDLQSKKNTRSRQDIQGKQ-DFQNK 776 >gi|50754810|ref|XP_414513.1| PREDICTED: similar to alpha-catenin [Gallus gallus] Length = 905 Score = 39.3 bits (89), Expect = 0.43, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 42/138 (30%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ + N SD Q+ + N K ++ +P+ Sbjct: 245 YKQLQQAVTGISNAAQATASDDAAQQQGGGGELAYALNNFDKQIIVDPSTFSEERFRPSL 304 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 365 SAIDKMTKKTRDLRRQLR 382 >gi|224068486|ref|XP_002187404.1| PREDICTED: catenin (cadherin-associated protein), alpha 1, 102kDa [Taeniopygia guttata] Length = 905 Score = 39.3 bits (89), Expect = 0.44, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 42/138 (30%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ + N SD Q+ + N K ++ +P+ Sbjct: 245 YKQLQQAVSGISNAAQATASDDAAQQQGGGGELAYALNNFDKQIIVDPSTFSEERFRPSL 304 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 365 SAIDKMTKKTRDLRRQLR 382 >gi|326928261|ref|XP_003210299.1| PREDICTED: catenin alpha-1-like isoform 1 [Meleagris gallopavo] Length = 905 Score = 39.3 bits (89), Expect = 0.44, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 42/138 (30%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ + N SD Q+ + N K ++ +P+ Sbjct: 245 YKQLQQAVTGISNAAQATASDDAAQQQGGGGELAYALNNFDKQIIVDPSTFSEERFRPSL 304 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 365 SAIDKMTKKTRDLRRQLR 382 >gi|26249230|ref|NP_755270.1| hypothetical protein c3395 [Escherichia coli CFT073] gi|227888365|ref|ZP_04006170.1| conserved hypothetical protein [Escherichia coli 83972] gi|26109637|gb|AAN81840.1|AE016765_242 Hypothetical protein c3395 [Escherichia coli CFT073] gi|222034514|emb|CAP77256.1| hypothetical protein LF82_435 [Escherichia coli LF82] gi|227834634|gb|EEJ45100.1| conserved hypothetical protein [Escherichia coli 83972] gi|307554795|gb|ADN47570.1| hypothetical protein ECABU_c30980 [Escherichia coli ABU 83972] gi|312947351|gb|ADR28178.1| hypothetical protein NRG857_13830 [Escherichia coli O83:H1 str. NRG 857C] Length = 658 Score = 38.9 bits (88), Expect = 0.52, Method: Composition-based stats. Identities = 16/138 (11%), Positives = 35/138 (25%), Gaps = 2/138 (1%) Query: 25 ANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHN 84 +A + N+ P D G+ ++I L + + + Sbjct: 425 GTPTAQTHFSNLGDGKPFWDSTTTLLQRATWPDPDSGQTLTINAPQVPEPLTAEELKNFD 484 Query: 85 RQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILY 144 + + S + P V+T + D P + Sbjct: 485 QDYARDEKQSGGAGYAYGQINPETKKPVDTDYRYYISLYGYFDRKMVPKKDSGYYQSGPG 544 Query: 145 NYPTMGSQQYEKAFLDRL 162 + +YEK + + Sbjct: 545 S--KEDRVKYEKQSQEEM 560 >gi|149726843|ref|XP_001504306.1| PREDICTED: similar to Catenin alpha-1 (Cadherin-associated protein) (Alpha E-catenin) (NY-REN-13 antigen) [Equus caballus] Length = 905 Score = 38.9 bits (88), Expect = 0.57, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ + N SD +Q + N K L ++ +P+ Sbjct: 245 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 304 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 365 SAIDKMTKKTRDLRRQLR 382 >gi|297676097|ref|XP_002815982.1| PREDICTED: catenin alpha-1-like isoform 1 [Pongo abelii] gi|297676099|ref|XP_002815983.1| PREDICTED: catenin alpha-1-like isoform 2 [Pongo abelii] Length = 905 Score = 38.9 bits (88), Expect = 0.59, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ + N SD +Q + N K L ++ +P+ Sbjct: 245 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 304 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 365 SAIDKMTKKTRDLRRQLR 382 >gi|118389547|ref|XP_001027857.1| hypothetical protein TTHERM_00919640 [Tetrahymena thermophila] gi|89309627|gb|EAS07615.1| hypothetical protein TTHERM_00919640 [Tetrahymena thermophila SB210] Length = 3637 Score = 38.9 bits (88), Expect = 0.59, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 59/177 (33%), Gaps = 9/177 (5%) Query: 59 FIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDF 118 G + + P L QQ + N N N + QR T+ +T Sbjct: 2945 ANGSQATSPRIQDLSQLTSDQQSLLNNLNFQN----KIQLQRNSFSQDLLKTNNDTHF-- 2998 Query: 119 PPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH 178 Q++R K S + + +Q L Q D L KL Sbjct: 2999 -EQRIRPFSGVSKIEDSQIRKTSLQLKQQNYAKKQNLSLNQYDLDKIQQNDNHQLIQKLG 3057 Query: 179 NQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQ 235 N+ + +V N+ + + + + + L + Q R+ Q K A F+NL Q Sbjct: 3058 NKNYL--NVNLNQIQNASPSQNNFSKSNTKLDSQRRQTRMTQSQSKIASFNNLNHQQ 3112 >gi|326928263|ref|XP_003210300.1| PREDICTED: catenin alpha-1-like isoform 2 [Meleagris gallopavo] Length = 860 Score = 38.9 bits (88), Expect = 0.60, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 42/138 (30%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ + N SD Q+ + N K ++ +P+ Sbjct: 245 YKQLQQAVTGISNAAQATASDDAAQQQGGGGELAYALNNFDKQIIVDPSTFSEERFRPSL 304 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 365 SAIDKMTKKTRDLRRQLR 382 >gi|297676101|ref|XP_002815984.1| PREDICTED: catenin alpha-1-like isoform 3 [Pongo abelii] Length = 890 Score = 38.9 bits (88), Expect = 0.66, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ + N SD +Q + N K L ++ +P+ Sbjct: 230 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 289 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 290 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 349 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 350 SAIDKMTKKTRDLRRQLR 367 >gi|297676105|ref|XP_002815986.1| PREDICTED: catenin alpha-1-like isoform 5 [Pongo abelii] Length = 782 Score = 38.5 bits (87), Expect = 0.69, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ + N SD +Q + N K L ++ +P+ Sbjct: 122 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 181 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 182 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 241 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 242 SAIDKMTKKTRDLRRQLR 259 >gi|297676103|ref|XP_002815985.1| PREDICTED: catenin alpha-1-like isoform 4 [Pongo abelii] Length = 802 Score = 38.5 bits (87), Expect = 0.71, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%) Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +QQ + N SD +Q + N K L ++ +P+ Sbjct: 142 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 201 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190 I S +R+ + R+ L+ L N G S A N Sbjct: 202 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 261 Query: 191 RAIDETNRKLHDVRLAAM 208 AID+ +K D+R Sbjct: 262 SAIDKMTKKTRDLRRQLR 279 >gi|156395696|ref|XP_001637246.1| predicted protein [Nematostella vectensis] gi|156224357|gb|EDO45183.1| predicted protein [Nematostella vectensis] Length = 1945 Score = 38.5 bits (87), Expect = 0.85, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 47/209 (22%), Gaps = 42/209 (20%) Query: 68 HYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDND 127 Q+ +L P Q ++ L + T T P + Sbjct: 1532 QPGQAVTLRPEQPYGFSQAQRP-TNLQIPARPQ-GPAR--ASTPNTPTSMGLPTSAGSMN 1587 Query: 128 VPEK--PNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH------- 178 P A + + + QQ+ +Q Q L+ L Sbjct: 1588 PPSVYGTQAYQGGNQGLTHPMQQQQQQQFTLQQQRPMQPQAPQGTAILQHPLQAGTQQQQ 1647 Query: 179 ----------------------NQGLVSGSVAWNRAIDETNRKLHDVRLAA-----MLKA 211 NQG ++ + A+ + N+ M + Sbjct: 1648 GGQMNQGIPMSQMSQGMQLPVMNQGGQISQMSQSGAMTQINQGQISQMSQGGQLNQMNQG 1707 Query: 212 SDEQERLDNIQEKHAYFHNLAQA--QGLQ 238 + +Q QG Q Sbjct: 1708 GQMSQMNQGMQMPQMSQGGQMPQMNQGGQ 1736 >gi|300980554|ref|ZP_07175080.1| conserved hypothetical protein [Escherichia coli MS 45-1] gi|300409254|gb|EFJ92792.1| conserved hypothetical protein [Escherichia coli MS 45-1] Length = 439 Score = 38.1 bits (86), Expect = 0.91, Method: Composition-based stats. Identities = 13/131 (9%), Positives = 31/131 (23%) Query: 25 ANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHN 84 +A + N+ P D G+ ++I L + + + Sbjct: 206 GTPTAQTHFSNLGDGKPFWDSTTTLLQRATWPDPDSGQTLTINAPQVPEPLTAEELKNFD 265 Query: 85 RQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILY 144 + + S + P V+T + D P + Sbjct: 266 QDYARDEKQSGGAGYAYGQINPETKKPVDTDYRYYISLYGYFDRKMVPKKDSGYYQSGPG 325 Query: 145 NYPTMGSQQYE 155 + + + Sbjct: 326 SKEDRVKYEKQ 336 >gi|301049406|ref|ZP_07196370.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|300298848|gb|EFJ55233.1| conserved hypothetical protein [Escherichia coli MS 185-1] Length = 440 Score = 38.1 bits (86), Expect = 0.91, Method: Composition-based stats. Identities = 13/131 (9%), Positives = 31/131 (23%) Query: 25 ANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHN 84 +A + N+ P D G+ ++I L + + + Sbjct: 207 GTPTAQTHFSNLGDGKPFWDSTTTLLQRATWPDPDSGQTLTINAPQVPEPLTAEELKNFD 266 Query: 85 RQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILY 144 + + S + P V+T + D P + Sbjct: 267 QDYARDEKQSGGAGYAYGQINPETKKPVDTDYRYYISLYGYFDRKMVPKKDSGYYQSGPG 326 Query: 145 NYPTMGSQQYE 155 + + + Sbjct: 327 SKEDRVKYEKQ 337 >gi|315293827|gb|EFU53179.1| conserved hypothetical protein [Escherichia coli MS 153-1] Length = 441 Score = 38.1 bits (86), Expect = 0.93, Method: Composition-based stats. Identities = 13/131 (9%), Positives = 31/131 (23%) Query: 25 ANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHN 84 +A + N+ P D G+ ++I L + + + Sbjct: 208 GTPTAQTHFSNLGDGKPFWDSTTTLLQRATWPDPDSGQTLTINAPQVPEPLTAEELKNFD 267 Query: 85 RQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILY 144 + + S + P V+T + D P + Sbjct: 268 QDYARDEKQSGGAGYAYGQINPETKKPVDTDYRYYISLYGYFDRKMVPKKDSGYYQSGPG 327 Query: 145 NYPTMGSQQYE 155 + + + Sbjct: 328 SKEDRVKYEKQ 338 >gi|297621819|ref|YP_003709956.1| hypothetical protein wcw_1605 [Waddlia chondrophila WSU 86-1044] gi|297377120|gb|ADI38950.1| putative membrane protein [Waddlia chondrophila WSU 86-1044] Length = 1019 Score = 38.1 bits (86), Expect = 1.1, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 53/179 (29%), Gaps = 21/179 (11%) Query: 18 LQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYS--- 74 ++ N+ + + + I++M +G + G ++ + Sbjct: 339 AEIPENIKSMTQTVEQNAINQMNAEG-----WNIPQSYTPPSNGLSYNMRMQNSADEMFE 393 Query: 75 ---------LHPIQQQ----IHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQ 121 L P QQ+ ++ L+ +L Q + FP Sbjct: 394 GMLQNWDPPLTPDQQKALRNMYYGVEKPAGDLAAVLQQIESGVAAELAAAFGLPDGFPVP 453 Query: 122 QLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ 180 + + + E+L P +A D + ++ + + L KL NQ Sbjct: 454 KGSFSHQGNINGQFQMKFLELLNALPADQKAAVLQAINDPMNPAISAETKALLNKLFNQ 512 >gi|294636984|ref|ZP_06715306.1| outer membrane usher protein [Edwardsiella tarda ATCC 23685] gi|291089812|gb|EFE22373.1| outer membrane usher protein [Edwardsiella tarda ATCC 23685] Length = 817 Score = 37.4 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 64/223 (28%), Gaps = 17/223 (7%) Query: 27 ASANAYRENIDRMTP--DGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSL--HPIQQQI 82 N Y N+ + P G + + D D ++ + + L Q Sbjct: 477 GRKNNYAINLSQTLPPGWGSVFFSGTWRDYWGDGTRRQDYQVSYSNSWQQLNYTLAASQT 536 Query: 83 HNRQNIN--------NLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +++ + L LS +R L + + ++ N Sbjct: 537 YDQGLNSDRRVYLYFTLPLSFGEPRRSLYLSNATTVDRDGYQSNNASLSGYAGEWQQFNY 596 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAID 194 S+ + +GS +A L +S Q ++ +T + G+ G VA+ + Sbjct: 597 SVSLNNQRQDRLTALGSNLSYRARAVTLNASYSQSQDYRQTSV---GISGGVVAYRGGVL 653 Query: 195 ETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237 + L D + ++ A L Sbjct: 654 -FSNALTDTMAIVDAPGLRDAS-VNGYGYHATNGAGQALYAAL 694 >gi|259502965|ref|ZP_05745867.1| hypothetical protein HMPREF0494_1261 [Lactobacillus antri DSM 16041] gi|259169090|gb|EEW53585.1| hypothetical protein HMPREF0494_1261 [Lactobacillus antri DSM 16041] Length = 617 Score = 37.4 bits (84), Expect = 1.8, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 49/151 (32%), Gaps = 12/151 (7%) Query: 81 QIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERK 140 I+ ++ N LLT ++ L + K D +K SLE+ Sbjct: 422 AIYRQELQQN-----LLTDQLG-LPFYLPNKDQLLKYRLSGYQEDVLAVQKYQQSLEQNA 475 Query: 141 EILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLV----SGSVAWNRAIDET 196 + + + + F + S + LE +L QG + W A+ E Sbjct: 476 HVPRADALQWTSRVRRLFNHQFIQSFDDSQAALERELTAQGYTWTNPADREQWRAALREL 535 Query: 197 --NRKLHDVRLAAMLKASDEQERLDNIQEKH 225 +L R M + + +D +Q Sbjct: 536 VPGLRLFVRRGLTMAERNQRASVIDEVQRHQ 566 >gi|269139573|ref|YP_003296274.1| putative outer membrane protein [Edwardsiella tarda EIB202] gi|267985234|gb|ACY85063.1| putative outer membrane protein [Edwardsiella tarda EIB202] gi|304559461|gb|ADM42125.1| Fimbriae usher protein StcC [Edwardsiella tarda FL6-60] Length = 817 Score = 37.0 bits (83), Expect = 2.0, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 63/223 (28%), Gaps = 17/223 (7%) Query: 27 ASANAYRENIDRMTP--DGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHP--IQQQI 82 N Y N+ + P G + + D D ++ + + L Q Sbjct: 477 GRKNNYAINLSQTLPQGWGSVFFSGTWRDYWGDGARRQDYQVSYSNSWQQLSYTLAASQT 536 Query: 83 HNRQNIN--------NLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134 +++ + L LS +R L + + ++ N Sbjct: 537 YDQGLNSDRRFYLYFTLPLSVGEPRRTLYLSNATTFDRDGYQSNNASLSGYAGEWQQFNY 596 Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAID 194 S+ + +G+ +A L +S Q ++ +T G+ G +A+ + Sbjct: 597 SVSLNNQRQDRLTALGTNLSYRARSATLSASYSQSQDYRQTS---AGISGGVLAYRGGVL 653 Query: 195 ETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237 + L D + ++ A L Sbjct: 654 -FSNALTDTMAIVDAPGLRDAS-VNGYGYHATNGAGQALYAAL 694 >gi|238881789|gb|EEQ45427.1| conserved hypothetical protein [Candida albicans WO-1] Length = 985 Score = 37.0 bits (83), Expect = 2.2, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 55/184 (29%), Gaps = 6/184 (3%) Query: 55 IIDSFIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNT 114 DS + + QS Q +++N L + H Sbjct: 10 FADSNSNDDFLNSIFDQSQGEQQAPQVAQVSTSMSNPPLQSQSASSTSRISQAHTPMYQQ 69 Query: 115 TKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLE 174 + + + +P+ S+ + Q+ + ++ QQ ++ + Sbjct: 70 S------PVTAHTIPQNSPQSMPNQVAQPQQQIPPPPSQHLQQTTAQMLPQQQQQQQQQQ 123 Query: 175 TKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQA 234 + Q + + + + ++ + + M A ++ I + N ++ Sbjct: 124 QQKQEQLYRMKQQIYQQQMLKKQQENMSRQPSPMNSAGHNTQQNTPITQNAKTPQNNSKL 183 Query: 235 QGLQ 238 Q +Q Sbjct: 184 QSMQ 187 >gi|225159124|ref|ZP_03725430.1| OmpA/MotB domain protein [Opitutaceae bacterium TAV2] gi|224802279|gb|EEG20545.1| OmpA/MotB domain protein [Opitutaceae bacterium TAV2] Length = 199 Score = 37.0 bits (83), Expect = 2.5, Method: Composition-based stats. Identities = 11/103 (10%), Positives = 31/103 (30%), Gaps = 9/103 (8%) Query: 2 GKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIG 61 G ++ + PDP++ Q ++ TP G + D+ G Sbjct: 22 GCKKKPVRPDPQSTLIGQTPGGNDT--HSSGLN----TTPFGDLTPSPLPAGLVSDTGSG 75 Query: 62 REISIPHYLQSYSLHPIQQQIH---NRQNINNLLLSDLLTQRI 101 ++ + Q ++ ++ + + + + Sbjct: 76 LQLGTTDASHGNQIRDAVQSVYFAFDQSAVRQEERAKIQDAQN 118 >gi|229593727|ref|XP_001026894.2| CAF1 family ribonuclease containing protein [Tetrahymena thermophila] gi|225567345|gb|EAS06649.2| CAF1 family ribonuclease containing protein [Tetrahymena thermophila SB210] Length = 1272 Score = 36.6 bits (82), Expect = 2.8, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 68/223 (30%), Gaps = 23/223 (10%) Query: 18 LQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISI---PHYLQSYS 74 Q L S + + TP S + + + I + P+ + + Sbjct: 705 QQTQPQLVTYSYQPAMSYVSQTTPTNTIPIVQSYIQPVPIQVPNQNIVVQNPPNITYTTT 764 Query: 75 LHPIQQQIHNRQNINNLLLSD---LLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEK 131 P Q+H + L+ L T +I+ L P V T + + L N P Sbjct: 765 SVPNTTQVHLVPQKTSYLIESKPILQTSQIRILSPISSNRVQTNDEDFTKPLFTNKSPYS 824 Query: 132 PNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNR 191 + + + L++ + + Q ++++ +T+ N A+ R Sbjct: 825 KKYDEQRSQRWQEFSKD-------DSRLNQFEYNRQYNQQNEQTRQFN--------AYQR 869 Query: 192 AIDETNRKLHDVRLAAMLK--ASDEQERLDNIQEKHAYFHNLA 232 ++ N + R + + + + A Sbjct: 870 SVTNENNQRSTYRFEERNQFEGQQNYKNQQLYSQNVSQVAPPA 912 >gi|312219889|emb|CBX99831.1| similar to nuclear pore protein (Nic96) [Leptosphaeria maculans] Length = 1018 Score = 36.6 bits (82), Expect = 3.3, Method: Composition-based stats. Identities = 19/153 (12%), Positives = 48/153 (31%), Gaps = 3/153 (1%) Query: 83 HNRQNINNLLLSDLL--TQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERK 140 + L L D+ + + P + +D L K ++L + Sbjct: 73 FDELPSLQLGLGDIARKVRNLGSGGPSADQVQDRAQDRAAHYLLSASGV-KMGSTLRDLN 131 Query: 141 EILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKL 200 + + Q + F D + + L +GL ++ +++ + Sbjct: 132 QFSTQAGIPTNGQAQNLFDDDVDGYISNLHSQSTLALIQEGLEQSKRDFDTFLEDNVQIE 191 Query: 201 HDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQ 233 D + + + + +++ A F N A+ Sbjct: 192 WDKQRQRIYEHFGLGRQSEDMAASQATFGNTAR 224 >gi|91793868|ref|YP_563519.1| chromosome segregation protein SMC [Shewanella denitrificans OS217] gi|91715870|gb|ABE55796.1| Chromosome segregation protein SMC [Shewanella denitrificans OS217] Length = 1138 Score = 36.2 bits (81), Expect = 3.6, Method: Composition-based stats. Identities = 13/109 (11%), Positives = 39/109 (35%) Query: 72 SYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEK 131 +L ++ ++ + L+ + + + Q + V +D+++ Sbjct: 675 KQALSSEMAKLLHQDDAKETNLAKIASSQAQLEQQREDSQVQLLALMTLLDSQDDELQGL 734 Query: 132 PNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ 180 E +E L + + ++ D ++ + R+ L T++ Q Sbjct: 735 SKKQQELNQEWLSVSAQLRQAKAQRIEQDNIKRQHEHARQTLSTQVALQ 783 >gi|14133650|gb|AAK54090.1|AF362371_1 histidine kinase DhkI [Dictyostelium discoideum] Length = 1736 Score = 36.2 bits (81), Expect = 4.0, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 26/82 (31%), Gaps = 9/82 (10%) Query: 7 SLAPDPKAIASLQLSANLANASANAYRENIDRMTPDG---------IWQYKTSGVDKIID 57 + + + Q S+N+ N + N + +TP+G Q +S Sbjct: 1280 PNSSNSTSTNVTQSSSNIINNGNSITIINNNPVTPNGKKIVIVPLLSLQSASSPKQSQRG 1339 Query: 58 SFIGREISIPHYLQSYSLHPIQ 79 ++ S Y P Q Sbjct: 1340 YSPKQQYSPKQYSPKQQYSPKQ 1361 >gi|66822471|ref|XP_644590.1| histidine kinase [Dictyostelium discoideum AX4] gi|74860532|sp|Q86AT9|DHKI_DICDI RecName: Full=Hybrid signal transduction histidine kinase I gi|60472742|gb|EAL70692.1| histidine kinase [Dictyostelium discoideum AX4] Length = 1736 Score = 36.2 bits (81), Expect = 4.0, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 26/82 (31%), Gaps = 9/82 (10%) Query: 7 SLAPDPKAIASLQLSANLANASANAYRENIDRMTPDG---------IWQYKTSGVDKIID 57 + + + Q S+N+ N + N + +TP+G Q +S Sbjct: 1280 PNSSNSTSTNVTQSSSNIINNGNSITIINNNPVTPNGKKIVIVPLLSLQSASSPKQSQRG 1339 Query: 58 SFIGREISIPHYLQSYSLHPIQ 79 ++ S Y P Q Sbjct: 1340 YSPKQQYSPKQYSPKQQYSPKQ 1361 >gi|17313245|ref|NP_490625.1| hypothetical protein phiCTXp28 [Pseudomonas phage phiCTX] gi|4063799|dbj|BAA36253.1| unnamed protein product [Pseudomonas phage phiCTX] Length = 904 Score = 36.2 bits (81), Expect = 4.2, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 47/158 (29%), Gaps = 19/158 (12%) Query: 88 INNLLLSDLLTQRIQ--DLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYN 145 L L Q+ + QQ R + ++ A+ + + + Sbjct: 37 ATRERLKQLNAQQSDVRAFRTQRGALEQVSTALAAQQARVKALAQQMAAAGNPTRALTRD 96 Query: 146 YPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRL 205 Y E FL + + L T+L N G+ + + D+R Sbjct: 97 YNRAIR---EAGFLKQQHLQHSHALQQLRTRLSNAGIST---------RNLGQHERDLRA 144 Query: 206 AAMLKAS---DEQERLDNIQEKHA--YFHNLAQAQGLQ 238 + +RL N+ ++ ++G+Q Sbjct: 145 QIQAANGAINSQAQRLRNLSQQQERLTQARNTYSRGIQ 182 >gi|268638179|ref|XP_002649186.1| histidine kinase [Dictyostelium discoideum AX4] gi|256013041|gb|EEU04134.1| histidine kinase [Dictyostelium discoideum AX4] Length = 1732 Score = 36.2 bits (81), Expect = 4.3, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 26/82 (31%), Gaps = 9/82 (10%) Query: 7 SLAPDPKAIASLQLSANLANASANAYRENIDRMTPDG---------IWQYKTSGVDKIID 57 + + + Q S+N+ N + N + +TP+G Q +S Sbjct: 1280 PNSSNSTSTNVTQSSSNIINNGNSITIINNNPVTPNGKKIVIVPLLSLQSASSPKQSQRG 1339 Query: 58 SFIGREISIPHYLQSYSLHPIQ 79 ++ S Y P Q Sbjct: 1340 YSPKQQYSPKQYSPKQQYSPKQ 1361 >gi|42519054|ref|NP_964984.1| hypothetical protein LJ1128 [Lactobacillus johnsonii NCC 533] gi|41583341|gb|AAS08950.1| hypothetical protein LJ_1128 [Lactobacillus johnsonii NCC 533] Length = 4734 Score = 35.4 bits (79), Expect = 6.2, Method: Composition-based stats. Identities = 24/223 (10%), Positives = 50/223 (22%), Gaps = 4/223 (1%) Query: 16 ASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSL 75 A+ + NA A T Q + +S Sbjct: 1030 ATATQITDALNAINTAKGNLKGEATDKAALQTAVDNSATVKESNNYTNADQTQKTAYDKA 1089 Query: 76 HPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNAS 135 Q + ++ N ++ L + + + D P N Sbjct: 1090 VTAAQTVLDKTNATQAEVNQALQDLETANRNLNGDAKTEAANKAALEAAVKDAPNVRNTP 1149 Query: 136 LEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ----GLVSGSVAWNR 191 + + L+ S + + L+ + G + + A Sbjct: 1150 AYYNGSEETQTAYNNAITAGQTVLNEANPSASEVKNALDAINAAKDNLKGKATNTEALET 1209 Query: 192 AIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQA 234 A+ N +A+ E I + + A Sbjct: 1210 ALTNANNAKETGNYTNADQANQEALNNAIIAGQEILKNTSATQ 1252 >gi|144898504|emb|CAM75368.1| RTX toxins and related Ca2+-binding proteins [Magnetospirillum gryphiswaldense MSR-1] Length = 897 Score = 35.0 bits (78), Expect = 7.7, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 42/145 (28%), Gaps = 1/145 (0%) Query: 22 ANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQ 81 A+L + A N + ++PDG WQY + G +D + S L + + Sbjct: 202 ADLGTLAGVAVIGN-NTVSPDGAWQYSSDGGTTWVDVGGVNDNSSALALSASTKLRFNAA 260 Query: 82 IHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKE 141 +L + L + V T P + + + + Sbjct: 261 PDFHGTAPSLYVRGLDNSYAGGWSSSTGSAVYTNTSSPGGSSAIAAAATELSTDVNAVND 320 Query: 142 ILYNYPTMGSQQYEKAFLDRLQSSL 166 + + E ++ L Sbjct: 321 APTSSAVTLTAGVENVLYTFTETQL 345 >gi|116494973|ref|YP_806707.1| ATP-dependent exoDNAse (exonuclease V) beta subunit [Lactobacillus casei ATCC 334] gi|122263609|sp|Q038V7|ADDA_LACC3 RecName: Full=ATP-dependent helicase/nuclease subunit A; AltName: Full=ATP-dependent helicase/nuclease AddA gi|116105123|gb|ABJ70265.1| DNA helicase/exodeoxyribonuclease V, subunit A [Lactobacillus casei ATCC 334] Length = 1234 Score = 35.0 bits (78), Expect = 7.7, Method: Composition-based stats. Identities = 18/142 (12%), Positives = 34/142 (23%), Gaps = 13/142 (9%) Query: 28 SANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQ------SYSLHPIQQQ 81 AN + D+ T I K K +D E +P Y + +L + Sbjct: 816 QANKHFNMSDQ-TGTAILT-KQGIGIKWLDPETRVEYELPQYQAAKAARQNQTLAEEMRL 873 Query: 82 IHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA-----SL 136 ++ L + + L V + R + A + Sbjct: 874 LYVALTRAQQRLYVVGATMSGNQLTSADKTVEKWAAAAEGEARVLAPQVRSGATSYLDWI 933 Query: 137 EERKEILYNYPTMGSQQYEKAF 158 + + A Sbjct: 934 GPALIRHPQARGLAETTIKPAL 955 >gi|307186073|gb|EFN71805.1| Pre-mRNA cleavage complex 2 protein Pcf11 [Camponotus floridanus] Length = 1838 Score = 35.0 bits (78), Expect = 8.3, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 52/188 (27%), Gaps = 11/188 (5%) Query: 2 GKQRASLAPD--PKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSF 59 GK A + P+ PK +A AN A A + P + + D Sbjct: 210 GKPVAPVVPNQTPKQVAKQNAGANSGPRIAPASSIAVASAKPV-SRDPRLKPTPAVHDVT 268 Query: 60 IGREISIPHYLQSYSLHP-----IQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNT 114 I + + S Q + N +N L L ++ + + + Sbjct: 269 TVPTIDLRQRPGTTSPKELRNEGQTQPVVNTIVTSNQLKQQLPSKPA--VTSTINKPPAS 326 Query: 115 TKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLE 174 L + + +L ++ + + L + +L Sbjct: 327 PAGSDNPTLLNVINNNHADTNLNNSNNKTFS-GNINKDAVSHRTSQKKDPRLTSNSVNLN 385 Query: 175 TKLHNQGL 182 + QGL Sbjct: 386 SSKIGQGL 393 >gi|17231202|ref|NP_487750.1| heterocyst specific ABC-transporter, membrane fusion protein [Nostoc sp. PCC 7120] gi|1490222|emb|CAA67985.1| devB [Nostoc sp. PCC 7120] gi|17132844|dbj|BAB75409.1| heterocyst specific ABC-transporter, membrane fusion protein [Nostoc sp. PCC 7120] Length = 474 Score = 35.0 bits (78), Expect = 8.8, Method: Composition-based stats. Identities = 20/148 (13%), Positives = 51/148 (34%), Gaps = 8/148 (5%) Query: 71 QSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPE 130 Q+ + +Q Q+ ++ + +Q + + + QQ Sbjct: 149 QTAVIARLQAQLVGEMGAQQASITRIASQLSGEKVAQQALVNRLEAELVGQQDSLRATLN 208 Query: 131 KPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWN 190 + A + + +Y+ + + S ++DR L NQ ++ A Sbjct: 209 RIRAEQRNA--------QVDAGRYDFLYREGAISQQERDRRRLTATTANQQVIESQAALR 260 Query: 191 RAIDETNRKLHDVRLAAMLKASDEQERL 218 +A+ +++ + R M + Q++L Sbjct: 261 QALATLRQQVAEARANQMKTLASLQQQL 288 >gi|331694576|ref|YP_004330815.1| putative ECF subfamily RNA polymerase sigma-24 subunit [Pseudonocardia dioxanivorans CB1190] gi|326949265|gb|AEA22962.1| putative RNA polymerase, sigma-24 subunit, ECF subfamily [Pseudonocardia dioxanivorans CB1190] Length = 383 Score = 35.0 bits (78), Expect = 9.2, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 46/149 (30%), Gaps = 4/149 (2%) Query: 40 TPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQ 99 +P V + I +P + + +++I L+ +L Sbjct: 107 SPAAAVALTLRAVGGLTTRQIAAAHMVPEATMAQRISRAKRRIEGLPLDAPGDLTTVLRV 166 Query: 100 RIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFL 159 + V+ + + + +P A+ +L++ A Sbjct: 167 LYLVFNEGYGGDVDLAAEAIRLARQLAALSAEPEAAGLLALMLLHHARRASRT----APD 222 Query: 160 DRLQSSLQQDREDLETKLHNQGLVSGSVA 188 RL +QDR +T++ QG+ A Sbjct: 223 GRLVPLAEQDRSSWDTRMIEQGVAILQAA 251 >gi|285817113|gb|ADC37600.1| Putative Staphylococcal surface anchored protein; adhesin emb [Staphylococcus aureus 04-02981] Length = 970 Score = 35.0 bits (78), Expect = 9.4, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 64/194 (32%), Gaps = 5/194 (2%) Query: 21 SANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQ 80 + N A AN+ NI++ T + + + I+ EI + + Q Sbjct: 108 AKNKAEELANSIINNINKATSNQAVSQVQTAGNHAIEQVHANEIPKAKIDANKDVDKQVQ 167 Query: 81 QIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLE-ER 139 + + + N L + N + ++ + + A E + Sbjct: 168 ALIDEIDRNPNLTDKEKQALKDRINQILQQGHNDINNALTKEEIEQAKAQLAQALKEIKD 227 Query: 140 KEILYNYPTMGSQQYEKAFLDRLQ---SSLQQDREDLETKLHNQGLVSGSVAWNRAIDET 196 + +A +D + + ++++ L+ + NQ L G N A+ + Sbjct: 228 LVKAKENAKQDVDKQVQALIDEIDQNPNLTDKEKQALKDR-INQILQQGHNDINNAMTKE 286 Query: 197 NRKLHDVRLAAMLK 210 + +LA L+ Sbjct: 287 EIEQAKAQLAQALQ 300 >gi|253732074|ref|ZP_04866239.1| possible cell wall associated fibronectin-binding protein [Staphylococcus aureus subsp. aureus USA300_TCH959] gi|253724190|gb|EES92919.1| possible cell wall associated fibronectin-binding protein [Staphylococcus aureus subsp. aureus USA300_TCH959] Length = 1136 Score = 35.0 bits (78), Expect = 9.5, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 64/194 (32%), Gaps = 5/194 (2%) Query: 21 SANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQ 80 + N A AN+ NI++ T + + + I+ EI + + Q Sbjct: 197 AKNKAEELANSIINNINKATSNQAVSQVQTAGNHAIEQVHANEIPKAKIDANKDVDKQVQ 256 Query: 81 QIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLE-ER 139 + + + N L + N + ++ + + A + + Sbjct: 257 ALIDEIDRNPNLTDKEKQALKDRINQILQQGHNDINNALTKEEIEQAKAQLAQALQDIKD 316 Query: 140 KEILYNYPTMGSQQYEKAFLDRLQ---SSLQQDREDLETKLHNQGLVSGSVAWNRAIDET 196 + +A +D + + ++++ L+ + NQ L G N A+ + Sbjct: 317 LVKAKEDAKQDVDKQVQALIDEIDQNPNLTDKEKQALKDR-INQILQQGHNDINNAMTKE 375 Query: 197 NRKLHDVRLAAMLK 210 + RLA L+ Sbjct: 376 AIEQAKERLAQALQ 389 >gi|227512827|ref|ZP_03942876.1| conserved hypothetical protein [Lactobacillus buchneri ATCC 11577] gi|227083827|gb|EEI19139.1| conserved hypothetical protein [Lactobacillus buchneri ATCC 11577] Length = 461 Score = 34.7 bits (77), Expect = 9.9, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 57/226 (25%), Gaps = 12/226 (5%) Query: 9 APDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVD-KIIDSFIGREISIP 67 APDP S + N+ ID TP G K D S Sbjct: 32 APDPANNISQVNAGNVLKDYTQKNLNVIDNTTPKGNMDRKYIERTIDKNDPGTVESYSTT 91 Query: 68 HYLQSYSLHPIQQQIHNRQNINN---LLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 + + + + N+ N + ++ +T + Sbjct: 92 PDSTQQTTLQTKLYLPDGFNVTNYQHGNFQSVTLDDSGNMYFIESNGSDTNLGVIVKY-N 150 Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVS 184 D+ + S +NY + + + + E L+ N L Sbjct: 151 LADLNKLGAGSDPMIVWNAFNYFNPYTDEGVQH-----NQQYEDAYEQLKA--PNADLKK 203 Query: 185 GSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHN 230 T++K L A + Q D Q K N Sbjct: 204 VKSEVQNLQSTTSKKDATKANRQKLSALENQLETDQKQIKRIKQQN 249 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.292 0.102 0.232 Lambda K H 0.267 0.0313 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 802,609,314 Number of Sequences: 13984884 Number of extensions: 15977103 Number of successful extensions: 214152 Number of sequences better than 10.0: 4523 Number of HSP's better than 10.0 without gapping: 1033 Number of HSP's successfully gapped in prelim test: 3490 Number of HSP's that attempted gapping in prelim test: 159775 Number of HSP's gapped (non-prelim): 23237 length of query: 238 length of database: 4,792,584,752 effective HSP length: 135 effective length of query: 103 effective length of database: 2,904,625,412 effective search space: 299176417436 effective search space used: 299176417436 T: 11 A: 40 X1: 16 ( 6.8 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.0 bits) S2: 78 (35.0 bits)