BLASTP 2.2.22 [Sep-27-2009]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= 537021.9.peg.1064_1
(238 letters)
Database: nr
13,984,884 sequences; 4,792,584,752 total letters
Searching..................................................done
>gi|227822435|ref|YP_002826407.1| hypothetical protein NGR_c18900 [Sinorhizobium fredii NGR234]
gi|227341436|gb|ACP25654.1| hypothetical protein NGR_c18900 [Sinorhizobium fredii NGR234]
Length = 453
Score = 192 bits (487), Expect = 3e-47, Method: Composition-based stats.
Identities = 84/240 (35%), Positives = 122/240 (50%), Gaps = 12/240 (5%)
Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60
MGK +A PDPKA A+ Q + N+ A AN Y N++++TPDG Y + K D
Sbjct: 1 MGKSKAPTPPDPKATAAAQTATNIGTAVANGYMGNVNQVTPDGSLTYSYT-KQKWTDPLS 59
Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDF-- 118
G +P + +L +Q +I + + +L L+ L T + L ++ +K
Sbjct: 60 GNVYDLPVATATQTLSEMQDKIKKQNDQASLNLATLATSQSSRLNDLLGKPMDISKAPAA 119
Query: 119 -------PPQQLRDNDVPEKPNASLEERKEILYNYPT-MGSQQYEKAFLDRLQSSLQQDR 170
PQ + + PE S+ I +Y T + +YE A + RL L++DR
Sbjct: 120 GDHSKLTLPQYQQFSAGPE-LQTSVGNAGNIARSYETDFDTSKYENALMARLNPQLERDR 178
Query: 171 EDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHN 230
LET+L NQGL GS A+NRAIDE NR +D R+AA+L A EQ RL N+ + A F N
Sbjct: 179 AALETRLANQGLQPGSEAYNRAIDEANRTSNDARIAAVLNAGQEQTRLANLANQKASFEN 238
>gi|265985067|ref|ZP_06097802.1| conserved hypothetical protein [Brucella sp. 83/13]
gi|264663659|gb|EEZ33920.1| conserved hypothetical protein [Brucella sp. 83/13]
Length = 299
Score = 173 bits (437), Expect = 2e-41, Method: Composition-based stats.
Identities = 68/236 (28%), Positives = 113/236 (47%), Gaps = 1/236 (0%)
Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60
MGK +A +PDPK ++ Q N+ A AN+Y N++++TPDG Y +G+ K D +
Sbjct: 1 MGKSKAPKSPDPKETSAAQTGTNIGTAVANSYLNNVNQVTPDGSLTYSQTGMQKYYDPYT 60
Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPP 120
G+ IP + + L QQ I ++++ NL L L + L + +
Sbjct: 61 GKSYDIPQFTATQQLSQQQQAIKDQEDATNLNLGKLANSQSSRLNDLLGKPFDLSGAPAA 120
Query: 121 QQLRDNDVPEKPNASLEERKEILYNYP-TMGSQQYEKAFLDRLQSSLQQDREDLETKLHN 179
+ P+ + + + Y + Q+ E A + R+ L+QDR LE +L N
Sbjct: 121 GNAGNMTAPQYQQYTGGPQLQTSYTDDFSADRQKVEDALMSRINPQLEQDRSALEQRLAN 180
Query: 180 QGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQ 235
QG++ GS A+ A+++ + +D R+ A+L EQ RL + A F N A Q
Sbjct: 181 QGIMPGSKAFETAMNQNAQASNDARMQAILAGGQEQSRLAGLSRDQATFGNNANQQ 236
>gi|150397020|ref|YP_001327487.1| hypothetical protein Smed_1817 [Sinorhizobium medicae WSM419]
gi|150028535|gb|ABR60652.1| hypothetical protein Smed_1817 [Sinorhizobium medicae WSM419]
Length = 532
Score = 170 bits (430), Expect = 1e-40, Method: Composition-based stats.
Identities = 80/272 (29%), Positives = 116/272 (42%), Gaps = 40/272 (14%)
Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64
A APDPK AS Q + N+ A AN N +++TPDG Y + K D G+E
Sbjct: 6 SAPEAPDPKQTASAQTATNIGTAVANNVMGNANQVTPDGNLTYTYN-TQKWTDPLSGKEY 64
Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDF------ 118
+ + +L P QQ I ++++ L L+ L + L + + +
Sbjct: 65 DLKVPTATQTLSPAQQAIKDQEDAAQLNLATLANTQSGKLNGLLASKFDISGAPAAGKSD 124
Query: 119 ---PPQQLRDNDVP-----------------------------EKPNASLEERKEILYNY 146
PQ P K SL I +Y
Sbjct: 125 AIGLPQYQSFTSGPKLQTSLANAGNVQSSIAGAGSIQSQVADSGKIQTSLGNAGNITESY 184
Query: 147 P-TMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRL 205
+ + +YE+A +DRL +++DR LETKL NQGL GS A++RA+DE NR +D R+
Sbjct: 185 DFDIDTSKYEQALMDRLSPQIERDRAALETKLTNQGLQPGSEAYDRAMDEANRAANDARI 244
Query: 206 AAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237
A L A EQ R+ + + A F N AQ Q
Sbjct: 245 GATLSAGQEQSRIAGLAQNQAQFQNSAQQQAY 276
>gi|315122526|ref|YP_004063015.1| hypothetical protein CKC_03890 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313495928|gb|ADR52527.1| hypothetical protein CKC_03890 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 389
Score = 169 bits (427), Expect = 3e-40, Method: Composition-based stats.
Identities = 128/241 (53%), Positives = 171/241 (70%), Gaps = 3/241 (1%)
Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60
MGKQ++ L+PDPKA+AS+QLS N+ N+ N+ R N++ +TPDGI +Y GVDK+ID F
Sbjct: 1 MGKQQSFLSPDPKAVASMQLSENINNSLFNSSRANMNEITPDGILRYTQEGVDKMIDPFS 60
Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFP- 119
G+E+SIP Y +SY L P+ Q ++NR+N N++L S+LLTQR+Q+ +P + +
Sbjct: 61 GQELSIPRYSRSYELSPVAQDLYNRRNANHILFSNLLTQRLQNFMPSPQNNSMNLQQPLA 120
Query: 120 -PQQLRDNDVPEKPNASLEERKE-ILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKL 177
P + + S E++E ILY+Y QQYE LDRLQ L+QDREDLET+L
Sbjct: 121 IPDPAHNPIPEGTNHFSQPEQEEGILYDYGKNNGQQYENTLLDRLQPRLKQDREDLETRL 180
Query: 178 HNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237
NQGL+ GSV+WNR IDE NRKL+D RLAA+LK+S+EQERLDN++EK AYFHN AQAQ
Sbjct: 181 SNQGLMPGSVSWNRTIDENNRKLNDARLAALLKSSEEQERLDNMREKQAYFHNFAQAQSH 240
Query: 238 Q 238
Q
Sbjct: 241 Q 241
>gi|110632598|ref|YP_672806.1| hypothetical protein Meso_0237 [Mesorhizobium sp. BNC1]
gi|110283582|gb|ABG61641.1| conserved hypothetical protein [Chelativorans sp. BNC1]
Length = 322
Score = 168 bits (424), Expect = 6e-40, Method: Composition-based stats.
Identities = 44/222 (19%), Positives = 73/222 (32%), Gaps = 34/222 (15%)
Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64
+A APDP A+ Q + N A I + TP G Y+ +G I D G+ I
Sbjct: 2 KAPKAPDPWQTAAAQGAWNSFTAQQQQSMNMIGQNTPWGSLDYQQTGSTWITDP-TGKRI 60
Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124
+P Y + +L P QQ I R L+ + +
Sbjct: 61 EMPTYTANVNLSPEQQAIFERTQAAEGNLAQIAQDQS----------------------- 97
Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLD-RLQSSLQQDREDLETKLHNQGLV 183
L E + + + ++++ R+ +Q+++ L T+L N GL
Sbjct: 98 ---------EWLGEYLQEPFEFNNRDAEEWVWDLASPRILQQQEQNQQALRTQLINSGLR 148
Query: 184 SGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKH 225
G+ AW+ + D L Q
Sbjct: 149 PGTTAWDAEMTRLTNANTDQMNQLALTGRQMAFNEALAQRNQ 190
>gi|316933872|ref|YP_004108854.1| hypothetical protein Rpdx1_2530 [Rhodopseudomonas palustris DX-1]
gi|315601586|gb|ADU44121.1| hypothetical protein Rpdx1_2530 [Rhodopseudomonas palustris DX-1]
Length = 341
Score = 161 bits (406), Expect = 7e-38, Method: Composition-based stats.
Identities = 48/238 (20%), Positives = 73/238 (30%), Gaps = 32/238 (13%)
Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60
M APDP A Q NL +D++TP G Y +G +
Sbjct: 1 MDTPEPPAAPDPVKTAEAQGQMNLTTGVQQQLLNMVDQVTPTGSLTYSQNGTTSFV-GAD 59
Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPP 120
G+ ++P + + +L P QQ + + N L + + + T++
Sbjct: 60 GKTYTVPRFTSTQTLTPAQQALLDLSNKTQANLGQIGVDQSAKIGSLLGTNLKL------ 113
Query: 121 QQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ 180
E A L E RL Q E L T+L NQ
Sbjct: 114 -------GNEATEARLMELGS------------------ARLDPKFAQSEEALRTRLANQ 148
Query: 181 GLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238
G+ GS AWN + + +D +L + A G Q
Sbjct: 149 GIQPGSAAWNAEMKSFSEGKNDAYNQLLLSGRQLANTEIQAERNAPINEITALLSGSQ 206
>gi|218673260|ref|ZP_03522929.1| hypothetical protein RetlG_17541 [Rhizobium etli GR56]
Length = 334
Score = 157 bits (395), Expect = 2e-36, Method: Composition-based stats.
Identities = 43/236 (18%), Positives = 85/236 (36%), Gaps = 32/236 (13%)
Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62
+A APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+
Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGYQTMKDQ-NGK 62
Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122
+P Y + P Q I+++ L L+ L + + T+V+ + +
Sbjct: 63 SYQLPTYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTGKISGILGTNVDLSAGNVDKY 122
Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182
+ ++ + + +D+ LE L ++G+
Sbjct: 123 VNNH-------------------------------WQSGFNNQWDRDQASLEQSLADKGI 151
Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238
GS A++ A+ + + + + + + A G Q
Sbjct: 152 SMGSAAYDNAMRDFSTRKQAASDQYLGDMYSNAQNSILTERNQPLNEISALMSGSQ 207
>gi|327191473|gb|EGE58493.1| hypothetical protein RHECNPAF_300003 [Rhizobium etli CNPAF512]
Length = 335
Score = 156 bits (394), Expect = 2e-36, Method: Composition-based stats.
Identities = 42/236 (17%), Positives = 84/236 (35%), Gaps = 32/236 (13%)
Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62
+A APDP A+ Q + N+ A ANA + ++ TPDG +YK + + D G+
Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTSKSIMKDQ-NGK 62
Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122
+P Y + P Q I+++ L L+ L + + T+V+ + +
Sbjct: 63 TYELPVYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTGKISGILGTNVDLSAGNVDKY 122
Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182
+ ++ + + +D+ LE L ++G+
Sbjct: 123 VNNH-------------------------------WQSGFDNQWNRDQASLEQSLADKGI 151
Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238
GS A++ A+ + + + + + + A G Q
Sbjct: 152 AMGSAAYDNAMRDFSTRKQAASDQYLGDMYSNAQNAILTERNQPLNEISALMSGSQ 207
>gi|209548343|ref|YP_002280260.1| hypothetical protein Rleg2_0738 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209534099|gb|ACI54034.1| conserved hypothetical protein [Rhizobium leguminosarum bv.
trifolii WSM2304]
Length = 334
Score = 155 bits (390), Expect = 6e-36, Method: Composition-based stats.
Identities = 43/236 (18%), Positives = 84/236 (35%), Gaps = 32/236 (13%)
Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62
+A APDP A+ Q + N+ A ANA +++ TPDG +YK +G + D G+
Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSYVNQYTPDGSLEYKVTGQQTMTDQ-NGK 62
Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122
IP + P Q I+++ L L+ L + + T+V+ + +
Sbjct: 63 TYQIPIRSAYQTYSPQNQAIYDQTQQTQLGLAKLANDQTGKISGILGTNVDLSAGNVDKY 122
Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182
+ D+ + + +D+ L+ L ++G+
Sbjct: 123 VNDH-------------------------------WQSGFNNQWDRDQASLDQSLADKGI 151
Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238
GS A++ A+ + + + + + + A G Q
Sbjct: 152 SMGSAAYDNAMRDFSTRKQAASDQYLGDMYSNAQNSILTERNQPLNEISALMSGSQ 207
>gi|116253668|ref|YP_769506.1| hypothetical protein RL3928 [Rhizobium leguminosarum bv. viciae
3841]
gi|115258316|emb|CAK09418.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 335
Score = 155 bits (390), Expect = 6e-36, Method: Composition-based stats.
Identities = 46/236 (19%), Positives = 85/236 (36%), Gaps = 32/236 (13%)
Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62
+A APDP A+ Q + N+ A ANA + ++ TPDG +YK SG + D G+
Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVSGYQTMKDQ-NGK 62
Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122
+P Y + P Q I+++ L LS L ++ + T+V+ + +
Sbjct: 63 TYQLPTYSAYQTYSPQNQAIYDQTQQTQLGLSKLANEQTGKISGILGTNVDLSAGNVDKY 122
Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182
D+ + + +D+ L+ L ++G+
Sbjct: 123 ANDH-------------------------------WQGGFNNQWDRDQASLDQSLADKGI 151
Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238
GS A+N A+ + + + + + + A G Q
Sbjct: 152 SMGSEAYNNALRDFSTRKQAASDQFLGDMYSNAQNSILTERNQPLNEISALMSGSQ 207
>gi|86356745|ref|YP_468637.1| hypothetical protein RHE_CH01103 [Rhizobium etli CFN 42]
gi|86280847|gb|ABC89910.1| hypothetical conserved protein [Rhizobium etli CFN 42]
Length = 334
Score = 150 bits (378), Expect = 1e-34, Method: Composition-based stats.
Identities = 42/236 (17%), Positives = 83/236 (35%), Gaps = 32/236 (13%)
Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62
+ APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+
Sbjct: 4 TPKPPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGYQTMTDQ-NGK 62
Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122
+P Y + P Q I+++ L L+ L + + ++V+ + +
Sbjct: 63 TYKLPTYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTAKVSGILGSNVDLSAGNVDKY 122
Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182
+ D+ + + +D+ LE L ++G+
Sbjct: 123 VNDH-------------------------------WQSGFNNQWDRDQASLEQSLADKGI 151
Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238
GS A++ A+ + + + + + A G Q
Sbjct: 152 AIGSAAYDNAMRDFTTRKQAASDQYLGDMHSNAQNSILTERNQPLNEISALMSGSQ 207
>gi|319783503|ref|YP_004142979.1| hypothetical protein Mesci_3812 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
gi|317169391|gb|ADV12929.1| hypothetical protein Mesci_3812 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
Length = 330
Score = 143 bits (359), Expect = 2e-32, Method: Composition-based stats.
Identities = 48/226 (21%), Positives = 85/226 (37%), Gaps = 31/226 (13%)
Query: 13 KAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQS 72
K ++ + N+ A ANA N++++TPDG Y +G K D + G+ IP Y +
Sbjct: 13 KETSAASTATNVGTAIANANLGNVNQVTPDGSLNYSQTGTYKWNDPYTGKSYDIPTYTAT 72
Query: 73 YSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKP 132
+L Q I ++ + L L +L + L V+ + D +L D
Sbjct: 73 QTLSGTGQAIKDQTDQAKLNLGELAAGQSSFLKDWLAKPVDLSNDATEGRLMDLG----- 127
Query: 133 NASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRA 192
+ RLQ +L R+ E L N+G+ GS + +A
Sbjct: 128 --------------------------MKRLQPALDARRQANEADLINRGIRPGSDNYAQA 161
Query: 193 IDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238
+ ++ +D + +L + + Q + A G Q
Sbjct: 162 QNIQDQGENDAYNSLLLSGRGQAVQEALAQNSAPINNLTALLSGSQ 207
>gi|218510551|ref|ZP_03508429.1| hypothetical protein RetlB5_25766 [Rhizobium etli Brasil 5]
Length = 271
Score = 137 bits (343), Expect = 2e-30, Method: Composition-based stats.
Identities = 44/236 (18%), Positives = 87/236 (36%), Gaps = 32/236 (13%)
Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62
+A APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+
Sbjct: 4 TPKAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGKSTMTDQ-NGK 62
Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122
++P Y +L P Q I+++ L L+ L + Q + T+V+ + +
Sbjct: 63 TYNLPVYSAYQTLSPQNQAIYDQSQQTQLGLAKLANDQTQKVSGILGTNVDLSSGNVDKY 122
Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182
+ D+ + + +++ L+ L ++G+
Sbjct: 123 VNDH-------------------------------WRAGFDNQWDREQASLDQSLADKGI 151
Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238
GS A++ A+ + + + + + A G Q
Sbjct: 152 AMGSAAYDNAMRDFTTRKQAAADQYLGDMYSNAQNSILTERNQPLNEISALMSGSQ 207
>gi|126443127|ref|YP_001063336.1| hypothetical protein BURPS668_A2342 [Burkholderia pseudomallei 668]
gi|126222618|gb|ABN86123.1| conserved hypothetical protein [Burkholderia pseudomallei 668]
Length = 408
Score = 122 bits (305), Expect = 4e-26, Method: Composition-based stats.
Identities = 48/235 (20%), Positives = 75/235 (31%), Gaps = 35/235 (14%)
Query: 6 ASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREIS 65
A APDP A+A+ N A N + P G Q G D G
Sbjct: 36 APAAPDPYAVANATTQTNNQTAQFNKALNLNNYSNPFGSQQSTQIG----TDPATG---- 87
Query: 66 IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRD 125
I+N + L L+ + + T N R
Sbjct: 88 --------------APIYNTNITASGPLQSLINSTMGSAGNANSTVNNALFGLGGLTARY 133
Query: 126 NDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDR----LQSSLQQDREDLETKLHNQG 181
+ + K A +I N + Q+ + A L Q + LE++L NQG
Sbjct: 134 DALNGKLGAL---AGQIDPNAAQLAGQRGQNAAYAAQTQYLDPRFSQGQTSLESQLANQG 190
Query: 182 LVSGSVAWNRAIDET----NRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLA 232
L GS A++ A+ N+ D ++L ++ +Q + A A
Sbjct: 191 LTPGSQAYDNAMKNFNLSKNQAYSDAANQSILTGQQIGTQM--LQNELAAVGTQA 243
>gi|167907339|ref|ZP_02494544.1| hypothetical protein BpseN_34235 [Burkholderia pseudomallei NCTC
13177]
Length = 399
Score = 119 bits (296), Expect = 5e-25, Method: Composition-based stats.
Identities = 48/235 (20%), Positives = 75/235 (31%), Gaps = 35/235 (14%)
Query: 6 ASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREIS 65
A APDP A+A+ N A N + P G Q G D G
Sbjct: 27 APAAPDPYAVANATTQTNNQTAQFNKALNLNNYSNPFGSQQSTQIG----TDPATG---- 78
Query: 66 IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRD 125
I+N + L L+ + + T N R
Sbjct: 79 --------------APIYNTNITASGPLQSLINSTMGSAGNANSTVNNALFGLGGLTARY 124
Query: 126 NDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDR----LQSSLQQDREDLETKLHNQG 181
+ + K A +I N + Q+ + A L Q + LE++L NQG
Sbjct: 125 DALNGKLGAL---AGQIDPNAAQLAGQRGQNAAYAAQTQYLDPRFSQGQTSLESQLANQG 181
Query: 182 LVSGSVAWNRAIDET----NRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLA 232
L GS A++ A+ N+ D ++L ++ +Q + A A
Sbjct: 182 LTPGSQAYDNAMKNFNLSKNQAYSDAANQSILTGQQIGTQM--LQNELAAVGTQA 234
>gi|152982946|ref|YP_001353886.1| hypothetical protein mma_2196 [Janthinobacterium sp. Marseille]
gi|151283023|gb|ABR91433.1| Hypothetical protein mma_2196 [Janthinobacterium sp. Marseille]
Length = 305
Score = 101 bits (250), Expect = 9e-20, Method: Composition-based stats.
Identities = 50/226 (22%), Positives = 82/226 (36%), Gaps = 35/226 (15%)
Query: 2 GKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIG 61
G APD A NL A A A ++++TP G Y D
Sbjct: 24 GSPSPPPAPDYAGAAQQTAQGNLEAARAAAEANRVNQVTPYGNLTYSRDPNASTPDG--- 80
Query: 62 REISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQ 121
+ + +L P QQ + ++QN +L L+ L + + + Q
Sbjct: 81 ------GWTATQTLLPAQQALLDQQNKTSLGLAGLADRGLG---------------YVDQ 119
Query: 122 QLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQG 181
L +N A + Q + A + R Q ++Q R+ L+ +L NQG
Sbjct: 120 ALSNNITAADLPADMVNAG-----------QTGQDALMARFQPQMEQSRKALDAQLANQG 168
Query: 182 LVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAY 227
+ GS A+N A+ + +D+R A L + N Q +
Sbjct: 169 ITQGSEAYNNAMRTQQQGENDLRSQAALNGIAVGQNAQNQQLQVKT 214
>gi|15320624|ref|NP_203468.1| hypothetical protein Mx8p54 [Myxococcus phage Mx8]
gi|15281734|gb|AAK94389.1|AF396866_54 p54 [Myxococcus phage Mx8]
Length = 333
Score = 99.0 bits (244), Expect = 5e-19, Method: Composition-based stats.
Identities = 39/232 (16%), Positives = 60/232 (25%), Gaps = 51/232 (21%)
Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60
MGKQ A PD + A Q A+ + + + TP Q+
Sbjct: 1 MGKQ-APAPPDFRGAAEQQSQASQQSINQQTQANRPNINTPWASQQWTQGPNGSW----- 54
Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPP 120
+ T L +
Sbjct: 55 ----------------------------------GMQTSFNGPLGDASNAVQQQLATSLS 80
Query: 121 QQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ 180
Q L + +P + + I Y RL Q+ + T+L NQ
Sbjct: 81 QPLDFSGLPGVSSGDAARNQAIESAYSQAT---------SRLDPQWQRREDAERTRLLNQ 131
Query: 181 GLVSGSVAWNRAIDETNRKLHDVRLAAMLK--ASDEQERLDNIQEKHAYFHN 230
GL GS A+ A E ++ +D +AM + A N
Sbjct: 132 GLSEGSEAYRNAQSEFGQQRNDAYTSAMASAIGQGTAAGQAVFNQDMAARQN 183
>gi|117924321|ref|YP_864938.1| hypothetical protein Mmc1_1014 [Magnetococcus sp. MC-1]
gi|117608077|gb|ABK43532.1| hypothetical protein Mmc1_1014 [Magnetococcus sp. MC-1]
Length = 381
Score = 97.1 bits (239), Expect = 2e-18, Method: Composition-based stats.
Identities = 29/204 (14%), Positives = 57/204 (27%), Gaps = 37/204 (18%)
Query: 28 SANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHNRQN 87
+ +A + TP G+ + EI P +L Q+ + Q
Sbjct: 30 NESAKVNQFRQETPYGVLDWS-------------GEIGTPDRTMKVTLSEDAQRAYGDQQ 76
Query: 88 INNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYP 147
L+ + R+ + D + L
Sbjct: 77 AIAANLAQIAMGRMGQI--------------------DAGPFSLDGVAQVPNGASLEQAR 116
Query: 148 TMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVR--- 204
Q+Y L ++ L+ +L QG+ S A+ +A+ + + H+
Sbjct: 117 NQAMQEYYAHGSQFLDKRTANEQSKLQDRLIQQGVGLDSRAYRQAMQDFQEQSHEAYAEL 176
Query: 205 -LAAMLKASDEQERLDNIQEKHAY 227
A L S E + + +
Sbjct: 177 ESRARLAGSSEASQQYQLGRQMRN 200
>gi|13470675|ref|NP_102244.1| hypothetical protein mll0449 [Mesorhizobium loti MAFF303099]
gi|14021417|dbj|BAB48030.1| mll0449 [Mesorhizobium loti MAFF303099]
Length = 230
Score = 62.8 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 20/92 (21%), Positives = 35/92 (38%)
Query: 147 PTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLA 206
+ + +RL L Q + L+T+L NQG+ GS A++RA+ + +D
Sbjct: 9 NDATESRLLQLGRERLDPILAQQSDALQTQLSNQGIKLGSAAYDRAMTQQALHANDATDQ 68
Query: 207 AMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238
+L+ + A G Q
Sbjct: 69 LILQGHGQAFAEGQAIRNQPINEITALLSGSQ 100
>gi|312214728|emb|CBX94682.1| predicted protein [Leptosphaeria maculans]
Length = 592
Score = 47.0 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 34/236 (14%), Positives = 60/236 (25%), Gaps = 30/236 (12%)
Query: 3 KQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGR 62
A P P A++ N A I +S D D+ G
Sbjct: 230 TSSAPGVPKPAAVSWQSADWNQPLGQA------ISAFPTFTTQVSSSSNKDIAADTLPGS 283
Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122
+P + P QQ + L H PP
Sbjct: 284 SAIMPSLSNHSANKPTQQAVF-------------------PLAMQWGPHSTGL--PPPDN 322
Query: 123 LRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDR--LQSSLQQDREDLETKLHNQ 180
L P + Y+Y S +++ A + + + N+
Sbjct: 323 LLYTSGPNPAGVYDLPPGVMPYSY-NHSSLKWKDALAAETNMDKLTALKKAAKQASTANK 381
Query: 181 GLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQG 236
V + N +D+ ++ R L + + E +A + +G
Sbjct: 382 SSVKAAEPTNNELDDKQERIKQEREQKRLVSRISSTMSSALAELYARYIKETSERG 437
>gi|313885186|ref|ZP_07818938.1| efflux ABC transporter, permease protein [Eremococcus coleocola
ACS-139-V-Col8]
gi|312619877|gb|EFR31314.1| efflux ABC transporter, permease protein [Eremococcus coleocola
ACS-139-V-Col8]
Length = 1145
Score = 42.8 bits (98), Expect = 0.038, Method: Composition-based stats.
Identities = 19/154 (12%), Positives = 43/154 (27%), Gaps = 9/154 (5%)
Query: 80 QQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEER 139
Q+I + S L + Q + + + + E +
Sbjct: 327 QEIQSASQKLEDGRSQLAASKSQ----LDAAADQINQGYAQLEPEKAKLDEVAAQLAGPQ 382
Query: 140 KEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRK 199
++ + S + Q + L ++L QG+ + +A +
Sbjct: 383 AQLDQAKADLDSSMSQLDQAQAQIDEGQAQLDALASQLQEQGIDPATSPDYQA----GQT 438
Query: 200 LHDVRLAAMLKAS-DEQERLDNIQEKHAYFHNLA 232
D + + + L QE+ A F +
Sbjct: 439 NLDSQKQTLAAGQAQYEAGLAQYQEQKALFGQES 472
>gi|149477002|ref|XP_001516414.1| PREDICTED: similar to catenin (cadherin-associated protein), alpha
1, 102kDa [Ornithorhynchus anatinus]
Length = 732
Score = 42.8 bits (98), Expect = 0.038, Method: Composition-based stats.
Identities = 28/138 (20%), Positives = 44/138 (31%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ I N SD +Q+ + N K L ++ +P+
Sbjct: 246 YKQLQQAITGISNAAQATASDDASQQQGAGGELAYALNNFDKQIIVDPLSFSEERFRPSL 305
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 306 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 365
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 366 SAIDKMTKKTRDLRRQLR 383
>gi|237746507|ref|ZP_04576987.1| predicted protein [Oxalobacter formigenes HOxBLS]
gi|229377858|gb|EEO27949.1| predicted protein [Oxalobacter formigenes HOxBLS]
Length = 552
Score = 41.2 bits (94), Expect = 0.13, Method: Composition-based stats.
Identities = 25/219 (11%), Positives = 54/219 (24%), Gaps = 11/219 (5%)
Query: 7 SLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISI 66
P + S + + N RE + +P + + +
Sbjct: 50 PSPSSPTLASYRAASGSSDTVNQNLSRELTRQASPGLSPVIDNTAFSDKTTPPVSNSATS 109
Query: 67 PHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDN 126
+ + P ++ +N + + P H + D
Sbjct: 110 SAIRGTETFSPQRKGSSFGRNNTAFKPASAGSDTFPTTDPRHTDAIRYGSDTTTGTSSRL 169
Query: 127 DVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSG- 185
++ P Q +Q+ L+ L N+G
Sbjct: 170 RSQGDTPDYAPAAGNEGFHLPAEIHPGLVSPDSPGQQERTRQETARLQHSLGNEGFTLSP 229
Query: 186 -----SVAWNRAIDETNRKLHDVRLAAMLKASDEQERLD 219
+ + A++ T R+ AML + + R
Sbjct: 230 DIPRQAARFRAAMEATGRQA-----GAMLSGQERETRFA 263
>gi|118388201|ref|XP_001027200.1| Adenylate and Guanylate cyclase catalytic domain containing protein
[Tetrahymena thermophila]
gi|89308970|gb|EAS06958.1| Adenylate and Guanylate cyclase catalytic domain containing protein
[Tetrahymena thermophila SB210]
Length = 3203
Score = 40.4 bits (92), Expect = 0.20, Method: Composition-based stats.
Identities = 21/190 (11%), Positives = 54/190 (28%), Gaps = 17/190 (8%)
Query: 45 WQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDL 104
+ +II R + + + Q ++ N L
Sbjct: 413 LKQSQYNNSQIIKPHKLRIFNENGFNLGSDVSIPQSEVKNDTEQFKSQSEQQSKDPSPPL 472
Query: 105 LPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQS 164
++ N + + Q +D++ A ++ +I Y + + D +
Sbjct: 473 KQKNNQKYNNSFNNSLQSQQDSN----TKADKDQTDQIGYEHQETNRELVLHH--DFISP 526
Query: 165 SLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEK 224
+ + K +G G+ D + L + ++ + N+Q
Sbjct: 527 QITSRENQILQKSSKEGGSLGTEG-----------NSDTESQSPLDSPQKRRQRQNMQSH 575
Query: 225 HAYFHNLAQA 234
+ ++ Q
Sbjct: 576 QDEYDDIPQE 585
>gi|193204962|ref|NP_494177.3| Prion-like-(Q/N-rich)-domain-bearing protein family member (pqn-66)
[Caenorhabditis elegans]
gi|163644489|gb|AAB37876.4| Prion-like-(q/n-rich)-domain-bearing protein protein 66
[Caenorhabditis elegans]
Length = 898
Score = 40.1 bits (91), Expect = 0.29, Method: Composition-based stats.
Identities = 27/162 (16%), Positives = 46/162 (28%), Gaps = 8/162 (4%)
Query: 4 QRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGRE 63
A A + K A+ Q + N NA +A +N+ D Q + +
Sbjct: 690 PNAPNAQNSKDDANAQNAQNDQNAPNDANGQNVQIDRNDSNAQNGQNAPNDQNAQNDPNA 749
Query: 64 ISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQL 123
+ P+ S + Q N QN N + + + P
Sbjct: 750 QNAPNVQNSQN-TRNAQNSQNAQNARNAPNAQIAQ-------NDPNAPNAQIAQNAPNAQ 801
Query: 124 RDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSS 165
D + P NA + +++ EK L
Sbjct: 802 NDINAPNVQNAQKAPNAQNAQEQQEAQAKELEKEIGQFLCKR 843
>gi|145485313|ref|XP_001428665.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395752|emb|CAK61267.1| unnamed protein product [Paramecium tetraurelia]
Length = 2080
Score = 39.3 bits (89), Expect = 0.40, Method: Composition-based stats.
Identities = 31/218 (14%), Positives = 52/218 (23%), Gaps = 18/218 (8%)
Query: 28 SANAYRENIDRMTPDGI----------WQYKTSGVDKIIDSFIGREISIPHYLQSYSLHP 77
N EN D P Q + D I S H
Sbjct: 1543 VQNNQFENPDDEPPYASPGSENFSSVKSQSSQHCQNSFNDQSQSPLKDISQIQDSEEPHE 1602
Query: 78 IQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLE 137
Q + N T ++Q + + H N F + +ND + +
Sbjct: 1603 NSQLSIFDEEKNKSPSKQQKTLQLQKIQDYPPDHYNIVPTFENEYDNENDPQQINQQVEK 1662
Query: 138 ERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETN 197
N + + Q + +SS + + N+G NRA+
Sbjct: 1663 ADSFCKKNQSQLSNNQGDNNLPSNRKSSQSRRELAKSAQFANEG--------NRALQSHQ 1714
Query: 198 RKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQ 235
+ A Q + + Q
Sbjct: 1715 SNSRESLSVQTNLAQQGQYTQQVSNQDKPLTQSFVYQQ 1752
>gi|307197463|gb|EFN78697.1| Probable exonuclease mut-7-like protein [Harpegnathos saltator]
Length = 1058
Score = 39.3 bits (89), Expect = 0.42, Method: Composition-based stats.
Identities = 29/217 (13%), Positives = 60/217 (27%), Gaps = 21/217 (9%)
Query: 17 SLQLSANLANASANAYRENIDRMTPDGIWQYKT-SGVDKIIDSFIGR-EISIPHYLQSYS 74
S + + N + A+ G Y +G D I + Q+
Sbjct: 579 SQKTNTNKSTYKKPAHLNLATDNR--GNENYPMNTGAVPKYDGMTNHGSIPKHGFSQNNQ 636
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
++ +++Q N H + + P +N+ + N
Sbjct: 637 HRHDNRKKYDKQKKYNYN-------------KHDSYNKYDNYNKPDSYNGNNNHSKYENH 683
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAID 194
+ N +Q + +R + + D + + NQ ++
Sbjct: 684 NRYNNYNKNDNCNKRENQNKQTHSQNRYDNQSRYDDQS---RYDNQNKRDNHNRYDNQDR 740
Query: 195 ETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNL 231
+ D R L++ +IQ K F N
Sbjct: 741 RDIQNRQDTRNRQDLQSKKNTRSRQDIQGKQ-DFQNK 776
>gi|50754810|ref|XP_414513.1| PREDICTED: similar to alpha-catenin [Gallus gallus]
Length = 905
Score = 39.3 bits (89), Expect = 0.43, Method: Composition-based stats.
Identities = 26/138 (18%), Positives = 42/138 (30%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ + N SD Q+ + N K ++ +P+
Sbjct: 245 YKQLQQAVTGISNAAQATASDDAAQQQGGGGELAYALNNFDKQIIVDPSTFSEERFRPSL 304
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 365 SAIDKMTKKTRDLRRQLR 382
>gi|224068486|ref|XP_002187404.1| PREDICTED: catenin (cadherin-associated protein), alpha 1, 102kDa
[Taeniopygia guttata]
Length = 905
Score = 39.3 bits (89), Expect = 0.44, Method: Composition-based stats.
Identities = 26/138 (18%), Positives = 42/138 (30%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ + N SD Q+ + N K ++ +P+
Sbjct: 245 YKQLQQAVSGISNAAQATASDDAAQQQGGGGELAYALNNFDKQIIVDPSTFSEERFRPSL 304
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 365 SAIDKMTKKTRDLRRQLR 382
>gi|326928261|ref|XP_003210299.1| PREDICTED: catenin alpha-1-like isoform 1 [Meleagris gallopavo]
Length = 905
Score = 39.3 bits (89), Expect = 0.44, Method: Composition-based stats.
Identities = 26/138 (18%), Positives = 42/138 (30%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ + N SD Q+ + N K ++ +P+
Sbjct: 245 YKQLQQAVTGISNAAQATASDDAAQQQGGGGELAYALNNFDKQIIVDPSTFSEERFRPSL 304
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 365 SAIDKMTKKTRDLRRQLR 382
>gi|26249230|ref|NP_755270.1| hypothetical protein c3395 [Escherichia coli CFT073]
gi|227888365|ref|ZP_04006170.1| conserved hypothetical protein [Escherichia coli 83972]
gi|26109637|gb|AAN81840.1|AE016765_242 Hypothetical protein c3395 [Escherichia coli CFT073]
gi|222034514|emb|CAP77256.1| hypothetical protein LF82_435 [Escherichia coli LF82]
gi|227834634|gb|EEJ45100.1| conserved hypothetical protein [Escherichia coli 83972]
gi|307554795|gb|ADN47570.1| hypothetical protein ECABU_c30980 [Escherichia coli ABU 83972]
gi|312947351|gb|ADR28178.1| hypothetical protein NRG857_13830 [Escherichia coli O83:H1 str. NRG
857C]
Length = 658
Score = 38.9 bits (88), Expect = 0.52, Method: Composition-based stats.
Identities = 16/138 (11%), Positives = 35/138 (25%), Gaps = 2/138 (1%)
Query: 25 ANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHN 84
+A + N+ P D G+ ++I L + + +
Sbjct: 425 GTPTAQTHFSNLGDGKPFWDSTTTLLQRATWPDPDSGQTLTINAPQVPEPLTAEELKNFD 484
Query: 85 RQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILY 144
+ + S + P V+T + D P +
Sbjct: 485 QDYARDEKQSGGAGYAYGQINPETKKPVDTDYRYYISLYGYFDRKMVPKKDSGYYQSGPG 544
Query: 145 NYPTMGSQQYEKAFLDRL 162
+ +YEK + +
Sbjct: 545 S--KEDRVKYEKQSQEEM 560
>gi|149726843|ref|XP_001504306.1| PREDICTED: similar to Catenin alpha-1 (Cadherin-associated protein)
(Alpha E-catenin) (NY-REN-13 antigen) [Equus caballus]
Length = 905
Score = 38.9 bits (88), Expect = 0.57, Method: Composition-based stats.
Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ + N SD +Q + N K L ++ +P+
Sbjct: 245 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 304
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 365 SAIDKMTKKTRDLRRQLR 382
>gi|297676097|ref|XP_002815982.1| PREDICTED: catenin alpha-1-like isoform 1 [Pongo abelii]
gi|297676099|ref|XP_002815983.1| PREDICTED: catenin alpha-1-like isoform 2 [Pongo abelii]
Length = 905
Score = 38.9 bits (88), Expect = 0.59, Method: Composition-based stats.
Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ + N SD +Q + N K L ++ +P+
Sbjct: 245 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 304
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 365 SAIDKMTKKTRDLRRQLR 382
>gi|118389547|ref|XP_001027857.1| hypothetical protein TTHERM_00919640 [Tetrahymena thermophila]
gi|89309627|gb|EAS07615.1| hypothetical protein TTHERM_00919640 [Tetrahymena thermophila SB210]
Length = 3637
Score = 38.9 bits (88), Expect = 0.59, Method: Composition-based stats.
Identities = 36/177 (20%), Positives = 59/177 (33%), Gaps = 9/177 (5%)
Query: 59 FIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDF 118
G + + P L QQ + N N N + QR T+ +T
Sbjct: 2945 ANGSQATSPRIQDLSQLTSDQQSLLNNLNFQN----KIQLQRNSFSQDLLKTNNDTHF-- 2998
Query: 119 PPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH 178
Q++R K S + + +Q L Q D L KL
Sbjct: 2999 -EQRIRPFSGVSKIEDSQIRKTSLQLKQQNYAKKQNLSLNQYDLDKIQQNDNHQLIQKLG 3057
Query: 179 NQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQ 235
N+ + +V N+ + + + + + L + Q R+ Q K A F+NL Q
Sbjct: 3058 NKNYL--NVNLNQIQNASPSQNNFSKSNTKLDSQRRQTRMTQSQSKIASFNNLNHQQ 3112
>gi|326928263|ref|XP_003210300.1| PREDICTED: catenin alpha-1-like isoform 2 [Meleagris gallopavo]
Length = 860
Score = 38.9 bits (88), Expect = 0.60, Method: Composition-based stats.
Identities = 26/138 (18%), Positives = 42/138 (30%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ + N SD Q+ + N K ++ +P+
Sbjct: 245 YKQLQQAVTGISNAAQATASDDAAQQQGGGGELAYALNNFDKQIIVDPSTFSEERFRPSL 304
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 305 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 364
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 365 SAIDKMTKKTRDLRRQLR 382
>gi|297676101|ref|XP_002815984.1| PREDICTED: catenin alpha-1-like isoform 3 [Pongo abelii]
Length = 890
Score = 38.9 bits (88), Expect = 0.66, Method: Composition-based stats.
Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ + N SD +Q + N K L ++ +P+
Sbjct: 230 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 289
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 290 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 349
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 350 SAIDKMTKKTRDLRRQLR 367
>gi|297676105|ref|XP_002815986.1| PREDICTED: catenin alpha-1-like isoform 5 [Pongo abelii]
Length = 782
Score = 38.5 bits (87), Expect = 0.69, Method: Composition-based stats.
Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ + N SD +Q + N K L ++ +P+
Sbjct: 122 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 181
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 182 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 241
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 242 SAIDKMTKKTRDLRRQLR 259
>gi|297676103|ref|XP_002815985.1| PREDICTED: catenin alpha-1-like isoform 4 [Pongo abelii]
Length = 802
Score = 38.5 bits (87), Expect = 0.71, Method: Composition-based stats.
Identities = 27/138 (19%), Positives = 43/138 (31%), Gaps = 4/138 (2%)
Query: 75 LHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+QQ + N SD +Q + N K L ++ +P+
Sbjct: 142 YKQLQQAVTGISNAAQATASDDASQHQGGGGELAYALNNFDKQIIVDPLSFSEERFRPSL 201
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH----NQGLVSGSVAWN 190
I S +R+ + R+ L+ L N G S A N
Sbjct: 202 EERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALN 261
Query: 191 RAIDETNRKLHDVRLAAM 208
AID+ +K D+R
Sbjct: 262 SAIDKMTKKTRDLRRQLR 279
>gi|156395696|ref|XP_001637246.1| predicted protein [Nematostella vectensis]
gi|156224357|gb|EDO45183.1| predicted protein [Nematostella vectensis]
Length = 1945
Score = 38.5 bits (87), Expect = 0.85, Method: Composition-based stats.
Identities = 26/209 (12%), Positives = 47/209 (22%), Gaps = 42/209 (20%)
Query: 68 HYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDND 127
Q+ +L P Q ++ L + T T P +
Sbjct: 1532 QPGQAVTLRPEQPYGFSQAQRP-TNLQIPARPQ-GPAR--ASTPNTPTSMGLPTSAGSMN 1587
Query: 128 VPEK--PNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH------- 178
P A + + + QQ+ +Q Q L+ L
Sbjct: 1588 PPSVYGTQAYQGGNQGLTHPMQQQQQQQFTLQQQRPMQPQAPQGTAILQHPLQAGTQQQQ 1647
Query: 179 ----------------------NQGLVSGSVAWNRAIDETNRKLHDVRLAA-----MLKA 211
NQG ++ + A+ + N+ M +
Sbjct: 1648 GGQMNQGIPMSQMSQGMQLPVMNQGGQISQMSQSGAMTQINQGQISQMSQGGQLNQMNQG 1707
Query: 212 SDEQERLDNIQEKHAYFHNLAQA--QGLQ 238
+ +Q QG Q
Sbjct: 1708 GQMSQMNQGMQMPQMSQGGQMPQMNQGGQ 1736
>gi|300980554|ref|ZP_07175080.1| conserved hypothetical protein [Escherichia coli MS 45-1]
gi|300409254|gb|EFJ92792.1| conserved hypothetical protein [Escherichia coli MS 45-1]
Length = 439
Score = 38.1 bits (86), Expect = 0.91, Method: Composition-based stats.
Identities = 13/131 (9%), Positives = 31/131 (23%)
Query: 25 ANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHN 84
+A + N+ P D G+ ++I L + + +
Sbjct: 206 GTPTAQTHFSNLGDGKPFWDSTTTLLQRATWPDPDSGQTLTINAPQVPEPLTAEELKNFD 265
Query: 85 RQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILY 144
+ + S + P V+T + D P +
Sbjct: 266 QDYARDEKQSGGAGYAYGQINPETKKPVDTDYRYYISLYGYFDRKMVPKKDSGYYQSGPG 325
Query: 145 NYPTMGSQQYE 155
+ + +
Sbjct: 326 SKEDRVKYEKQ 336
>gi|301049406|ref|ZP_07196370.1| conserved hypothetical protein [Escherichia coli MS 185-1]
gi|300298848|gb|EFJ55233.1| conserved hypothetical protein [Escherichia coli MS 185-1]
Length = 440
Score = 38.1 bits (86), Expect = 0.91, Method: Composition-based stats.
Identities = 13/131 (9%), Positives = 31/131 (23%)
Query: 25 ANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHN 84
+A + N+ P D G+ ++I L + + +
Sbjct: 207 GTPTAQTHFSNLGDGKPFWDSTTTLLQRATWPDPDSGQTLTINAPQVPEPLTAEELKNFD 266
Query: 85 RQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILY 144
+ + S + P V+T + D P +
Sbjct: 267 QDYARDEKQSGGAGYAYGQINPETKKPVDTDYRYYISLYGYFDRKMVPKKDSGYYQSGPG 326
Query: 145 NYPTMGSQQYE 155
+ + +
Sbjct: 327 SKEDRVKYEKQ 337
>gi|315293827|gb|EFU53179.1| conserved hypothetical protein [Escherichia coli MS 153-1]
Length = 441
Score = 38.1 bits (86), Expect = 0.93, Method: Composition-based stats.
Identities = 13/131 (9%), Positives = 31/131 (23%)
Query: 25 ANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHN 84
+A + N+ P D G+ ++I L + + +
Sbjct: 208 GTPTAQTHFSNLGDGKPFWDSTTTLLQRATWPDPDSGQTLTINAPQVPEPLTAEELKNFD 267
Query: 85 RQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILY 144
+ + S + P V+T + D P +
Sbjct: 268 QDYARDEKQSGGAGYAYGQINPETKKPVDTDYRYYISLYGYFDRKMVPKKDSGYYQSGPG 327
Query: 145 NYPTMGSQQYE 155
+ + +
Sbjct: 328 SKEDRVKYEKQ 338
>gi|297621819|ref|YP_003709956.1| hypothetical protein wcw_1605 [Waddlia chondrophila WSU 86-1044]
gi|297377120|gb|ADI38950.1| putative membrane protein [Waddlia chondrophila WSU 86-1044]
Length = 1019
Score = 38.1 bits (86), Expect = 1.1, Method: Composition-based stats.
Identities = 24/179 (13%), Positives = 53/179 (29%), Gaps = 21/179 (11%)
Query: 18 LQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYS--- 74
++ N+ + + + I++M +G + G ++ +
Sbjct: 339 AEIPENIKSMTQTVEQNAINQMNAEG-----WNIPQSYTPPSNGLSYNMRMQNSADEMFE 393
Query: 75 ---------LHPIQQQ----IHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQ 121
L P QQ+ ++ L+ +L Q + FP
Sbjct: 394 GMLQNWDPPLTPDQQKALRNMYYGVEKPAGDLAAVLQQIESGVAAELAAAFGLPDGFPVP 453
Query: 122 QLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ 180
+ + + E+L P +A D + ++ + + L KL NQ
Sbjct: 454 KGSFSHQGNINGQFQMKFLELLNALPADQKAAVLQAINDPMNPAISAETKALLNKLFNQ 512
>gi|294636984|ref|ZP_06715306.1| outer membrane usher protein [Edwardsiella tarda ATCC 23685]
gi|291089812|gb|EFE22373.1| outer membrane usher protein [Edwardsiella tarda ATCC 23685]
Length = 817
Score = 37.4 bits (84), Expect = 1.6, Method: Composition-based stats.
Identities = 31/223 (13%), Positives = 64/223 (28%), Gaps = 17/223 (7%)
Query: 27 ASANAYRENIDRMTP--DGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSL--HPIQQQI 82
N Y N+ + P G + + D D ++ + + L Q
Sbjct: 477 GRKNNYAINLSQTLPPGWGSVFFSGTWRDYWGDGTRRQDYQVSYSNSWQQLNYTLAASQT 536
Query: 83 HNRQNIN--------NLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+++ + L LS +R L + + ++ N
Sbjct: 537 YDQGLNSDRRVYLYFTLPLSFGEPRRSLYLSNATTVDRDGYQSNNASLSGYAGEWQQFNY 596
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAID 194
S+ + +GS +A L +S Q ++ +T + G+ G VA+ +
Sbjct: 597 SVSLNNQRQDRLTALGSNLSYRARAVTLNASYSQSQDYRQTSV---GISGGVVAYRGGVL 653
Query: 195 ETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237
+ L D + ++ A L
Sbjct: 654 -FSNALTDTMAIVDAPGLRDAS-VNGYGYHATNGAGQALYAAL 694
>gi|259502965|ref|ZP_05745867.1| hypothetical protein HMPREF0494_1261 [Lactobacillus antri DSM
16041]
gi|259169090|gb|EEW53585.1| hypothetical protein HMPREF0494_1261 [Lactobacillus antri DSM
16041]
Length = 617
Score = 37.4 bits (84), Expect = 1.8, Method: Composition-based stats.
Identities = 27/151 (17%), Positives = 49/151 (32%), Gaps = 12/151 (7%)
Query: 81 QIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERK 140
I+ ++ N LLT ++ L + K D +K SLE+
Sbjct: 422 AIYRQELQQN-----LLTDQLG-LPFYLPNKDQLLKYRLSGYQEDVLAVQKYQQSLEQNA 475
Query: 141 EILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLV----SGSVAWNRAIDET 196
+ + + + F + S + LE +L QG + W A+ E
Sbjct: 476 HVPRADALQWTSRVRRLFNHQFIQSFDDSQAALERELTAQGYTWTNPADREQWRAALREL 535
Query: 197 --NRKLHDVRLAAMLKASDEQERLDNIQEKH 225
+L R M + + +D +Q
Sbjct: 536 VPGLRLFVRRGLTMAERNQRASVIDEVQRHQ 566
>gi|269139573|ref|YP_003296274.1| putative outer membrane protein [Edwardsiella tarda EIB202]
gi|267985234|gb|ACY85063.1| putative outer membrane protein [Edwardsiella tarda EIB202]
gi|304559461|gb|ADM42125.1| Fimbriae usher protein StcC [Edwardsiella tarda FL6-60]
Length = 817
Score = 37.0 bits (83), Expect = 2.0, Method: Composition-based stats.
Identities = 29/223 (13%), Positives = 63/223 (28%), Gaps = 17/223 (7%)
Query: 27 ASANAYRENIDRMTP--DGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHP--IQQQI 82
N Y N+ + P G + + D D ++ + + L Q
Sbjct: 477 GRKNNYAINLSQTLPQGWGSVFFSGTWRDYWGDGARRQDYQVSYSNSWQQLSYTLAASQT 536
Query: 83 HNRQNIN--------NLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA 134
+++ + L LS +R L + + ++ N
Sbjct: 537 YDQGLNSDRRFYLYFTLPLSVGEPRRTLYLSNATTFDRDGYQSNNASLSGYAGEWQQFNY 596
Query: 135 SLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAID 194
S+ + +G+ +A L +S Q ++ +T G+ G +A+ +
Sbjct: 597 SVSLNNQRQDRLTALGTNLSYRARSATLSASYSQSQDYRQTS---AGISGGVLAYRGGVL 653
Query: 195 ETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237
+ L D + ++ A L
Sbjct: 654 -FSNALTDTMAIVDAPGLRDAS-VNGYGYHATNGAGQALYAAL 694
>gi|238881789|gb|EEQ45427.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 985
Score = 37.0 bits (83), Expect = 2.2, Method: Composition-based stats.
Identities = 20/184 (10%), Positives = 55/184 (29%), Gaps = 6/184 (3%)
Query: 55 IIDSFIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNT 114
DS + + QS Q +++N L + H
Sbjct: 10 FADSNSNDDFLNSIFDQSQGEQQAPQVAQVSTSMSNPPLQSQSASSTSRISQAHTPMYQQ 69
Query: 115 TKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLE 174
+ + + +P+ S+ + Q+ + ++ QQ ++ +
Sbjct: 70 S------PVTAHTIPQNSPQSMPNQVAQPQQQIPPPPSQHLQQTTAQMLPQQQQQQQQQQ 123
Query: 175 TKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQA 234
+ Q + + + + ++ + + M A ++ I + N ++
Sbjct: 124 QQKQEQLYRMKQQIYQQQMLKKQQENMSRQPSPMNSAGHNTQQNTPITQNAKTPQNNSKL 183
Query: 235 QGLQ 238
Q +Q
Sbjct: 184 QSMQ 187
>gi|225159124|ref|ZP_03725430.1| OmpA/MotB domain protein [Opitutaceae bacterium TAV2]
gi|224802279|gb|EEG20545.1| OmpA/MotB domain protein [Opitutaceae bacterium TAV2]
Length = 199
Score = 37.0 bits (83), Expect = 2.5, Method: Composition-based stats.
Identities = 11/103 (10%), Positives = 31/103 (30%), Gaps = 9/103 (8%)
Query: 2 GKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIG 61
G ++ + PDP++ Q ++ TP G + D+ G
Sbjct: 22 GCKKKPVRPDPQSTLIGQTPGGNDT--HSSGLN----TTPFGDLTPSPLPAGLVSDTGSG 75
Query: 62 REISIPHYLQSYSLHPIQQQIH---NRQNINNLLLSDLLTQRI 101
++ + Q ++ ++ + + + +
Sbjct: 76 LQLGTTDASHGNQIRDAVQSVYFAFDQSAVRQEERAKIQDAQN 118
>gi|229593727|ref|XP_001026894.2| CAF1 family ribonuclease containing protein [Tetrahymena
thermophila]
gi|225567345|gb|EAS06649.2| CAF1 family ribonuclease containing protein [Tetrahymena
thermophila SB210]
Length = 1272
Score = 36.6 bits (82), Expect = 2.8, Method: Composition-based stats.
Identities = 31/223 (13%), Positives = 68/223 (30%), Gaps = 23/223 (10%)
Query: 18 LQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISI---PHYLQSYS 74
Q L S + + TP S + + + I + P+ + +
Sbjct: 705 QQTQPQLVTYSYQPAMSYVSQTTPTNTIPIVQSYIQPVPIQVPNQNIVVQNPPNITYTTT 764
Query: 75 LHPIQQQIHNRQNINNLLLSD---LLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEK 131
P Q+H + L+ L T +I+ L P V T + + L N P
Sbjct: 765 SVPNTTQVHLVPQKTSYLIESKPILQTSQIRILSPISSNRVQTNDEDFTKPLFTNKSPYS 824
Query: 132 PNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNR 191
+ + + L++ + + Q ++++ +T+ N A+ R
Sbjct: 825 KKYDEQRSQRWQEFSKD-------DSRLNQFEYNRQYNQQNEQTRQFN--------AYQR 869
Query: 192 AIDETNRKLHDVRLAAMLK--ASDEQERLDNIQEKHAYFHNLA 232
++ N + R + + + + A
Sbjct: 870 SVTNENNQRSTYRFEERNQFEGQQNYKNQQLYSQNVSQVAPPA 912
>gi|312219889|emb|CBX99831.1| similar to nuclear pore protein (Nic96) [Leptosphaeria maculans]
Length = 1018
Score = 36.6 bits (82), Expect = 3.3, Method: Composition-based stats.
Identities = 19/153 (12%), Positives = 48/153 (31%), Gaps = 3/153 (1%)
Query: 83 HNRQNINNLLLSDLL--TQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERK 140
+ L L D+ + + P + +D L K ++L +
Sbjct: 73 FDELPSLQLGLGDIARKVRNLGSGGPSADQVQDRAQDRAAHYLLSASGV-KMGSTLRDLN 131
Query: 141 EILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKL 200
+ + Q + F D + + L +GL ++ +++ +
Sbjct: 132 QFSTQAGIPTNGQAQNLFDDDVDGYISNLHSQSTLALIQEGLEQSKRDFDTFLEDNVQIE 191
Query: 201 HDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQ 233
D + + + + +++ A F N A+
Sbjct: 192 WDKQRQRIYEHFGLGRQSEDMAASQATFGNTAR 224
>gi|91793868|ref|YP_563519.1| chromosome segregation protein SMC [Shewanella denitrificans OS217]
gi|91715870|gb|ABE55796.1| Chromosome segregation protein SMC [Shewanella denitrificans OS217]
Length = 1138
Score = 36.2 bits (81), Expect = 3.6, Method: Composition-based stats.
Identities = 13/109 (11%), Positives = 39/109 (35%)
Query: 72 SYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEK 131
+L ++ ++ + L+ + + + Q + V +D+++
Sbjct: 675 KQALSSEMAKLLHQDDAKETNLAKIASSQAQLEQQREDSQVQLLALMTLLDSQDDELQGL 734
Query: 132 PNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ 180
E +E L + + ++ D ++ + R+ L T++ Q
Sbjct: 735 SKKQQELNQEWLSVSAQLRQAKAQRIEQDNIKRQHEHARQTLSTQVALQ 783
>gi|14133650|gb|AAK54090.1|AF362371_1 histidine kinase DhkI [Dictyostelium discoideum]
Length = 1736
Score = 36.2 bits (81), Expect = 4.0, Method: Composition-based stats.
Identities = 14/82 (17%), Positives = 26/82 (31%), Gaps = 9/82 (10%)
Query: 7 SLAPDPKAIASLQLSANLANASANAYRENIDRMTPDG---------IWQYKTSGVDKIID 57
+ + + Q S+N+ N + N + +TP+G Q +S
Sbjct: 1280 PNSSNSTSTNVTQSSSNIINNGNSITIINNNPVTPNGKKIVIVPLLSLQSASSPKQSQRG 1339
Query: 58 SFIGREISIPHYLQSYSLHPIQ 79
++ S Y P Q
Sbjct: 1340 YSPKQQYSPKQYSPKQQYSPKQ 1361
>gi|66822471|ref|XP_644590.1| histidine kinase [Dictyostelium discoideum AX4]
gi|74860532|sp|Q86AT9|DHKI_DICDI RecName: Full=Hybrid signal transduction histidine kinase I
gi|60472742|gb|EAL70692.1| histidine kinase [Dictyostelium discoideum AX4]
Length = 1736
Score = 36.2 bits (81), Expect = 4.0, Method: Composition-based stats.
Identities = 14/82 (17%), Positives = 26/82 (31%), Gaps = 9/82 (10%)
Query: 7 SLAPDPKAIASLQLSANLANASANAYRENIDRMTPDG---------IWQYKTSGVDKIID 57
+ + + Q S+N+ N + N + +TP+G Q +S
Sbjct: 1280 PNSSNSTSTNVTQSSSNIINNGNSITIINNNPVTPNGKKIVIVPLLSLQSASSPKQSQRG 1339
Query: 58 SFIGREISIPHYLQSYSLHPIQ 79
++ S Y P Q
Sbjct: 1340 YSPKQQYSPKQYSPKQQYSPKQ 1361
>gi|17313245|ref|NP_490625.1| hypothetical protein phiCTXp28 [Pseudomonas phage phiCTX]
gi|4063799|dbj|BAA36253.1| unnamed protein product [Pseudomonas phage phiCTX]
Length = 904
Score = 36.2 bits (81), Expect = 4.2, Method: Composition-based stats.
Identities = 23/158 (14%), Positives = 47/158 (29%), Gaps = 19/158 (12%)
Query: 88 INNLLLSDLLTQRIQ--DLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYN 145
L L Q+ + QQ R + ++ A+ + + +
Sbjct: 37 ATRERLKQLNAQQSDVRAFRTQRGALEQVSTALAAQQARVKALAQQMAAAGNPTRALTRD 96
Query: 146 YPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRL 205
Y E FL + + L T+L N G+ + + D+R
Sbjct: 97 YNRAIR---EAGFLKQQHLQHSHALQQLRTRLSNAGIST---------RNLGQHERDLRA 144
Query: 206 AAMLKAS---DEQERLDNIQEKHA--YFHNLAQAQGLQ 238
+ +RL N+ ++ ++G+Q
Sbjct: 145 QIQAANGAINSQAQRLRNLSQQQERLTQARNTYSRGIQ 182
>gi|268638179|ref|XP_002649186.1| histidine kinase [Dictyostelium discoideum AX4]
gi|256013041|gb|EEU04134.1| histidine kinase [Dictyostelium discoideum AX4]
Length = 1732
Score = 36.2 bits (81), Expect = 4.3, Method: Composition-based stats.
Identities = 14/82 (17%), Positives = 26/82 (31%), Gaps = 9/82 (10%)
Query: 7 SLAPDPKAIASLQLSANLANASANAYRENIDRMTPDG---------IWQYKTSGVDKIID 57
+ + + Q S+N+ N + N + +TP+G Q +S
Sbjct: 1280 PNSSNSTSTNVTQSSSNIINNGNSITIINNNPVTPNGKKIVIVPLLSLQSASSPKQSQRG 1339
Query: 58 SFIGREISIPHYLQSYSLHPIQ 79
++ S Y P Q
Sbjct: 1340 YSPKQQYSPKQYSPKQQYSPKQ 1361
>gi|42519054|ref|NP_964984.1| hypothetical protein LJ1128 [Lactobacillus johnsonii NCC 533]
gi|41583341|gb|AAS08950.1| hypothetical protein LJ_1128 [Lactobacillus johnsonii NCC 533]
Length = 4734
Score = 35.4 bits (79), Expect = 6.2, Method: Composition-based stats.
Identities = 24/223 (10%), Positives = 50/223 (22%), Gaps = 4/223 (1%)
Query: 16 ASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSL 75
A+ + NA A T Q + +S
Sbjct: 1030 ATATQITDALNAINTAKGNLKGEATDKAALQTAVDNSATVKESNNYTNADQTQKTAYDKA 1089
Query: 76 HPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNAS 135
Q + ++ N ++ L + + + D P N
Sbjct: 1090 VTAAQTVLDKTNATQAEVNQALQDLETANRNLNGDAKTEAANKAALEAAVKDAPNVRNTP 1149
Query: 136 LEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQ----GLVSGSVAWNR 191
+ + L+ S + + L+ + G + + A
Sbjct: 1150 AYYNGSEETQTAYNNAITAGQTVLNEANPSASEVKNALDAINAAKDNLKGKATNTEALET 1209
Query: 192 AIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQA 234
A+ N +A+ E I + + A
Sbjct: 1210 ALTNANNAKETGNYTNADQANQEALNNAIIAGQEILKNTSATQ 1252
>gi|144898504|emb|CAM75368.1| RTX toxins and related Ca2+-binding proteins [Magnetospirillum
gryphiswaldense MSR-1]
Length = 897
Score = 35.0 bits (78), Expect = 7.7, Method: Composition-based stats.
Identities = 21/145 (14%), Positives = 42/145 (28%), Gaps = 1/145 (0%)
Query: 22 ANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQ 81
A+L + A N + ++PDG WQY + G +D + S L + +
Sbjct: 202 ADLGTLAGVAVIGN-NTVSPDGAWQYSSDGGTTWVDVGGVNDNSSALALSASTKLRFNAA 260
Query: 82 IHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKE 141
+L + L + V T P + + + +
Sbjct: 261 PDFHGTAPSLYVRGLDNSYAGGWSSSTGSAVYTNTSSPGGSSAIAAAATELSTDVNAVND 320
Query: 142 ILYNYPTMGSQQYEKAFLDRLQSSL 166
+ + E ++ L
Sbjct: 321 APTSSAVTLTAGVENVLYTFTETQL 345
>gi|116494973|ref|YP_806707.1| ATP-dependent exoDNAse (exonuclease V) beta subunit [Lactobacillus
casei ATCC 334]
gi|122263609|sp|Q038V7|ADDA_LACC3 RecName: Full=ATP-dependent helicase/nuclease subunit A; AltName:
Full=ATP-dependent helicase/nuclease AddA
gi|116105123|gb|ABJ70265.1| DNA helicase/exodeoxyribonuclease V, subunit A [Lactobacillus casei
ATCC 334]
Length = 1234
Score = 35.0 bits (78), Expect = 7.7, Method: Composition-based stats.
Identities = 18/142 (12%), Positives = 34/142 (23%), Gaps = 13/142 (9%)
Query: 28 SANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQ------SYSLHPIQQQ 81
AN + D+ T I K K +D E +P Y + +L +
Sbjct: 816 QANKHFNMSDQ-TGTAILT-KQGIGIKWLDPETRVEYELPQYQAAKAARQNQTLAEEMRL 873
Query: 82 IHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNA-----SL 136
++ L + + L V + R + A +
Sbjct: 874 LYVALTRAQQRLYVVGATMSGNQLTSADKTVEKWAAAAEGEARVLAPQVRSGATSYLDWI 933
Query: 137 EERKEILYNYPTMGSQQYEKAF 158
+ + A
Sbjct: 934 GPALIRHPQARGLAETTIKPAL 955
>gi|307186073|gb|EFN71805.1| Pre-mRNA cleavage complex 2 protein Pcf11 [Camponotus floridanus]
Length = 1838
Score = 35.0 bits (78), Expect = 8.3, Method: Composition-based stats.
Identities = 27/188 (14%), Positives = 52/188 (27%), Gaps = 11/188 (5%)
Query: 2 GKQRASLAPD--PKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSF 59
GK A + P+ PK +A AN A A + P + + D
Sbjct: 210 GKPVAPVVPNQTPKQVAKQNAGANSGPRIAPASSIAVASAKPV-SRDPRLKPTPAVHDVT 268
Query: 60 IGREISIPHYLQSYSLHP-----IQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNT 114
I + + S Q + N +N L L ++ + + +
Sbjct: 269 TVPTIDLRQRPGTTSPKELRNEGQTQPVVNTIVTSNQLKQQLPSKPA--VTSTINKPPAS 326
Query: 115 TKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLE 174
L + + +L ++ + + L + +L
Sbjct: 327 PAGSDNPTLLNVINNNHADTNLNNSNNKTFS-GNINKDAVSHRTSQKKDPRLTSNSVNLN 385
Query: 175 TKLHNQGL 182
+ QGL
Sbjct: 386 SSKIGQGL 393
>gi|17231202|ref|NP_487750.1| heterocyst specific ABC-transporter, membrane fusion protein
[Nostoc sp. PCC 7120]
gi|1490222|emb|CAA67985.1| devB [Nostoc sp. PCC 7120]
gi|17132844|dbj|BAB75409.1| heterocyst specific ABC-transporter, membrane fusion protein
[Nostoc sp. PCC 7120]
Length = 474
Score = 35.0 bits (78), Expect = 8.8, Method: Composition-based stats.
Identities = 20/148 (13%), Positives = 51/148 (34%), Gaps = 8/148 (5%)
Query: 71 QSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPE 130
Q+ + +Q Q+ ++ + +Q + + + QQ
Sbjct: 149 QTAVIARLQAQLVGEMGAQQASITRIASQLSGEKVAQQALVNRLEAELVGQQDSLRATLN 208
Query: 131 KPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWN 190
+ A + + +Y+ + + S ++DR L NQ ++ A
Sbjct: 209 RIRAEQRNA--------QVDAGRYDFLYREGAISQQERDRRRLTATTANQQVIESQAALR 260
Query: 191 RAIDETNRKLHDVRLAAMLKASDEQERL 218
+A+ +++ + R M + Q++L
Sbjct: 261 QALATLRQQVAEARANQMKTLASLQQQL 288
>gi|331694576|ref|YP_004330815.1| putative ECF subfamily RNA polymerase sigma-24 subunit
[Pseudonocardia dioxanivorans CB1190]
gi|326949265|gb|AEA22962.1| putative RNA polymerase, sigma-24 subunit, ECF subfamily
[Pseudonocardia dioxanivorans CB1190]
Length = 383
Score = 35.0 bits (78), Expect = 9.2, Method: Composition-based stats.
Identities = 21/149 (14%), Positives = 46/149 (30%), Gaps = 4/149 (2%)
Query: 40 TPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQ 99
+P V + I +P + + +++I L+ +L
Sbjct: 107 SPAAAVALTLRAVGGLTTRQIAAAHMVPEATMAQRISRAKRRIEGLPLDAPGDLTTVLRV 166
Query: 100 RIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFL 159
+ V+ + + + +P A+ +L++ A
Sbjct: 167 LYLVFNEGYGGDVDLAAEAIRLARQLAALSAEPEAAGLLALMLLHHARRASRT----APD 222
Query: 160 DRLQSSLQQDREDLETKLHNQGLVSGSVA 188
RL +QDR +T++ QG+ A
Sbjct: 223 GRLVPLAEQDRSSWDTRMIEQGVAILQAA 251
>gi|285817113|gb|ADC37600.1| Putative Staphylococcal surface anchored protein; adhesin emb
[Staphylococcus aureus 04-02981]
Length = 970
Score = 35.0 bits (78), Expect = 9.4, Method: Composition-based stats.
Identities = 28/194 (14%), Positives = 64/194 (32%), Gaps = 5/194 (2%)
Query: 21 SANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQ 80
+ N A AN+ NI++ T + + + I+ EI + + Q
Sbjct: 108 AKNKAEELANSIINNINKATSNQAVSQVQTAGNHAIEQVHANEIPKAKIDANKDVDKQVQ 167
Query: 81 QIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLE-ER 139
+ + + N L + N + ++ + + A E +
Sbjct: 168 ALIDEIDRNPNLTDKEKQALKDRINQILQQGHNDINNALTKEEIEQAKAQLAQALKEIKD 227
Query: 140 KEILYNYPTMGSQQYEKAFLDRLQ---SSLQQDREDLETKLHNQGLVSGSVAWNRAIDET 196
+ +A +D + + ++++ L+ + NQ L G N A+ +
Sbjct: 228 LVKAKENAKQDVDKQVQALIDEIDQNPNLTDKEKQALKDR-INQILQQGHNDINNAMTKE 286
Query: 197 NRKLHDVRLAAMLK 210
+ +LA L+
Sbjct: 287 EIEQAKAQLAQALQ 300
>gi|253732074|ref|ZP_04866239.1| possible cell wall associated fibronectin-binding protein
[Staphylococcus aureus subsp. aureus USA300_TCH959]
gi|253724190|gb|EES92919.1| possible cell wall associated fibronectin-binding protein
[Staphylococcus aureus subsp. aureus USA300_TCH959]
Length = 1136
Score = 35.0 bits (78), Expect = 9.5, Method: Composition-based stats.
Identities = 28/194 (14%), Positives = 64/194 (32%), Gaps = 5/194 (2%)
Query: 21 SANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQ 80
+ N A AN+ NI++ T + + + I+ EI + + Q
Sbjct: 197 AKNKAEELANSIINNINKATSNQAVSQVQTAGNHAIEQVHANEIPKAKIDANKDVDKQVQ 256
Query: 81 QIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLE-ER 139
+ + + N L + N + ++ + + A + +
Sbjct: 257 ALIDEIDRNPNLTDKEKQALKDRINQILQQGHNDINNALTKEEIEQAKAQLAQALQDIKD 316
Query: 140 KEILYNYPTMGSQQYEKAFLDRLQ---SSLQQDREDLETKLHNQGLVSGSVAWNRAIDET 196
+ +A +D + + ++++ L+ + NQ L G N A+ +
Sbjct: 317 LVKAKEDAKQDVDKQVQALIDEIDQNPNLTDKEKQALKDR-INQILQQGHNDINNAMTKE 375
Query: 197 NRKLHDVRLAAMLK 210
+ RLA L+
Sbjct: 376 AIEQAKERLAQALQ 389
>gi|227512827|ref|ZP_03942876.1| conserved hypothetical protein [Lactobacillus buchneri ATCC 11577]
gi|227083827|gb|EEI19139.1| conserved hypothetical protein [Lactobacillus buchneri ATCC 11577]
Length = 461
Score = 34.7 bits (77), Expect = 9.9, Method: Composition-based stats.
Identities = 34/226 (15%), Positives = 57/226 (25%), Gaps = 12/226 (5%)
Query: 9 APDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVD-KIIDSFIGREISIP 67
APDP S + N+ ID TP G K D S
Sbjct: 32 APDPANNISQVNAGNVLKDYTQKNLNVIDNTTPKGNMDRKYIERTIDKNDPGTVESYSTT 91
Query: 68 HYLQSYSLHPIQQQIHNRQNINN---LLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124
+ + + + N+ N + ++ +T +
Sbjct: 92 PDSTQQTTLQTKLYLPDGFNVTNYQHGNFQSVTLDDSGNMYFIESNGSDTNLGVIVKY-N 150
Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVS 184
D+ + S +NY + + + + E L+ N L
Sbjct: 151 LADLNKLGAGSDPMIVWNAFNYFNPYTDEGVQH-----NQQYEDAYEQLKA--PNADLKK 203
Query: 185 GSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHN 230
T++K L A + Q D Q K N
Sbjct: 204 VKSEVQNLQSTTSKKDATKANRQKLSALENQLETDQKQIKRIKQQN 249
Database: nr
Posted date: May 13, 2011 4:10 AM
Number of letters in database: 999,999,932
Number of sequences in database: 2,987,209
Database: /data/usr2/db/fasta/nr.01
Posted date: May 13, 2011 4:17 AM
Number of letters in database: 999,998,956
Number of sequences in database: 2,896,973
Database: /data/usr2/db/fasta/nr.02
Posted date: May 13, 2011 4:23 AM
Number of letters in database: 999,999,979
Number of sequences in database: 2,907,862
Database: /data/usr2/db/fasta/nr.03
Posted date: May 13, 2011 4:29 AM
Number of letters in database: 999,999,513
Number of sequences in database: 2,932,190
Database: /data/usr2/db/fasta/nr.04
Posted date: May 13, 2011 4:33 AM
Number of letters in database: 792,586,372
Number of sequences in database: 2,260,650
Lambda K H
0.292 0.102 0.232
Lambda K H
0.267 0.0313 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 802,609,314
Number of Sequences: 13984884
Number of extensions: 15977103
Number of successful extensions: 214152
Number of sequences better than 10.0: 4523
Number of HSP's better than 10.0 without gapping: 1033
Number of HSP's successfully gapped in prelim test: 3490
Number of HSP's that attempted gapping in prelim test: 159775
Number of HSP's gapped (non-prelim): 23237
length of query: 238
length of database: 4,792,584,752
effective HSP length: 135
effective length of query: 103
effective length of database: 2,904,625,412
effective search space: 299176417436
effective search space used: 299176417436
T: 11
A: 40
X1: 16 ( 6.8 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.0 bits)
S2: 78 (35.0 bits)