BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029556
(191 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255560013|ref|XP_002521025.1| conserved hypothetical protein [Ricinus communis]
gi|223539862|gb|EEF41442.1| conserved hypothetical protein [Ricinus communis]
Length = 408
Score = 198 bits (504), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 106/195 (54%), Positives = 126/195 (64%), Gaps = 45/195 (23%)
Query: 14 LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
LL L + V GL+LEDGYTVTT+IDGH+LEINPH+V+ RP SSDLI+LDSS S YT+S
Sbjct: 18 LLHLPSNYVLGGLILEDGYTVTTIIDGHKLEINPHAVLSRPQSSDLILLDSSHSTIYTIS 77
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN- 132
FP+S+ESVVKRL+GDGV G SDGEPGSARF+KP+SFAVD KGNIYVAD+ N IRKITN
Sbjct: 78 FPISQESVVKRLSGDGVAGLSDGEPGSARFNKPRSFAVDNKGNIYVADRLNGTIRKITNS 137
Query: 133 ------------------------------------------LDHGNQLIRQINLKPEDC 150
DHGNQL+R++ LKP+DC
Sbjct: 138 GVSTIAGGYSKGFGREDGPAQNATFSSDFEVAFVAEECALLISDHGNQLVRRLPLKPDDC 197
Query: 151 SKSSQSGSGM--FWV 163
+ +S S G FWV
Sbjct: 198 ATASHSALGAVSFWV 212
>gi|224079468|ref|XP_002305875.1| predicted protein [Populus trichocarpa]
gi|222848839|gb|EEE86386.1| predicted protein [Populus trichocarpa]
Length = 241
Score = 175 bits (443), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 95/183 (51%), Positives = 115/183 (62%), Gaps = 48/183 (26%)
Query: 27 LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKRL 85
+LEDGY VTTV+DGH+L INPH+V R SS+++VLDSSRS FYTL FP+S+ SV VKRL
Sbjct: 1 MLEDGYMVTTVLDGHKLNINPHAVQLR--SSEIVVLDSSRSVFYTLPFPISQASVMVKRL 58
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
+G+G GY DGEPG ARF+KPKSFAVD++GN+YVAD+ NH +RKI+N
Sbjct: 59 SGEGKTGYIDGEPGLARFNKPKSFAVDLRGNVYVADQQNHAVRKISNSGVTSTIVGNYSQ 118
Query: 133 ------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSGM-- 160
DHGNQL+RQI+LKPEDC SQS G
Sbjct: 119 TGRQDGPGKTATFSSDFEVLFVPQICALLISDHGNQLLRQIDLKPEDCVIGSQSALGAVK 178
Query: 161 FWV 163
FWV
Sbjct: 179 FWV 181
>gi|297850862|ref|XP_002893312.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297339154|gb|EFH69571.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 398
Score = 168 bits (425), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 90/199 (45%), Positives = 122/199 (61%), Gaps = 45/199 (22%)
Query: 7 YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
+SS+++ L ++F+LVS ++LEDGY VTTV+DGH+ +NP+++ PGSS+LIVLDSS
Sbjct: 5 FSSVSLFLF-VVFNLVSGKIVLEDGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S FYT SFPLS +SV+ R AGDG GY DG+ G++RF KP+ FA+D KGN+YVAD+SN
Sbjct: 64 STFYTTSFPLSFDSVIHRFAGDGTSGYVDGKAGNSRFKKPRGFAIDAKGNVYVADRSNKA 123
Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
IRKI++ DHGN+++RQ
Sbjct: 124 IRKISSSGYVTTIAGGISKEFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMVRQ 183
Query: 143 INLKPEDCSKSSQSGSGMF 161
INLK EDC +SS S G +
Sbjct: 184 INLKEEDCLESSHSNLGAY 202
>gi|225424715|ref|XP_002265272.1| PREDICTED: uncharacterized protein LOC100243227 [Vitis vinifera]
Length = 438
Score = 167 bits (423), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 128/213 (60%), Gaps = 53/213 (24%)
Query: 1 MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
MA++ SL++ ++ +F++ S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1 MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
SD I+LDSS+S FYT+S PLS+ES +KRL+G G+SDG+ SA F KP+SFAVD+KGN
Sbjct: 59 SDFIILDSSKSVFYTVSSPLSQESEIKRLSGSSA-GFSDGDSASATFSKPRSFAVDLKGN 117
Query: 117 IYVADKSNHVIRKITNL------------------------------------------- 133
+YVAD+SN VIRKITN
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177
Query: 134 -DHGNQLIRQINLKPEDCSKSSQS--GSGMFWV 163
D G+QL+RQI+LK EDC +S QS G WV
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQSVLGGAFLWV 210
>gi|296086532|emb|CBI32121.3| unnamed protein product [Vitis vinifera]
Length = 369
Score = 167 bits (423), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 128/213 (60%), Gaps = 53/213 (24%)
Query: 1 MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
MA++ SL++ ++ +F++ S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1 MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
SD I+LDSS+S FYT+S PLS+ES +KRL+G G+SDG+ SA F KP+SFAVD+KGN
Sbjct: 59 SDFIILDSSKSVFYTVSSPLSQESEIKRLSGSSA-GFSDGDSASATFSKPRSFAVDLKGN 117
Query: 117 IYVADKSNHVIRKITNL------------------------------------------- 133
+YVAD+SN VIRKITN
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177
Query: 134 -DHGNQLIRQINLKPEDCSKSSQS--GSGMFWV 163
D G+QL+RQI+LK EDC +S QS G WV
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQSVLGGAFLWV 210
>gi|224065156|ref|XP_002301692.1| predicted protein [Populus trichocarpa]
gi|222843418|gb|EEE80965.1| predicted protein [Populus trichocarpa]
Length = 264
Score = 165 bits (418), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 90/184 (48%), Positives = 114/184 (61%), Gaps = 48/184 (26%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKR 84
++LEDGY VTTV+DGH+L +NPH+V R SSDL+VLDSS+S FYTL FP+S++ V VKR
Sbjct: 23 IMLEDGYMVTTVMDGHKLNVNPHAVQLR--SSDLVVLDSSKSVFYTLPFPISQDGVMVKR 80
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------ 132
L+G G +GY DGEPG ARF+KPKSF VD++GN+YVAD+ NH +RKI++
Sbjct: 81 LSGSGDKGYIDGEPGLARFNKPKSFTVDLRGNVYVADQLNHAVRKISSSGMTTTIAGNYS 140
Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSGM- 160
DHGNQL+RQ++LK EDC SQS G
Sbjct: 141 QIGRQDGPGETATFSTDFEVLFVPQICALLISDHGNQLLRQVDLKQEDCIIGSQSALGAV 200
Query: 161 -FWV 163
FWV
Sbjct: 201 KFWV 204
>gi|18395326|ref|NP_564209.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|15724248|gb|AAL06517.1|AF412064_1 At1g23890/T23E23_13 [Arabidopsis thaliana]
gi|16974539|gb|AAL31179.1| At1g23890/T23E23_13 [Arabidopsis thaliana]
gi|110742589|dbj|BAE99208.1| hypothetical protein [Arabidopsis thaliana]
gi|332192326|gb|AEE30447.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 261
Score = 165 bits (417), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 90/199 (45%), Positives = 122/199 (61%), Gaps = 45/199 (22%)
Query: 7 YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
+SS+++ L ++F+LVS ++LE+GY VTTV+DGH+ +NP+++ PGSS+LIVLDSS
Sbjct: 5 FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S FYT SFPLS +SV+ R AGDG G+ DG+ G++RF KP+ FAVD KGN+YVADKSN
Sbjct: 64 STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123
Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
IRKI++ DHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183
Query: 143 INLKPEDCSKSSQSGSGMF 161
INLK EDC ++S S G +
Sbjct: 184 INLKEEDCLENSHSNLGTY 202
>gi|42571623|ref|NP_973902.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|332192325|gb|AEE30446.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 400
Score = 165 bits (417), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/199 (45%), Positives = 122/199 (61%), Gaps = 45/199 (22%)
Query: 7 YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
+SS+++ L ++F+LVS ++LE+GY VTTV+DGH+ +NP+++ PGSS+LIVLDSS
Sbjct: 5 FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S FYT SFPLS +SV+ R AGDG G+ DG+ G++RF KP+ FAVD KGN+YVADKSN
Sbjct: 64 STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123
Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
IRKI++ DHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183
Query: 143 INLKPEDCSKSSQSGSGMF 161
INLK EDC ++S S G +
Sbjct: 184 INLKEEDCLENSHSNLGTY 202
>gi|9369410|gb|AAF87158.1|AC002423_23 T23E23.6 [Arabidopsis thaliana]
Length = 370
Score = 163 bits (413), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/194 (45%), Positives = 120/194 (61%), Gaps = 45/194 (23%)
Query: 7 YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
+SS+++ L ++F+LVS ++LE+GY VTTV+DGH+ +NP+++ PGSS+LIVLDSS
Sbjct: 5 FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S FYT SFPLS +SV+ R AGDG G+ DG+ G++RF KP+ FAVD KGN+YVADKSN
Sbjct: 64 STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123
Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
IRKI++ DHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183
Query: 143 INLKPEDCSKSSQS 156
INLK EDC ++S S
Sbjct: 184 INLKEEDCLENSHS 197
>gi|147798774|emb|CAN69795.1| hypothetical protein VITISV_022545 [Vitis vinifera]
Length = 459
Score = 162 bits (411), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 126/213 (59%), Gaps = 53/213 (24%)
Query: 1 MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
MA++ SL++ ++ +F++ S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1 MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
SD I+LDSS+S FYT+ PLS+ES + RL+G G+SDG+ SA F KP+SFAVD+KGN
Sbjct: 59 SDFIILDSSKSVFYTVXSPLSQESEINRLSGSSA-GFSDGDSASAMFSKPRSFAVDLKGN 117
Query: 117 IYVADKSNHVIRKITNL------------------------------------------- 133
+YVAD+SN VIRKITN
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177
Query: 134 -DHGNQLIRQINLKPEDCSKSSQS--GSGMFWV 163
D G+QL+RQI+LK EDC +S QS G WV
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQSVLGGAFLWV 210
>gi|388505270|gb|AFK40701.1| unknown [Medicago truncatula]
Length = 384
Score = 157 bits (398), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 83/179 (46%), Positives = 107/179 (59%), Gaps = 45/179 (25%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD 88
E+GYT+TT++DGH+L INP S++ RP SSDLIVLDS+ S FYT+ P+S+ESV KR +G+
Sbjct: 31 EEGYTITTILDGHKLHINPFSILQRPISSDLIVLDSTNSTFYTVQLPISQESVFKRFSGN 90
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------------- 132
G GY DG+ G ARFDKP+SFAVD +GN+YVAD+ N VIRKI+
Sbjct: 91 GSPGYEDGDVGLARFDKPRSFAVDFRGNVYVADRVNKVIRKISTNGVTTIAGGSSEKSSI 150
Query: 133 ---------------------------LDHGNQLIRQINLKPEDCSKSSQS--GSGMFW 162
DH +QL+ QINLK EDC+ S+S G+ M W
Sbjct: 151 KDGPVQNASFSNDFELTFIPALCALLVSDHMHQLVHQINLKGEDCTLGSKSALGAVMTW 209
>gi|356526828|ref|XP_003532018.1| PREDICTED: uncharacterized protein LOC100816542 [Glycine max]
Length = 400
Score = 153 bits (386), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 85/184 (46%), Positives = 105/184 (57%), Gaps = 46/184 (25%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
++ E+GYTVTTV DGH+ I P +V+ RP SSDLI+LDS S FYT FP++EESV RL
Sbjct: 24 VITEEGYTVTTVFDGHKPHIFPFTVLQRPFSSDLILLDSVNSTFYTAQFPITEESVFTRL 83
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL------------ 133
+GDG GYSDG+ GSARF KP+SFA DM+GN+YVADKSN IRKI+
Sbjct: 84 SGDGSVGYSDGDVGSARFAKPRSFAFDMRGNVYVADKSNRAIRKISAKGVTTIAGGEFSE 143
Query: 134 --------------------------------DHGNQLIRQINLKPEDCSKSSQSGSG-- 159
DH ++L+RQINL EDC+ S+ G G
Sbjct: 144 KSRTKDGPALNASFSNDFDLTFIPGLCALLVSDHMHRLVRQINLMEEDCTLGSKPGLGAV 203
Query: 160 MFWV 163
M W
Sbjct: 204 MTWT 207
>gi|148909539|gb|ABR17864.1| unknown [Picea sitchensis]
Length = 264
Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 77/211 (36%), Positives = 112/211 (53%), Gaps = 55/211 (26%)
Query: 14 LLSLLFSLVS---SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFY 70
LL +LFSL++ L+ E+GY V TV+ G+++++ PHS++ P + D+++LDS S +
Sbjct: 8 LLIILFSLITFTEPSLIPEEGYRVETVLYGNKMDVYPHSIL--PFNGDILLLDSVNSTLF 65
Query: 71 TLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+ PLS++S +K AG + V G++DG A+F+ PKSF +D KGNIYVAD++NH IR
Sbjct: 66 RIGLPLSQDSTIKVFAGSRNTVPGFADGGFLDAQFNHPKSFTLDSKGNIYVADRANHAIR 125
Query: 129 KITN-------------------------------------------LDHGNQLIRQINL 145
KI+ D GN++IRQI L
Sbjct: 126 KISKSGVSTIAGGTAGKTGHADGPSQEAMFSNDYDLTFIPSICALLVSDRGNRMIRQIKL 185
Query: 146 KPEDCSKSSQSGSGMFWVTVFSPYLISCKSF 176
P DC + SGSG WVT P + S SF
Sbjct: 186 PPGDCVQ--HSGSGTHWVT---PVIASGVSF 211
>gi|115476416|ref|NP_001061804.1| Os08g0414600 [Oryza sativa Japonica Group]
gi|37806208|dbj|BAC99711.1| unknown protein [Oryza sativa Japonica Group]
gi|113623773|dbj|BAF23718.1| Os08g0414600 [Oryza sativa Japonica Group]
gi|215715379|dbj|BAG95130.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215715380|dbj|BAG95131.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 275
Score = 108 bits (271), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 69/193 (35%), Positives = 95/193 (49%), Gaps = 50/193 (25%)
Query: 25 GLLLEDGYTVTTVID----GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
GL+L+DGYTVTT D G +P++++ RP + DL++LDS+ SA YTL+ PLS +
Sbjct: 26 GLVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGA 85
Query: 81 VVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------ 132
+ LAG G EP A FD+P+S AVD N+YVAD+ N +RKI
Sbjct: 86 AARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTT 145
Query: 133 --------------------------------------LDHGNQLIRQINLKPEDCSKSS 154
D GN+LIRQINLK EDC++ +
Sbjct: 146 IAGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARET 205
Query: 155 QSGSGMFWVTVFS 167
Q G G V++ +
Sbjct: 206 QPGLGTTSVSIIA 218
>gi|222640551|gb|EEE68683.1| hypothetical protein OsJ_27311 [Oryza sativa Japonica Group]
Length = 420
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/193 (35%), Positives = 95/193 (49%), Gaps = 50/193 (25%)
Query: 25 GLLLEDGYTVTTVID----GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
GL+L+DGYTVTT D G +P++++ RP + DL++LDS+ SA YTL+ PLS +
Sbjct: 26 GLVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGA 85
Query: 81 VVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------ 132
+ LAG G EP A FD+P+S AVD N+YVAD+ N +RKI
Sbjct: 86 AARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTT 145
Query: 133 --------------------------------------LDHGNQLIRQINLKPEDCSKSS 154
D GN+LIRQINLK EDC++ +
Sbjct: 146 IAGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARET 205
Query: 155 QSGSGMFWVTVFS 167
Q G G V++ +
Sbjct: 206 QPGLGTTSVSIIA 218
>gi|414870561|tpg|DAA49118.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 280
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 91/188 (48%), Gaps = 47/188 (25%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L +P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
G G++DG PG A FD+P+S AVD N+YVAD+ + +RK+
Sbjct: 88 LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
D GN+++RQINLKPEDC+ QSG G
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207
Query: 160 MFWVTVFS 167
V+ +
Sbjct: 208 NTLVSFIA 215
>gi|226495023|ref|NP_001144050.1| uncharacterized protein LOC100276874 precursor [Zea mays]
gi|195636116|gb|ACG37526.1| hypothetical protein [Zea mays]
gi|414870559|tpg|DAA49116.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 272
Score = 105 bits (263), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 91/188 (48%), Gaps = 47/188 (25%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L +P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
G G++DG PG A FD+P+S AVD N+YVAD+ + +RK+
Sbjct: 88 LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
D GN+++RQINLKPEDC+ QSG G
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207
Query: 160 MFWVTVFS 167
V+ +
Sbjct: 208 NTLVSFIA 215
>gi|224028415|gb|ACN33283.1| unknown [Zea mays]
gi|414870560|tpg|DAA49117.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 418
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 90/185 (48%), Gaps = 47/185 (25%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L +P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
G G++DG PG A FD+P+S AVD N+YVAD+ + +RK+
Sbjct: 88 LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
D GN+++RQINLKPEDC+ QSG G
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207
Query: 160 MFWVT 164
V+
Sbjct: 208 NTLVS 212
>gi|414870562|tpg|DAA49119.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 207
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 87/178 (48%), Gaps = 47/178 (26%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L +P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
G G++DG PG A FD+P+S AVD N+YVAD+ + +RK+
Sbjct: 88 LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
D GN+++RQINLKPEDC+ QSG
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSG 205
>gi|242079119|ref|XP_002444328.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
gi|241940678|gb|EES13823.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
Length = 416
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 95/189 (50%), Gaps = 52/189 (27%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT + + L +P++++ RP + DL++LDS+ SA YTLS S +
Sbjct: 29 VLEDGYTVTTFANFNPLPASGPHPYAILPRPRAGDLLLLDSAGSALYTLSLS-SSPGEPR 87
Query: 84 RLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
RLAG + G+ DG+ A FD+P+S AVD N+YVAD+ + +RK+
Sbjct: 88 RLAGGKRRSGFDDGD---AAFDRPRSVAVDAADNVYVADQRHGAVRKVAPSGYTTTVAGG 144
Query: 133 ----------------------------------LDHGNQLIRQINLKPEDCSKSSQSGS 158
D GN+++RQINLKPEDC+ QSG
Sbjct: 145 LSSGPGHRDGLAQNATFSADFELVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGL 204
Query: 159 GMFWVTVFS 167
G V+V +
Sbjct: 205 GTTSVSVIA 213
>gi|296087695|emb|CBI34951.3| unnamed protein product [Vitis vinifera]
Length = 811
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 94/201 (46%), Gaps = 56/201 (27%)
Query: 6 VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
+ S++ L+ L+SL S S + EDGYTV TV DG +L I P+S ++ S
Sbjct: 34 IVSNVVSALVKWLWSLKSTTNTAISSRSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 92
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L+VLDS S Y +S PLS S K +AG DG G+ DG+ AR + PK +D +
Sbjct: 93 GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 152
Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
GNIY+AD N IRKI++
Sbjct: 153 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 212
Query: 133 --LDHGNQLIRQINLKPEDCS 151
+D GNQ IR+I L EDC+
Sbjct: 213 LVIDRGNQAIREIQLHYEDCA 233
>gi|225452518|ref|XP_002274991.1| PREDICTED: uncharacterized protein LOC100266244 [Vitis vinifera]
Length = 677
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 94/201 (46%), Gaps = 56/201 (27%)
Query: 6 VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
+ S++ L+ L+SL S S + EDGYTV TV DG +L I P+S ++ S
Sbjct: 34 IVSNVVSALVKWLWSLKSTTNTAISSRSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 92
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L+VLDS S Y +S PLS S K +AG DG G+ DG+ AR + PK +D +
Sbjct: 93 GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 152
Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
GNIY+AD N IRKI++
Sbjct: 153 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 212
Query: 133 --LDHGNQLIRQINLKPEDCS 151
+D GNQ IR+I L EDC+
Sbjct: 213 LVIDRGNQAIREIQLHYEDCA 233
>gi|147860395|emb|CAN80443.1| hypothetical protein VITISV_043282 [Vitis vinifera]
Length = 527
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 94/201 (46%), Gaps = 56/201 (27%)
Query: 6 VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
+ S++ L+ L+SL S S + EDGYTV TV DG +L I P+S ++ S
Sbjct: 17 IVSNVVSALVKWLWSLKSTTNTAISSXSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 75
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L+VLDS S Y +S PLS S K +AG DG G+ DG+ AR + PK +D +
Sbjct: 76 GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 135
Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
GNIY+AD N IRKI++
Sbjct: 136 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 195
Query: 133 --LDHGNQLIRQINLKPEDCS 151
+D GNQ IR+I L EDC+
Sbjct: 196 LVIDRGNQAIREIQLHYEDCA 216
>gi|115448467|ref|NP_001048013.1| Os02g0730400 [Oryza sativa Japonica Group]
gi|46390465|dbj|BAD15926.1| putative NHL repeat-containing protein [Oryza sativa Japonica
Group]
gi|46390861|dbj|BAD16365.1| putative NHL repeat-containing protein [Oryza sativa Japonica
Group]
gi|113537544|dbj|BAF09927.1| Os02g0730400 [Oryza sativa Japonica Group]
gi|125541002|gb|EAY87397.1| hypothetical protein OsI_08804 [Oryza sativa Indica Group]
gi|125583566|gb|EAZ24497.1| hypothetical protein OsJ_08258 [Oryza sativa Japonica Group]
gi|215715228|dbj|BAG94979.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 508
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 95/204 (46%), Gaps = 55/204 (26%)
Query: 8 SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A +++ L+SL S S + E GYTV TV DG +L I P++V G +
Sbjct: 43 SNAASSVMKRLWSLKSTTKTGSGGKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 101
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G G+ DG P AR + PK F VD +GN
Sbjct: 102 LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGFPGHVDGRPREARMNHPKGFTVDGRGN 161
Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
IYVAD N IRKI++ +
Sbjct: 162 IYVADAMNMAIRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIGSSCSLLVI 221
Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
D GNQ IR+I L +DC ++G
Sbjct: 222 DRGNQAIREIQLNFDDCVYQYEAG 245
>gi|242095458|ref|XP_002438219.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
gi|241916442|gb|EER89586.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
Length = 507
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/174 (36%), Positives = 81/174 (46%), Gaps = 46/174 (26%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY V TV DG +L I PH+V P + DL+VLDS S Y + PLS S K LAG
Sbjct: 76 EGGYAVDTVFDGSKLGIEPHAVEITP-AGDLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 134
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
+G+ G+ DG AR + P F VD KGNIYVAD N IRKI++
Sbjct: 135 PEGLSGHVDGRLREARMNHPNGFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSIRG 194
Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC+ ++G
Sbjct: 195 GHIDGPSDDAKFSTDFEIQYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYETG 248
>gi|255552640|ref|XP_002517363.1| conserved hypothetical protein [Ricinus communis]
gi|223543374|gb|EEF44905.1| conserved hypothetical protein [Ricinus communis]
Length = 494
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 68/194 (35%), Positives = 88/194 (45%), Gaps = 56/194 (28%)
Query: 13 TLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLD 63
L+ L+SL S S + E GYTV TV DG +L I PH+V P S +L+VLD
Sbjct: 41 ALVKWLWSLKSNTKTAASSRSMMKFEGGYTVETVFDGSKLGIEPHTVDVSP-SGELLVLD 99
Query: 64 SSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
S S Y +S PLS S K +AG +G G+ DG AR + PK VD +GNIY+AD
Sbjct: 100 SENSNIYKISTPLSRYSRPKLIAGSPEGYSGHVDGRLREARMNHPKGLTVDNRGNIYIAD 159
Query: 122 KSNHVIRKITN--------------------------------------------LDHGN 137
N IRKI++ +D GN
Sbjct: 160 TMNMAIRKISDGGVTTIAGGKWTRAGGHVDGPSEDAKFSNDFDIAYIGSSCSLLVIDRGN 219
Query: 138 QLIRQINLKPEDCS 151
Q IR+I L +DC+
Sbjct: 220 QAIREIQLNDDDCN 233
>gi|9755656|emb|CAC01808.1| putative protein [Arabidopsis thaliana]
Length = 733
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 89/192 (46%), Gaps = 48/192 (25%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY + TV DG +L I P+++ P +LIVLDS S + +S PLS K L+G
Sbjct: 54 ESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPKLLSGS 113
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
+G G+ DG+ AR ++P+ A+D +GNIYVAD N IRKI++
Sbjct: 114 QEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSG 173
Query: 133 --------------------------LDHGNQLIRQINLKPEDCSKSSQS------GSGM 160
+D GNQLI++I L DCS+ G+ +
Sbjct: 174 GSKEESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQPEPDTDSLHLGTAL 233
Query: 161 FWVTVFSPYLIS 172
VF Y+++
Sbjct: 234 LVAAVFFGYMLA 245
>gi|293332279|ref|NP_001170567.1| uncharacterized protein LOC100384592 precursor [Zea mays]
gi|238006082|gb|ACR34076.1| unknown [Zea mays]
Length = 505
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 83/177 (46%), Gaps = 50/177 (28%)
Query: 29 EDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E GY V TV DG +L I PH+ I+R G DL++LDS S Y + PLS S K LAG
Sbjct: 76 EGGYAVDTVFDGSKLGIEPHAAQINRAG--DLLLLDSINSNIYRVQLPLSPYSRPKLLAG 133
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
+G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++
Sbjct: 134 SPEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSIR 193
Query: 133 --------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC+ ++G
Sbjct: 194 GGYIDEPSVSDDAKFSTDFEVQYISSTCSLLVIDRGNQAIREIPLNDDDCAYQYEAG 250
>gi|30685100|ref|NP_196993.2| NHL domain-containing protein [Arabidopsis thaliana]
gi|332004703|gb|AED92086.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 754
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 89/192 (46%), Gaps = 48/192 (25%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY + TV DG +L I P+++ P +LIVLDS S + +S PLS K L+G
Sbjct: 75 ESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPKLLSGS 134
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
+G G+ DG+ AR ++P+ A+D +GNIYVAD N IRKI++
Sbjct: 135 QEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSG 194
Query: 133 --------------------------LDHGNQLIRQINLKPEDCSKSSQS------GSGM 160
+D GNQLI++I L DCS+ G+ +
Sbjct: 195 GSKEESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQPEPDTDSLHLGTAL 254
Query: 161 FWVTVFSPYLIS 172
VF Y+++
Sbjct: 255 LVAAVFFGYMLA 266
>gi|413953662|gb|AFW86311.1| hypothetical protein ZEAMMB73_666053 [Zea mays]
Length = 506
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 83/177 (46%), Gaps = 50/177 (28%)
Query: 29 EDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E GY V TV DG +L I PH+ I+R G DL++LDS S Y + PLS S K LAG
Sbjct: 76 EGGYAVDTVFDGSKLGIEPHAAQINRAG--DLLLLDSINSNIYRVQLPLSPYSRPKLLAG 133
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
+G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++
Sbjct: 134 SPEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSIR 193
Query: 133 --------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC+ ++G
Sbjct: 194 GGYIDEPSVSDDAKFSTDFEVQYISSTCSLLVIDRGNQAIREIPLNDDDCAYQYEAG 250
>gi|326503362|dbj|BAJ99306.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 90/188 (47%), Gaps = 53/188 (28%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+GL+LEDGYTV+TV D + +P++++ RP + DL++LDS+ S YTL P+S ++ +
Sbjct: 54 AGLVLEDGYTVSTVSDLNPSGTHPYALLPRPRAGDLVLLDSAGSTLYTLPLPVSADAGPR 113
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
RLAG + P+S AVD N+YVAD++N IRK+
Sbjct: 114 RLAGGAG---------ALGDGHPRSIAVDGADNVYVADRANGSIRKVAPSGYTTTIAGAY 164
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
D GN+LIRQI LKPEDC+ +Q G G
Sbjct: 165 SAGTGHRDEPAQNATFSADFELIYVPQICALLVADRGNRLIRQIKLKPEDCAHENQKGLG 224
Query: 160 MFWVTVFS 167
++ +
Sbjct: 225 TTSTSIIA 232
>gi|195621162|gb|ACG32411.1| hypothetical protein [Zea mays]
gi|219884599|gb|ACL52674.1| unknown [Zea mays]
gi|413944257|gb|AFW76906.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
Length = 508
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 81/174 (46%), Gaps = 46/174 (26%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY V TV DG +L I PH+V P + +L+VLDS S Y + PLS S K LAG
Sbjct: 73 EGGYAVDTVFDGSKLGIEPHAVEITP-AGNLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 131
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++
Sbjct: 132 PKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISDTGVTTIAGGKSIRG 191
Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC+ ++G
Sbjct: 192 GHIDGPSDDAKFSTDFEIKYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYEAG 245
>gi|218197942|gb|EEC80369.1| hypothetical protein OsI_22479 [Oryza sativa Indica Group]
Length = 517
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 92/207 (44%), Gaps = 58/207 (28%)
Query: 8 SSLAITLLSLLFSLVSSG------------LLLEDGYTVTTVIDGHQLEINPHSVIDRPG 55
SS A ++ L+SL S+ + E GY V TV DG +L I PH V P
Sbjct: 45 SSTASAVVKQLWSLKSTATRTGSAAAGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTP- 103
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
S +L+VLDS S Y + PLS S K +AG +G+ G+ DG A+ + PK F VD
Sbjct: 104 SGELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDD 163
Query: 114 KGNIYVADKSNHVIRKITN----------------------------------------- 132
+GNIYVAD N IRKI++
Sbjct: 164 RGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSL 223
Query: 133 --LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC ++G
Sbjct: 224 LVIDRGNQAIREIALHNDDCEYQYEAG 250
>gi|115467514|ref|NP_001057356.1| Os06g0269300 [Oryza sativa Japonica Group]
gi|53793193|dbj|BAD54399.1| NHL repeat-containing protein-like [Oryza sativa Japonica Group]
gi|113595396|dbj|BAF19270.1| Os06g0269300 [Oryza sativa Japonica Group]
gi|215768552|dbj|BAH00781.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222635365|gb|EEE65497.1| hypothetical protein OsJ_20925 [Oryza sativa Japonica Group]
Length = 517
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 92/207 (44%), Gaps = 58/207 (28%)
Query: 8 SSLAITLLSLLFSLVSSG------------LLLEDGYTVTTVIDGHQLEINPHSVIDRPG 55
SS A ++ L+SL S+ + E GY V TV DG +L I PH V P
Sbjct: 45 SSTASAVVKQLWSLKSTATRTGSAAAGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTP- 103
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
S +L+VLDS S Y + PLS S K +AG +G+ G+ DG A+ + PK F VD
Sbjct: 104 SGELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDD 163
Query: 114 KGNIYVADKSNHVIRKITN----------------------------------------- 132
+GNIYVAD N IRKI++
Sbjct: 164 RGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSL 223
Query: 133 --LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC ++G
Sbjct: 224 LVIDRGNQAIREIALHNDDCEYQYEAG 250
>gi|413923863|gb|AFW63795.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
Length = 515
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 95/204 (46%), Gaps = 55/204 (26%)
Query: 8 SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A ++ L+SL S S + E GYTV TV DG +L I P+SV + S +
Sbjct: 44 SNAASAVVKRLWSLKSTTKTASGGKSMVKYEGGYTVETVFDGSKLGIEPYSV-EVTQSGE 102
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G G+ DG+ AR + PK F VD KGN
Sbjct: 103 LLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDKGN 162
Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
IYVAD N IRKI++ +
Sbjct: 163 IYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLVI 222
Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
D GNQ IR+I L +DC ++G
Sbjct: 223 DRGNQAIREIQLHFDDCVYQYEAG 246
>gi|226530993|ref|NP_001143690.1| uncharacterized protein LOC100276420 precursor [Zea mays]
gi|195624742|gb|ACG34201.1| hypothetical protein [Zea mays]
Length = 515
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 83/174 (47%), Gaps = 46/174 (26%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GYTV TV DG +L I P+SV + S +L+V+DS S Y ++ PLS S K +AG
Sbjct: 75 EGGYTVETVFDGSKLGIEPYSV-EVTQSGELLVMDSMNSNIYRMALPLSRYSRPKLVAGS 133
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
+G G+ DG+ AR + PK F VD KGNIYVAD N IRKI++
Sbjct: 134 PEGFPGHVDGKLREARMNHPKGFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRG 193
Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC ++G
Sbjct: 194 GHVDGPSDEAKFSTDFEVRYIGSSCSLLVIDRGNQAIREIQLHFDDCVYQYEAG 247
>gi|413923861|gb|AFW63793.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
Length = 516
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 83/174 (47%), Gaps = 46/174 (26%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GYTV TV DG +L I P+SV + S +L+V+DS S Y ++ PLS S K +AG
Sbjct: 75 EGGYTVETVFDGSKLGIEPYSV-EVTQSGELLVMDSMNSNIYRMALPLSRYSRPKLVAGS 133
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
+G G+ DG+ AR + PK F VD KGNIYVAD N IRKI++
Sbjct: 134 PEGFPGHVDGKLREARMNHPKGFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRG 193
Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC ++G
Sbjct: 194 GHVDGPSDEAKFSTDFEVRYIGSSCSLLVIDRGNQAIREIQLHFDDCVYQYEAG 247
>gi|297807503|ref|XP_002871635.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317472|gb|EFH47894.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 704
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 80/165 (48%), Gaps = 42/165 (25%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY + TV DG +L I P+++ P +LIVLDS S + +S PLS + K ++G
Sbjct: 75 ESGYNIETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYAKPKLVSGS 134
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
+G G+ DG+ A+ ++P+ A+D GNIYVAD +N IRKI++
Sbjct: 135 QEGYTGHVDGKLKEAKMNRPRGLAIDDSGNIYVADTNNMAIRKISDDGVSTITAGGRWSG 194
Query: 133 --------------------------LDHGNQLIRQINLKPEDCS 151
+D GNQ+I++I L DCS
Sbjct: 195 GSKEESMRFSDDFDLIYVSSSCSLLVIDRGNQMIKEIQLHDHDCS 239
>gi|326512916|dbj|BAK03365.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529495|dbj|BAK04694.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529987|dbj|BAK08273.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 503
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 94/204 (46%), Gaps = 55/204 (26%)
Query: 8 SSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A +L L+SL S+ + E GYTV TV DG +L I P++V G +
Sbjct: 40 SNAASAVLKRLWSLKSTAKTASGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 98
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G+ G+ DG A+ + PK F VD +GN
Sbjct: 99 LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGIPGHIDGRLREAKMNHPKGFTVDGRGN 158
Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
IYVAD N +RKI++ +
Sbjct: 159 IYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSLLVI 218
Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
D GNQ IR+I L +DC ++G
Sbjct: 219 DRGNQAIREIQLNFDDCVYQYEAG 242
>gi|293331401|ref|NP_001168100.1| uncharacterized protein LOC100381837 precursor [Zea mays]
gi|223946007|gb|ACN27087.1| unknown [Zea mays]
gi|413938717|gb|AFW73268.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
Length = 510
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 96/204 (47%), Gaps = 55/204 (26%)
Query: 8 SSLAITLLSLLFSL-----VSSG----LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A ++ L+SL ++SG + E GYTV TV DG +L I P+SV G +
Sbjct: 44 SNAASAVVKRLWSLKSTTKIASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG-E 102
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G G+ DG+ AR + PK F VD +GN
Sbjct: 103 LLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRGN 162
Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
IYVAD N IRKI++ +
Sbjct: 163 IYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLVI 222
Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
D GNQ IR+I L +DC ++G
Sbjct: 223 DRGNQAIREIQLHFDDCVYQYEAG 246
>gi|413944258|gb|AFW76907.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
Length = 187
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 64/106 (60%), Gaps = 3/106 (2%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY V TV DG +L I PH+V P + +L+VLDS S Y + PLS S K LAG
Sbjct: 73 EGGYAVDTVFDGSKLGIEPHAVEITP-AGNLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 131
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++
Sbjct: 132 PKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISD 177
>gi|413938718|gb|AFW73269.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
Length = 511
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 94/205 (45%), Gaps = 56/205 (27%)
Query: 8 SSLAITLLSLLFSLVSSGLLL----------EDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
S+ A ++ L+SL S+ + E GYTV TV DG +L I P+SV G
Sbjct: 44 SNAASAVVKRLWSLKSTTKIAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG- 102
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+L+V+DS S Y ++ PLS S K +AG +G G+ DG+ AR + PK F VD +G
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRG 162
Query: 116 NIYVADKSNHVIRKITN------------------------------------------- 132
NIYVAD N IRKI++
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLV 222
Query: 133 LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC ++G
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAG 247
>gi|242066098|ref|XP_002454338.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
gi|241934169|gb|EES07314.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
Length = 518
Score = 89.4 bits (220), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 82/174 (47%), Gaps = 46/174 (26%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GYTV TV DG +L I P+SV G +L+V+DS S Y ++ PLS S K +AG
Sbjct: 75 EGGYTVETVFDGSKLGIEPYSVEVTQGG-ELLVMDSMNSNIYRMALPLSRYSRPKLVAGS 133
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
+G G+ DG+ AR + PK F VD +GNIYVAD N IRKI++
Sbjct: 134 PEGFPGHVDGKLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSGRG 193
Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC ++G
Sbjct: 194 GHVDGPSDEAKFSTDFEVRYVGSSCSLLVIDRGNQAIREIQLHFDDCVYQYEAG 247
>gi|9369390|gb|AAF87138.1|AC002423_3 T23E23.5 [Arabidopsis thaliana]
Length = 493
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 70/232 (30%), Positives = 102/232 (43%), Gaps = 60/232 (25%)
Query: 1 MASHFVYSSLAITLLSLLFSLVSSG-----------LLLEDGYTVTTVIDGHQLEINPHS 49
M+ HF++ + I L S + S + E+GY+V TV+DG +L I P+S
Sbjct: 1 MSRHFLFLGIIILLFSAFVASAPSSTSPATVPTKSMVKFENGYSVETVLDGSKLGIEPYS 60
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPK 107
I + +L++LDS S Y +S LS S + + G +G G+ DG AR + PK
Sbjct: 61 -IQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDARLNNPK 119
Query: 108 SFAVDMKGNIYVADKSNHVIRKITN----------------------------------- 132
VD +GNIYVAD N+ IRKI+
Sbjct: 120 GLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVRGGGHVDGPSEDAKFSNDFDVVY 179
Query: 133 ---------LDHGNQLIRQINLKPEDCSKSSQSG--SGMFWVTVFSPYLISC 173
+D GNQ IR+I L +DC+ SG G ++ FS ++ SC
Sbjct: 180 LGSSCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLGNIYIPKFSVWVPSC 231
>gi|357137830|ref|XP_003570502.1| PREDICTED: uncharacterized protein LOC100839308 [Brachypodium
distachyon]
Length = 495
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 93/204 (45%), Gaps = 55/204 (26%)
Query: 8 SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A +L L+SL S S + E GYTV TV DG +L I P++V G +
Sbjct: 39 SNAASAVLKRLWSLKSTTKTATGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 97
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G G+ DG A+ + PK F VD +GN
Sbjct: 98 LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGYPGHVDGRLREAKMNHPKGFTVDGRGN 157
Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
IYVAD N +RKI++ +
Sbjct: 158 IYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSLLVI 217
Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
D GNQ IR+I L +DC ++G
Sbjct: 218 DRGNQAIREIQLHFDDCVYQYEAG 241
>gi|356573394|ref|XP_003554846.1| PREDICTED: uncharacterized protein LOC100793916 [Glycine max]
Length = 528
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 77/170 (45%), Gaps = 47/170 (27%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E GY+V T+ DG QL I PHSV P + + +VLDS S Y +S +S S K LAG
Sbjct: 70 FESGYSVETIFDGSQLGIEPHSVKISP-NGEFLVLDSENSNIYKVSGSMSRYSRPKLLAG 128
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
+G G+ DG P AR + PK VD +GNIY+AD N IRKI++
Sbjct: 129 SAEGNIGHIDGRPREARMNHPKGLTVDDRGNIYIADTLNMAIRKISDEGVTTIAGGKRGY 188
Query: 133 -------------------------------LDHGNQLIRQINLKPEDCS 151
+D GN IR+I L +DC+
Sbjct: 189 AGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVVDRGNHAIREIQLHQDDCT 238
>gi|449446849|ref|XP_004141183.1| PREDICTED: uncharacterized protein LOC101216407 [Cucumis sativus]
gi|449489545|ref|XP_004158343.1| PREDICTED: uncharacterized protein LOC101227198 [Cucumis sativus]
Length = 417
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 81/172 (47%), Gaps = 46/172 (26%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ E GY V T++DG ++ I P+SV P S +L++LD+ S + +S P+S+ K
Sbjct: 2 IKFERGYVVETLLDGSKMGIEPYSVGVSP-SGELLILDAENSNVHKISMPVSQFCRPKLF 60
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
AG +G G+ DG+ AR P+ VD +GNIY+AD N IRKI++
Sbjct: 61 AGSSEGYSGHVDGKLRDARMSHPRGLTVDQRGNIYIADTKNKAIRKISDAGVTTIAGGKW 120
Query: 133 --------------------------------LDHGNQLIRQINLKPEDCSK 152
+D GNQ IR+I L+ EDC++
Sbjct: 121 RKSGHIDGPSEESKFSNDFDLVYVGSSCSLLVVDRGNQAIREIQLRAEDCTE 172
>gi|147798773|emb|CAN69794.1| hypothetical protein VITISV_022544 [Vitis vinifera]
Length = 491
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 86/182 (47%), Gaps = 49/182 (26%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GYTV TV DG +L I P+S I+ GS +L++LDS+ S Y +S LS+ + K
Sbjct: 62 SMMKFESGYTVETVFDGSKLGIEPYS-IEVLGSGELLILDSANSNLYKISSSLSQYTRPK 120
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
+ G +G G+ DG AR + PK VD +GNIYVAD N IRKI++
Sbjct: 121 LVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIRKISDAGVTTIAGG 180
Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC + Q G
Sbjct: 181 KLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDC--AYQYG 238
Query: 158 SG 159
SG
Sbjct: 239 SG 240
>gi|224102017|ref|XP_002312513.1| predicted protein [Populus trichocarpa]
gi|222852333|gb|EEE89880.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 99/208 (47%), Gaps = 57/208 (27%)
Query: 6 VYSSLAITLLSLLFSL------VSSG---LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
++S++ L+ L+SL V SG + E GYTV TV DG +L I P+SV P S
Sbjct: 38 LFSNVVSALMKWLWSLKATTKTVISGRPMMKFESGYTVETVFDGSKLGIEPYSVQVLP-S 96
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L++LDS+ S Y +S LS S K +AG +G G+ DG+ A+ + PK VD +
Sbjct: 97 GELLILDSANSNIYRISASLSLYSRPKLVAGSHEGYSGHVDGKLREAKMNHPKGLTVDDR 156
Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
GNIYVAD N IRKI++
Sbjct: 157 GNIYVADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDANFSNDFDVVYIGSSCSLL 216
Query: 133 -LDHGNQLIRQINLKPEDCSKSSQSGSG 159
+D GN+ IR+I L +DC + Q GSG
Sbjct: 217 VIDRGNRAIREIQLHFDDC--AYQYGSG 242
>gi|225424717|ref|XP_002265467.1| PREDICTED: uncharacterized protein LOC100263777 [Vitis vinifera]
gi|296086531|emb|CBI32120.3| unnamed protein product [Vitis vinifera]
Length = 491
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 86/182 (47%), Gaps = 49/182 (26%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GYTV TV DG +L I P+S I+ GS +L++LDS+ S Y +S LS+ + K
Sbjct: 62 SMMKFESGYTVETVFDGSKLGIEPYS-IEVLGSGELLILDSANSNLYKISSSLSQYTRPK 120
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
+ G +G G+ DG AR + PK VD +GNIYVAD N IRKI++
Sbjct: 121 LVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIRKISDAGVTTIAGG 180
Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC + Q G
Sbjct: 181 KLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDC--AYQYG 238
Query: 158 SG 159
SG
Sbjct: 239 SG 240
>gi|225461726|ref|XP_002285515.1| PREDICTED: uncharacterized protein LOC100245328 [Vitis vinifera]
gi|302142876|emb|CBI20171.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 89/186 (47%), Gaps = 50/186 (26%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L EDGY V TV++G++L + PHS+ + G +L +D+ ++ ++ PLS+ S +
Sbjct: 57 LQFEDGYLVETVVEGNELGVVPHSIRVSEDG--ELFAVDAVKNNIVRITPPLSQYSRARL 114
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
+AG G G+ DG+P ARF+ PK +D KGN+YVAD SN IRKI +
Sbjct: 115 VAGSFQGHTGHVDGKPSDARFNGPKGVTMDDKGNVYVADTSNLAIRKIGDSGVTTIAGGK 174
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDC--SKSSQSG 157
+D GN +RQI+L EDC SS S
Sbjct: 175 SNVAGYRDGPSEDAKFSSDFDVVYVRPTCSLLVVDRGNAALRQISLNQEDCDYQNSSISA 234
Query: 158 SGMFWV 163
+ +F V
Sbjct: 235 TDIFMV 240
>gi|449492692|ref|XP_004159073.1| PREDICTED: uncharacterized protein LOC101226879 [Cucumis sativus]
Length = 516
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 94/201 (46%), Gaps = 56/201 (27%)
Query: 6 VYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
V S++ +L+ L+S+ SS + E GY+V TV DG +L I+P+SV P S
Sbjct: 34 VLSNVVSSLVKKLWSMKSSAKTAVSSRSMMKFESGYSVETVFDGSKLGIDPYSVEMSP-S 92
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L++LD+ S + +S PLS S K ++G +G G+ DG AR + PK +D +
Sbjct: 93 GELLILDAENSNIHKISMPLSRFSRPKLVSGSAEGYSGHVDGHSREARMNHPKGLTLDER 152
Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
GNIY+AD N IRKI++
Sbjct: 153 GNIYIADTMNMAIRKISDTGVTTIAGGRWNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSL 212
Query: 133 --LDHGNQLIRQINLKPEDCS 151
+D GN+ IR+I L +DC+
Sbjct: 213 LVIDRGNKAIREIELNYDDCN 233
>gi|195609352|gb|ACG26506.1| hypothetical protein [Zea mays]
gi|414870563|tpg|DAA49120.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 159
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 49/109 (44%), Positives = 70/109 (64%), Gaps = 5/109 (4%)
Query: 27 LLEDGYTVTTVIDGHQLEIN---PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L + P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISL-SSSPGEPR 86
Query: 84 RLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
RLAG G++DG PG A FD+P+S AVD N+YVAD+ + +RK+
Sbjct: 87 RLAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVA 135
>gi|224108107|ref|XP_002314724.1| predicted protein [Populus trichocarpa]
gi|222863764|gb|EEF00895.1| predicted protein [Populus trichocarpa]
Length = 487
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 96/208 (46%), Gaps = 57/208 (27%)
Query: 6 VYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
V+S++ + L+S+ S+ + E GYTV TV DG +L I P+SV P S
Sbjct: 41 VFSNVVPAFMKWLWSMKSTTKTVISGRPMMKFESGYTVETVFDGSKLGIEPYSVQVLP-S 99
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L++LDS S Y +S LS S K +AG +G G+ DG+ A+ + PK VD +
Sbjct: 100 GELLILDSVNSNIYRMSSSLSLYSRPKLVAGSHEGYSGHVDGKLREAKMNHPKGLTVDDR 159
Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
GNIY+AD N IRKI++
Sbjct: 160 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDAKFSNDFDVLYIGSSCSLL 219
Query: 133 -LDHGNQLIRQINLKPEDCSKSSQSGSG 159
+D GNQ IR+I L +DC+ Q GSG
Sbjct: 220 VIDRGNQAIREIQLHFDDCAY--QYGSG 245
>gi|357118152|ref|XP_003560822.1| PREDICTED: uncharacterized protein LOC100834977 [Brachypodium
distachyon]
Length = 544
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 81/174 (46%), Gaps = 46/174 (26%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E Y V TV DG +L I P++V P + +L+VLDS S Y + PLS S K ++G
Sbjct: 79 EGWYEVETVFDGSKLGIEPYAVEVTP-AGELLVLDSMNSNIYRVQLPLSRYSRPKLVSGS 137
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
+G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++
Sbjct: 138 PEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSVRG 197
Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L+ +DC ++G
Sbjct: 198 GHTDGPSDDAKFSTDFEIRYISSSCSLLVIDRGNQAIREIPLQLDDCEYQHEAG 251
>gi|255560011|ref|XP_002521024.1| conserved hypothetical protein [Ricinus communis]
gi|223539861|gb|EEF41441.1| conserved hypothetical protein [Ricinus communis]
Length = 500
Score = 85.9 bits (211), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 83/178 (46%), Gaps = 47/178 (26%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ E GY V TV DG +L I P+SV P + +L++LDS+ S Y +S LS S K +
Sbjct: 72 MKFEGGYNVETVFDGSKLGIEPYSVEVLP-NGELLILDSANSNIYRISSSLSLYSRPKLV 130
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
AG DG G+ DG+ AR + PK VD +GNIY+AD N IRKI++
Sbjct: 131 AGSPDGYSGHVDGKHREARMNHPKGLTVDDRGNIYIADAMNMAIRKISDAGVTTIAGGKW 190
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GN+ IR+I L +DC+ +SG
Sbjct: 191 GRGGSHVDGASEDAKFSNDFDVVYIGSSCSLLVIDRGNRAIREIQLHFDDCAYQYESG 248
>gi|356497860|ref|XP_003517774.1| PREDICTED: uncharacterized protein LOC100799483 [Glycine max]
Length = 508
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 84/180 (46%), Gaps = 47/180 (26%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GYTV TV DG +L I P++V P + +L++LDS+ S Y +S LS S K
Sbjct: 62 SMMKFESGYTVETVFDGSKLGIEPYAVEVLP-NGELLILDSANSNIYRISSSLSLNSRPK 120
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
+AG +G G+ DG AR + PK VD +GNIYVAD N IRKI++
Sbjct: 121 LVAGSAEGYSGHVDGRFREARMNHPKGITVDNRGNIYVADIMNMAIRKISDSGVTTIAGG 180
Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L +DC+ ++G
Sbjct: 181 KWSRGGGHVDGPSEEAKFSNDFDVVYVGSSCSLLVIDRGNQAIREIQLHFDDCAYQYENG 240
>gi|356501873|ref|XP_003519748.1| PREDICTED: uncharacterized protein LOC100797570 [Glycine max]
Length = 507
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 85/182 (46%), Gaps = 47/182 (25%)
Query: 22 VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
S + E GYTV TV DG +L I P++V + + +L++LDS+ S Y +S LS +
Sbjct: 60 AKSMMKFESGYTVETVFDGSKLGIEPYAV-EVLANGELLILDSANSNIYRISSSLSLNTR 118
Query: 82 VKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------- 132
K +AG +G G+ DG AR + PK AVD +GNIY+AD N IRKI++
Sbjct: 119 PKLVAGSAEGYSGHVDGRLREARMNHPKGIAVDNRGNIYIADIMNMAIRKISDSGVTTIA 178
Query: 133 -------------------------------------LDHGNQLIRQINLKPEDCSKSSQ 155
+D GNQ IR+I L +DC+ +
Sbjct: 179 GGKWSRGGGHIDGPSEEAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDCAYQYE 238
Query: 156 SG 157
+G
Sbjct: 239 NG 240
>gi|317015034|gb|ADU85604.1| NHL repeat-containing protein, partial [Wolffia arrhiza]
Length = 239
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/107 (43%), Positives = 65/107 (60%), Gaps = 3/107 (2%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E GY V T+ DG++L + PH++ P +LI +DS S ++ PLS S + +AG
Sbjct: 60 FESGYLVETIADGNRLGLTPHAIRVSP-DGELIAVDSVNSNIVRITPPLSAFSRGRLVAG 118
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
G G DG+P ARF++P+ AVD +GNIYVAD SN IRKI +
Sbjct: 119 SFQGRSGLIDGKPSEARFNQPRGVAVDRRGNIYVADVSNLAIRKIGD 165
>gi|449449204|ref|XP_004142355.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101209861 [Cucumis sativus]
Length = 454
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 82/174 (47%), Gaps = 47/174 (27%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GY+V TV DG +L I+P+SV P S +L++LD+ S + +S PLS S K
Sbjct: 13 SMMKFESGYSVETVFDGSKLGIDPYSVEMSP-SGELLILDAENSNIHKISMPLSRFSRPK 71
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
++G +G G+ DG AR + PK +D +GNIY+AD N IRKI++
Sbjct: 72 LVSGSAEGYSGHVDGHSREARMNHPKGLTLDERGNIYIADTMNMAIRKISDTGVTTIAGG 131
Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCS 151
+D GN+ IR+I L +DC+
Sbjct: 132 RWNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIELNYDDCN 185
>gi|297845436|ref|XP_002890599.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297336441|gb|EFH66858.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 90/197 (45%), Gaps = 49/197 (24%)
Query: 9 SLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSA 68
SL+ ++ S + E+GY+V TV+DG +L I P+S+ P + +L++LDS S
Sbjct: 49 SLSFKTTTITAVSTKSMVKFENGYSVETVLDGSKLGIEPYSLQVLP-NGELLILDSQNSN 107
Query: 69 FYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
Y +S LS S + + G +G G+ DG AR + PK VD +GNIYVAD N+
Sbjct: 108 IYKISSSLSLYSRPRLITGSPEGYPGHVDGRLRDARLNNPKGLTVDDRGNIYVADTVNNA 167
Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
IRKI+ +D GNQ IR+
Sbjct: 168 IRKISEAGVTTIAGGKMVHGGGHVDGPSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAIRE 227
Query: 143 INLKPEDCSKSSQSGSG 159
I L +DC + Q GSG
Sbjct: 228 IQLHFDDC--ADQYGSG 242
>gi|356549639|ref|XP_003543199.1| PREDICTED: uncharacterized protein LOC100797841 [Glycine max]
Length = 487
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 85/191 (44%), Gaps = 45/191 (23%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+++ + P+ + +L +D+ S ++ PLS+ S + +
Sbjct: 57 LQFENGYVVETVVEGNEIGVVPYRIRVSEEDGELFAVDAINSNIVRITPPLSQYSRGRLV 116
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
AG G G+ DG+P ARF+ PK VD KGN+YVAD N IRKI +
Sbjct: 117 AGSFQGYTGHVDGKPSDARFNHPKGITVDDKGNVYVADTQNLAIRKIGDAGVTTIAGGKS 176
Query: 133 --------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSGM 160
+D GN +RQI+L EDC S S S
Sbjct: 177 NVAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLDQEDCDYQSNSISST 236
Query: 161 FWVTVFSPYLI 171
+TV ++
Sbjct: 237 DILTVVGAVIV 247
>gi|22329761|ref|NP_173800.2| NHL domain-containing protein [Arabidopsis thaliana]
gi|18700091|gb|AAL77657.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
gi|20856013|gb|AAM26643.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
gi|332192324|gb|AEE30445.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 545
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 82/178 (46%), Gaps = 49/178 (27%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E+GY+V TV+DG +L I P+S I + +L++LDS S Y +S LS S + + G
Sbjct: 128 FENGYSVETVLDGSKLGIEPYS-IQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTG 186
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
+G G+ DG AR + PK VD +GNIYVAD N+ IRKI+
Sbjct: 187 SPEGYPGHVDGRLRDARLNNPKGLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVR 246
Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
+D GNQ IR+I L +DC + Q GSG
Sbjct: 247 GGGHVDGPSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAIREIQLHFDDC--ADQYGSG 302
>gi|356565931|ref|XP_003551189.1| PREDICTED: uncharacterized protein LOC100781836 [Glycine max]
Length = 493
Score = 82.8 bits (203), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 86/182 (47%), Gaps = 49/182 (26%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GY V TV DG +L I P++V P + +L++LDS+ S Y +S LS S K
Sbjct: 66 SMMKFESGYNVETVFDGSKLGIEPYAVEVLP-NGELLILDSANSNIYRISSSLSLYSRPK 124
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
+AG +G G+ DG+ AR + PK VD +GNIYVAD +N IRKI++
Sbjct: 125 LVAGSAEGYSGHVDGKLREARMNHPKGITVDDRGNIYVADTTNMAIRKISDSGVTTIAGG 184
Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GN+ IR+I L +DC + Q G
Sbjct: 185 KWNRGGGHVDGPSEEAKFSDDLDVVYVGSSCSLLVIDRGNRAIREIQLHFDDC--AYQYG 242
Query: 158 SG 159
SG
Sbjct: 243 SG 244
>gi|224117004|ref|XP_002317450.1| predicted protein [Populus trichocarpa]
gi|222860515|gb|EEE98062.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 88/207 (42%), Gaps = 54/207 (26%)
Query: 3 SHFVYS-SLAITLLSLLFSLVSSG-------LLLEDGYTVTTVIDGHQLEINPHSVIDRP 54
SHF Y+ LA+ L+ F + L EDGY V TV+ G+ + + P+ I
Sbjct: 4 SHFSYTLVLAVLSLACTFQFQAHAAPPDENVLQFEDGYLVETVVKGNAMGVVPYK-IRLS 62
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVD 112
+L +D S ++ PLS+ S + +AG G G+ DG+P RF+ P+ +D
Sbjct: 63 EDGELYAVDEVNSNVVKITPPLSQYSRARLVAGSFQGYTGHIDGKPNEVRFNHPRGLTMD 122
Query: 113 MKGNIYVADKSNHVIRKITN---------------------------------------- 132
KGNIYVAD NH IRKI +
Sbjct: 123 DKGNIYVADSLNHAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVHSTCS 182
Query: 133 ---LDHGNQLIRQINLKPEDCSKSSQS 156
+D GN +RQI+L EDC S S
Sbjct: 183 LLVVDRGNAALRQISLNQEDCDYQSSS 209
>gi|413936635|gb|AFW71186.1| hypothetical protein ZEAMMB73_395034 [Zea mays]
Length = 501
Score = 82.4 bits (202), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 80/176 (45%), Gaps = 46/176 (26%)
Query: 22 VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEES 80
V L E GY V T+++G +L + PH+V P +L+ +DS+ S ++ PLSE S
Sbjct: 67 VGVALQFESGYFVETLVEGDKLGVTPHTVRVSPVEGGELLAVDSAHSNIVRITPPLSEYS 126
Query: 81 VVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------ 132
+ +AG G G+ DG+P ARF +P AVD GN+YVAD +N IRKI
Sbjct: 127 RGRLVAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTI 186
Query: 133 -------------------------------------LDHGNQLIRQINLKPEDCS 151
+D GN +R+I+L EDC+
Sbjct: 187 AGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT 242
>gi|224117390|ref|XP_002331700.1| predicted protein [Populus trichocarpa]
gi|222874306|gb|EEF11437.1| predicted protein [Populus trichocarpa]
Length = 495
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 92/213 (43%), Gaps = 53/213 (24%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L EDGY V TV++G+ + + P+ + + G +L +D S ++ PLS+ S R
Sbjct: 61 LQFEDGYLVETVVEGNAMGVVPYKIRVSEDG--ELYAVDEVNSNVVKITPPLSQYSRA-R 117
Query: 85 LAGDGVQGYS---DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
LA QGY+ DG+P ARF+ P+ +D KGNIYVAD N IRKI +
Sbjct: 118 LAAGSFQGYTGHIDGKPNEARFNHPRGLTMDDKGNIYVADTLNLAIRKIGDAGVTTIAGG 177
Query: 133 ----------------------------------LDHGNQLIRQINLKPEDCSKSSQSGS 158
+D GN +RQI+L EDC S+
Sbjct: 178 KSNVAGFRDGPSEDAKFSNDFDVVYLHSTCSLLVVDRGNAALRQISLNQEDCDYQSKCTR 237
Query: 159 GMFWVTVFSPYLISCKSFWVHLYLYFCSFKGSI 191
T+ S L+S W L+ Y G++
Sbjct: 238 K---TTIESTRLVSKTRCWGVLFDYVIMVVGAV 267
>gi|226504928|ref|NP_001143593.1| uncharacterized protein LOC100276297 precursor [Zea mays]
gi|195623016|gb|ACG33338.1| hypothetical protein [Zea mays]
Length = 501
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 80/176 (45%), Gaps = 46/176 (26%)
Query: 22 VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEES 80
V L E GY V T+++G +L + PH++ P +L+ +DS+ S ++ PLSE S
Sbjct: 67 VGVALQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYS 126
Query: 81 VVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------ 132
+ +AG G G+ DG+P ARF +P AVD GN+YVAD +N IRKI
Sbjct: 127 RGRLVAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTI 186
Query: 133 -------------------------------------LDHGNQLIRQINLKPEDCS 151
+D GN +R+I+L EDC+
Sbjct: 187 AGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT 242
>gi|242061092|ref|XP_002451835.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
gi|241931666|gb|EES04811.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
Length = 495
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 79/172 (45%), Gaps = 46/172 (26%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E GY V T+++G +L + PH++ P +L+ +DS+ S ++ PLSE S +
Sbjct: 70 LQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRL 129
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
+AG G G+ DG+P ARF +P AVD GN+YVAD +N IRKI
Sbjct: 130 VAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTIAGGK 189
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCS 151
+D GN +R+I+L EDC+
Sbjct: 190 SNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT 241
>gi|357140648|ref|XP_003571876.1| PREDICTED: uncharacterized protein LOC100824092 [Brachypodium
distachyon]
Length = 491
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 79/173 (45%), Gaps = 46/173 (26%)
Query: 25 GLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
L E GY V T+++G +L + PH++ P +L+ +DS+ S ++ PLSE S +
Sbjct: 65 ALQFESGYFVETLVEGDKLGVTPHTIRVSPLEGGELLAVDSAHSNIVRITPPLSEYSRAR 124
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
+AG G G+ DG+P ARF +P AVD GN+Y+AD +N IRKI +
Sbjct: 125 LVAGSFQGHAGHIDGKPIDARFKRPTGVAVDDTGNVYIADTANLAIRKIGDSGVTTIAGG 184
Query: 133 ----------------------------------LDHGNQLIRQINLKPEDCS 151
+D GN +R+I L EDC+
Sbjct: 185 KSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCT 237
>gi|115445197|ref|NP_001046378.1| Os02g0234200 [Oryza sativa Japonica Group]
gi|50251216|dbj|BAD27660.1| putative NHL repeat-containing protein [Oryza sativa Japonica
Group]
gi|113535909|dbj|BAF08292.1| Os02g0234200 [Oryza sativa Japonica Group]
gi|125538730|gb|EAY85125.1| hypothetical protein OsI_06475 [Oryza sativa Indica Group]
gi|222622487|gb|EEE56619.1| hypothetical protein OsJ_06002 [Oryza sativa Japonica Group]
Length = 493
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 78/172 (45%), Gaps = 46/172 (26%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E GY V T+++G +L + PH++ P +L+ +DS+ S ++ PLSE S +
Sbjct: 69 LQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRL 128
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
+AG G G+ DG+P ARF +P AVD GN+YVAD +N IRKI
Sbjct: 129 VAGSFQGHSGHIDGKPSDARFKRPTGVAVDDTGNVYVADTANLAIRKIGESGVTTIAGGK 188
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCS 151
+D GN +R+I L EDC+
Sbjct: 189 SNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCT 240
>gi|449456567|ref|XP_004146020.1| PREDICTED: uncharacterized protein LOC101206392 [Cucumis sativus]
Length = 477
Score = 79.7 bits (195), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 80/173 (46%), Gaps = 48/173 (27%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
+ + E GY V T+++G+++ + P+ + + G +L +DS S +S PLS S
Sbjct: 55 NAIQFESGYLVETIVEGNEIGMVPYKIRVSEDG--ELFAVDSVNSNVVKVSPPLSRYSRA 112
Query: 83 KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------- 132
+ +AG G +G+ DG+P ARF++PK +D KGN+YVAD N IRKI +
Sbjct: 113 RLVAGSFQGYKGHVDGKPSDARFNQPKGITIDDKGNVYVADTLNLAIRKIVDAGVTTIAG 172
Query: 133 -----------------------------------LDHGNQLIRQINLKPEDC 150
+D GN +RQI+L EDC
Sbjct: 173 GKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVVDRGNAALRQISLNKEDC 225
>gi|449518065|ref|XP_004166064.1| PREDICTED: uncharacterized protein LOC101228723 [Cucumis sativus]
Length = 449
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 80/173 (46%), Gaps = 48/173 (27%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
+ + E GY V T+++G+++ + P+ + + G +L +DS S +S PLS S
Sbjct: 27 NAIQFESGYLVETIVEGNEIGMVPYKIRVSEDG--ELFAVDSVNSNVVKVSPPLSRYSRA 84
Query: 83 KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------- 132
+ +AG G +G+ DG+P ARF++PK +D KGN+YVAD N IRKI +
Sbjct: 85 RLVAGSFQGYKGHVDGKPSDARFNQPKGITIDDKGNVYVADTLNLAIRKIVDAGVTTIAG 144
Query: 133 -----------------------------------LDHGNQLIRQINLKPEDC 150
+D GN +RQI+L EDC
Sbjct: 145 GKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVVDRGNAALRQISLNKEDC 197
>gi|224140207|ref|XP_002323476.1| predicted protein [Populus trichocarpa]
gi|222868106|gb|EEF05237.1| predicted protein [Populus trichocarpa]
Length = 453
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 62/108 (57%), Gaps = 3/108 (2%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ E GYTV TV DG +L I P+SV P S +L++LDS S Y +S S+ K +
Sbjct: 9 MKFEGGYTVETVFDGSKLGIEPYSVEVSP-SGELLLLDSENSNIYKISTQFSKYGRPKLI 67
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G +G G+ DG+ AR + PK VD GNIYVAD N IRKI+
Sbjct: 68 TGSFEGYPGHVDGKLRDARMNHPKGLTVDEGGNIYVADTMNMAIRKIS 115
>gi|449438911|ref|XP_004137231.1| PREDICTED: uncharacterized protein LOC101210502 [Cucumis sativus]
Length = 480
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 80/171 (46%), Gaps = 48/171 (28%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E+GY V TV++G+++ + PH + + + G +L V+DS S ++ PLS+ + +
Sbjct: 59 LQFENGYLVGTVVEGNEIGVLPHKIHVSKDG--ELFVVDSVNSNIVKITPPLSKYTRARL 116
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
+AG G+ DG+P ARF+ P+ VD KGN+YVAD N IRKI +
Sbjct: 117 VAGSFQSHTGHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 176
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDC 150
+D GN IRQI+L EDC
Sbjct: 177 SNVVGYRDGPGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDC 227
>gi|449530424|ref|XP_004172195.1| PREDICTED: uncharacterized LOC101210502, partial [Cucumis sativus]
Length = 437
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 80/171 (46%), Gaps = 48/171 (28%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E+GY V TV++G+++ + PH + + + G +L V+DS S ++ PLS+ + +
Sbjct: 16 LQFENGYLVGTVVEGNEIGVLPHKIHVSKDG--ELFVVDSVNSNIVKITPPLSKYTRARL 73
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
+AG G+ DG+P ARF+ P+ VD KGN+YVAD N IRKI +
Sbjct: 74 VAGSFQSHTGHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 133
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDC 150
+D GN IRQI+L EDC
Sbjct: 134 SNVVGYRDGPGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDC 184
>gi|297841763|ref|XP_002888763.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297334604|gb|EFH65022.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 503
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 82/178 (46%), Gaps = 49/178 (27%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E+GY+V TV DG +L I P+S+ P + +L++LDS S Y +S LS S + + G
Sbjct: 66 FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 124
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
+G G+ DG A+ + PK VD +GNIYVAD N+ IRKI+
Sbjct: 125 SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 184
Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
+D GN+ IR+I L +DC+ Q GSG
Sbjct: 185 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDCAY--QYGSG 240
>gi|30698020|ref|NP_177185.3| NHL domain-containing protein [Arabidopsis thaliana]
gi|332196920|gb|AEE35041.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 509
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 82/178 (46%), Gaps = 49/178 (27%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E+GY+V TV DG +L I P+S+ P + +L++LDS S Y +S LS S + + G
Sbjct: 66 FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 124
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
+G G+ DG A+ + PK VD +GNIYVAD N+ IRKI+
Sbjct: 125 SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 184
Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
+D GN+ IR+I L +DC+ Q GSG
Sbjct: 185 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDCAY--QYGSG 240
>gi|3176691|gb|AAC18814.1| Contains homology to serine/threonine protein kinase gb|X99618 from
Mycobacterium tuberculosis. ESTs gb|F14403, gb|F14404,
and gb|N96730 come from this gene [Arabidopsis thaliana]
Length = 493
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 82/178 (46%), Gaps = 49/178 (27%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E+GY+V TV DG +L I P+S+ P + +L++LDS S Y +S LS S + + G
Sbjct: 39 FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 97
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
+G G+ DG A+ + PK VD +GNIYVAD N+ IRKI+
Sbjct: 98 SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 157
Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
+D GN+ IR+I L +DC + Q GSG
Sbjct: 158 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDC--AYQYGSG 213
>gi|30698018|ref|NP_850974.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|17065224|gb|AAL32766.1| Unknown protein [Arabidopsis thaliana]
gi|21387163|gb|AAM47985.1| unknown protein [Arabidopsis thaliana]
gi|332196919|gb|AEE35040.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 447
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 82/178 (46%), Gaps = 49/178 (27%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E+GY+V TV DG +L I P+S+ P + +L++LDS S Y +S LS S + + G
Sbjct: 4 FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 62
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
+G G+ DG A+ + PK VD +GNIYVAD N+ IRKI+
Sbjct: 63 SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 122
Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
+D GN+ IR+I L +DC + Q GSG
Sbjct: 123 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDC--AYQYGSG 178
>gi|255567088|ref|XP_002524526.1| conserved hypothetical protein [Ricinus communis]
gi|223536200|gb|EEF37853.1| conserved hypothetical protein [Ricinus communis]
Length = 484
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 80/177 (45%), Gaps = 48/177 (27%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L EDGY V TV++G+++ + P+ + + G +L +D S ++ PLS+ S +
Sbjct: 60 LQFEDGYLVETVVEGNEIGVVPYKIRVSEDG--ELYAVDEVNSNIVKITPPLSQYSRARL 117
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
+AG G +G+ DG+ ARF+ PK +D KGN+YVAD N IRKI +
Sbjct: 118 VAGSFQGYKGHVDGKSNEARFNHPKGITMDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 177
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQS 156
+D GN +RQI+L EDC S S
Sbjct: 178 SNTAGYRDGPSEDAKFSTDFDVVYVHSTCSLLVVDRGNVALRQISLNQEDCDYQSSS 234
>gi|357452081|ref|XP_003596317.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
gi|355485365|gb|AES66568.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
Length = 493
Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 63/108 (58%), Gaps = 4/108 (3%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+++ + P+ + +L +D S ++ PLS+ S + +
Sbjct: 63 LQFENGYVVETVVEGNEIGVIPYRIRVSEEDGELFAVDEINSNIVRITPPLSQYSRGRLV 122
Query: 86 AGDGVQGYSD---GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AG QGY+D G+P ARF+ PK +D KGN+YVAD N IRKI
Sbjct: 123 AG-SFQGYTDHVDGKPSDARFNHPKGITMDDKGNVYVADTQNLAIRKI 169
>gi|168018878|ref|XP_001761972.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686689|gb|EDQ73076.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 386
Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 64/106 (60%), Gaps = 6/106 (5%)
Query: 31 GYTVTTVIDGHQLEINPHSV---IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
GY VTTV++G+ ++ + + DR + IVLDS++S + PLS++SVV+ +AG
Sbjct: 44 GYDVTTVLNGNLRGLSFYCIDEATDR-APAWAIVLDSTKSKVLRVQLPLSQDSVVEHIAG 102
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+G GY DG G A F+ PK +D GNIYVAD N IR IT
Sbjct: 103 SLEGKAGYQDGRGGDALFNHPKMLTLDSDGNIYVADVRNTAIRMIT 148
>gi|357485963|ref|XP_003613269.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
gi|355514604|gb|AES96227.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
Length = 589
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 88/208 (42%), Gaps = 65/208 (31%)
Query: 14 LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
+L+L S + E GY V TV DG +L I P++V + + +L++LDS S Y +S
Sbjct: 66 ILNLQTIAGKSMMKFESGYNVETVFDGSKLGIEPYAV-EVLSNGELLILDSENSNIYKIS 124
Query: 74 FPLS------------------EESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
LS S K +AG +G G+ DG+ AR + PK VD
Sbjct: 125 SSLSLCKCANVITSPMMSLVAPTNSRPKLVAGSAEGYSGHVDGKLREARMNHPKGITVDD 184
Query: 114 KGNIYVADKSNHVIRKITN----------------------------------------- 132
+GNIYVAD N IRKI++
Sbjct: 185 RGNIYVADIMNMAIRKISDSGVTTIAGGKLSRGGGHVDGPSEEAKFSNDFDVVYVGSSCS 244
Query: 133 ---LDHGNQLIRQINLKPEDCSKSSQSG 157
+D GNQ IR+I L+ +DC+ +SG
Sbjct: 245 LLVIDRGNQAIREIQLRFDDCAYQYESG 272
>gi|11994597|dbj|BAB02652.1| unnamed protein product [Arabidopsis thaliana]
Length = 511
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 76/170 (44%), Gaps = 46/170 (27%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+ + + P+ I +L +D S ++ PLS+ S + +
Sbjct: 71 LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
AG G G++DG+P ARF+ P+ +D KGN+YVAD N IRKI +
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189
Query: 133 --------------------------------LDHGNQLIRQINLKPEDC 150
+D GN +RQI+L EDC
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDC 239
>gi|22331093|ref|NP_188104.2| NHL domain-containing protein [Arabidopsis thaliana]
gi|20260314|gb|AAM13055.1| unknown protein [Arabidopsis thaliana]
gi|25083516|gb|AAN72090.1| unknown protein [Arabidopsis thaliana]
gi|332642056|gb|AEE75577.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 492
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 76/170 (44%), Gaps = 46/170 (27%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+ + + P+ I +L +D S ++ PLS+ S + +
Sbjct: 71 LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
AG G G++DG+P ARF+ P+ +D KGN+YVAD N IRKI +
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189
Query: 133 --------------------------------LDHGNQLIRQINLKPEDC 150
+D GN +RQI+L EDC
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDC 239
>gi|42572439|ref|NP_974315.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|332642057|gb|AEE75578.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 493
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 76/170 (44%), Gaps = 46/170 (27%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+ + + P+ I +L +D S ++ PLS+ S + +
Sbjct: 71 LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
AG G G++DG+P ARF+ P+ +D KGN+YVAD N IRKI +
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189
Query: 133 --------------------------------LDHGNQLIRQINLKPEDC 150
+D GN +RQI+L EDC
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDC 239
>gi|297834352|ref|XP_002885058.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297330898|gb|EFH61317.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 78/171 (45%), Gaps = 48/171 (28%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E+GY V TV++G+++ + P+ + + G +L +D S ++ PLS+ S +
Sbjct: 67 LQFENGYLVETVVEGNEIGVVPYKIRVSHDG--ELYAVDELNSNIMKITPPLSQYSRGRL 124
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
+AG G G++DG+P ARF+ P+ +D KGN+YV D N IRKI +
Sbjct: 125 VAGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVGDTLNLAIRKIGDSGVTTIAGGK 184
Query: 133 ---------------------------------LDHGNQLIRQINLKPEDC 150
+D GN +RQI+L EDC
Sbjct: 185 SNIAGYRDGPSEDAKFSNDFDVVYVRSTCSLLVIDRGNAALRQISLSDEDC 235
>gi|302768697|ref|XP_002967768.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
gi|300164506|gb|EFJ31115.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
Length = 186
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 64/113 (56%), Gaps = 2/113 (1%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
L+ + +L+E G+TV T+ D + N ++ P + ++ LDSS + + L PLS+ S
Sbjct: 7 LLKTFVLVEPGFTVKTLFDLGKHSANAFALYPDPRNKFVLALDSSGNRIWKLRLPLSQNS 66
Query: 81 VVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ AG G GY DG + F++P+S A+ G ++VAD N IRKI+
Sbjct: 67 SLEAFAGSFVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKIS 119
>gi|302821461|ref|XP_002992393.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
gi|300139809|gb|EFJ06543.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
Length = 183
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 63/113 (55%), Gaps = 2/113 (1%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
L+ + +L+E G+TV T+ D + N ++ P ++ LDSS + + L PLS+ S
Sbjct: 7 LLKTFVLVEPGFTVKTLFDLGKHSANAFALYPDPRKKFVLALDSSGNRIWKLRLPLSQNS 66
Query: 81 VVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ AG G GY DG + F++P+S ++ G ++VAD N IRKI+
Sbjct: 67 SLEAFAGSFVGESGYVDGPAAKSLFNRPQSLSICDNGAVFVADTRNLAIRKIS 119
>gi|414870491|tpg|DAA49048.1| TPA: hypothetical protein ZEAMMB73_516870 [Zea mays]
Length = 271
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 79/165 (47%), Gaps = 25/165 (15%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GYTV ++D +L I P+S+ G S+++++DS+ Y+++ PLS +S K +AG
Sbjct: 105 ECGYTVEKILDYSKLGIEPYSMEVTQG-SEILIMDSN---IYSMALPLSRDSRPKLVAGS 160
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
+G G+ DG R PK VD KG D + + T+
Sbjct: 161 LEGFPGHVDGNLREGRIYHPKGVTVDDKGRGGQVDGPSDEAKSSTDFEVCYYIGSSCSLL 220
Query: 133 -LDHGNQLIRQINLKPEDCSKSSQS----GSGMFWVTVFSPYLIS 172
+D GNQ IR+I L +DC ++ G + V F Y+++
Sbjct: 221 VIDRGNQTIREIQLHFDDCVYQHEADFPLGVALLAVAAFLGYMLA 265
>gi|124002516|ref|ZP_01687369.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
gi|123992345|gb|EAY31713.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
Length = 2385
Score = 63.9 bits (154), Expect = 3e-08, Method: Composition-based stats.
Identities = 30/62 (48%), Positives = 40/62 (64%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V +AG GV G+ D A+F+ P A+DM GN+YVADK+NH IR ITN G+ ++R
Sbjct: 631 VSIVAGSGVAGFKDDNGSLAQFNYPSGLALDMAGNLYVADKNNHAIRMITNPSGGSPVVR 690
Query: 142 QI 143
I
Sbjct: 691 TI 692
Score = 57.4 bits (137), Expect = 3e-06, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 45/82 (54%), Gaps = 5/82 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-----GVQGYSDGEPGSARFDKPKSFA 110
+ +L V D + A ++ P VV+ +AG+ V G G A+F++P A
Sbjct: 663 AGNLYVADKNNHAIRMITNPSGGSPVVRTIAGNSSYPTAVSGNVTGALAVAKFNEPSGVA 722
Query: 111 VDMKGNIYVADKSNHVIRKITN 132
VD GNIYVADK+NH I+KI N
Sbjct: 723 VDAAGNIYVADKNNHRIKKIAN 744
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 86 AGD---GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
AGD G+ G+ DG SA+F P V+ G++Y+AD N VIRKI+
Sbjct: 812 AGDVANGIAGHQDGNAASAKFRSPAGITVNAVGDVYIADTHNQVIRKIS 860
Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 38/65 (58%), Gaps = 9/65 (13%)
Query: 71 TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRK 129
TL+ P+++ + + G +DG +ARF P S A+D+ G +YVADK N++IR+
Sbjct: 749 TLAGPMNDAA--------SIPGRTDGAADAARFFFPTSVALDITGAQLYVADKLNNIIRQ 800
Query: 130 ITNLD 134
+ D
Sbjct: 801 VNTAD 805
>gi|325102830|ref|YP_004272484.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
DSM 12145]
gi|324971678|gb|ADY50662.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
DSM 12145]
Length = 491
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P+ V P + DLIV D + A +S + +V ++AG G G DG+ A F+ P
Sbjct: 170 PNGVSFDPTTGDLIVADRTAQAIKRIS----KAGIVTKIAGTGTAGRVDGDISIASFNNP 225
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
AVD GNIYVAD NH IRKI
Sbjct: 226 YKTAVDKYGNIYVADNGNHRIRKI 249
Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 30/43 (69%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
GY +G PG +RF P AVD +GN+++AD +N+VIRKI +
Sbjct: 443 NGYVNGLPGISRFYNPTDVAVDEEGNVFIADMNNYVIRKIVKI 485
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 3/87 (3%)
Query: 46 NPH-SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP+ + +D+ G ++ V D+ + + S + AG GY+DG + +
Sbjct: 224 NPYKTAVDKYG--NIYVADNGNHRIRKIDLSTNTVSTIAGGAGAATSGYTDGIGANGLLN 281
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
P AVD N+YVAD +NH +RKIT
Sbjct: 282 TPTGLAVDDDLNVYVADAANHAVRKIT 308
>gi|223935789|ref|ZP_03627704.1| NHL repeat containing protein [bacterium Ellin514]
gi|223895390|gb|EEF61836.1| NHL repeat containing protein [bacterium Ellin514]
Length = 755
Score = 62.4 bits (150), Expect = 8e-08, Method: Composition-based stats.
Identities = 34/68 (50%), Positives = 43/68 (63%), Gaps = 3/68 (4%)
Query: 67 SAFYTLSF-PLSEES--VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
+ F +L+F PLS V LAG+ QG +DG SARF+ P AVD GN+YVAD +
Sbjct: 9 TVFLSLAFVPLSHSQPLTVSTLAGNAGQGSADGNNSSARFNLPGGVAVDKTGNLYVADTA 68
Query: 124 NHVIRKIT 131
NH IRKI+
Sbjct: 69 NHTIRKIS 76
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 34/56 (60%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S V LAG G +DG GSA+F PK AVD GN++V+D NH IRKIT
Sbjct: 343 ISSGGAVTTLAGSASIGNADGTGGSAKFYWPKGTAVDASGNVFVSDTFNHTIRKIT 398
Score = 53.5 bits (127), Expect = 5e-05, Method: Composition-based stats.
Identities = 32/81 (39%), Positives = 47/81 (58%), Gaps = 8/81 (9%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSF 109
+D+ G +L V D++ +S VV AG GV G ++G+ +ARF++P+
Sbjct: 56 VDKTG--NLYVADTANHTIRKIS-----GGVVSTFAGLAGVSGSANGKGSAARFNQPQGV 108
Query: 110 AVDMKGNIYVADKSNHVIRKI 130
AVD G +YVAD NH+IRKI
Sbjct: 109 AVDTNGIVYVADTGNHIIRKI 129
Score = 50.1 bits (118), Expect = 4e-04, Method: Composition-based stats.
Identities = 30/77 (38%), Positives = 45/77 (58%), Gaps = 6/77 (7%)
Query: 62 LDSSRSAFYTLSF-----PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKG 115
+D+S + F + +F ++ V LAG G G ++G G A+F P+ AVD G
Sbjct: 378 VDASGNVFVSDTFNHTIRKITAAGTVSTLAGTAGSSGTNNGVGGGAQFYAPQGIAVDTGG 437
Query: 116 NIYVADKSNHVIRKITN 132
N YVAD +N+VIRK+T+
Sbjct: 438 NAYVADTANNVIRKVTS 454
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 33/51 (64%), Gaps = 1/51 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG GV+G DG +A+F P++ A+D N+YV+D NH IRKI+
Sbjct: 458 VTTLAGTAGVEGQGDGTGSNAQFSGPQAVALDGAANVYVSDTGNHTIRKIS 508
Score = 45.4 bits (106), Expect = 0.010, Method: Composition-based stats.
Identities = 20/32 (62%), Positives = 22/32 (68%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ARF P AVD GN+YVAD NH IRKIT
Sbjct: 537 TARFYSPSGLAVDSSGNVYVADTGNHTIRKIT 568
Score = 44.7 bits (104), Expect = 0.019, Method: Composition-based stats.
Identities = 26/85 (30%), Positives = 46/85 (54%), Gaps = 2/85 (2%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
++ + V LAG GV G +DG ARF +P+ ++D +GN++V D NH +R +
Sbjct: 567 ITADGSVSTLAGLPGVWGNADGTNRDARFFQPEGISIDSQGNLFVMDSGNHTMRMLI-AS 625
Query: 135 HGNQLIRQINLKPEDCSKSSQSGSG 159
N ++ I +P+ + +G+G
Sbjct: 626 GTNWIVTTIAGQPDLGGAADGTGNG 650
Score = 40.4 bits (93), Expect = 0.33, Method: Composition-based stats.
Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
V LAG G G ++G +A+F +P AV G +YVAD NH IR +T+
Sbjct: 240 VTTLAGSPGNYGSTNGTGSAAQFYQPMGVAVAANGTVYVADNLNHTIRAVTS 291
>gi|323454387|gb|EGB10257.1| hypothetical protein AURANDRAFT_62887 [Aureococcus anophagefferens]
Length = 434
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 45/79 (56%), Gaps = 13/79 (16%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRL----------AGDGVQGYSDGEPGSARFDKPKSF 109
+ LD++R+ Y S V+RL AGDG +GY+DG+ G ARFD+P +
Sbjct: 247 VCLDAARNCLYVAD---SGNHCVRRLDLASGAVTTVAGDGTRGYADGDAGGARFDEPTAV 303
Query: 110 AVDMKGNIYVADKSNHVIR 128
A+D G +YVAD+ N +R
Sbjct: 304 ALDADGALYVADQENRRVR 322
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 7/87 (8%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGSARFD 104
P V P + D+ V S ++ + V LAG G +G +DG +A F
Sbjct: 188 GPWGVAAHPATGDVYV-----SEYFGCKIRVIRRGDVTTLAGSGPRGGRADGVGPAASFA 242
Query: 105 KPKSFAVDMKGN-IYVADKSNHVIRKI 130
P+ +D N +YVAD NH +R++
Sbjct: 243 GPRGVCLDAARNCLYVADSGNHCVRRL 269
>gi|373955434|ref|ZP_09615394.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
gi|373892034|gb|EHQ27931.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
Length = 459
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 33/56 (58%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S VV LAG G GY+DG A F P AVD GNIYVAD+ + IRK+T
Sbjct: 243 ISAAGVVTTLAGKGSAGYADGTGADAVFKSPAGLAVDASGNIYVADQGTNTIRKVT 298
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 34/56 (60%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ V AG G GY DG +A+F PK A D +GNIYVAD N++IRKI+
Sbjct: 189 ITAAGVTTTYAGTGTLGYLDGPAATAQFYAPKGVAADAQGNIYVADMGNNMIRKIS 244
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG G D + ARF P VD GN+YVAD +NH IRK+T
Sbjct: 297 VTSAGVVTTLAGAAASGQVDATTNTDARFSSPSGVTVDASGNVYVADLANHAIRKVT 353
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 69 FYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
F T S + + V AG+ G + +G +A+F P+ D +GN+YVAD N+VI
Sbjct: 127 FKTNSATFAITATVSTFAGNANGGGFVNGTGVNAQFHNPQGICTDAQGNMYVADSYNNVI 186
Query: 128 RKIT 131
RKIT
Sbjct: 187 RKIT 190
Score = 43.9 bits (102), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Query: 78 EESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+++ LAG G G+++G +A F+ P++ +D +GNIYV D N++IRKI
Sbjct: 400 TTNIIYSLAGVAGTSGFANGTNINALFNGPQALTLDSQGNIYVVDYYNNMIRKI 453
>gi|255531263|ref|YP_003091635.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255344247|gb|ACU03573.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 646
Score = 60.5 bits (145), Expect = 4e-07, Method: Composition-based stats.
Identities = 27/52 (51%), Positives = 36/52 (69%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+V AG G G +G SARFD P+ A+D GN+Y+ADK+N+VIRKIT+
Sbjct: 327 LVSNYAGTGTAGLVNGAKASARFDSPEGLAIDASGNMYIADKNNNVIRKITS 378
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/56 (48%), Positives = 36/56 (64%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV +AG+G GY++G +A+F P VD GNIYVAD+ NH IRKIT
Sbjct: 483 ITAAGVVTTIAGNGTAGYTNGTGTAAQFKNPSGVDVDASGNIYVADRLNHRIRKIT 538
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 34/56 (60%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ V AG GV G++DG A+F+ P AVD GN+YVAD+ N IRKIT
Sbjct: 376 ITSTGTVSTFAGTGVAGFADGAGSIAKFNGPWKVAVDATGNVYVADRDNFKIRKIT 431
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +V LAG GY+DG +A+F +P A+D GN+ VAD ++H IRKIT
Sbjct: 430 ITPAGIVSTLAGS-TAGYADGTGSAAKFMQPLDVAIDPSGNVIVADNTSHRIRKIT 484
Score = 39.3 bits (90), Expect = 0.88, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 35/57 (61%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ VV LAG G G +DG GSA+F P VD+ GN+YVAD + IRKI++
Sbjct: 537 ITTSGVVSSLAGTGTSGTTDGAAGSAKFSDPYGVTVDVSGNVYVADLISSRIRKISS 593
>gi|325104746|ref|YP_004274400.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
gi|324973594|gb|ADY52578.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
Length = 1769
Score = 60.1 bits (144), Expect = 4e-07, Method: Composition-based stats.
Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 15/135 (11%)
Query: 52 DRPGSSDL------IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGS 100
D PG+S + +V+D+S + + + S ++ V +AG GVQG DG
Sbjct: 1098 DGPGTSAMFNSPQAMVMDASDNIYVSESSNHRIRKITPAGEVSTVAGKGVQGNRDGTKEE 1157
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--LDHGNQLIRQINLKPEDCSKSSQSGS 158
A+F P A+DM GNIYVA+ SNH IRKIT G I + + P S+ GS
Sbjct: 1158 AQFWGPVGIALDMSGNIYVAEWSNHRIRKITQGITLTGTPGITDVGINPVALKASNTVGS 1217
Query: 159 G--MFWVTVFSPYLI 171
F +TV P ++
Sbjct: 1218 TSQQFDITVSGPAIL 1232
Score = 55.8 bits (133), Expect = 8e-06, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 35/56 (62%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ V AG+G GY+DG SA F+ P++ +D NIYV++ SNH IRKIT
Sbjct: 1079 ITPNGTVSTFAGNGTAGYADGPGTSAMFNSPQAMVMDASDNIYVSESSNHRIRKIT 1134
Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 36/56 (64%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + +V +AG G G S+G +A F+KP A+D GNIY+AD NH IRKIT
Sbjct: 972 ITPQGLVSTIAGSGTNGNSNGVGTAASFNKPTGIAIDPWGNIYIADDLNHSIRKIT 1027
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ V AG+G GY+DG ARF++P D GN+YV++ SN+ IRKIT
Sbjct: 1026 ITPNGTVSTFAGNGTAGYADGVGVLARFNRPTGIITDALGNVYVSESSNY-IRKIT 1080
Score = 46.2 bits (108), Expect = 0.006, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 36/60 (60%), Gaps = 8/60 (13%)
Query: 80 SVVKRLAGDG--------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
S+++++ DG GY+DG +A+F+ P+ A+D GNIY+A++ IRKIT
Sbjct: 914 SLIRKITADGQVSTFAGSTFGYADGIGTAAKFNGPQGMAIDASGNIYIAERGASRIRKIT 973
>gi|405345633|ref|ZP_11022426.1| hypothetical protein A176_0011 [Chondromyces apiculatus DSM 436]
gi|397093682|gb|EJJ24379.1| hypothetical protein A176_0011 [Myxococcus sp. (contaminant ex DSM
436)]
Length = 870
Score = 60.1 bits (144), Expect = 4e-07, Method: Composition-based stats.
Identities = 36/87 (41%), Positives = 49/87 (56%), Gaps = 5/87 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES-VVKRLAGDG-VQGYSDGEPGSARF 103
+P +V+ P + +L V DS L EE V+ AG G V G++DG P ARF
Sbjct: 611 SPTAVVAGP-TGELYVADSGNHVIRRLD--RGEEGWQVRTWAGQGFVAGFADGGPARARF 667
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
+P + AVD GN+YVAD+ NH IR +
Sbjct: 668 SRPMALAVDAAGNVYVADQDNHRIRMV 694
Score = 43.1 bits (100), Expect = 0.053, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 29/58 (50%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL 139
V +AG G G+ DG A P + G +YVAD NHVIR++ + G Q+
Sbjct: 588 VSTIAGSGELGHRDGAGSQALLRSPTAVVAGPTGELYVADSGNHVIRRLDRGEEGWQV 645
>gi|431798813|ref|YP_007225717.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
gi|430789578|gb|AGA79707.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
Length = 655
Score = 60.1 bits (144), Expect = 4e-07, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 37/48 (77%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+AG G+ G+ +G +A+FD P+ A+D +GNI+VAD+ N+VIRKI++
Sbjct: 338 IAGTGIAGFQNGSSSTAKFDNPEGIAIDQQGNIFVADRDNNVIRKISS 385
Score = 59.7 bits (143), Expect = 6e-07, Method: Composition-based stats.
Identities = 27/56 (48%), Positives = 36/56 (64%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S V AG G+ G++DG G A+F+ P AVD +GN+ VAD+ NH IRKIT
Sbjct: 383 ISSSGDVSTFAGTGIAGFTDGVAGVAKFNSPWKVAVDNQGNVIVADRGNHSIRKIT 438
Score = 49.3 bits (116), Expect = 8e-04, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 28/49 (57%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V +AG G GY DG A+F P D GN++VAD+ NH IRKI
Sbjct: 496 VSTIAGTGNGGYVDGAGSQAQFYYPSGIDTDPNGNLFVADRKNHAIRKI 544
Score = 46.6 bits (109), Expect = 0.006, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ V LAG GY DG A+FD+P AV GNI +AD NH IR I
Sbjct: 437 ITPNGTVSTLAGT-TNGYQDGSGNQAKFDQPTDVAVLPNGNIVIADNRNHCIRMI 490
Score = 40.4 bits (93), Expect = 0.32, Method: Composition-based stats.
Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 5/101 (4%)
Query: 40 GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
G Q + S ID + +L V D A + S +V G+G +G +G
Sbjct: 512 GSQAQFYYPSGIDTDPNGNLFVADRKNHAIRKID---SYHNVSTVAGGNG-EGIQNGGIA 567
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
A+FD P AV G + VAD N+VIR+I N D+ + +I
Sbjct: 568 VAKFDDPYGVAVGQNGKVLVADLDNNVIREI-NGDYVSTII 607
>gi|430809992|ref|ZP_19437107.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
gi|429497580|gb|EKZ96110.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
Length = 536
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 35/56 (62%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S V LAG GV G +DG +A F+ P AVD GN+YVAD NH+IRKIT
Sbjct: 263 ISPYGAVTTLAGSGVAGSTDGTGTAASFNLPSGVAVDGAGNVYVADTYNHLIRKIT 318
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 35/56 (62%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG G G++DG +A F++P AVD GNIYV+D N IRKI+
Sbjct: 371 VTPTGVVTTLAGTGTAGFADGPAQAAAFNRPAGIAVDAAGNIYVSDLGNARIRKIS 426
Score = 43.1 bits (100), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 30/56 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S +VV LAG +G +DG +A F A D GN+Y D ++ +RK+T
Sbjct: 425 ISPANVVTTLAGSTTRGSADGTGAAASFTSLLRIATDAAGNVYAVDAGSNAVRKVT 480
Score = 36.6 bits (83), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 18/26 (69%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
P AVD GN+YVAD NH IRKI+
Sbjct: 239 PIDVAVDGTGNVYVADYGNHRIRKIS 264
Score = 35.8 bits (81), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 14/32 (43%), Positives = 21/32 (65%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+A F P VD+ GN++VAD NH++R +T
Sbjct: 341 AASFYYPSDIVVDVAGNVFVADTYNHLVRMVT 372
>gi|302791603|ref|XP_002977568.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
gi|300154938|gb|EFJ21572.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
Length = 288
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 39/166 (23%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
G+TV T+ D + +N + P + ++ DSS + + L PLS+ S ++ AG
Sbjct: 38 GFTVKTLFDLGKHSVNAFGLYPDPRNKFMLAPDSSGNRIWKLRLPLSQNSSLEAFAG-WE 96
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL----------------- 133
GY DG + F++P+S A+ G I+VAD + IRKI+ +
Sbjct: 97 SGYIDGPAAKSLFNRPQSLAMCGNGAIFVAD-TRMAIRKISKVTTIAGGSSRKPGIADGT 155
Query: 134 -------------------DHGNQLIRQINL-KPEDCSKSSQSGSG 159
D GN LIR+I + P+ C S + SG
Sbjct: 156 ARFSSEFSLACSCGSLPIADRGNWLIREIQIDDPKSCDSSDSTVSG 201
>gi|255531264|ref|YP_003091636.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255344248|gb|ACU03574.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 439
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 38/68 (55%)
Query: 64 SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
S+ +T + E +V AGDG G +G A+F P+ AVD +GN+ VAD+
Sbjct: 101 SAEGPVFTYKAAPASEYIVSTFAGDGTAGLFNGAAADAQFRNPEGLAVDAQGNVIVADRQ 160
Query: 124 NHVIRKIT 131
NH IRKI+
Sbjct: 161 NHTIRKIS 168
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 32/56 (57%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S V LAGDG GY D A+F P VD +GNI VAD+ NH IRKIT
Sbjct: 274 VSPSGTVTTLAGDGTAGYLDATGVQAKFRNPSGLTVDQQGNIIVADRLNHRIRKIT 329
Score = 53.5 bits (127), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 46/86 (53%), Gaps = 7/86 (8%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP + +D G ++IV D +S P + V +AGDG GY+DG +A+F
Sbjct: 142 NPEGLAVDAQG--NVIVADRQNHTIRKIS-PAGQ---VTTIAGDGTAGYADGTGTAAKFS 195
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI 130
P A+D GNI VAD+ N IRKI
Sbjct: 196 SPWKLAIDPLGNIIVADRDNFKIRKI 221
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V LAG G++DG +A+F +P D +GNIYVAD + H IRK++
Sbjct: 221 IAPDGAVTTLAGS-TAGFADGTGSAAKFMQPLDVVADAQGNIYVADNTAHRIRKVS 275
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 55/104 (52%), Gaps = 11/104 (10%)
Query: 31 GYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIV-LDSSRSAFYTLSFPLSEESVVKRLA 86
G T ++DG L +P+ V G + L+ L ++R T P+ + V LA
Sbjct: 339 GAGTTGLLDGDALTAKFADPYGVAVDAGGNILVAELTNARIRKIT---PVGQ---VSTLA 392
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G G++DG +A+F++P +D KGNIYVA+ +NH IR I
Sbjct: 393 GSSA-GFADGLSVNAKFNQPTDLDIDAKGNIYVAEVTNHRIRMI 435
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 32/56 (57%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ V +AG G G DG+ +A+F P AVD GNI VA+ +N IRKIT
Sbjct: 328 ITPAGTVSTIAGAGTTGLLDGDALTAKFADPYGVAVDAGGNILVAELTNARIRKIT 383
>gi|402814801|ref|ZP_10864394.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
gi|402507172|gb|EJW17694.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
Length = 571
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 28/40 (70%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
QG+ DGE +FD+P A D KGN+YVAD NHVIRKI
Sbjct: 129 QGWEDGEQAQVQFDQPTGLAADKKGNLYVADAGNHVIRKI 168
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 33/49 (67%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V +AGDG+ G+ DG G+ARF P++ AV G IYV D NHVIR+I
Sbjct: 174 VTTVAGDGIPGWKDGAAGTARFHTPRAIAVADDGAIYVTDSLNHVIRRI 222
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 50/117 (42%), Gaps = 13/117 (11%)
Query: 14 LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
L +F+ S L+ DG ++ + ID S + SS +A YTL
Sbjct: 257 LTEAMFNEPSGIALMNDGKLAVADTGNQRIRL-----IDLKQGSVSTIAGSSSTAGYTLP 311
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V + L G GY DG ++ F+ P A+ + I VAD+ NHV+R I
Sbjct: 312 ------GVQRVLYAPG--GYRDGAASNSIFNSPTGIAITNENGIVVADRWNHVVRYI 360
>gi|455647406|gb|EMF26379.1| hypothetical protein H114_24527 [Streptomyces gancidicus BKS 13-15]
Length = 627
Score = 58.5 bits (140), Expect = 1e-06, Method: Composition-based stats.
Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 18/116 (15%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ P S +V D++R L+ E+V++R G G +G DG PG ARF +P
Sbjct: 186 PGKVLRLPSSGTFLVTDTTRHQVVELA--EDGETVLRRF-GSGGRGLEDGPPGRARFQEP 242
Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
+ A+ G + VAD NH +R+ L PE ++ +G+G W
Sbjct: 243 QGLALLPDGAVVVADTVNHALRR---------------LDPESGEVTTLAGTGSPW 283
>gi|325104251|ref|YP_004273905.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
gi|324973099|gb|ADY52083.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
Length = 532
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 65/128 (50%), Gaps = 20/128 (15%)
Query: 15 LSLLFSLVSSGLLLEDGYTVTTV--------IDGHQLEI---NPHSV-IDRPGSSDLIVL 62
L+L+ L + + YTVTTV +DG+ L+ NP + +D G D+I+
Sbjct: 6 LTLILMLGTYMVTKSQNYTVTTVAGSTTFGFVDGNGLDARFRNPDGILVDMNG--DIIIT 63
Query: 63 DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
D + + ++ VV LAG GV GY++G+PG +F+ P VD GNI V +K
Sbjct: 64 DRTNHSIR----KMTTAGVVSTLAGTGVSGYANGKPG--QFNTPWQSTVDAAGNIIVVEK 117
Query: 123 SNHVIRKI 130
IRKI
Sbjct: 118 DGARIRKI 125
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 6/106 (5%)
Query: 48 HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
S +D G+ ++ D +R L + V +AG G GYSDG ARFD
Sbjct: 103 QSTVDAAGNIIVVEKDGARIRKIAL------DGTVSTIAGTGSAGYSDGAVSVARFDNAL 156
Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKS 153
VD GNI++AD++N +RKIT GN ++ D + S
Sbjct: 157 DAVVDSDGNIFIADRNNRRVRKITPGAGGNWTTATVSTVAGDGTTS 202
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 31/55 (56%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
S+ V +AG G+ DG ARF P VDM G+I + D++NH IRK+T
Sbjct: 20 SQNYTVTTVAGSTTFGFVDGNGLDARFRNPDGILVDMNGDIIITDRTNHSIRKMT 74
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 28/51 (54%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V +AG G G DG A F++P D GNIYVAD NH IRK+
Sbjct: 299 ATVTTIAGTGSTGRIDGLGNVATFNQPYDVVTDENGNIYVADNVNHSIRKL 349
>gi|255532869|ref|YP_003093241.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255345853|gb|ACU05179.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 2296
Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/51 (52%), Positives = 33/51 (64%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
VV AG GV+G+ DG +A+F P A D GN+YVAD+ NH IRKIT
Sbjct: 642 VVSTFAGSGVKGFKDGTGTAAQFAFPTGLAADAAGNVYVADRDNHRIRKIT 692
Score = 55.8 bits (133), Expect = 9e-06, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 37/56 (66%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG+G ++DG +ARF+ P AVD GN++VAD++N IRK+T
Sbjct: 691 ITAAGVVTTLAGNGTADFADGTQANARFNMPVGVAVDANGNVFVADRNNFRIRKVT 746
Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 1/51 (1%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
VVKR G + GY++G SA F+ P A+D GN Y+AD+ NH IRKI+
Sbjct: 481 VVKRFVGKDL-GYANGTGTSAMFNSPAGIAMDAAGNTYLADRGNHRIRKIS 530
Score = 40.4 bits (93), Expect = 0.34, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 35/56 (62%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S VV LAG G G++DG +A+F+ P AVD GN+YV+D N +RKIT
Sbjct: 529 ISPAGVVTTLAGSGTAGFADGAGAAAQFNDPSGVAVDAAGNVYVSDTRNFRVRKIT 584
Score = 37.7 bits (86), Expect = 2.1, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
S+ V AG GV GY +G+ +A+F D N+YV D N+ IRKI
Sbjct: 33 SQAQTVSPYAGTGVDGYYEGDNNDAITAQFSYIGGITTDANNNLYVTDSRNYCIRKI 89
Score = 36.2 bits (82), Expect = 6.1, Method: Composition-based stats.
Identities = 21/50 (42%), Positives = 28/50 (56%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V AG G GY++G +A F A+D GN+YV D+ + IRKIT
Sbjct: 589 VSTFAGTGTYGYANGTTTTATFTYLLGIAIDNSGNLYVTDQDVNSIRKIT 638
>gi|421740691|ref|ZP_16178929.1| NHL repeat protein [Streptomyces sp. SM8]
gi|406690906|gb|EKC94689.1| NHL repeat protein [Streptomyces sp. SM8]
Length = 444
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 47/76 (61%), Gaps = 3/76 (3%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G+G +G +DG P ARF +P+ A+ +G
Sbjct: 16 SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLALTPEG 73
Query: 116 N-IYVADKSNHVIRKI 130
+ + VAD NH +R++
Sbjct: 74 DAVIVADTVNHALRRV 89
>gi|348676369|gb|EGZ16187.1| hypothetical protein PHYSODRAFT_506757 [Phytophthora sojae]
Length = 1981
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 2/63 (3%)
Query: 70 YTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
Y+ +F S + V +AG+G G+ DG +ARF+ P+ AVD G +YVAD +NH I
Sbjct: 190 YSAAFGASGRAQGRVTTVAGEGTAGFLDGPAATARFNHPRGVAVDSNGVVYVADTANHRI 249
Query: 128 RKI 130
RKI
Sbjct: 250 RKI 252
Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats.
Identities = 21/39 (53%), Positives = 27/39 (69%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +DG+P ARFD P AVD G ++VAD NH+IR+I
Sbjct: 344 GLADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRI 382
Score = 43.9 bits (102), Expect = 0.033, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 6/56 (10%)
Query: 81 VVKRLAGDGVQGYSDGEP-GSARFDKPKSFAV-DMKG----NIYVADKSNHVIRKI 130
+V LAGDG++G++DG +ARF P AV + G ++VAD NH IR+I
Sbjct: 258 MVSTLAGDGIEGFADGAALSAARFSYPSDVAVLETNGGATVTVFVADTGNHRIRQI 313
Score = 40.0 bits (92), Expect = 0.43, Method: Composition-based stats.
Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 5/71 (7%)
Query: 66 RSAFYTLSFPLSEESV-----VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
R Y F + E+V V LAG V G+ DGE A F+ P A G +YVA
Sbjct: 450 RRVTYDGGFASTIETVTSTNRVVTLAGSNVPGHIDGEGNEATFNAPAGVAFAADGRVYVA 509
Query: 121 DKSNHVIRKIT 131
++ +R+I+
Sbjct: 510 SSTDCTVRQIS 520
>gi|344924046|ref|ZP_08777507.1| NHL repeat containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 368
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 60 IVLDSSRSAFYTLSFP-----LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
+ +DSS + + T ++ ++ V LAG GY++G SARF P AVD
Sbjct: 222 VAVDSSGTVYVTDTYTQRVRKITSGGVTSLLAGSNTIGYAEGTGASARFSSPYGIAVDSS 281
Query: 115 GNIYVADKSNHVIRKITN 132
G YVAD NH IRKIT+
Sbjct: 282 GTAYVADSDNHRIRKITS 299
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ V LAG QGY++G ARFD+P S AVD G +YVAD N IR+IT+
Sbjct: 136 ITSAGVTSLLAGS-AQGYAEGTGSGARFDRPYSVAVDSSGTVYVADFFNSRIRRITS 191
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+V LAG GY++G SARF+ P AV G IYVAD +NH IR I++
Sbjct: 34 IVSTLAGS-TGGYAEGTGASARFNYPYGIAVHSSGTIYVADSANHRIRSISS 84
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 52 DRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
DRP S + +DSS + + F ++ V LAG GY +G G+A+F P
Sbjct: 163 DRPYS---VAVDSSGTVYVADFFNSRIRRITSAGVTSTLAGSSTGGYLEGTGGAAQFGTP 219
Query: 107 KSFAVDMKGNIYVADKSNHVIRKITN 132
AVD G +YV D +RKIT+
Sbjct: 220 IDVAVDSSGTVYVTDTYTQRVRKITS 245
Score = 40.4 bits (93), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
SA+F+ P AVD G +YV++ +NH IRKIT+
Sbjct: 106 SAQFNNPYGVAVDSSGTLYVSEYTNHRIRKITS 138
Score = 39.3 bits (90), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+F+ P VD G++Y+AD SNH+IRKI L
Sbjct: 323 QFNYPAGIDVDSSGSLYIADSSNHLIRKIDAL 354
>gi|302848538|ref|XP_002955801.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
nagariensis]
gi|300258994|gb|EFJ43226.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
nagariensis]
Length = 1149
Score = 57.4 bits (137), Expect = 3e-06, Method: Composition-based stats.
Identities = 29/71 (40%), Positives = 42/71 (59%), Gaps = 9/71 (12%)
Query: 73 SFPLSEESVVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
S +S++ ++R+A DG G++DG GSARF + AVD GNI+VAD S
Sbjct: 241 SLVVSDQHCLRRVASDGFVTTIAGGTTPGHADGPSGSARFYNLRGVAVDGDGNIFVADSS 300
Query: 124 NHVIRKITNLD 134
NH +R++T D
Sbjct: 301 NHCVRQLTAAD 311
Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats.
Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 5/78 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
G ++ V DSS L+ + +S+V L G G G+ DG ARF P AV++
Sbjct: 290 GDGNIFVADSSNHCVRQLT---AADSIVSTLVGSPGNAGFRDGTGTDARFRNPCGAAVNL 346
Query: 114 K-GNIYVADKSNHVIRKI 130
+ G + VAD N+ +RK+
Sbjct: 347 QDGTLVVADAGNNRLRKV 364
>gi|291452573|ref|ZP_06591963.1| NHL repeat containing protein [Streptomyces albus J1074]
gi|291355522|gb|EFE82424.1| NHL repeat containing protein [Streptomyces albus J1074]
Length = 613
Score = 57.4 bits (137), Expect = 3e-06, Method: Composition-based stats.
Identities = 28/76 (36%), Positives = 47/76 (61%), Gaps = 3/76 (3%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G+G +G +DG P ARF +P+ A+ +G
Sbjct: 185 SGNFLVSDTTRHQLVELA--EDGESVVRRIGGEGERGLADGGPAEARFSEPQGLALTPEG 242
Query: 116 N-IYVADKSNHVIRKI 130
+ + VAD NH +R++
Sbjct: 243 DAVIVADTVNHALRRV 258
Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats.
Identities = 25/77 (32%), Positives = 36/77 (46%), Gaps = 6/77 (7%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVD 112
G + L V DS SA + E V + G G+ G+ DGE A F P + V
Sbjct: 354 GDARLWVADSETSALRWVD----PEGTVHTVVGTGLFDFGHRDGEAAQALFQHPLAVTVL 409
Query: 113 MKGNIYVADKSNHVIRK 129
G++ V+D NH +R+
Sbjct: 410 PDGSVAVSDTYNHALRR 426
>gi|409204160|ref|ZP_11232357.1| putative outer membrane adhesin-like protein [Pseudoalteromonas
flavipulchra JG1]
Length = 2380
Score = 56.6 bits (135), Expect = 4e-06, Method: Composition-based stats.
Identities = 39/102 (38%), Positives = 52/102 (50%), Gaps = 3/102 (2%)
Query: 32 YTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYT-LSFPLSEESVVKRLAGDG 89
+T T V + + +S+ P D+ V S+ + + LS S E+ V LAG
Sbjct: 489 FTSTAVTASTEDSVYKYSIATSDPDKDDVTVSVSASTPLPSWLSLNTSTEATVSTLAGQS 548
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY+DG +A F P A D N+YVAD SNHVIRKIT
Sbjct: 549 -SGYADGTGTAASFKSPYDVATDSNNNVYVADYSNHVIRKIT 589
Score = 56.2 bits (134), Expect = 6e-06, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 36/56 (64%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ E VV LAG G G +G +A F+ PK+ VD GN+YVAD SN+ IRKIT
Sbjct: 588 ITPEGVVTTLAGSGTAGSDEGTGSAASFNFPKAVTVDASGNVYVADSSNNKIRKIT 643
Score = 49.3 bits (116), Expect = 7e-04, Method: Composition-based stats.
Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
++DL + D + A ++ SVV AG G G ++G A F+KP A+D
Sbjct: 735 NNDLFIADRNNHAIRKVT----SASVVTTFAGTGSAGSANGTGTQASFNKPYDIALDSAD 790
Query: 116 NIYVADKSNHVIRKITN 132
N+YV++++ H IRKIT+
Sbjct: 791 NLYVSEQTGHTIRKITS 807
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 27/57 (47%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV AG GV G +DG ARF +P AVD G +YVAD NH IRKI+
Sbjct: 805 ITSAGVVTTFAGSAGVSGSTDGLASVARFSQPFGIAVDSNGIVYVADTGNHRIRKIS 861
Score = 45.8 bits (107), Expect = 0.009, Method: Composition-based stats.
Identities = 25/77 (32%), Positives = 39/77 (50%), Gaps = 4/77 (5%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
S ++ V DSS + ++ VV AG G G +DG +A F P +D
Sbjct: 625 ASGNVYVADSSNNKIRKIT----PAGVVTTFAGSGSPGSTDGTGTAATFAAPTGITIDSN 680
Query: 115 GNIYVADKSNHVIRKIT 131
GN++V + + H++RKIT
Sbjct: 681 GNLFVVETNPHIVRKIT 697
Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats.
Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 6/79 (7%)
Query: 60 IVLDSSRSAFYTLSFP-----LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
I +DS+ + F + P ++ VV AG G G++D SA+F+ P +
Sbjct: 675 ITIDSNGNLFVVETNPHIVRKITPAGVVTTFAGSKGSSGFTDATGTSAKFNFPYNGGSSS 734
Query: 114 KGNIYVADKSNHVIRKITN 132
++++AD++NH IRK+T+
Sbjct: 735 NNDLFIADRNNHAIRKVTS 753
>gi|392540748|ref|ZP_10287885.1| Ig family protein [Pseudoalteromonas piscicida JCM 20779]
Length = 2384
Score = 56.6 bits (135), Expect = 5e-06, Method: Composition-based stats.
Identities = 38/102 (37%), Positives = 48/102 (47%), Gaps = 3/102 (2%)
Query: 32 YTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYT--LSFPLSEESVVKRLAGDG 89
+T T V + + +SV D + + S S LS S E+ V LAG
Sbjct: 493 FTSTAVTASTEDSVYKYSVTTSDPDKDAVTVSVSASTPLPSWLSLNTSTEATVSTLAGQS 552
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY+DG +A F P D GN+YVAD NHVIRKIT
Sbjct: 553 -SGYADGTGTAASFKSPYDLVTDSNGNVYVADYGNHVIRKIT 593
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/56 (48%), Positives = 36/56 (64%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ E VV LAG G G DG +A F+ PK+ +D GN+YVAD SN+ IRK+T
Sbjct: 592 ITPEGVVTTLAGSGSAGSDDGTGSAASFNFPKAVTLDSSGNVYVADSSNNNIRKVT 647
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
++DL + D + A ++ SVV AG G G ++G A F+KP A D
Sbjct: 739 NNDLFIADRNNHAIRKVT----SASVVTTFAGTGSAGSTNGTGTQASFNKPYDVAADSAD 794
Query: 116 NIYVADKSNHVIRKITN 132
N+YV +++ H IRKIT+
Sbjct: 795 NLYVTEQAAHTIRKITS 811
Score = 46.2 bits (108), Expect = 0.006, Method: Composition-based stats.
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 4/76 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S ++ V DSS + ++ VV AG G G DG +A F P +D G
Sbjct: 630 SGNVYVADSSNNNIRKVT----PAGVVTTFAGSGTYGSDDGTGTAATFAAPTGITIDSNG 685
Query: 116 NIYVADKSNHVIRKIT 131
N+YV + + H++RKIT
Sbjct: 686 NLYVVETNPHIVRKIT 701
Score = 43.9 bits (102), Expect = 0.032, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV AG G G +DG ARF +P AVD +YVAD NH IRKI+
Sbjct: 809 ITSTGVVTTYAGSAGASGNTDGLVSVARFSQPYGIAVDSNDVVYVADTGNHRIRKIS 865
Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats.
Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 6/79 (7%)
Query: 60 IVLDSSRSAFYTLSFP-----LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
I +DS+ + + + P ++ VV AG G++D SA F+ P + +
Sbjct: 679 ITIDSNGNLYVVETNPHIVRKITPAGVVTTFAGSKNSSGFTDATGTSATFNFPYNGGSNS 738
Query: 114 KGNIYVADKSNHVIRKITN 132
++++AD++NH IRK+T+
Sbjct: 739 NNDLFIADRNNHAIRKVTS 757
>gi|340788705|ref|YP_004754170.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
gi|340553972|gb|AEK63347.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
Length = 394
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 5/77 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
+ ++ V DS T+S VV LAG G G +DG ARF++P+ A D
Sbjct: 297 AGNIRVSDSGNQLIRTVSL----TGVVSTLAGAAGTAGSTDGSGNKARFNQPEGIAADAA 352
Query: 115 GNIYVADKSNHVIRKIT 131
NIYVAD SN++IRK+T
Sbjct: 353 NNIYVADTSNNLIRKVT 369
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV +AG G G +DG G ARF++P A D GN+YV D N +RKIT
Sbjct: 137 ITSAGVVTTIAGSAGQAGQNDGTAGDARFNQPWGVARDGAGNLYVTDTGNATVRKIT 193
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 28/40 (70%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G SDG + RF +P + AVD GN++VAD +N+ IRKIT
Sbjct: 44 GSSDGSGSNVRFSQPGAIAVDAGGNLFVADTANNTIRKIT 83
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
ARF++P AVD GN+Y+AD NH IR IT
Sbjct: 108 ARFNQPGGIAVDAGGNLYLADTQNHTIRMIT 138
Score = 36.2 bits (82), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
ARF++P A+D GNI V+D N +IR ++
Sbjct: 284 ARFNQPYGIALDSAGNIRVSDSGNQLIRTVS 314
>gi|124008176|ref|ZP_01692873.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
gi|123986275|gb|EAY26097.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
Length = 2548
Score = 56.6 bits (135), Expect = 5e-06, Method: Composition-based stats.
Identities = 26/51 (50%), Positives = 34/51 (66%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V +AGDG G + G SARF +P AVD GN+YVADK+NH I+K+
Sbjct: 715 ATVTTVAGDGTSGDAIGASTSARFREPSDVAVDFSGNLYVADKNNHKIKKV 765
Score = 50.1 bits (118), Expect = 4e-04, Method: Composition-based stats.
Identities = 34/89 (38%), Positives = 45/89 (50%), Gaps = 9/89 (10%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG---YSDGEPGSAR 102
P S+ +DR G +L V D + ++ + AGD Q Y DG SAR
Sbjct: 801 PTSIALDRSG--NLFVADRHNNLIRQVA---TSSGATSTYAGDISQTNALYVDGAAASAR 855
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
F+ P VDM G++YVAD N VIRKI+
Sbjct: 856 FNHPTGITVDMVGDVYVADTRNQVIRKIS 884
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 79 ESVVKRLAGD-GVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
+S V +AG G G + G+ ARF+ P A+D G +YVADK NHVIR I N D
Sbjct: 655 DSTVSIVAGSVGASGLVNDPVGTNARFNHPSGLAIDDAGILYVADKDNHVIRAIANPD 712
Score = 41.6 bits (96), Expect = 0.14, Method: Composition-based stats.
Identities = 18/39 (46%), Positives = 27/39 (69%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +DG ARF P S A+D GN++VAD+ N++IR++
Sbjct: 787 GATDGTASIARFFFPTSIALDRSGNLFVADRHNNLIRQV 825
Score = 38.1 bits (87), Expect = 1.7, Method: Composition-based stats.
Identities = 15/39 (38%), Positives = 26/39 (66%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+G ++G +A+F+ P S D++ NIYV DK N ++R+
Sbjct: 898 RGLTNGTAQAAKFNYPGSVYADLEQNIYVGDKVNQLVRR 936
>gi|223939820|ref|ZP_03631690.1| NHL repeat containing protein [bacterium Ellin514]
gi|223891508|gb|EEF57999.1| NHL repeat containing protein [bacterium Ellin514]
Length = 805
Score = 55.8 bits (133), Expect = 7e-06, Method: Composition-based stats.
Identities = 33/82 (40%), Positives = 49/82 (59%), Gaps = 4/82 (4%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSF 109
ID+ G +L+V D+ S ++ P+ VV LAG G +DG ARF+ P+
Sbjct: 358 IDKAG--NLLVADTQNSEIRKIA-PVGTNYVVTTLAGFAQNVGGADGTNAVARFNSPRGI 414
Query: 110 AVDMKGNIYVADKSNHVIRKIT 131
AVD GN++VAD++N+ IRK+T
Sbjct: 415 AVDAAGNVFVADQNNNTIRKLT 436
Score = 53.1 bits (126), Expect = 6e-05, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 2/78 (2%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMK 114
S L V DSS ++ ++ P+ VV +AG V G +DG G A F++P AVD
Sbjct: 187 SGHLFVGDSSNNSIRKMT-PVGTNWVVTTIAGSSPVSGSNDGTNGFAHFNQPCGLAVDAA 245
Query: 115 GNIYVADKSNHVIRKITN 132
G+I+VAD N IRKIT+
Sbjct: 246 GSIFVADYFNSTIRKITS 263
Score = 50.4 bits (119), Expect = 3e-04, Method: Composition-based stats.
Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 2/78 (2%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
+ ++ V D + + L+ P+ V +AG G+ Y DG +A F+ P AVD
Sbjct: 418 AAGNVFVADQNNNTIRKLT-PVGTNWAVTTIAGQAGMAYYGDGNGTNAYFNYPAGIAVDA 476
Query: 114 KGNIYVADKSNHVIRKIT 131
GN++V D NHVIRK+T
Sbjct: 477 SGNLFVTDAGNHVIRKLT 494
Score = 49.3 bits (116), Expect = 8e-04, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 30/46 (65%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG G +DG ARF+ P++ A D GNIYVAD NHVIRK+
Sbjct: 42 IAGVASIGSNDGTNRVARFNNPEATASDSAGNIYVADSGNHVIRKM 87
Score = 46.2 bits (108), Expect = 0.006, Method: Composition-based stats.
Identities = 25/54 (46%), Positives = 34/54 (62%), Gaps = 1/54 (1%)
Query: 81 VVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
VV +AG G G +DG +A+F P + AVD GN++VAD +N+ IRKI L
Sbjct: 95 VVTTIAGLPGNPGSADGTNSAAQFYYPAAVAVDNSGNVFVADSANYTIRKIAPL 148
Score = 46.2 bits (108), Expect = 0.006, Method: Composition-based stats.
Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMK 114
S ++ V DS+ ++ PL VV +AG G G SDG A F P+ A D
Sbjct: 129 SGNVFVADSANYTIRKIA-PLGTNWVVTTIAGVPGYHGSSDGTNTDALFFYPEGLAFDSS 187
Query: 115 GNIYVADKSNHVIRKIT 131
G+++V D SN+ IRK+T
Sbjct: 188 GHLFVGDSSNNSIRKMT 204
Score = 43.5 bits (101), Expect = 0.039, Method: Composition-based stats.
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 1/52 (1%)
Query: 81 VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+V +AG GV ++G +A F+ P + AVD GN++VAD N+ IRK+T
Sbjct: 269 LVTTIAGKVGVADSAEGTGTNAVFNYPHAVAVDTNGNVFVADSENYTIRKLT 320
Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats.
Identities = 18/42 (42%), Positives = 24/42 (57%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
G+ DG ARF P A+D +YV D+ N+ IRK+T L
Sbjct: 630 GFMDGTNSDARFAYPTGIAIDTNDTLYVTDQGNNTIRKVTPL 671
Score = 40.8 bits (94), Expect = 0.30, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 75 PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
PL +V LAG G +DG +A F+ P A+D GN++VAD N IRK
Sbjct: 670 PLGTNWMVTTLAGIHASTGSADGAGSAALFNGPFGIAIDKTGNLFVADLQNSSIRK 725
>gi|373488690|ref|ZP_09579354.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
gi|372005635|gb|EHP06271.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
Length = 438
Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRL-AG-DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L V D++R L P S+ +V + L AG DG+ G D SA F+ P+ A+D GN
Sbjct: 132 LYVSDATRHTISKL-VPNSDGTVTQSLLAGSDGISGSEDKSGSSASFNSPEGLALDASGN 190
Query: 117 IYVADKSNHVIRKITNL 133
+YVAD NH IRKI L
Sbjct: 191 LYVADYGNHTIRKIDTL 207
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 82 VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ LAG G G ++G ARF++P + G + VAD NH IRK++
Sbjct: 320 ISTLAGYAGDTGTANGTGNGARFNQPVGITLHTSGYLLVADAYNHAIRKVS 370
>gi|153868997|ref|ZP_01998705.1| protein kinase-like protein [Beggiatoa sp. PS]
gi|152074439|gb|EDN71293.1| protein kinase-like protein [Beggiatoa sp. PS]
Length = 1432
Score = 55.8 bits (133), Expect = 8e-06, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 35/58 (60%), Gaps = 2/58 (3%)
Query: 75 PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
PL+ S++ +AGDG GYS +G AR P S VD + N+Y+AD NH IRKI
Sbjct: 558 PLNANSLITTIAGDGRSGYSGDNGPAIQARLGNPSSLVVDNENNLYIADTDNHRIRKI 615
Score = 49.7 bits (117), Expect = 6e-04, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ AG G +GYS DG +AR + P AVD GNI++AD++NH IRKI
Sbjct: 621 ITTFAGSGYKGYSGDGNLAITARLNMPTGLAVDGTGNIFIADQNNHRIRKI 671
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 2/59 (3%)
Query: 74 FPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
F L E + AG G +G+ +G A P+ A+D +GN+Y+AD +NH IRKI
Sbjct: 436 FKLDPEGNLTVFAGTGTKGFEGDNGPAIEANLSNPEGLAIDAQGNLYIADTNNHRIRKI 494
Score = 47.0 bits (110), Expect = 0.003, Method: Composition-based stats.
Identities = 24/59 (40%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
Query: 79 ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E +++ G GV+G +DG S A ++P A+D GN+Y+A+K NH IRKI D
Sbjct: 674 EGIIRTFTGTGVRGTATDGILASVAEINQPTDIALDQYGNLYLAEKGNHFIRKIGEKDQ 732
Score = 38.1 bits (87), Expect = 2.0, Method: Composition-based stats.
Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
+ + ++ + G G GY+ D E +A+ KP + D G+ Y+AD N+ IRKI
Sbjct: 494 IDSDGIITTVVGTGEAGYAGDNEFAIAAQLKKPTAIVFDHNGHFYIADSGNNSIRKINYQ 553
Query: 131 --TNLDHGNQLIRQI 143
T+ + N LI I
Sbjct: 554 PGTSPLNANSLITTI 568
>gi|111219919|ref|YP_710713.1| serine/threonine protein kinase [Frankia alni ACN14a]
gi|111147451|emb|CAJ59101.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
Length = 779
Score = 55.8 bits (133), Expect = 8e-06, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 8/92 (8%)
Query: 42 QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS 100
+L+ +V DR G DL + DS + + +V +AG G +G+S DG P +
Sbjct: 622 RLQATTAAVADRSG--DLYIADSEANRIRRID----SRGIVTTVAGTGAKGFSGDGGPAT 675
Query: 101 -ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
A + P AVD GN+Y+AD +NH IR++T
Sbjct: 676 RAMLNNPTGVAVDRAGNLYIADSANHRIRRVT 707
Score = 40.4 bits (93), Expect = 0.34, Method: Composition-based stats.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIR 128
V +AG G G S DG P A F + S AVD GNIY++D+SN V R
Sbjct: 486 VSTVAGSGETGTSGDGGPALQAAFQRLGSVAVDAAGNIYISDESNRVRR 534
>gi|359151030|ref|ZP_09183771.1| Redoxin domain-containing protein [Streptomyces sp. S4]
Length = 568
Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 46/76 (60%), Gaps = 3/76 (3%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G+G +G +DG P ARF +P+ + +G
Sbjct: 140 SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLVLTPEG 197
Query: 116 N-IYVADKSNHVIRKI 130
+ + VAD NH +R++
Sbjct: 198 DAVIVADTVNHALRRV 213
>gi|329908044|ref|ZP_08274804.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
IMCC9480]
gi|327546777|gb|EGF31711.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
IMCC9480]
Length = 642
Score = 55.8 bits (133), Expect = 8e-06, Method: Composition-based stats.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 8/88 (9%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARF 103
+P S+ IDR G L V DS +S + VV LAG G G +DG +ARF
Sbjct: 78 SPESIAIDRAGM--LYVADSVNHTIRKVS----PQGVVTTLAGRAGEPGSADGRGSAARF 131
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
PK AVD+ GN+ V+D +NH IRKI+
Sbjct: 132 FDPKGVAVDVAGNVVVSDNANHTIRKIS 159
Score = 49.7 bits (117), Expect = 7e-04, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 30/56 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV +LAG G G DG A F PK D GN++VAD N IRKIT
Sbjct: 369 ITPAGVVSQLAGSGKWGSVDGTGAGASFSAPKGIVADPLGNVFVADTYNSTIRKIT 424
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 82 VKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG G DG +ARF P+S A+D G +YVAD NH IRK++
Sbjct: 53 VSTLAGASASTPGAVDGRGAAARFYSPESIAIDRAGMLYVADSVNHTIRKVS 104
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV +AG G +DG ARF P++ A+D + N+YV D NH IRKI+
Sbjct: 423 ITPAGVVTTIAGAPSSTGSTDGPGNLARFSGPEAIAIDAQRNLYVGDTGNHTIRKIS 479
Score = 43.1 bits (100), Expect = 0.058, Method: Composition-based stats.
Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 1/59 (1%)
Query: 74 FPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ ++ S KRLAG + +G ARF+ P + D GN YVAD N++IRKIT
Sbjct: 312 YVVTNGSTFKRLAGLAPELALVNGNGAQARFNLPWAVTSDATGNWYVADAGNYMIRKIT 370
Score = 42.0 bits (97), Expect = 0.11, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 31/57 (54%), Gaps = 3/57 (5%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S VV LAG G G DG +AR P+S +VD GN+YV S +RKIT
Sbjct: 478 ISTSGVVSTLAGSPGRYGSEDGTGAAARLASPRSMSVDQAGNVYVI--SYRAVRKIT 532
Score = 38.9 bits (89), Expect = 0.98, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + V LAG + DG +ARF P++ D N+YV D+ N ++RKIT
Sbjct: 158 ISPDGNVTTLAGSARNASALDGIGSAARFAFPQALVTDAARNVYVVDRGNGLLRKIT 214
Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats.
Identities = 19/42 (45%), Positives = 25/42 (59%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G G +G ARF P S VD GN Y+AD++N+ IRK+
Sbjct: 6 GGSGNLNGTGSLARFAYPGSMVVDSAGNRYIADQNNNQIRKV 47
>gi|325916326|ref|ZP_08178604.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
gi|325537497|gb|EGD09215.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
Length = 699
Score = 55.8 bits (133), Expect = 9e-06, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 32/54 (59%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + VV LAGDG G+ DG ARF+ P AVD +G +YVAD N IR I
Sbjct: 153 TPQGVVTTLAGDGTAGFGDGAASQARFNGPMGVAVDAQGRVYVADTYNDRIRVI 206
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/50 (48%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V+ LAG G +G++DG +A F+ P A+D GN+Y+AD NH IRK T
Sbjct: 105 VQTLAG-GREGFADGIGAAAAFNTPSGIALDTAGNLYIADTGNHAIRKRT 153
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 30/56 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ + V+ LAG + G +DG AR D P VD G ++VAD N IR+IT
Sbjct: 206 IERDGQVRTLAGSALPGMADGVGEQARLDTPTDLKVDAHGVVWVADMRNDAIRRIT 261
>gi|182416423|ref|YP_001821489.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
gi|177843637|gb|ACB77889.1| NHL repeat containing protein [Opitutus terrae PB90-1]
Length = 1026
Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats.
Identities = 29/56 (51%), Positives = 34/56 (60%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + V LAG G G +DG+ SARF AVD GN+YVAD NH IRKIT
Sbjct: 187 VSPDGNVTILAGAGKAGSADGDSASARFSGSDGLAVDGTGNVYVADLFNHTIRKIT 242
Score = 50.4 bits (119), Expect = 3e-04, Method: Composition-based stats.
Identities = 29/57 (50%), Positives = 34/57 (59%), Gaps = 3/57 (5%)
Query: 78 EESVVKRLAG---DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
E V R AG D G +DG SARF P+S AVD G +YVAD ++ VIRKIT
Sbjct: 22 ESYVFTRYAGNPNDTPYGSTDGFGDSARFAAPRSIAVDASGTLYVADAASSVIRKIT 78
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 5/94 (5%)
Query: 38 IDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE 97
+D + + + R G D+ V DS S ++ + + R G GY++G
Sbjct: 318 VDAARFSLPRGVAVSRTG--DIYVADSGNSTVRRIAVGGAVTTFAGRPGG---PGYANGS 372
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+A+F P A+D N++VAD N+VIRKIT
Sbjct: 373 SETAQFYFPTGIAIDQNRNVFVADSYNNVIRKIT 406
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 6/86 (6%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
P S+ +D G+ L V D++ S ++ +E V + G +G +DG +ARF
Sbjct: 53 PRSIAVDASGT--LYVADAASSVIRKIT---AEGMVTTFVGTAGQRGSADGIGAAARFQG 107
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
A+D +GN+Y D ++H +RKIT
Sbjct: 108 IDGLAIDARGNLYAVDFTDHTVRKIT 133
Score = 43.9 bits (102), Expect = 0.037, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 11/65 (16%)
Query: 78 EESVVKRLAGDGVQGYSDGEPG-----------SARFDKPKSFAVDMKGNIYVADKSNHV 126
+ V+ ++A DG + G PG ARF+ P AV G IYVAD NH
Sbjct: 453 QTHVIAKIAPDGAVTFFAGSPGLSGSTDGNARTEARFNGPTGIAVGPSGTIYVADFDNHT 512
Query: 127 IRKIT 131
IR+I+
Sbjct: 513 IRQIS 517
Score = 43.5 bits (101), Expect = 0.038, Method: Composition-based stats.
Identities = 27/74 (36%), Positives = 36/74 (48%), Gaps = 7/74 (9%)
Query: 65 SRSAFYTLSF------PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
+R Y + F ++ E VV LAG G G G G ARFD P + AVD N+
Sbjct: 115 ARGNLYAVDFTDHTVRKITPEGVVTTLAGSAGDHGTQVGHGGEARFDSPMAVAVDRWDNL 174
Query: 118 YVADKSNHVIRKIT 131
YV + IRK++
Sbjct: 175 YVGQMGDGAIRKVS 188
Score = 43.5 bits (101), Expect = 0.045, Method: Composition-based stats.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+G DG G ARF P AVD G++YVAD N+++RK
Sbjct: 635 EGKQDGRGGEARFLNPYGVAVDAAGHLYVADSGNNLVRK 673
Score = 43.1 bits (100), Expect = 0.050, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
+S +V LAG Q G +DG +ARF P + VD G IYVAD + +RKIT
Sbjct: 516 ISPAGMVSTLAGAAGQPGTADGTGSAARFYAPAAVTVDRAGMIYVADSWSSAVRKITPDG 575
Query: 135 HGNQLIRQ 142
++RQ
Sbjct: 576 VVTTVVRQ 583
Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S VV +AG G+ G DG +ARF P+ AV G+IYVAD N +R+I
Sbjct: 296 VSPSGVVSTVAGKTGLSGSDDG-VDAARFSLPRGVAVSRTGDIYVADSGNSTVRRI 350
Score = 40.8 bits (94), Expect = 0.29, Method: Composition-based stats.
Identities = 28/77 (36%), Positives = 43/77 (55%), Gaps = 6/77 (7%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDM 113
I +D +R+ F S+ ++ VV +AG GV G ++G +ARF P + A+D
Sbjct: 384 IAIDQNRNVFVADSYNNVIRKITPGGVVTTVAGLGGVFGSAEGSGAAARFGVPAAVAIDA 443
Query: 114 KGNIYVADKSNHVIRKI 130
N+YVA++ HVI KI
Sbjct: 444 AANLYVANRQTHVIAKI 460
>gi|255589882|ref|XP_002535117.1| nhl repeat-containing protein, putative [Ricinus communis]
gi|223523999|gb|EEF27264.1| nhl repeat-containing protein, putative [Ricinus communis]
Length = 435
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V LAG G +GY++G P +A F+ P A+D GN+YVAD N+ IRKI
Sbjct: 96 DGVTTTLAG-GTEGYAEGAPTAAAFNTPSGLAIDAAGNLYVADTGNNAIRKI 146
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 4/75 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ +L V D+ +A + + V LAGDG+ G DG+ A+F+ P AVD G
Sbjct: 130 AGNLYVADTGNNAIRKIG----PDGTVSTLAGDGLAGDKDGKGAGAQFNGPIGVAVDAAG 185
Query: 116 NIYVADKSNHVIRKI 130
+YV D N IR+I
Sbjct: 186 VVYVTDTYNDRIRRI 200
Score = 44.3 bits (103), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-NHVIRKI 130
P + + +AGDG+ G S+G RF P A+D +GN+YVAD N+ IRKI
Sbjct: 37 PAGWHARITTMAGDGLPGSSNGSGKRTRFADPFGVALDRQGNLYVADGGDNNSIRKI 93
Score = 41.2 bits (95), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 28/55 (50%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + V +AG G +DG A FD P A+ G +Y+AD N IRKI
Sbjct: 200 IAPDGTVTTVAGGKRAGMADGAGAQALFDTPTGIALGASGALYIADTGNSAIRKI 254
>gi|194364186|ref|YP_002026796.1| SMP-30/gluconolaconase/LRE domain-containing protein
[Stenotrophomonas maltophilia R551-3]
gi|194346990|gb|ACF50113.1| SMP-30/Gluconolaconase/LRE domain protein [Stenotrophomonas
maltophilia R551-3]
Length = 693
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 7/95 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
IDG L+ N S I +L V D+ A + + V LAG G QGY+D
Sbjct: 114 IDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGYAD 168
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G ARFD P AVD +G +YVAD N IR I
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 203
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I
Sbjct: 103 IETVAGQG-EGRIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 150
Score = 41.2 bits (95), Expect = 0.21, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 29/49 (59%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ LAG G +DG +ARFD P + A D G + VAD N+ +R++
Sbjct: 209 VRTLAGGERPGLADGLGVAARFDTPVALAFDAHGALLVADLFNNAVRRV 257
Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK-SNHVIRK 129
PL+ + ++ LAGDG G DG ARF P + G+IY D N+ IR+
Sbjct: 41 PLAWTAQIEMLAGDGHPGDRDGASAQARFADPYALLRSADGSIYFTDAGDNNRIRR 96
>gi|431798812|ref|YP_007225716.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
gi|430789577|gb|AGA79706.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
Length = 428
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 8/97 (8%)
Query: 37 VIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+IDG ++ NP V P LIV D + ++ ++ + V + G G G+
Sbjct: 121 LIDGSGIQAAFRNPEGVTMHPDGY-LIVTDRANNSIRKVT----TDGAVSTVLGTGNSGF 175
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+G SA D P VDM+GNIYVAD+ NH+IRKI
Sbjct: 176 QNGPVASALLDYPWKSCVDMEGNIYVADRDNHMIRKI 212
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + +V +AG G G++DG A+FD+P AV +G +YV D NH IRKI
Sbjct: 212 IDPQGMVSTVAGTGEAGFADGPAEEAQFDQPLDIAVTAEGVLYVTDNRNHRIRKI 266
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 43/83 (51%), Gaps = 7/83 (8%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDC 150
QG DG A F P VD GNIYVAD+ NH+IRKI +L+ G Q++ D
Sbjct: 280 QGNQDGALEEATFRYPSGLDVDDMGNIYVADRINHLIRKI-DLNAG-----QVSTVAGDG 333
Query: 151 SKSSQSGSGMFWVTVFSPYLISC 173
S+ ++ G + +PY IS
Sbjct: 334 SQGTRDGQ-VMTAQFNNPYGISV 355
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 30/49 (61%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V +AGDG QG DG+ +A+F+ P +V G + VAD SNH IR I
Sbjct: 326 VSTVAGDGSQGTRDGQVMTAQFNNPYGISVADNGQLVVADLSNHKIRLI 374
>gi|254522386|ref|ZP_05134441.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
gi|219719977|gb|EED38502.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
Length = 643
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 38/96 (39%), Positives = 47/96 (48%), Gaps = 7/96 (7%)
Query: 37 VIDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS 94
+IDG L+ N S I +L V D+ A + + V LAG GVQG+
Sbjct: 63 LIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGI----DGQVTTLAG-GVQGHG 117
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
DG ARFD P AVD +G +YVAD N IR I
Sbjct: 118 DGPVAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 153
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I
Sbjct: 53 VETVAGQG-EGLIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 100
Score = 43.5 bits (101), Expect = 0.038, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 30/49 (61%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ LAG G +DG +ARFD P + A D +G + VAD N+ +R++
Sbjct: 159 VRTLAGGERPGLADGAGDAARFDTPVALAFDAQGALLVADLFNNAVRRV 207
>gi|408823744|ref|ZP_11208634.1| NHL repeat-containing protein [Pseudomonas geniculata N1]
Length = 681
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 37/95 (38%), Positives = 46/95 (48%), Gaps = 7/95 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A + + V LAG G QGY+D
Sbjct: 102 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGYAD 156
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G ARFD P AVD +G +YVAD N IR I
Sbjct: 157 GPAAQARFDAPMGIAVDAQGQVYVADTYNDRIRVI 191
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I
Sbjct: 91 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 138
Score = 44.3 bits (103), Expect = 0.025, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 30/49 (61%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ LAG G +DG +ARFD P + A D +G + VAD N+ +R+I
Sbjct: 197 VRTLAGGERPGMADGVGAAARFDTPVALAFDAQGALLVADLFNNAVRRI 245
>gi|374606289|ref|ZP_09679172.1| copper amine oxidase domain-containing protein [Paenibacillus
dendritiformis C454]
gi|374388103|gb|EHQ59542.1| copper amine oxidase domain-containing protein [Paenibacillus
dendritiformis C454]
Length = 564
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 36/55 (65%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
L + V +AG G+ G+ DG+ ARF++P+ AV G++YVAD NHVIR+I
Sbjct: 165 LDKSGRVTTVAGSGLAGWKDGKGAEARFNEPRDVAVAEDGSLYVADALNHVIRRI 219
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 29/38 (76%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++DG+ ARF++P A D +GN+Y+AD +NHVIRK+
Sbjct: 128 WADGKGTEARFNEPMGMAEDRQGNLYIADAANHVIRKL 165
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 31/61 (50%), Gaps = 8/61 (13%)
Query: 68 AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
A Y+ FP + RL G GY DG+ A F+ P A+ +G I VAD+ NH I
Sbjct: 303 ASYSYKFPDT------RLYAAG--GYRDGKASEALFNGPAGIALTGEGGIVVADRWNHAI 354
Query: 128 R 128
R
Sbjct: 355 R 355
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 28/63 (44%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
AG GV G +DG A+F P G + +AD NH+IR+ + L Q+
Sbjct: 58 AGSGVSGSADGRAEQAQFRYPAGLLAGKNGALLIADTYNHLIRRADAAGQVSTLAGQVRF 117
Query: 146 KPE 148
E
Sbjct: 118 AAE 120
>gi|455790313|gb|EMF42184.1| putative lipoprotein [Leptospira interrogans serovar Lora str. TE
1992]
Length = 294
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.9 bits (89), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209
>gi|383636240|ref|ZP_09950646.1| redoxin domain-containing protein [Streptomyces chartreusis NRRL
12338]
Length = 605
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 33/99 (33%), Positives = 55/99 (55%), Gaps = 10/99 (10%)
Query: 39 DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
DG + P + R PG S +V D++R L+ ESVV+R+ G G +
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGTFLVSDTTRHQLVELA--EDGESVVRRI-GSGTR 221
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G++DG G+A F++P+ A+ +G++ VAD NH +R++
Sbjct: 222 GFADGPTGTAAFNEPQGLALLDEGSVVVADTVNHALRRL 260
>gi|124006033|ref|ZP_01690870.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123988440|gb|EAY28086.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length = 789
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 45/127 (35%), Positives = 66/127 (51%), Gaps = 15/127 (11%)
Query: 12 ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFY 70
ITLL ++ SL +SG ++ Y + TV+ + N V +D +++ V D+ R+ Y
Sbjct: 9 ITLLCIV-SLEASG---QNQYKINTVLSNYD---NIWGVAVD--AQNNVYVTDAERNRIY 59
Query: 71 TLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
L +++ AG G +GYS DG P +A D P VD GN+Y AD SN V+R
Sbjct: 60 RLDAITGVRTII---AGTGTEGYSGDGGPATAAMLDYPTGITVDAAGNVYFADGSNDVVR 116
Query: 129 KITNLDH 135
KI H
Sbjct: 117 KIDATTH 123
Score = 40.0 bits (92), Expect = 0.45, Method: Composition-based stats.
Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG+G +G++ G+ +A+ P A+D GNIY+ D N IRK+
Sbjct: 124 VISTIAGNGNRGFAGDGGQATAAQLHFPSDVALDTAGNIYIVDHRNDRIRKV 175
Score = 37.7 bits (86), Expect = 2.3, Method: Composition-based stats.
Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 85 LAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AGDG +DG P A PK A+D GN+Y+AD + IRK+
Sbjct: 225 FAGDG-NALNDGGPAHLASLRNPKGLAIDAAGNVYIADVLDDRIRKV 270
>gi|386716891|ref|YP_006183217.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
gi|384076453|emb|CCH11034.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
Length = 652
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 37/95 (38%), Positives = 47/95 (49%), Gaps = 7/95 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A +S + V LAG G QG++D
Sbjct: 73 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIS----TDGQVTTLAG-GEQGHAD 127
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G ARFD P AVD +G +YVAD N IR I
Sbjct: 128 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 162
Score = 48.9 bits (115), Expect = 9e-04, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I+
Sbjct: 62 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIS 110
Score = 44.7 bits (104), Expect = 0.018, Method: Composition-based stats.
Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 5/102 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
VTT+ G Q + + R + I +D+ + +F + + V+ LAG
Sbjct: 115 VTTLAGGEQGHADGPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG 174
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +DG +ARFD P + A D +G + VAD N+ +R++
Sbjct: 175 DRPGLADGVGAAARFDTPVALAFDAQGALLVADLFNNAVRRV 216
>gi|456988165|gb|EMG23306.1| hypothetical protein LEP1GSC150_2908 [Leptospira interrogans
serovar Copenhageni str. LT2050]
Length = 231
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
>gi|427400920|ref|ZP_18892158.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
gi|425720099|gb|EKU83025.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
Length = 717
Score = 54.3 bits (129), Expect = 3e-05, Method: Composition-based stats.
Identities = 34/93 (36%), Positives = 47/93 (50%), Gaps = 13/93 (13%)
Query: 50 VIDRPGSSDLI-----VLDSSRSAFYTLSFP-------LSEESVVKRLAGDGVQGYSDGE 97
VID PGS+ V +R A Y + + V LAG G +G++DG
Sbjct: 64 VIDGPGSASRFSDPYGVAIGARGAVYVADGGEANRIRLIQPDGAVSTLAG-GKEGFADGI 122
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+A F P + A+D +GN+YVAD NH IRK+
Sbjct: 123 GAAAAFHTPSALALDHEGNLYVADTGNHAIRKV 155
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 31/55 (56%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + +V LAG+G G DG A FD P + A G +YVAD NH +R+I
Sbjct: 209 IAPDGMVTTLAGNGKPGLLDGALLDAGFDTPSALAAGRDGTLYVADTGNHAVRRI 263
Score = 45.1 bits (105), Expect = 0.014, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 32/55 (58%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + V +AG G GY DG +A+F+ P AVD G +YVAD N IR+I
Sbjct: 155 VAPDGTVTTVAGSGSPGYLDGIGRAAQFNGPVGIAVDDAGIVYVADTYNDRIRRI 209
Score = 39.7 bits (91), Expect = 0.62, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVAD 121
V LAGDG G DG PGSA RF P A+ +G +YVAD
Sbjct: 53 VSLLAGDGRDGVIDG-PGSASRFSDPYGVAIGARGAVYVAD 92
>gi|290974765|ref|XP_002670115.1| predicted protein [Naegleria gruberi]
gi|284083670|gb|EFC37371.1| predicted protein [Naegleria gruberi]
Length = 784
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 48/91 (52%), Gaps = 5/91 (5%)
Query: 43 LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGS 100
LE P V P S+DL++ D L+ +++V + GD GYS +G
Sbjct: 95 LEFQPGGVALYPNSNDLLISDPVGGVIVRLN-SKGIQTIVAGMKGD--LGYSGDNGLATR 151
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
AR + P AV G +Y+ADKSNHVIRKI+
Sbjct: 152 ARLNTPTGVAVASNGEVYIADKSNHVIRKIS 182
Score = 43.1 bits (100), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG+G +G+S G +A+ + P ++ G +Y+AD NHVIRKI
Sbjct: 187 ISTIAGNGEEGFSGDGGNAKTAQLNSPIGLSISSTGELYIADSKNHVIRKI 237
Score = 39.7 bits (91), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADK 122
+ SA++ + +S ++ + AG G GYS DG A+F++P A G+I++AD
Sbjct: 405 AESAYHKIR-KISPNGIISQYAGTGDHGYSGDGSDAKLAKFNEPTRIACSKNGDIFIADL 463
Query: 123 SNHVIRKI 130
N IR+I
Sbjct: 464 FNGAIRRI 471
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 6/99 (6%)
Query: 33 TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
TV TV+ G +P +VI S +L+V D S +S + L G G
Sbjct: 477 TVDTVVSGLG---SPQAVILTE-SGELLVADRSSHVIRKISTNGVMSIIAGVLEDGGFNG 532
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
DG +F P+ A+ G++Y+AD N++IRK++
Sbjct: 533 --DGLATKTKFSGPQDIALAPNGDLYIADYDNYLIRKLS 569
>gi|409099442|ref|ZP_11219466.1| NHL repeat containing protein [Pedobacter agri PB92]
Length = 754
Score = 54.3 bits (129), Expect = 3e-05, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 32/56 (57%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S V +AG G G DG SA F+ P AVD GNIYVAD N+ IRKIT
Sbjct: 193 ISSTGFVTTVAGSGATGSLDGNAASATFNNPAGVAVDRAGNIYVADAGNNKIRKIT 248
Score = 45.1 bits (105), Expect = 0.013, Method: Composition-based stats.
Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 7/87 (8%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP V +DR G ++ V D+ + ++ VV LAG+G G +DG +AR
Sbjct: 222 NPAGVAVDRAG--NIYVADAGNNKIRKIT----PAGVVSTLAGNGTAGSTDGTGTNARLY 275
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
P +D N+Y+AD N+ IRKI+
Sbjct: 276 YPTGITIDPLDNLYIADYRNNRIRKIS 302
Score = 43.9 bits (102), Expect = 0.034, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 35/56 (62%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S VV LAG+G G +G ++ F+ P A D++GN+++ D N++IRKI+
Sbjct: 301 ISPSGVVTTLAGNGSAGNLNGVGTNSSFNGPIMLAFDVQGNLFITDFQNNLIRKIS 356
Score = 35.8 bits (81), Expect = 9.8, Method: Composition-based stats.
Identities = 19/50 (38%), Positives = 24/50 (48%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V AG G +G S+ F+ P D + NIYV D + IRKIT
Sbjct: 63 VSTFAGSGAASSYNGTGLSSGFNMPSGIGSDAQSNIYVCDYGSGAIRKIT 112
>gi|418714767|ref|ZP_13275259.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
gi|410788941|gb|EKR82646.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
Length = 358
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.9 bits (89), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209
>gi|418707795|ref|ZP_13268612.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. UI 08368]
gi|410771897|gb|EKR47094.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. UI 08368]
gi|456968671|gb|EMG09839.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. LT2186]
Length = 358
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 217 GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 36.6 bits (83), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
G +G+ ++ FD P +D + N+YV + SNH IRKI NL+ G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209
>gi|417772389|ref|ZP_12420278.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Pomona]
gi|418684105|ref|ZP_13245295.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Kennewicki LC82-25]
gi|400324210|gb|EJO76509.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Kennewicki LC82-25]
gi|409945760|gb|EKN95775.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Pomona]
gi|455670390|gb|EMF35375.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Fox 32256]
Length = 358
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 37.0 bits (84), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
G +G+ ++ FD P +D + N+YV + SNH IRKI NL+ G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209
>gi|418707100|ref|ZP_13267936.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
str. R499]
gi|410763306|gb|EKR34037.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
str. R499]
Length = 358
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.9 bits (89), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209
>gi|24215844|ref|NP_713325.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
str. 56601]
gi|45656856|ref|YP_000942.1| hypothetical protein LIC10968 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|386074980|ref|YP_005989298.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
str. IPAV]
gi|417765006|ref|ZP_12412972.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
Mallika]
gi|418666993|ref|ZP_13228411.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
2006006960]
gi|418699338|ref|ZP_13260303.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
L1111]
gi|418723579|ref|ZP_13282417.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
gi|418727930|ref|ZP_13286513.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
gi|421083673|ref|ZP_15544545.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
gi|421102269|ref|ZP_15562874.1| putative lipoprotein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|421122244|ref|ZP_15582528.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
gi|24197042|gb|AAN50343.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
str. 56601]
gi|45600092|gb|AAS69579.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|353458770|gb|AER03315.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
str. IPAV]
gi|400352606|gb|EJP04785.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
Mallika]
gi|409962948|gb|EKO26678.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
gi|410344796|gb|EKO95953.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
gi|410367934|gb|EKP23317.1| putative lipoprotein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410433778|gb|EKP78117.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
gi|410757315|gb|EKR18927.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
2006006960]
gi|410761708|gb|EKR27881.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
L1111]
gi|410777322|gb|EKR57287.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
gi|456822102|gb|EMF70597.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
LT1962]
Length = 358
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209
>gi|417783289|ref|ZP_12431009.1| putative lipoprotein [Leptospira interrogans str. C10069]
gi|409953418|gb|EKO07917.1| putative lipoprotein [Leptospira interrogans str. C10069]
Length = 358
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 39.3 bits (90), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
+V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 210 MVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209
>gi|421861496|ref|ZP_16293498.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
gi|410828922|dbj|GAC43935.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
Length = 575
Score = 53.9 bits (128), Expect = 3e-05, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 35/55 (63%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
L + V +AG G+ G+ DG ARF++P+ AV G++YVAD NHV+R+I
Sbjct: 176 LDKSGRVTTVAGSGLAGWRDGTGSEARFNEPRDVAVAEDGSLYVADALNHVLRRI 230
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 19/38 (50%), Positives = 28/38 (73%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++DG+ ARF++P A D +GN+Y+AD NHVIRK+
Sbjct: 139 WTDGKGTEARFNQPMGMAEDRQGNLYIADAGNHVIRKL 176
Score = 42.0 bits (97), Expect = 0.12, Method: Composition-based stats.
Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 19/105 (18%)
Query: 44 EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-------------- 89
+ N S + S DL+V D+ ++ ++ V LAG G
Sbjct: 269 KFNEPSSLAFTSSGDLVVSDTGNQRLRLVNL---KQKYVTTLAGAGSVASYSCKFPDAQL 325
Query: 90 --VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
GY D E A F+ P A+ +G I VAD+ NH IR + N
Sbjct: 326 YAAGGYRDSEASEALFNGPAGIAITAEGGIIVADRWNHAIRYLFN 370
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 13/66 (19%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSK 152
Y+DG+ G ++F++P S A G++ V+D GNQ +R +NLK + +
Sbjct: 260 YADGKLGESKFNEPSSLAFTSSGDLVVSDT-------------GNQRLRLVNLKQKYVTT 306
Query: 153 SSQSGS 158
+ +GS
Sbjct: 307 LAGAGS 312
Score = 37.7 bits (86), Expect = 2.4, Method: Composition-based stats.
Identities = 17/44 (38%), Positives = 21/44 (47%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
AG GV G DG A F P G + +AD NH+IR+
Sbjct: 72 AGSGVSGSVDGRAERAEFRYPSGLLAGKNGELLIADTYNHLIRR 115
>gi|417763451|ref|ZP_12411429.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
gi|417776933|ref|ZP_12424763.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
gi|418671130|ref|ZP_13232484.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
gi|409940657|gb|EKN86296.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
gi|410573292|gb|EKQ36344.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
gi|410581799|gb|EKQ49606.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
Length = 358
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 35.8 bits (81), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D + N+YV + +NH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELNNHTIRKI-NLNSG 209
>gi|445496654|ref|ZP_21463509.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
gi|444786649|gb|ELX08197.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
Length = 741
Score = 53.9 bits (128), Expect = 3e-05, Method: Composition-based stats.
Identities = 33/80 (41%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
ID G +L V D+ +A ++ E VV LAGDG+ G DG +A+F+ P A
Sbjct: 165 IDAAG--NLYVADTGNNAIRKVT----PEGVVSTLAGDGLPGDKDGRGAAAQFNGPVGIA 218
Query: 111 VDMKGNIYVADKSNHVIRKI 130
VD G +YVAD N IR+I
Sbjct: 219 VDAAGVVYVADTYNDRIRRI 238
Score = 53.1 bits (126), Expect = 5e-05, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAG GV+GY++G +A F+ P A+D GN+YVAD N+ IRK+T
Sbjct: 140 LAG-GVEGYAEGAGKAAAFNTPSGLAIDAAGNLYVADTGNNAIRKVT 185
Score = 46.2 bits (108), Expect = 0.006, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 27/49 (55%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V +AG G +DG A FD P A+ G++Y+AD NH IRK+
Sbjct: 244 VTTIAGGSRAGKADGAAAQALFDTPTGLALSAAGDLYIADTGNHAIRKL 292
Score = 43.9 bits (102), Expect = 0.032, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD-KSNHVIRKI 130
V +AGDG+ G ++G S RF P +D GN+YVAD N+ IRKI
Sbjct: 82 VTSIAGDGLPGANNGHGRSTRFADPFGVVIDGAGNLYVADGGDNNSIRKI 131
>gi|443625938|ref|ZP_21110373.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
gi|443340614|gb|ELS54821.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
Length = 632
Score = 53.9 bits (128), Expect = 3e-05, Method: Composition-based stats.
Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 25/132 (18%)
Query: 39 DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
DG + P + R PG S D +V D++R L+ E+VV+R+ G G +
Sbjct: 169 DGPYVAPEPEPTVLRFPGKVLLLPSGDFLVSDTTRHQLVELAG--DGETVVRRI-GQGSR 225
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
G +DG ARF +P+ A+ G + VAD NH++R+ PE
Sbjct: 226 GSTDGYADRARFQEPQGLALLPDGTVAVADTVNHLVRR---------------FDPETGR 270
Query: 152 KSSQSGSGMFWV 163
++ +G+G+ W+
Sbjct: 271 TTTLAGTGIQWM 282
>gi|225872818|ref|YP_002754275.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
51196]
gi|225794459|gb|ACO34549.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
51196]
Length = 855
Score = 53.5 bits (127), Expect = 4e-05, Method: Composition-based stats.
Identities = 33/86 (38%), Positives = 48/86 (55%), Gaps = 6/86 (6%)
Query: 49 SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKP 106
S + GS +L + D++ + +S VV +AG+G +GYS DG SA D P
Sbjct: 82 SAVAYDGSGNLYIADTNNNVIREVS----TTGVVTTIAGNGEEGYSGDGAAATSAMLDTP 137
Query: 107 KSFAVDMKGNIYVADKSNHVIRKITN 132
AVD GNIY+AD N+ IR+++N
Sbjct: 138 TGIAVDSNGNIYIADSHNNRIREVSN 163
Score = 46.2 bits (108), Expect = 0.006, Method: Composition-based stats.
Identities = 24/52 (46%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG+G GYS DG SA D P + AVD GNIY+AD N IR +
Sbjct: 165 IINTVAGNGTAGYSGDGAAATSAMLDDPTAVAVDASGNIYIADTGNQRIRNV 216
Score = 43.5 bits (101), Expect = 0.041, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 85 LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG G GYS G SA F P + A D GN+Y+AD +N+VIR+++
Sbjct: 58 VAGTGSAGYSGNGGAATSAAFASPSAVAYDGSGNLYIADTNNNVIREVS 106
Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats.
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG+G +GYS DG ++ D P AVD GNIY+AD N+ IR+++
Sbjct: 221 IHTVAGNGEEGYSGDGAAAASAELDTPTGIAVDSSGNIYIADSHNNRIREVS 272
Score = 37.4 bits (85), Expect = 2.9, Method: Composition-based stats.
Identities = 21/53 (39%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V+ +AG+G QGYS DG SA + P++ +V+ G++ VAD N +R +T
Sbjct: 334 VIATVAGNGQQGYSGDGAAATSAALNDPRNASVNASGSVAVADTLNERVRGLT 386
Score = 35.8 bits (81), Expect = 9.1, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 6/58 (10%)
Query: 81 VVKRLAGDGV---QGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
V+ +AG G G G+ GSA KP A+D G++Y+AD +N +R+I N
Sbjct: 275 VINTVAGSGAVTFPGSYSGDGGSATAATLAKPTGVALDAAGHVYIADTNNERLREIAN 332
>gi|330824371|ref|YP_004387674.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
gi|329309743|gb|AEB84158.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
Length = 362
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 36/55 (65%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
L V +AG G G++DG +ARF++P + AV+ G +YVAD++NH IRKI
Sbjct: 83 LRSGGTVATVAGTGKAGFADGPAHAARFNEPIAVAVERSGAVYVADRNNHRIRKI 137
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 36/50 (72%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG+G G++DG+ +ARF++P + +D +G ++V D+ NH +R++T
Sbjct: 199 VDTLAGNGAPGFADGQRATARFNQPYNVRIDGQGRLWVPDQLNHAVRRVT 248
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 7/58 (12%)
Query: 71 TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
TL FP V+ LAGDG G +DG PG R ++P + G++Y AD+ NH +R
Sbjct: 31 TLPFPE-----VRTLAGDGQPGAADGSPG--RVNRPHGLSYGNDGSLYFADRGNHQVR 81
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 29/50 (58%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG G GY+DG + RFD P A G + VAD++N+ +R +T
Sbjct: 253 VTTVAGAGKAGYADGPAATVRFDNPTGVAPLPNGAVVVADRNNNRLRLVT 302
Score = 40.0 bits (92), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
+ + V LAG G+ DG+ SARF++P A+D + +YVAD NH IR+I
Sbjct: 137 IRPDGTVITLAGGDSAGFVDGDLKSARFNQPYGVALDAAQTTLYVADYLNHAIRRI 192
>gi|182414082|ref|YP_001819148.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
gi|177841296|gb|ACB75548.1| NHL repeat containing protein [Opitutus terrae PB90-1]
Length = 963
Score = 53.5 bits (127), Expect = 4e-05, Method: Composition-based stats.
Identities = 28/60 (46%), Positives = 37/60 (61%), Gaps = 1/60 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
++ VV LAGD + G +DG G ARF P AVD GNI+VAD N+ IR++T L+
Sbjct: 253 ITPTGVVLTLAGDPDLVGSADGTGGDARFSSPAGLAVDRDGNIFVADSLNNTIRRVTPLN 312
Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 32/56 (57%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ V LAG+ G +DG ARF+ P AVD+ GN YVAD N IRKIT
Sbjct: 369 IAPSGAVTTLAGEASVGTADGPGPMARFNYPNGVAVDLAGNTYVADTFNATIRKIT 424
Score = 49.7 bits (117), Expect = 6e-04, Method: Composition-based stats.
Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 7/84 (8%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEES---VVKRLAGD-GVQGYSDGEPGSARFDKP 106
+DR G ++ V DS + ++ PL+ + VV +AG GV G +DG ARF+ P
Sbjct: 289 VDRDG--NIFVADSLNNTIRRVT-PLNGPAPLGVVTTVAGQAGVTGSADGVGSQARFNLP 345
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
AVD GNI+VAD N IRKI
Sbjct: 346 YGIAVDAAGNIFVADLGNTTIRKI 369
Score = 49.3 bits (116), Expect = 8e-04, Method: Composition-based stats.
Identities = 25/47 (53%), Positives = 30/47 (63%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAG G DG+ +A F P S AVD GN++VAD +NH IRKIT
Sbjct: 44 LAGRPFFGGLDGQGRAAGFTTPSSAAVDQAGNLFVADTTNHTIRKIT 90
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 7/84 (8%)
Query: 49 SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPK 107
+ +D+ G +L V D++ ++ V AG G Q G DG +ARF P
Sbjct: 68 AAVDQAG--NLFVADTTNHTIRKIT----PSGTVSTFAGMGGQPGSVDGTGNAARFLSPH 121
Query: 108 SFAVDMKGNIYVADKSNHVIRKIT 131
A+D GN+YVAD N+ IRKIT
Sbjct: 122 GVALDEAGNLYVADSGNNTIRKIT 145
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 22/57 (38%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + +V LAG Q G DG +ARF +P +D GN+YV+D+ + +RKIT
Sbjct: 532 ITPDGMVSTLAGSPAQRGGIDGTGTAARFVQPAGLTIDAAGNLYVSDRGDFTVRKIT 588
Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV +AG G G +DG +ARF P AV G +YVAD+ N IR+IT
Sbjct: 477 ITPAGVVTTIAGVSGNFGSADGPGLAARFAFPNGLAVATDGTLYVADEENSTIRQIT 533
Score = 39.3 bits (90), Expect = 0.82, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG Q G +DG +ARF+ P AVD GN+Y S +RKIT
Sbjct: 423 ITPAGVVSTLAGAAGQIGSADGTGSAARFEFPLGIAVDRAGNVYTTANSA-TVRKIT 478
Score = 38.1 bits (87), Expect = 1.7, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG G+ G +DG +A+F A+D +G +YVAD SN+ IR+IT
Sbjct: 593 VTTVAGQHGIAGGADGTGSAAQFAYAGGIAIDRRGTLYVAD-SNNRIRQIT 642
>gi|42522312|ref|NP_967692.1| hypothetical protein Bd0727 [Bdellovibrio bacteriovorus HD100]
gi|39574843|emb|CAE78685.1| hypothetical protein with NHL repeat [Bdellovibrio bacteriovorus
HD100]
Length = 709
Score = 53.5 bits (127), Expect = 4e-05, Method: Composition-based stats.
Identities = 30/78 (38%), Positives = 43/78 (55%), Gaps = 6/78 (7%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
+ D+ ++DS+ S +S VV AG GV ++DG SARFD P +D
Sbjct: 85 AAGDIFIVDSNASVIRKIS-----NGVVSTFAGKFGVFDHADGTGDSARFDYPTGITIDG 139
Query: 114 KGNIYVADKSNHVIRKIT 131
GN++V + +NH IRKIT
Sbjct: 140 SGNLFVTEGNNHTIRKIT 157
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 30/77 (38%), Positives = 39/77 (50%), Gaps = 1/77 (1%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
S +L V DS ++ ++ V LAGD G +DG +A F P+ A D
Sbjct: 358 ASGNLFVTDSDNASIRKVTPARVVTLVAGSLAGDS-DGSADGTGTAASFHSPEGVAADPA 416
Query: 115 GNIYVADKSNHVIRKIT 131
GN+YVAD N IRKIT
Sbjct: 417 GNLYVADTMNRTIRKIT 433
Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 41/78 (52%), Gaps = 5/78 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
G+ L V DS YT+ ++ VV LAG G+QG DG A F + V
Sbjct: 523 GAGSLYVSDSGN---YTVR-KVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTP 578
Query: 114 KGNIYVADKSNHVIRKIT 131
GNI+VAD N+VIRKIT
Sbjct: 579 SGNIFVADTDNNVIRKIT 596
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 5/78 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
GS +L V + + ++ +VV +AG G G +DG +ARF+ P+ +
Sbjct: 139 GSGNLFVTEGNNHTIRKIT----PAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAA 194
Query: 114 KGNIYVADKSNHVIRKIT 131
GN Y+ DK+N++IRK+T
Sbjct: 195 DGNFYITDKNNNMIRKMT 212
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 29/77 (37%), Positives = 38/77 (49%), Gaps = 5/77 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
S +L V D S SA ++ +VV AG G G DG +ARF P +D
Sbjct: 304 SDNLYVADYSNSAIRKVT----SSAVVSNFAGSYGDYGAVDGTGTAARFAGPAGVGIDAS 359
Query: 115 GNIYVADKSNHVIRKIT 131
GN++V D N IRK+T
Sbjct: 360 GNLFVTDSDNASIRKVT 376
Score = 43.1 bits (100), Expect = 0.054, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 29/56 (51%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV AGDG G +DG +A F+ P D GN++V S IRKIT
Sbjct: 211 MTPAGVVTTFAGDGTYGCTDGTGAAAHFNYPTGIVGDSAGNLFVVCSSCSTIRKIT 266
Score = 42.4 bits (98), Expect = 0.096, Method: Composition-based stats.
Identities = 27/75 (36%), Positives = 38/75 (50%), Gaps = 1/75 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
L+ VV LAGD G +DG SARF A D G++YV+D N+ +RK+T
Sbjct: 485 LTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIASDGAGSLYVSDSGNYTVRKVTLAG 544
Query: 135 HGNQLIRQINLKPED 149
L Q+ ++ D
Sbjct: 545 VVTTLAGQVGIQGSD 559
Score = 42.0 bits (97), Expect = 0.11, Method: Composition-based stats.
Identities = 29/79 (36%), Positives = 38/79 (48%), Gaps = 6/79 (7%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDM 113
IV DS+ + F S ++ VV AG G DG +ARF P +D
Sbjct: 244 IVGDSAGNLFVVCSSCSTIRKITPAGVVTTFAGQAYTTGAVDGTGTAARFSWPVGITIDS 303
Query: 114 KGNIYVADKSNHVIRKITN 132
N+YVAD SN IRK+T+
Sbjct: 304 SDNLYVADYSNSAIRKVTS 322
Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
Query: 81 VVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
VV AG QG +D GS ARF +P A D GN+YVA+ IRKIT+
Sbjct: 600 VVTTFAGAAGQGGNDDGMGSNARFSQPHFVATDSSGNLYVAEWGEATIRKITS 652
Score = 36.6 bits (83), Expect = 4.6, Method: Composition-based stats.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 3/51 (5%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG G G +DG +ARF P V GNIYVAD+ + IRK+T
Sbjct: 438 VTTIAGSPGQIGSADGTGAAARFSYPTKLTVAEDGNIYVADE--YRIRKLT 486
>gi|421117009|ref|ZP_15577380.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
Fiocruz LV133]
gi|410011441|gb|EKO69561.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
Fiocruz LV133]
Length = 358
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTISTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 37.0 bits (84), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
G +G+ ++ FD P +D + N+YV + SNH IRKI NL+ G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209
>gi|418688943|ref|ZP_13250072.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
gi|421128059|ref|ZP_15588277.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. 2006006986]
gi|421134520|ref|ZP_15594655.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. Andaman]
gi|400361935|gb|EJP17894.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
gi|410021314|gb|EKO88104.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. Andaman]
gi|410434526|gb|EKP83664.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. 2006006986]
Length = 358
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTISTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ N++IRKI GN NL +D S
Sbjct: 96 DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259
Score = 36.6 bits (83), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
G +G+ ++ FD P +D + N+YV + SNH IRKI NL+ G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209
>gi|110639161|ref|YP_679370.1| hypothetical protein CHU_2779 [Cytophaga hutchinsonii ATCC 33406]
gi|110281842|gb|ABG60028.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 1051
Score = 53.5 bits (127), Expect = 4e-05, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 36/57 (63%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ VV AG G G +DG +A+F+ P A+D GN+YVAD N++IRKIT+
Sbjct: 135 ITPAGVVITFAGSGTAGSNDGTGTAAQFNNPYGMAIDASGNLYVADSGNNLIRKITS 191
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/51 (45%), Positives = 31/51 (60%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+V +AG+G G D +ARF+ P + +D NIY+AD NH IRKIT
Sbjct: 86 IVSTIAGNGTSGLIDATGSAARFNLPAAVVLDAAQNIYIADNGNHCIRKIT 136
Score = 50.1 bits (118), Expect = 4e-04, Method: Composition-based stats.
Identities = 32/88 (36%), Positives = 48/88 (54%), Gaps = 7/88 (7%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP+ + ID S +L V DS + L ++ VV +AG+ GY DG +ARF
Sbjct: 164 NPYGMAID--ASGNLYVADSGNN----LIRKITSSGVVTTIAGNTTPGYVDGTGTAARFY 217
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITN 132
P + +D+ GN ++ D NH IRK+T+
Sbjct: 218 LPVNITLDVSGNFFITDNRNHRIRKMTS 245
Score = 49.7 bits (117), Expect = 6e-04, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 35/57 (61%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ VV +AG G GY +G +A+F++P VD N+YV D +N VIRKIT+
Sbjct: 243 MTSAGVVSTVAGSGSAGYMNGTGVTAQFNRPYGIVVDAFSNLYVTDTNNGVIRKITS 299
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 23/75 (30%), Positives = 38/75 (50%), Gaps = 4/75 (5%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
S+L V D++ ++ VV AG G G++DG +A+F P ++ G+
Sbjct: 282 SNLYVTDTNNGVIRKIT----SSGVVSTYAGTGTPGFADGPAANAQFQWPTGLTINASGD 337
Query: 117 IYVADKSNHVIRKIT 131
+Y AD H +RK+T
Sbjct: 338 LYEADNETHRVRKVT 352
>gi|426402696|ref|YP_007021667.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
Tiberius]
gi|425859364|gb|AFY00400.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
Tiberius]
Length = 692
Score = 53.5 bits (127), Expect = 5e-05, Method: Composition-based stats.
Identities = 30/78 (38%), Positives = 43/78 (55%), Gaps = 6/78 (7%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
+ D+ ++DS+ S +S VV AG GV ++DG SARFD P +D
Sbjct: 68 AAGDIFIVDSNASVIRKIS-----NGVVSTFAGKFGVFDHADGTGDSARFDYPTGITIDG 122
Query: 114 KGNIYVADKSNHVIRKIT 131
GN++V + +NH IRKIT
Sbjct: 123 SGNLFVTEGNNHTIRKIT 140
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 31/77 (40%), Positives = 39/77 (50%), Gaps = 1/77 (1%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
S DL V DS ++ ++ V LAGD G +DG +A F P+ A D
Sbjct: 341 ASGDLFVTDSDNASIRKITPARVVTLVAGSLAGDS-DGSADGTGTAASFFSPEGVAADPA 399
Query: 115 GNIYVADKSNHVIRKIT 131
GN+YVAD N IRKIT
Sbjct: 400 GNLYVADTMNRTIRKIT 416
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 41/78 (52%), Gaps = 5/78 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
G+ L V DS YT+ ++ VV LAG G+QG DG A F + V
Sbjct: 506 GAGSLYVSDSGN---YTVR-KVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTP 561
Query: 114 KGNIYVADKSNHVIRKIT 131
GNI+VAD N+VIRKIT
Sbjct: 562 SGNIFVADTDNNVIRKIT 579
Score = 45.8 bits (107), Expect = 0.008, Method: Composition-based stats.
Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 5/78 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
GS +L V + + ++ +VV +AG G G +DG +ARF+ P+ +
Sbjct: 122 GSGNLFVTEGNNHTIRKIT----PAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAA 177
Query: 114 KGNIYVADKSNHVIRKIT 131
GN Y+ DK+N++IRK+T
Sbjct: 178 DGNFYITDKNNNMIRKMT 195
Score = 43.1 bits (100), Expect = 0.060, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 29/56 (51%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV AGDG G +DG +A F+ P D GN++V S IRKIT
Sbjct: 194 MTPAGVVTTFAGDGTYGCTDGTGAAAHFNYPTGIVGDSAGNLFVVCSSCSTIRKIT 249
Score = 42.7 bits (99), Expect = 0.071, Method: Composition-based stats.
Identities = 28/77 (36%), Positives = 37/77 (48%), Gaps = 5/77 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
S +L V D SA ++ +VV AG G G DG +ARF P +D
Sbjct: 287 SDNLYVADYGNSAIRKVT----SSAVVSNFAGSYGDYGAVDGTGTAARFAGPAGIGIDAS 342
Query: 115 GNIYVADKSNHVIRKIT 131
G+++V D N IRKIT
Sbjct: 343 GDLFVTDSDNASIRKIT 359
Score = 42.0 bits (97), Expect = 0.11, Method: Composition-based stats.
Identities = 27/75 (36%), Positives = 38/75 (50%), Gaps = 1/75 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
L+ VV LAGD G +DG SARF A D G++YV+D N+ +RK+T
Sbjct: 468 LTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIASDGAGSLYVSDSGNYTVRKVTLAG 527
Query: 135 HGNQLIRQINLKPED 149
L Q+ ++ D
Sbjct: 528 VVTTLAGQVGIQGSD 542
Score = 41.6 bits (96), Expect = 0.18, Method: Composition-based stats.
Identities = 25/52 (48%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 81 VVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
VV AG QG +D GS ARF +P A D GN+YVA+ IRKIT
Sbjct: 583 VVTTFAGAAGQGGNDDGMGSNARFSQPHFVATDSSGNLYVAEWGEATIRKIT 634
Score = 38.9 bits (89), Expect = 1.0, Method: Composition-based stats.
Identities = 28/79 (35%), Positives = 37/79 (46%), Gaps = 6/79 (7%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDM 113
IV DS+ + F S ++ VV AG G DG +ARF P +D
Sbjct: 227 IVGDSAGNLFVVCSSCSTIRKITPAGVVTTFAGLANATGALDGTGTAARFSWPIGITIDS 286
Query: 114 KGNIYVADKSNHVIRKITN 132
N+YVAD N IRK+T+
Sbjct: 287 SDNLYVADYGNSAIRKVTS 305
>gi|411003633|ref|ZP_11379962.1| hypothetical protein SgloC_12561 [Streptomyces globisporus C-1027]
Length = 605
Score = 53.1 bits (126), Expect = 5e-05, Method: Composition-based stats.
Identities = 28/72 (38%), Positives = 39/72 (54%), Gaps = 3/72 (4%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V D++R L ++ V+R G G +G SDG PG ARF +P+ AV G +
Sbjct: 188 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLSDGGPGEARFSEPQGLAVLPDGRVA 244
Query: 119 VADKSNHVIRKI 130
VAD NH IR +
Sbjct: 245 VADTVNHAIRAL 256
>gi|182414669|ref|YP_001819735.1| SMP-30/gluconolaconase/LRE domain-containing protein [Opitutus
terrae PB90-1]
gi|177841883|gb|ACB76135.1| SMP-30/Gluconolaconase/LRE domain protein [Opitutus terrae PB90-1]
Length = 1292
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 27/50 (54%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
Query: 82 VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V +AG G+ G DGE +ARF+ P AVD GN+YVAD NH IRKI
Sbjct: 456 VSTVAGAHGLDGSLDGEKTAARFNAPHGLAVDQHGNLYVADTFNHSIRKI 505
Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/37 (59%), Positives = 26/37 (70%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
DG ARF+ PK AVD G +YVAD SNH+IRK+T
Sbjct: 209 DGAGSDARFNGPKGIAVDANGTVYVADTSNHIIRKVT 245
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 33/81 (40%), Positives = 42/81 (51%), Gaps = 9/81 (11%)
Query: 52 DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFA 110
DR GS L V D R +S E VV +AG G DG +A F + K A
Sbjct: 542 DRDGS--LFVADGHRVR------RISPEGVVTTVAGTANATGSIDGVGAAATFGEIKGLA 593
Query: 111 VDMKGNIYVADKSNHVIRKIT 131
VD GN++VA+ + HVIR+IT
Sbjct: 594 VDRAGNVFVAENTTHVIRRIT 614
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 21/40 (52%), Positives = 25/40 (62%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY DGE ARF +P A D GN+YV D ++IRKIT
Sbjct: 43 GYVDGEGTQARFHEPAGLACDADGNLYVVDPGTNLIRKIT 82
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 27/57 (47%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG G+ G SDG +ARF+ P AVD G IYV D+S +RKIT
Sbjct: 244 VTPSGVVTTLAGSPGISGNSDGAGDAARFNAPTDIAVDDAGTIYVVDQSGS-LRKIT 299
Score = 40.0 bits (92), Expect = 0.50, Method: Composition-based stats.
Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 10/59 (16%)
Query: 81 VVKRLAGDGVQ----------GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
V++R+ DG G +DG +ARF++P A+D G++Y+ D N+ IRK
Sbjct: 609 VIRRITPDGTVVTIGGLAGSIGTADGVGSAARFNEPWGLALDRFGHLYIVDSGNNTIRK 667
Score = 40.0 bits (92), Expect = 0.52, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G DG SARF P+ AV G +Y+AD N IR IT
Sbjct: 98 GLVDGPAASARFGLPQGVAVGADGTVYIADTGNAAIRIIT 137
Score = 38.1 bits (87), Expect = 1.7, Method: Composition-based stats.
Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GV+G DG +ARF P + A D G+++VAD H +R+I+
Sbjct: 519 GVEGTVDGIGNAARFGSPTALAFDRDGSLFVAD--GHRVRRIS 559
>gi|373953102|ref|ZP_09613062.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
gi|373889702|gb|EHQ25599.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
Length = 3731
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 26/51 (50%), Positives = 32/51 (62%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
VV LAG G G +G +A F+ P S DM+GN+YVAD N +IRKIT
Sbjct: 168 VVTTLAGSGSAGSVNGTGTAASFNTPYSLTTDMQGNVYVADYGNQLIRKIT 218
Score = 52.8 bits (125), Expect = 7e-05, Method: Composition-based stats.
Identities = 27/56 (48%), Positives = 33/56 (58%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV AG G G +DG +A F+ P A D +GNIYVAD N VIRKIT
Sbjct: 1140 ITPSGVVTTFAGSGSSGAADGIGMAASFNSPYGLATDAQGNIYVADFGNQVIRKIT 1195
Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 37/57 (64%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ ++VV LAG+G G ++G +A F+ P D GN+YVAD N++IRK+T+
Sbjct: 1690 ITPDAVVTTLAGNGAAGSANGTGNAASFNHPWGIVADAAGNLYVADTYNNLIRKVTS 1746
Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 36/57 (63%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG G G+ +G +A+F+ P+S A D GN+YVAD+ N IRKIT
Sbjct: 217 ITPAGVVTTLAGTVGSSGFVNGTGTAAKFNYPRSVATDAAGNVYVADQVNQAIRKIT 273
Score = 50.4 bits (119), Expect = 4e-04, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 32/56 (57%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV AG GV G +G +A F P +D +GN+YVAD NH IRKIT
Sbjct: 272 ITPAGVVTTFAGSGVPGALNGTGTAATFYNPTGVTMDAQGNVYVADSQNHSIRKIT 327
Score = 50.1 bits (118), Expect = 4e-04, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 31/56 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG G G +DG +A F P D +GN+YVAD N IRKIT
Sbjct: 657 ITPAGVVTTLAGSGDSGSADGTGTAASFHTPYDLTTDAQGNVYVADNFNQTIRKIT 712
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ E VV AG G G+ +G +A+F P A D +GN+YVAD N IRKIT
Sbjct: 711 ITREGVVNTFAGTSGSSGFVNGTAAAAKFKNPIGIATDTQGNVYVADNGNLAIRKIT 767
Score = 49.7 bits (117), Expect = 5e-04, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 37/56 (66%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S VV AG GV G ++G +A F+ P A D++GN+YV+D ++++IRKIT
Sbjct: 109 ISPVGVVSTFAGSGVAGSANGTGTAASFNNPFGIATDVQGNLYVSDVNSNLIRKIT 164
Score = 49.3 bits (116), Expect = 8e-04, Method: Composition-based stats.
Identities = 27/51 (52%), Positives = 30/51 (58%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
VV LAG G G +DG SA F P + A D GNIYV D N +IRKIT
Sbjct: 1308 VVTTLAGTGSIGSADGVGTSASFYNPNAVATDAVGNIYVVDTYNQLIRKIT 1358
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 32/56 (57%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +V AG G G +G +A F+ P D +GNIYV+D +N+ IRKIT
Sbjct: 1086 ITPSGLVTTFAGSGAAGSENGNGAAASFNNPFGLTTDAQGNIYVSDANNNTIRKIT 1141
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 33/56 (58%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG G +DG +A F+ P AVD GN+YVAD N ++RKIT
Sbjct: 2130 ITAAGVVTTLAGKLTAGSADGVGAAASFNLPAGVAVDASGNVYVADLLNSMVRKIT 2185
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 25/50 (50%), Positives = 31/50 (62%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAGDG G ++G + F P + A D GNIYVAD N++IRKIT
Sbjct: 813 VTILAGDGSAGSANGAGTVSNFYVPNALATDALGNIYVADAGNNLIRKIT 862
Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 33/55 (60%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ VV LAG G G ++G +A F P + D GN+Y+AD +NH+IRKI
Sbjct: 326 ITPAGVVTTLAGTGSMGSANGAGTNASFYYPNAVVADALGNLYIADTNNHLIRKI 380
Score = 45.1 bits (105), Expect = 0.013, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 31/52 (59%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+VV AG G G + SA F+ P AVD +GN YV D+ N+++RKIT
Sbjct: 2026 AVVTTFAGSGAAGSVNSTGTSATFNGPLDVAVDAEGNTYVLDQLNNLVRKIT 2077
Score = 43.5 bits (101), Expect = 0.039, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 33/56 (58%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V LAG G +DG +A F P + VD +GNI VAD+ N+ IRKI+
Sbjct: 2184 ITPDGTVTTLAGSTSAGSADGTGAAAGFHYPTNLQVDDQGNIIVADQLNNKIRKIS 2239
Score = 43.5 bits (101), Expect = 0.040, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 32/55 (58%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S VV +AG G G ++G +A F S A D G++YVAD+ ++IRKI
Sbjct: 1798 ISPAGVVTTIAGSGASGIANGIGKAATFGNLYSIATDASGDVYVADQYKYIIRKI 1852
Score = 43.1 bits (100), Expect = 0.060, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 30/56 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +V AG G G +G +A F P D +GN+YVAD N+ IRKIT
Sbjct: 1249 ITPAGLVTTFAGSGGIGALNGTGTAASFHNPTGITTDAQGNVYVADLYNNAIRKIT 1304
Score = 42.7 bits (99), Expect = 0.067, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 30/50 (60%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V AG G DG +A F+ P + ++D GN+YVA+ + +VIRKI+
Sbjct: 1750 VTTFAGSGAASSVDGTGTAASFNYPSAISIDASGNLYVAELNGNVIRKIS 1799
Score = 42.7 bits (99), Expect = 0.080, Method: Composition-based stats.
Identities = 19/32 (59%), Positives = 24/32 (75%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+A F+ P AVD GNIYVAD+ N++IRKIT
Sbjct: 2100 AATFNHPTGLAVDAAGNIYVADQGNNMIRKIT 2131
Score = 42.4 bits (98), Expect = 0.090, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V AG+ G+ G ++ +A F P AVD GN+YVAD N++IRKI+
Sbjct: 60 VSTFAGNAGIAGNTNATGTAATFHSPFGVAVDASGNVYVADAGNNLIRKIS 110
Score = 41.6 bits (96), Expect = 0.14, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 32/59 (54%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ P S+V + G+G G +G +A + D+ GN++VAD NH+IRKIT
Sbjct: 1524 AVPAKTYSLVSTIVGNGSSGAVNGTGTAASLNLCDGLVFDLLGNMFVADFGNHMIRKIT 1582
Score = 41.6 bits (96), Expect = 0.14, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 33/56 (58%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +VV G G G ++G+ +A F P A+D GN++VAD+ + IRKIT
Sbjct: 1581 ITPATVVSTFVGTGSPGSTNGKGTAASFYVPYGMAIDAAGNLFVADQFYNQIRKIT 1636
Score = 41.2 bits (95), Expect = 0.19, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V LAG G G ++G +A F P A D GNIYV+D +N++IRKI
Sbjct: 556 VSTLAGKAGSAGNANGTGTAATFSSPTGVATDPSGNIYVSDYNNNLIRKI 605
Score = 41.2 bits (95), Expect = 0.19, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+F+ P AVD+ GN+YVAD+ N VIRKIT
Sbjct: 1221 KFNSPYDVAVDVTGNVYVADELNQVIRKIT 1250
Score = 40.8 bits (94), Expect = 0.25, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 36/65 (55%), Gaps = 3/65 (4%)
Query: 69 FYTLSFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
FY ++ + +V AG G G +DG +A F P+ A+D GN++V + N++
Sbjct: 1628 FYNQIRKITPDGLVTTFAGSLTGAPGATDGTGAAATFRSPRGMAIDALGNLFVVE-DNYL 1686
Query: 127 IRKIT 131
IRKIT
Sbjct: 1687 IRKIT 1691
Score = 40.4 bits (93), Expect = 0.35, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG G+ G ++G +A F P D GN+Y+AD +N +IRKIT
Sbjct: 1037 VSTVAGSVGIAGKANGIGTAATFSGPSGVTTDASGNLYIADFNNRLIRKIT 1087
>gi|344205849|ref|YP_004790990.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
gi|343777211|gb|AEM49764.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
Length = 693
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 37/95 (38%), Positives = 47/95 (49%), Gaps = 7/95 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A +S + V LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADVQGNLYVADTGNHAIRRIS----TDGQVTTLAG-GEQGHAD 168
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G ARFD P AVD +G +YVAD N IR I
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 203
Score = 49.7 bits (117), Expect = 5e-04, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V+ +AG G +G DG A F+ P A D++GN+YVAD NH IR+I+
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADVQGNLYVADTGNHAIRRIS 151
Score = 44.7 bits (104), Expect = 0.017, Method: Composition-based stats.
Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 5/102 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
VTT+ G Q + + R + I +D+ + +F + + V+ LAG
Sbjct: 156 VTTLAGGEQGHADGPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG 215
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G++DG +ARFD P + A D G + VAD N+ +R++
Sbjct: 216 DRPGFADGVGTAARFDTPVALAFDAHGALLVADLFNNAVRRV 257
>gi|424666862|ref|ZP_18103887.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
Ab55555]
gi|401069531|gb|EJP78052.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
Ab55555]
Length = 693
Score = 52.8 bits (125), Expect = 7e-05, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 46/95 (48%), Gaps = 7/95 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A + + V LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGI----DGQVTTLAG-GEQGHAD 168
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G ARFD P AVD +G +YVAD N IR I
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 203
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 150
Score = 44.3 bits (103), Expect = 0.026, Method: Composition-based stats.
Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 5/102 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
VTT+ G Q + + R + I +D+ + +F + + V+ LAG
Sbjct: 156 VTTLAGGEQGHADGPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG 215
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +DG +ARFD P + A D +G + VAD N+ +R++
Sbjct: 216 DRPGLADGVGAAARFDTPVALAFDAQGALLVADLFNNAVRRV 257
>gi|223934698|ref|ZP_03626618.1| NHL repeat containing protein [bacterium Ellin514]
gi|223896653|gb|EEF63094.1| NHL repeat containing protein [bacterium Ellin514]
Length = 847
Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 35/56 (62%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG +G +G SARF P++ AVD + NIYVAD N VIRKIT
Sbjct: 358 MTAAGVVTTLAGSPSEGSINGVTSSARFYSPQNVAVDGQNNIYVADTQNSVIRKIT 413
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 43/78 (55%), Gaps = 5/78 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
G +++ V D+ S ++ P VV LAG GV G +DG +A F P+ AVD
Sbjct: 395 GQNNIYVADTQNSVIRKIT-PFG---VVSVLAGTTGVFGSADGSGANALFSGPQGIAVDG 450
Query: 114 KGNIYVADKSNHVIRKIT 131
GNIYVAD N IRKIT
Sbjct: 451 GGNIYVADTGNSTIRKIT 468
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + + LAG GVQG +DG A F++P A+D GNIYV+D + IRK+T
Sbjct: 85 VTASGLCRTLAGTAGVQGSADGMGAQASFNQPSGIALDSDGNIYVSDYGSSTIRKVT 141
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S VV LAG G G +DG SARF P+ AVD G +++AD +N IR +T
Sbjct: 303 ISPAGVVTTLAGLSGTAGSADGTNSSARFLGPQGVAVDSTGTVFIADTANSTIRVMT 359
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 33/51 (64%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +AG +G +DG GSA+ P+ AVD GN+YVAD N++IR +T
Sbjct: 36 LIGTVAGYAGKGSADGVGGSAQLFGPQGVAVDGAGNVYVADTGNNIIRVVT 86
Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
G ++ V D+ S ++ P S + AG+ G +DG +A+F +P+ AVD
Sbjct: 450 GGGNIYVADTGNSTIRKIT-PSGSTSTLAGSAGN--PGNADGAGITAQFYQPQGVAVDSA 506
Query: 115 GNIYVADKSNHVIRKIT 131
N+YVAD NH +R +T
Sbjct: 507 NNVYVADTGNHTVRMVT 523
Score = 43.9 bits (102), Expect = 0.029, Method: Composition-based stats.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 85 LAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
LAG G G DG ARF+ P AVD GN+YV D +N IRK+T+
Sbjct: 531 LAGLAGTFGTFDGTNAGARFNGPTGIAVDGAGNLYVTDYNNDTIRKVTS 579
Score = 42.4 bits (98), Expect = 0.100, Method: Composition-based stats.
Identities = 33/93 (35%), Positives = 42/93 (45%), Gaps = 5/93 (5%)
Query: 40 GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEP 98
G Q N S I ++ V D S ++ + V LAG GV G +
Sbjct: 108 GAQASFNQPSGIALDSDGNIYVSDYGSSTIRKVT----QSGQVTTLAGMTGVTGSVNNTG 163
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+A F P AVD N+YVAD NH+IRKIT
Sbjct: 164 TNALFFHPMGLAVDNATNLYVADYGNHLIRKIT 196
Score = 40.0 bits (92), Expect = 0.53, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ V LAG G+ G DG SA F P +VD GN+YV D N +RK+T
Sbjct: 577 VTSAGAVTTLAGWTGMWGSIDGAGNSALFFGPSGISVDALGNLYVIDSGNSTLRKLT 633
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 24/71 (33%), Positives = 30/71 (42%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
+L V+DS S L+ +V GV G DG A F P V G +
Sbjct: 618 NLYVIDSGNSTLRKLTLSGGTWTVSTVAGMPGVNGGIDGSGAGAEFYYPAGVTVSAAGYV 677
Query: 118 YVADKSNHVIR 128
YVAD N+ IR
Sbjct: 678 YVADAGNNTIR 688
Score = 36.6 bits (83), Expect = 5.2, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG G G +D +A F +P ++ GN+YVAD N+ IR+I+
Sbjct: 254 VTTLAGAAGFVGSADASGTNALFHQPAGIGINSAGNLYVADYFNNTIRQIS 304
>gi|329908045|ref|ZP_08274805.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
IMCC9480]
gi|327546778|gb|EGF31712.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
IMCC9480]
Length = 685
Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats.
Identities = 32/87 (36%), Positives = 48/87 (55%), Gaps = 8/87 (9%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFD 104
P S+ IDR G+ L V D++ ++ + VV +AG GV G +DG +ARF
Sbjct: 120 PQSIAIDRAGT--LYVADTNNQTIRKIT----PQGVVTTIAGRVGVDGSTDGRGNAARFS 173
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
P+ AVD+ G + V+D NH +R I+
Sbjct: 174 YPQGIAVDVAGTVLVSDTYNHTVRTIS 200
Score = 50.4 bits (119), Expect = 4e-04, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + V LAG G +DG +ARF+ P+S A+D G +YVAD +N IRKIT
Sbjct: 89 VSADGSVSTLAGAMSTPGAADGTGTAARFNYPQSIAIDRAGTLYVADTNNQTIRKIT 145
Score = 50.4 bits (119), Expect = 4e-04, Method: Composition-based stats.
Identities = 30/78 (38%), Positives = 43/78 (55%), Gaps = 6/78 (7%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDM 113
IV D+ + F +F ++ VV +AG G G +DG ARF +P++ A+D
Sbjct: 443 IVADALGNLFVADTFNSRIRKITPAGVVTTIAGAGSSSGSTDGPGNIARFFEPEAIAIDA 502
Query: 114 KGNIYVADKSNHVIRKIT 131
N+YVAD NH +RKIT
Sbjct: 503 SRNLYVADTGNHTVRKIT 520
Score = 50.4 bits (119), Expect = 4e-04, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 10/116 (8%)
Query: 25 GLLLEDGYTVTTVIDGHQLE---INPHSVIDRPGSSDLIVLDSSRSAFYTLSFP------ 75
G+ + G T + G +LE ++ + R GS + D+S + +Y + P
Sbjct: 351 GVHVITGGTTLKTLAGKELERGMVDGNGAKARFGSLQGVTSDASGN-WYVVDTPHHMIRK 409
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV +LAG+G G DG A F+ P+ D GN++VAD N IRKIT
Sbjct: 410 ITPAGVVSQLAGNGKPGGVDGTGSGASFNYPRGIVADALGNLFVADTFNSRIRKIT 465
Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats.
Identities = 28/68 (41%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
S+ A+ T+ ++ VV LAG ++ G DG +ARF P+ A D GN+YVAD
Sbjct: 564 SQPAYGTIR-KITPGGVVTTLAGRVLETGALDGAGDAARFFNPQGLAADNAGNVYVADTG 622
Query: 124 NHVIRKIT 131
N+ IR+IT
Sbjct: 623 NNTIRRIT 630
Score = 45.1 bits (105), Expect = 0.013, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S VV LAG G G DG +ARF P+ D NIYVAD N V+RK+T
Sbjct: 199 ISPGGVVGTLAGSAGNFGVLDGVRSAARFSNPQGLVTDAARNIYVADAGNGVLRKVT 255
Score = 42.7 bits (99), Expect = 0.069, Method: Composition-based stats.
Identities = 32/93 (34%), Positives = 48/93 (51%), Gaps = 14/93 (15%)
Query: 52 DRPGS------SDLIVLDSSRSAF------YTLSFPLSEESVVKRLAGD-GVQGYSDGEP 98
D PG+ + I +D+SR+ + +T+ ++ VV LAG G G DG
Sbjct: 484 DGPGNIARFFEPEAIAIDASRNLYVADTGNHTVR-KITVAGVVSTLAGSPGKYGSDDGTG 542
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ARF KP + AVD GN+ ++ + IRKIT
Sbjct: 543 AAARFLKPPAIAVDPSGNVVLSQPAYGTIRKIT 575
Score = 40.4 bits (93), Expect = 0.34, Method: Composition-based stats.
Identities = 19/31 (61%), Positives = 22/31 (70%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
ARF+ P AVD GN YVAD NHVIRK++
Sbjct: 60 ARFNFPIGIAVDGVGNRYVADWQNHVIRKVS 90
>gi|290992224|ref|XP_002678734.1| predicted protein [Naegleria gruberi]
gi|284092348|gb|EFC45990.1| predicted protein [Naegleria gruberi]
Length = 1366
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 56/105 (53%), Gaps = 9/105 (8%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
L Y +TTV G + +P I + + DLIV D+ + +S+ VV R+AG
Sbjct: 399 LPSSYYITTVARGLK---DPQK-ISKLSNGDLIVSDTGDHSIKKISY---STGVVTRIAG 451
Query: 88 DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GV G+S DG + A+F+KP AV + IY+AD NH IR I
Sbjct: 452 TGVAGFSGDGGLATLAQFNKPYGIAVTINDEIYIADSLNHRIRFI 496
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 60/131 (45%), Gaps = 6/131 (4%)
Query: 2 ASHFVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIV 61
+++ V SLAI L+ + ++ G L + G E + R + D++V
Sbjct: 9 SANIVGFSLAIVLIMMAMVVMVKGTSLNPYSVKISTFAGGNGEFKDPQKLARLSNGDILV 68
Query: 62 LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYV 119
D+ A ++ V+ +AG GV G++ +G +A+ +KP AV IY
Sbjct: 69 TDALGHAIK----KINGTGVITTIAGTGVAGFAGDNGPAINAQVNKPYGIAVSSNDEIYF 124
Query: 120 ADKSNHVIRKI 130
AD NH IRKI
Sbjct: 125 ADSMNHRIRKI 135
>gi|456737758|gb|EMF62435.1| Hypothetical protein EPM1_0109 [Stenotrophomonas maltophilia EPM1]
Length = 693
Score = 52.4 bits (124), Expect = 8e-05, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 46/95 (48%), Gaps = 7/95 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A + + V LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGHAD 168
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G ARFD P AVD +G +YVAD N IR I
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 203
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 150
Score = 44.3 bits (103), Expect = 0.024, Method: Composition-based stats.
Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 5/102 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
VTT+ G Q + + R + I +D+ + +F + + V+ LAG
Sbjct: 156 VTTLAGGEQGHADGPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG 215
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +DG +ARFD P + A D +G + VAD N+ +R++
Sbjct: 216 DRPGLADGVGATARFDTPVALAFDAQGALLVADLFNNAVRRV 257
>gi|318059218|ref|ZP_07977941.1| hypothetical protein SSA3_14816 [Streptomyces sp. SA3_actG]
gi|318079124|ref|ZP_07986456.1| hypothetical protein SSA3_21010 [Streptomyces sp. SA3_actF]
Length = 610
Score = 52.4 bits (124), Expect = 9e-05, Method: Composition-based stats.
Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R A L ++ V+R G G +G DG+ ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLRDGDAAGARFNEP 238
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ A+ G + VAD NH +R++
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRV 262
>gi|302520584|ref|ZP_07272926.1| NHL repeat containing protein [Streptomyces sp. SPB78]
gi|302429479|gb|EFL01295.1| NHL repeat containing protein [Streptomyces sp. SPB78]
Length = 634
Score = 52.4 bits (124), Expect = 9e-05, Method: Composition-based stats.
Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R A L ++ V+R G G +G DG+ ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLRDGDAAGARFNEP 238
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ A+ G + VAD NH +R++
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRV 262
>gi|153806007|ref|ZP_01958675.1| hypothetical protein BACCAC_00252 [Bacteroides caccae ATCC 43185]
gi|149130684|gb|EDM21890.1| IPT/TIG domain protein [Bacteroides caccae ATCC 43185]
Length = 440
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 31/50 (62%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
VV+ +AG GY D GSA F P+ AVD G IYV DK N+VIRKI
Sbjct: 388 VVRLVAGGTQAGYVDANAGSAAFSGPQDLAVDKNGVIYVYDKKNNVIRKI 437
Score = 39.7 bits (91), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 43/92 (46%), Gaps = 15/92 (16%)
Query: 55 GSSDLIVLDSSRSAFYTLS-FPLSEESVVKRLAG--DGVQGYSDGEPGSARF--DKPK-- 107
G +L + ++R T+ F S +VV LAG +G GY DG A F D K
Sbjct: 98 GDGELTITIANREPVRTIEKFTYSFSAVVTTLAGSANGEPGYQDGVGSEALFFFDAAKAE 157
Query: 108 --------SFAVDMKGNIYVADKSNHVIRKIT 131
S VD GN+YV D N+ +RKIT
Sbjct: 158 PAEDWKKGSVCVDDDGNVYVGDCVNYCVRKIT 189
Score = 36.6 bits (83), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 39/85 (45%), Gaps = 5/85 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA-----DKSNHVIRKITNLDHG 136
+ L G G GY +G A F P A+D G+IYVA D ++ + I LD
Sbjct: 326 AENLIGKGTSGYLNGSFEDALFTFPSDLAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMT 385
Query: 137 NQLIRQINLKPEDCSKSSQSGSGMF 161
N+++R + + + +GS F
Sbjct: 386 NRVVRLVAGGTQAGYVDANAGSAAF 410
>gi|375146530|ref|YP_005008971.1| NHL repeat containing protein [Niastella koreensis GR20-10]
gi|361060576|gb|AEV99567.1| NHL repeat containing protein [Niastella koreensis GR20-10]
Length = 434
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 36/56 (64%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V +AG G GY++G A+F+ P + A+D GN+YVA+++ IRKIT
Sbjct: 174 IAPDGTVTTIAGTGENGYNEGPASGAKFNNPCATAMDANGNMYVAERNGRRIRKIT 229
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 32/50 (64%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG G GY++G S RF+ P AVD GN+Y+ D +N+ +RKIT
Sbjct: 383 VTTIAGAGDSGYAEGTGSSVRFNGPTGIAVDKNGNMYILDMANNRVRKIT 432
Score = 43.1 bits (100), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 3/81 (3%)
Query: 53 RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK---SF 109
+ G+ ++V + T F VV LAG G G++DG+ +A F+
Sbjct: 94 KTGTGPVVVTIGKNTGASTTDFKYLFSYVVTTLAGSGNAGFADGKGANASFNFAGVRCQL 153
Query: 110 AVDMKGNIYVADKSNHVIRKI 130
+VD GN+YV D N IRKI
Sbjct: 154 SVDNIGNVYVPDGGNQRIRKI 174
>gi|375146499|ref|YP_005008940.1| cell surface receptor IPT/TIG domain-containing protein [Niastella
koreensis GR20-10]
gi|361060545|gb|AEV99536.1| cell surface receptor IPT/TIG domain protein [Niastella koreensis
GR20-10]
Length = 500
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 4/76 (5%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
G ++ V D S + Y + P + S++ AG GY DG+P ARF P+ +D
Sbjct: 426 GVGNIYVTDDSTNGIYRVD-PNGKLSLI---AGGVRSGYIDGKPQDARFSGPRGIVIDAS 481
Query: 115 GNIYVADKSNHVIRKI 130
GN++VAD N+ IRKI
Sbjct: 482 GNLFVADIGNNCIRKI 497
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 4/75 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ +LIV + S ++ PL VV +AG G GY DG +A+F P+ +D G
Sbjct: 232 AGNLIVSEIGNSKIRRIT-PLG---VVSTIAGSGTYGYLDGPGLTAQFHFPQGLTLDNAG 287
Query: 116 NIYVADKSNHVIRKI 130
NIYVA+ N IRKI
Sbjct: 288 NIYVAEYQNQSIRKI 302
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 3/75 (4%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+S++ V D S + +S + ++ L+G+G G DG+P A+F +P A+D
Sbjct: 330 ASNVYVTDQSNNRICKISL---QTGLLSVLSGNGNWGMVDGDPQQAQFYQPAKMALDNNN 386
Query: 116 NIYVADKSNHVIRKI 130
N+ +ADK N +RK+
Sbjct: 387 NLIIADKINGRVRKV 401
>gi|423219560|ref|ZP_17206056.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
CL03T12C61]
gi|392624765|gb|EIY18843.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
CL03T12C61]
Length = 436
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 31/50 (62%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
VV+ +AG GY D GSA F P+ AVD G IYV DK N+VIRKI
Sbjct: 384 VVRLVAGGTQAGYVDANAGSAAFSGPQDLAVDKNGVIYVYDKKNNVIRKI 433
Score = 39.7 bits (91), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 43/92 (46%), Gaps = 15/92 (16%)
Query: 55 GSSDLIVLDSSRSAFYTL-SFPLSEESVVKRLAG--DGVQGYSDGEPGSARF--DKPK-- 107
G +L + ++R T+ F S +VV LAG +G GY DG A F D K
Sbjct: 94 GDGELTITIANREPVRTIDKFTYSFSAVVTTLAGSANGEPGYQDGVGSEALFFFDAAKAE 153
Query: 108 --------SFAVDMKGNIYVADKSNHVIRKIT 131
S VD GN+YV D N+ +RKIT
Sbjct: 154 PAEDWKKGSVCVDDDGNVYVGDCVNYCVRKIT 185
Score = 36.6 bits (83), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 39/85 (45%), Gaps = 5/85 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA-----DKSNHVIRKITNLDHG 136
+ L G G GY +G A F P A+D G+IYVA D ++ + I LD
Sbjct: 322 AENLIGKGTSGYLNGSFEDALFTFPSDLAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMT 381
Query: 137 NQLIRQINLKPEDCSKSSQSGSGMF 161
N+++R + + + +GS F
Sbjct: 382 NRVVRLVAGGTQAGYVDANAGSAAF 406
>gi|351729452|ref|ZP_08947143.1| NHL repeat-containing protein [Acidovorax radicis N35]
Length = 659
Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats.
Identities = 21/43 (48%), Positives = 32/43 (74%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V ++DG+ +ARF +PK+ AVD GN+YVAD NH++R+I+
Sbjct: 43 AVSNHADGDGPAARFYQPKAVAVDTAGNMYVADTENHLVRRIS 85
Score = 42.0 bits (97), Expect = 0.12, Method: Composition-based stats.
Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 11/67 (16%)
Query: 76 LSEESVVKRLAGD--GVQGYSDGEPGSARF---------DKPKSFAVDMKGNIYVADKSN 124
++ +V LAG+ GY+DG +A F + + AVD +GN+Y+ D ++
Sbjct: 538 ITPSGMVSTLAGNPAAAPGYADGTGAAAVFTVDLGFRYPQQTAALAVDAQGNVYITDGAH 597
Query: 125 HVIRKIT 131
HV+R+IT
Sbjct: 598 HVVRRIT 604
Score = 39.3 bits (90), Expect = 0.76, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 28/56 (50%), Gaps = 2/56 (3%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
P ++V LA G DG+ GSARFD P + A G +YV D N +R +
Sbjct: 321 PQGTSTLVAGLAS--APGADDGQGGSARFDAPGALAWSPSGTLYVRDDGNRSLRTV 374
Score = 37.4 bits (85), Expect = 2.8, Method: Composition-based stats.
Identities = 21/47 (44%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
+S V LAG GV G DG +A F +P S AVD G +YVA+
Sbjct: 84 ISPTGAVTTLAGKPGVCGNQDGTSDTATFCRPSSIAVDKAGTVYVAE 130
Score = 37.4 bits (85), Expect = 2.9, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V LAG G G++DG+ +A+F + A D GN+YV D H + KI
Sbjct: 482 VTLLAGSLGTVGHADGQGAAAQFSALGNMATDAAGNVYVVDGLFHEVNKI 531
>gi|344924044|ref|ZP_08777505.1| NHL repeat containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 383
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 30/47 (63%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAG G GY +G +A+F P S AVD G +YV D +N+ IRKIT
Sbjct: 201 LAGSGATGYVEGTGSAAQFSTPTSVAVDSSGTVYVIDANNYRIRKIT 247
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 8/88 (9%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP+++ ID G+ L V D+ Y + ++ LAG G GY++G SA+F+
Sbjct: 59 NPYAITIDSSGT--LYVADNGN---YRIR-KITSSGTTSLLAGSGTTGYAEGTGASAQFN 112
Query: 105 KPK-SFAVDMKGNIYVADKSNHVIRKIT 131
+ A D GN+YV+D +N+ +RKIT
Sbjct: 113 TFQWGIAADNSGNVYVSDTTNNRVRKIT 140
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 27/41 (65%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+GY +G +ARF+ P + +D G +YVAD N+ IRKIT
Sbjct: 45 RGYQEGTGSAARFENPYAITIDSSGTLYVADNGNYRIRKIT 85
Score = 40.4 bits (93), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 26/46 (56%), Gaps = 4/46 (8%)
Query: 90 VQGYSDGEPGSARFD----KPKSFAVDMKGNIYVADKSNHVIRKIT 131
QGY++G +ARF+ P VD G +YVAD N+ IR IT
Sbjct: 259 TQGYAEGTGSAARFNFFNLIPSGITVDNAGTVYVADTFNYRIRTIT 304
Score = 39.3 bits (90), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAG GY +G AR P+ AV+ G +YVA ++ IR IT
Sbjct: 148 LAGS-TSGYQEGTGAGARLSSPRGLAVNSAGTVYVATATSERIRAIT 193
>gi|299473428|emb|CBN77825.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 2120
Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 29/46 (63%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G G GY+DG +A F+ P+ AVD N+YVAD NH IR+I+
Sbjct: 205 GGSGEAGYADGSGTAALFNDPQDVAVDAHANVYVADTGNHRIRRIS 250
Score = 44.7 bits (104), Expect = 0.018, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 4/60 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN----IYVADKSNHVIRKIT 131
+S E VV +AGDG +G DG+ A F P A+ + +YVAD +NH +RKI+
Sbjct: 249 ISPEGVVTTVAGDGEEGSDDGDAMEASFSFPGGIALYYDSSEGLVLYVADTNNHRLRKIS 308
Score = 43.9 bits (102), Expect = 0.036, Method: Composition-based stats.
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 84 RLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
R+A + G++DG+ ARFD P A G ++VAD +NH+IR +
Sbjct: 334 RMAAEATPEAGFADGDGSYARFDGPSGLAAAEDGTLFVADTNNHLIRMV 382
>gi|182412380|ref|YP_001817446.1| immunoglobulin I-set domain-containing protein [Opitutus terrae
PB90-1]
gi|177839594|gb|ACB73846.1| Immunoglobulin I-set domain protein [Opitutus terrae PB90-1]
Length = 1130
Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats.
Identities = 33/89 (37%), Positives = 49/89 (55%), Gaps = 6/89 (6%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFD 104
NP +++ P S DL V DS+ A ++ VV LAG ++ GY DG A+F
Sbjct: 126 NPSALVVGP-SGDLYVADSNGHAIRKVT----PAGVVTTLAGGPLRYGYMDGPGTEAQFS 180
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNL 133
P+ AV+ G I+V+D+S H IR++ L
Sbjct: 181 YPRGIAVNATGVIFVSDRSAHTIRRVDQL 209
Score = 50.1 bits (118), Expect = 4e-04, Method: Composition-based stats.
Identities = 22/42 (52%), Positives = 27/42 (64%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G G +DG ARF P+ A+D GN+YVAD +NH IRKI
Sbjct: 220 GSAGSADGPGDQARFRDPEGLAIDAAGNVYVADINNHTIRKI 261
Score = 44.7 bits (104), Expect = 0.018, Method: Composition-based stats.
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 6/77 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMK 114
S ++ V+D S +S + +V AG G G DG +ARF P +D
Sbjct: 499 SGNIFVVDRGASTIRKIS-----QGIVTTFAGMPGETGQDDGAGAAARFRDPMGIVIDGA 553
Query: 115 GNIYVADKSNHVIRKIT 131
N+YVAD +N IRK+T
Sbjct: 554 DNLYVADTNNWKIRKVT 570
Score = 44.3 bits (103), Expect = 0.022, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 82 VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG DG G +DG SARF AVD GNI+V D+ IRKI+
Sbjct: 466 VTTIAGADGEDGDTDGPAASARFRAVDGLAVDSSGNIFVVDRGASTIRKIS 516
Score = 43.5 bits (101), Expect = 0.040, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG G G++DG +ARF P S A+D G I+V D N IRKI+
Sbjct: 267 VTTLAGAAGESGFADGPAANARFFCPTSLAIDPAGAIWVNDAINRAIRKIS 317
Score = 42.0 bits (97), Expect = 0.12, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG V G DG +A F P + V G++YVAD + H IRK+T
Sbjct: 96 ITPAGVVTTLAGVANVHGCIDGVGSNALFHNPSALVVGPSGDLYVADSNGHAIRKVT 152
Score = 38.9 bits (89), Expect = 1.0, Method: Composition-based stats.
Identities = 20/47 (42%), Positives = 26/47 (55%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAG+ +G DG ARF+ A+ G +YVAD +N IR IT
Sbjct: 51 LAGEWNKGSRDGAAAVARFNGANGVAIAPNGLVYVADLANSTIRAIT 97
Score = 37.0 bits (84), Expect = 4.2, Method: Composition-based stats.
Identities = 31/101 (30%), Positives = 48/101 (47%), Gaps = 15/101 (14%)
Query: 33 TVTTVID--GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
TVTTV D G + I+P+ V+ + D I S S ++ P +S R
Sbjct: 321 TVTTVADTAGEGITIDPNGVL-YIAADDRIKRLESGSVLSVVAGP--TDSYTSR------ 371
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+G +ARF +P A+ + GN+YV D + IR++T
Sbjct: 372 ----NGVGANARFVQPIGSALAVDGNLYVTDSGGYAIRRVT 408
>gi|357516309|ref|XP_003628443.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
gi|355522465|gb|AET02919.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
Length = 160
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 48/122 (39%), Gaps = 46/122 (37%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------------- 132
G G+ DG+ ARF PK A+D KGN+YVAD N IRKI +
Sbjct: 24 GRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGY 83
Query: 133 ---------------------------LDHGNQLIRQINLKPEDCSKSSQSGS---GMFW 162
+D GN +RQI L EDC+ S S S FW
Sbjct: 84 RDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQSSSISLTGSEFW 143
Query: 163 VT 164
++
Sbjct: 144 IS 145
>gi|301122083|ref|XP_002908768.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099530|gb|EEY57582.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1981
Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 31/46 (67%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AGDG G+ DG +++F+ P+ AVD G +YVAD +NH IR I
Sbjct: 208 VAGDGSAGFQDGLAATSQFNHPRGVAVDSNGVVYVADTANHRIRII 253
Score = 50.4 bits (119), Expect = 3e-04, Method: Composition-based stats.
Identities = 21/39 (53%), Positives = 27/39 (69%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +DG+P ARFD P AVD G ++VAD NH+IR+I
Sbjct: 344 GMADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRI 382
Score = 44.3 bits (103), Expect = 0.022, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 32/56 (57%), Gaps = 5/56 (8%)
Query: 82 VKRLAGDGVQGYSDGEP-GSARFDKPKSFAVDMKGN----IYVADKSNHVIRKITN 132
V LAGDGV+G+ DG +ARF P AV + ++VAD NH IR+I N
Sbjct: 260 VSTLAGDGVEGFVDGPALTAARFSYPSDVAVRESNSGTITVFVADTGNHRIRQIRN 315
Score = 36.6 bits (83), Expect = 5.1, Method: Composition-based stats.
Identities = 17/50 (34%), Positives = 25/50 (50%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG V G+ DGE A F+ P G +Y A ++ +R++T
Sbjct: 471 VVTLAGSNVPGHIDGEGNEATFNAPAGVTFAADGRVYAASSTDCSVRQVT 520
>gi|153868931|ref|ZP_01998653.1| receptor protein kinase [Beggiatoa sp. PS]
gi|152074497|gb|EDN71345.1| receptor protein kinase [Beggiatoa sp. PS]
Length = 3115
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG+G +GYS DG+P +A + + P VD GN+Y+ADK+NH IRK+
Sbjct: 2336 ITTVAGNGNKGYSGDGDPATAAQINTPTGLEVDSTGNLYIADKNNHRIRKV 2386
Score = 49.7 bits (117), Expect = 7e-04, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 77 SEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
S S++ +AG+G GY +G AR P AVD + N+Y+AD NH IRK+
Sbjct: 2275 SANSIITTVAGNGRSGYQGDNGPATGARLSNPTGLAVDSQNNLYIADTDNHRIRKV 2330
Score = 43.1 bits (100), Expect = 0.051, Method: Composition-based stats.
Identities = 20/48 (41%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG G +G + DG A+ P+ A+D +GN+Y+AD NH IRK+
Sbjct: 2163 VAGTGAKGSTGDDGPAIEAKLKNPQGTAIDHEGNLYIADTLNHRIRKV 2210
Score = 40.4 bits (93), Expect = 0.38, Method: Composition-based stats.
Identities = 23/58 (39%), Positives = 31/58 (53%), Gaps = 2/58 (3%)
Query: 79 ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
E ++ G G G +DG S A+ +P A+D GN+Y+ADK N IRKI D
Sbjct: 2389 EGIITTFTGTGKPGTATDGIIASVAQISQPTDVALDQYGNLYIADKGNDTIRKIGEKD 2446
Score = 36.6 bits (83), Expect = 4.8, Method: Composition-based stats.
Identities = 17/53 (32%), Positives = 30/53 (56%), Gaps = 2/53 (3%)
Query: 81 VVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +AG G G + +G +A+ P + D G++Y+AD NH IRK++
Sbjct: 2215 IITTVAGIGKAGNTGDNGLATAAKLRNPTAIVFDNNGHLYIADSGNHRIRKVS 2267
>gi|86605024|ref|YP_473787.1| NHL repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86553566|gb|ABC98524.1| NHL repeat domain protein [Synechococcus sp. JA-3-3Ab]
Length = 637
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 35/56 (62%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + V AG V GY DGE A+F +P A+ + G++YVAD++N IR+IT
Sbjct: 408 ISRQRQVTTFAGTCVAGYRDGERDEAQFREPFGLALGLDGSLYVADRANRRIRRIT 463
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 24/58 (41%), Positives = 33/58 (56%)
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
F LS E ++ AG G G+ DG A+FD P+ D KG ++VAD NH +R I+
Sbjct: 352 FRLSPEGELEVWAGSGKAGHRDGAADQAQFDSPQGLLWDPKGGLWVADSGNHCLRHIS 409
Score = 38.9 bits (89), Expect = 0.91, Method: Composition-based stats.
Identities = 18/50 (36%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V AG G G +DG A+ +P + AVD +GN+++AD+ H +R+++
Sbjct: 468 VTTAAGTGQPGSADGPADQAQLLQPTALAVDREGNLWIADR--HRLRRLS 515
Score = 38.5 bits (88), Expect = 1.3, Method: Composition-based stats.
Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 8/63 (12%)
Query: 76 LSEESVVKRLAGDGV--------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
+++ ++RL+ DG GY DG ARF A D G +++AD+ NH +
Sbjct: 505 IADRHRLRRLSADGQLTTLSRAEAGYRDGPLAEARFQTLAGLAFDSAGILWLADRDNHRL 564
Query: 128 RKI 130
R++
Sbjct: 565 RRL 567
>gi|116624994|ref|YP_827150.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116228156|gb|ABJ86865.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 912
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 27/51 (52%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ LAG GV GYS DG P SA+ + P++ AVD GN+YVAD +N+ +RKI
Sbjct: 511 ISTLAGTGVSGYSGDGGPATSAQLNGPQAVAVDGSGNVYVADTANNRVRKI 561
Score = 41.6 bits (96), Expect = 0.15, Method: Composition-based stats.
Identities = 21/52 (40%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG G G++ G SA+ + P S A+D GN+Y+A+ SN+ IRK+
Sbjct: 454 VMSTVAGSGTSGFAGDGGAATSAQLNAPFSVALDAAGNLYIAEFSNNRIRKV 505
Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats.
Identities = 29/103 (28%), Positives = 50/103 (48%), Gaps = 4/103 (3%)
Query: 32 YTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS--FPLSEESVVKRLAGDG 89
YT++TV G ++ G I L F + + F + + +AG+
Sbjct: 11 YTISTVAGGAPPPTPVAALSTSIGQPRKIALSGGNMYFSSGNSVFKIDGSGTLTLVAGNS 70
Query: 90 VQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G+S DG P +A+ + P+ A+D GN+Y+AD N+ +RK+
Sbjct: 71 RAGFSGDGGPAVNAQLNSPQGVALDSAGNLYIADSQNNRVRKV 113
Score = 40.8 bits (94), Expect = 0.24, Method: Composition-based stats.
Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 87 GDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GDG QG++ G P P S VD GN+Y AD N+ IRK++
Sbjct: 294 GDGTQGFAGDGGAPNKVEMSLPTSVQVDSSGNLYFADSLNNRIRKLS 340
Score = 37.4 bits (85), Expect = 3.0, Method: Composition-based stats.
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ + P+ AVD GN+Y+AD NH +RK++
Sbjct: 422 QLNNPQGLAVDSAGNLYIADTQNHRVRKVS 451
Score = 36.2 bits (82), Expect = 6.2, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 10/69 (14%)
Query: 72 LSFPLSEESVVKRLAGDGVQGYSD----------GEPGSARFDKPKSFAVDMKGNIYVAD 121
L F S + +++L+G V Y+ G +A+ + P AVD GN+YV+D
Sbjct: 326 LYFADSLNNRIRKLSGGNVNTYAGNGIVARSGDGGAATNAQLNTPLGVAVDAAGNLYVSD 385
Query: 122 KSNHVIRKI 130
N+++R++
Sbjct: 386 TLNNLVRRV 394
>gi|325104254|ref|YP_004273908.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
gi|324973102|gb|ADY52086.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
Length = 439
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 58/114 (50%), Gaps = 20/114 (17%)
Query: 29 EDGYTVTTV--------IDGHQLEI---NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPL 76
E Y V+T+ IDG+ L+ NP + +D G D+I+ D + + +
Sbjct: 117 ESNYIVSTIAGSTTFGFIDGNGLDARFRNPDGILVDMNG--DIIITDRTNHSIR----KM 170
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ VV LAG GV GY++G+PG +F+ P +D GNI V +K IRKI
Sbjct: 171 TTAGVVSTLAGTGVSGYANGKPG--QFNTPWQSTMDAAGNIIVIEKDGGRIRKI 222
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 33/52 (63%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ V +AG+G G DG +A F++P +D KG+IYVAD NH IRKI
Sbjct: 385 KATVSTIAGNGTAGRGDGLGHAATFNQPYDVVMDAKGDIYVADNVNHSIRKI 436
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 6/84 (7%)
Query: 48 HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
S +D G+ +I D R ++ + V +AG G G++DG ARF+
Sbjct: 200 QSTMDAAGNIIVIEKDGGRIR------KIAPDGAVSLIAGTGSLGFTDGNVSVARFNHAL 253
Query: 108 SFAVDMKGNIYVADKSNHVIRKIT 131
VD +GNI+VAD++N+ IRKIT
Sbjct: 254 DGVVDSEGNIFVADRNNYRIRKIT 277
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 13/91 (14%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFP-----------LSEES--VVKRLAGDGVQGYSDGEPGS 100
P ++IV D++ + T++ + +ES +V +AG G+ DG
Sbjct: 81 PNRLEVIVPDNATTGNVTIAVKGQQIYGPRFTVIKKESNYIVSTIAGSTTFGFIDGNGLD 140
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
ARF P VDM G+I + D++NH IRK+T
Sbjct: 141 ARFRNPDGILVDMNGDIIITDRTNHSIRKMT 171
Score = 35.8 bits (81), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 32/59 (54%), Gaps = 4/59 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+++ V+ + G +GY DG PG + + D +GNI +AD SN+ IRKIT
Sbjct: 319 ITKSGVMTTIVGS-TKGYDDGTPGNPLTVKLGDIFGLNFDNEGNILLADASNNRIRKIT 376
>gi|387219837|gb|AFJ69627.1| nhl repeat containing 2 [Nannochloropsis gaditana CCMP526]
Length = 295
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKI 130
V RLAG G +GY DG A FD+P+ VD K G +V D +NH IR +
Sbjct: 229 VSRLAGSGKRGYRDGAGAEAEFDEPRGLCVDGKGGQAFVCDSNNHCIRTV 278
>gi|441163236|ref|ZP_20968263.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
gi|440616375|gb|ELQ79517.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
Length = 238
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 37/57 (64%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + ++ +AG+G GY SDG P +A + + P S AVD GN+Y+AD NH +RK+
Sbjct: 127 VDTKGIITTVAGNGTGGYVSDGGPATATQLNGPHSVAVDRDGNVYIADYHNHRVRKV 183
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + ++ +AG+G+ GY SDG P +A + P A D GN+Y+AD N+ +RK+
Sbjct: 71 VTPQGIITTVAGNGIAGYVSDGGPATATQLSSPCGVAADGAGNLYIADLGNNRVRKV 127
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 79 ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ + +AG+G GY SDG P + + + P A+D GN+Y+ +SN+ +RK+T
Sbjct: 18 QGCIATVAGNGTAGYLSDGGPATLTQLNWPHDVALDEHGNLYIVCRSNNRVRKVT 72
>gi|347754243|ref|YP_004861807.1| gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
gi|347586761|gb|AEP11291.1| Gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
Length = 1222
Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 2/56 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
L+ + V LAGDGV+G DG ARF P++ AVD G +YVAD + +R+I+
Sbjct: 333 LTLDGAVTTLAGDGVEGTRDGVGKQARFKAPRAVAVDASGTVYVADDAR--VRRIS 386
Score = 50.4 bits (119), Expect = 4e-04, Method: Composition-based stats.
Identities = 24/52 (46%), Positives = 31/52 (59%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
E VV AGD G+ DG SAR + P A+D +G +Y+AD NH IRK+
Sbjct: 616 ELVVTVTAGDSTPGFLDGVGTSARLNHPVGLALDAQGVLYIADHFNHAIRKL 667
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 29/75 (38%), Positives = 39/75 (52%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQIN 144
LAG G +G+ DG A+F+ P AV G +YVAD N IRK+T + + L
Sbjct: 676 LAGGGQRGFQDGYGPEAQFNGPLGLAVGRDGELYVADHLNMRIRKVTPDGYVSTLAGTGI 735
Query: 145 LKPEDCSKSSQSGSG 159
K ED S ++ S G
Sbjct: 736 SKIEDGSVATASFEG 750
Score = 43.9 bits (102), Expect = 0.036, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
SV A +G GY+DG A+F +P A D +GN+YVAD H IR+I+
Sbjct: 232 SVSTLTASEG--GYADGPLTVAKFLRPNGIACDPQGNLYVADFGGHRIRQIS 281
Score = 42.7 bits (99), Expect = 0.064, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V LAG G+ DG +A F+ PK AVDM G +YV D +R IT
Sbjct: 721 VTPDGYVSTLAGTGISKIEDGSVATASFEGPKGVAVDMHGVVYVTDGV--TVRTIT 774
Score = 40.4 bits (93), Expect = 0.33, Method: Composition-based stats.
Identities = 24/80 (30%), Positives = 45/80 (56%), Gaps = 14/80 (17%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPG-------SARFDKPKSFAV 111
+ +D+S + + +++++ V+R++ G V + GEPG +ARFD A+
Sbjct: 366 VAVDASGTVY------VADDARVRRISPGGMVVTIAGGEPGCVDGPAEAARFDTLSGLAL 419
Query: 112 DMKGNIYVADKSNHVIRKIT 131
D GN+Y+AD N +RK++
Sbjct: 420 DRVGNLYLADAGNRRLRKLS 439
Score = 36.2 bits (82), Expect = 6.0, Method: Composition-based stats.
Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + V+ LAG V+G+ DG A F + AVD+ G +V D +NH +R I
Sbjct: 773 ITPDGEVRTLAGQ-VRGFRDGIGTRAMFGWAYAIAVDVSGLCFVTDAANHAVRCI 826
>gi|408828763|ref|ZP_11213653.1| hypothetical protein SsomD4_16351 [Streptomyces somaliensis DSM
40738]
Length = 614
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 18/108 (16%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +L+V D++R ++ P E+VV+R GDG +G DG SARF +P+ G
Sbjct: 197 SGNLLVSDTTRHRLVEVA-P-DGETVVRRY-GDGRRGLVDGPAESARFSEPQGMCALPDG 253
Query: 116 NIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWV 163
I VAD NH +R L PE + S+ +G+G W+
Sbjct: 254 RIVVADTVNHALRA---------------LDPETGAVSTLAGTGAQWM 286
>gi|386841352|ref|YP_006246410.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374101653|gb|AEY90537.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451794647|gb|AGF64696.1| hypothetical protein SHJGH_5033 [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 611
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 3/75 (4%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +V D++R L+ E+VV+R+ G G +G++DG P A F +P+ A+ G
Sbjct: 189 SGTFLVSDTTRHQLVELA--EDAETVVRRI-GSGTRGFADGGPREASFSEPQGLALLDDG 245
Query: 116 NIYVADKSNHVIRKI 130
++ VAD NH +R++
Sbjct: 246 SVVVADTVNHALRRL 260
>gi|393778566|ref|ZP_10366837.1| Ig family protein [Ralstonia sp. PBA]
gi|392714468|gb|EIZ02071.1| Ig family protein [Ralstonia sp. PBA]
Length = 342
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 34/56 (60%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +V AG G G +DG +A F+ P A+D GN+YVAD N+++R+IT
Sbjct: 227 ITPGGLVTTFAGSGTTGSADGPGNTASFNSPGGVALDGDGNVYVADTGNNLLRRIT 282
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ VV LAG GV G +G +ARF +P VD G +YVAD N++IRK+
Sbjct: 281 ITRAGVVSTLAGQAGVTGAQNGIGSAARFKQPYGVVVDADGTVYVADTFNNLIRKV 336
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 8/87 (9%)
Query: 47 PHSVI-DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFD 104
P+ +I D+ G +L V D ++ P E + +AG G+ G DG +A FD
Sbjct: 148 PYGIIFDKNG--NLFVSDVQTGILRKIT-PAGETT---NIAGGGISSGPVDGNGAAAYFD 201
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
A+D GNIY + S + IRKIT
Sbjct: 202 GLGGIALDAAGNIYATELSGNRIRKIT 228
>gi|223937279|ref|ZP_03629185.1| NHL repeat containing protein [bacterium Ellin514]
gi|223894064|gb|EEF60519.1| NHL repeat containing protein [bacterium Ellin514]
Length = 429
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 29/55 (52%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + AG G DG SARF P S A D GNIYVAD N IRKIT
Sbjct: 25 AQSLTINTFAGHDAPGSKDGLSSSARFRHPNSIAADSAGNIYVADTENSTIRKIT 79
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV LAG S DG +A F +P+S AVD GN+YVAD NH IRKIT
Sbjct: 133 ITPAGVVSTLAGSAGNINSFDGTGINANFYQPRSLAVDNGGNVYVADTWNHTIRKIT 189
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +V LAG G G +DG ARF++P AVD N++V D NH IRKIT
Sbjct: 188 ITPAGLVSTLAGLAGNPGSADGTNSKARFNRPSGIAVDNATNLFVTDFHNHTIRKIT 244
Score = 45.8 bits (107), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ V +AG GV G +DG ARF +P+ D GN++VAD N IRKI+
Sbjct: 243 ITPGGTVTTIAGLPGVWGNADGTNNVARFFQPQGIVADNAGNLFVADSGNQTIRKIS 299
Score = 43.1 bits (100), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 40/80 (50%), Gaps = 4/80 (5%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKS 108
V D G +L V DS +S P +V +AG G+ G ++G +ARF P
Sbjct: 278 VADNAG--NLFVADSGNQTIRKIS-PSGTNWIVSTVAGLSGIAGNANGTNNTARFYFPAD 334
Query: 109 FAVDMKGNIYVADKSNHVIR 128
A D+ G IYVAD N+ IR
Sbjct: 335 VAQDIAGYIYVADLGNNAIR 354
Score = 42.4 bits (98), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 56/139 (40%), Gaps = 31/139 (22%)
Query: 15 LSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD------------LIVL 62
LS V GLL V L IN + D PGS D I
Sbjct: 5 LSFKNLFVGGGLLAS-----MAVAHAQSLTINTFAGHDAPGSKDGLSSSARFRHPNSIAA 59
Query: 63 DSSRSAFYTLSFPLSEESVVKRLAGDG----------VQGYSDGEPGSARFDKPKSFAVD 112
DS+ + + + E S ++++ +G G +DG +A F P+ AVD
Sbjct: 60 DSAGNIYVADT----ENSTIRKITPNGSVSTFAGFAGTFGSADGVGTNALFYAPQGIAVD 115
Query: 113 MKGNIYVADKSNHVIRKIT 131
G IYVAD +N IRKIT
Sbjct: 116 SAGFIYVADTANATIRKIT 134
>gi|223993873|ref|XP_002286620.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977935|gb|EED96261.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1937
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 20/42 (47%), Positives = 29/42 (69%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
GY+DGE ARFD P+ V G ++VAD +NH++R+I+ L
Sbjct: 392 GYADGEKDEARFDSPQGITVASDGRVFVADTNNHLVREISRL 433
Score = 49.3 bits (116), Expect = 7e-04, Method: Composition-based stats.
Identities = 25/61 (40%), Positives = 35/61 (57%), Gaps = 2/61 (3%)
Query: 73 SFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ L+ + V +AG G +G+ DG ARF P+ AVD G +YVAD NH IR I
Sbjct: 218 NYTLNPQGFVFTMAGSTSGEEGFVDGVESDARFRHPEGVAVDHDGYVYVADTGNHAIRMI 277
Query: 131 T 131
+
Sbjct: 278 S 278
>gi|444306537|ref|ZP_21142300.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
gi|443481173|gb|ELT44105.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
Length = 673
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 4/111 (3%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
ED T + GH LE++ S D S L V+ + SF P+S E V +A
Sbjct: 324 EDAAGFTGRLSGHPLEVSLSSPWDLVWSRKLNAVVIAMAGTHQIFSFDPVSGE--VDIIA 381
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
G+G++G DG A F +P A D GNI+VAD +RK+ D G+
Sbjct: 382 GNGLEGLLDGPAHEAWFAQPSGLAEDGDGNIWVADSETSALRKLVIDDAGS 432
>gi|333025645|ref|ZP_08453709.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
gi|332745497|gb|EGJ75938.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
Length = 610
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R A L ++ V+R G G +G DG+ ARF++P
Sbjct: 183 PAKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLLDGDATGARFNEP 238
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ A+ G + VAD NH +R++
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRV 262
>gi|190572588|ref|YP_001970433.1| NHL repeat-containing protein [Stenotrophomonas maltophilia K279a]
gi|190010510|emb|CAQ44119.1| putative NHL repeat protein [Stenotrophomonas maltophilia K279a]
Length = 1267
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 46/95 (48%), Gaps = 7/95 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A + + V LAG G QG++D
Sbjct: 688 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGHAD 742
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G ARFD P AVD +G +YVAD N IR I
Sbjct: 743 GPAVQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 777
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I
Sbjct: 677 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 724
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 30/102 (29%), Positives = 51/102 (50%), Gaps = 5/102 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
VTT+ G Q + +V R + I +D+ + +F + + +V+ LAG
Sbjct: 730 VTTLAGGEQGHADGPAVQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGMVRTLAGG 789
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +DG +ARFD P + A D +G + VAD N+ +R++
Sbjct: 790 DRPGLADGVGAAARFDTPVALAFDAQGALLVADLFNNAVRRV 831
>gi|111219921|ref|YP_710715.1| ascorbate-dependent monooxygenase [Frankia alni ACN14a]
gi|111147453|emb|CAJ59103.1| Putative ascorbate-dependent monooxygenase [Frankia alni ACN14a]
Length = 865
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 39/57 (68%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ V+ +AG GV G+S DG P +A+ D+P A+D +G++Y+AD+ NH IR++
Sbjct: 558 INAAGVITTIAGTGVAGFSGDGGPAVAAQLDQPAGIALDNRGDLYIADRLNHRIRRV 614
Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG+ ++G++ DG P A P+ AVD GN+Y+ D+ N IRK+
Sbjct: 742 VITTVAGNRLRGFAGDGGPAVKASLQDPRGIAVDAVGNLYITDRGNSRIRKV 793
Score = 37.0 bits (84), Expect = 3.9, Method: Composition-based stats.
Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ AG +G + DG P + A D P A D G++Y+AD N IRK+
Sbjct: 686 IISTFAGSSGRGSFGDGGPATDALLDVPFGVAADAAGDVYIADTDNSRIRKV 737
>gi|290990564|ref|XP_002677906.1| NHL domain-containing protein kinase [Naegleria gruberi]
gi|284091516|gb|EFC45162.1| NHL domain-containing protein kinase [Naegleria gruberi]
Length = 2733
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 34/53 (64%), Gaps = 4/53 (7%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG G GYS G+ G SA+F +P+ AV G IY+AD NHV+RKI+
Sbjct: 548 IATVAGSGASGYS-GDGGLLTSAKFQQPQGVAVSSNGEIYIADTENHVVRKIS 599
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 9/76 (11%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMK 114
D+ + DS + +S + + +AG+G+ GYS G+ G SA+ P+S A+++
Sbjct: 272 DVYIADSGNNRIRKIS-----KGYISTVAGNGIAGYS-GDGGLSTSAQLATPQSVAINLN 325
Query: 115 GNIYVADKSNHVIRKI 130
G I +AD +N+ IRKI
Sbjct: 326 GEIIIADSNNNRIRKI 341
Score = 43.5 bits (101), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 5/75 (6%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKG 115
D+ + D+ + +SF ++ +AG G+ G + DG SA+ + P AV G
Sbjct: 471 DVYIADTYNNRIRKVSF---STGIISTVAGTGIAGQNGDGNLATSAQLNFPSGVAVSSNG 527
Query: 116 NIYVADKSNHVIRKI 130
+IY+AD NH IRKI
Sbjct: 528 DIYIADTENHRIRKI 542
Score = 42.7 bits (99), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 4/53 (7%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG G GYS G+ G +A+ P AV G +++AD+ NH IRK+T
Sbjct: 821 ISTIAGTGAMGYS-GDGGLAITAKLSSPVGVAVSSTGEVFIADRDNHRIRKVT 872
Score = 41.6 bits (96), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 4/57 (7%)
Query: 81 VVKRLAGDGVQ----GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
V++R+ + G DG P + + + +SF + G IY+AD NH IRK++ L
Sbjct: 762 VIRRVISGNISTIAGGVGDGGPATGGYIQAQSFDISSTGEIYIADTENHRIRKVSTL 818
Score = 41.6 bits (96), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 34/54 (62%), Gaps = 3/54 (5%)
Query: 81 VVKRLAGDGVQGY-SDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ +AG+G G+ SDG E A+ + P S + G IY+AD +NH IR+I+N
Sbjct: 876 IITTIAGNGTSGFNSDGIEAKFAQLNSPSSVTIS-GGEIYIADTNNHRIRQISN 928
Score = 39.7 bits (91), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S +V+ +AG G GY +G+ G +A+ P A+ G +++AD +NH IRKI
Sbjct: 598 ISTSNVLSTIAGTGSYGY-NGDGGLAITAKLFSPIGVAISSSGEVFIADNNNHRIRKI 654
Score = 39.3 bits (90), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 4/59 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ ++ +AG G GY DG+ +A+ P V+ G++Y+AD N+ IRKI+
Sbjct: 230 IATNGIITTIAGTGTHGY-DGDGALAINAQLYSPTGVVVNSNGDVYIADSGNNRIRKIS 287
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/57 (28%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++++ + + G G+ GY DG+ +A+ P+ + G +++AD++NH IRK+
Sbjct: 654 IAKDGYISTIVGTGLNGYDGDGDLATNAKLGNPQGVTLSPTGELFIADQNNHRIRKV 710
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S ++K +AG+G GY DG P +A+ + P A+ IY+A+ N IRK+T
Sbjct: 926 ISNTGIIKTIAGNGFGGYIGDGVLPPNAQLNSPSGVAISSTAEIYIAE--NKRIRKLT 981
>gi|29347802|ref|NP_811305.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|383120108|ref|ZP_09940841.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
gi|29339703|gb|AAO77499.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251838367|gb|EES66454.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
Length = 440
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 45/82 (54%), Gaps = 5/82 (6%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
ID+ G+ L + D + + L+ VV +AG G GY DG+P A+F++P
Sbjct: 362 AIDKEGN--LYIADVGNNRIRKVDTKLN---VVTTIAGSGAAGYKDGDPLEAQFNQPWGV 416
Query: 110 AVDMKGNIYVADKSNHVIRKIT 131
+D +Y+AD++NH IRK+
Sbjct: 417 YLDKNEFLYIADQNNHCIRKLA 438
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 8/73 (10%)
Query: 65 SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
S+SA Y +P + + + + G G G+ DG+ A+F +P+ A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIDEYIPGAGA-GFRDGDVQEAQFKEPRGIAIDKEGNL 369
Query: 118 YVADKSNHVIRKI 130
Y+AD N+ IRK+
Sbjct: 370 YIADVGNNRIRKV 382
>gi|223934991|ref|ZP_03626910.1| NHL repeat containing protein [bacterium Ellin514]
gi|223896444|gb|EEF62886.1| NHL repeat containing protein [bacterium Ellin514]
Length = 1064
Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats.
Identities = 28/58 (48%), Positives = 34/58 (58%), Gaps = 1/58 (1%)
Query: 75 PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
P +V +AG G G DG SARF +P+S +VD G +YVAD NH IRKIT
Sbjct: 593 PSGTNWIVNTVAGLAGFWGSIDGTNTSARFFQPRSLSVDASGALYVADSGNHAIRKIT 650
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 21/47 (44%), Positives = 32/47 (68%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAG +G ++G S +F P++ A+D+ N++VAD NHVIRKI+
Sbjct: 52 LAGYAGRGSANGISSSVQFKAPQAIAIDISNNVFVADTENHVIRKIS 98
Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S ++ LAG G G DG +A F +P AVD GN+ VAD N+ +RKIT
Sbjct: 97 ISCTGIITTLAGSLGTHGSRDGSGTNALFFRPAGIAVDASGNVLVADTGNNTVRKIT 153
Score = 44.7 bits (104), Expect = 0.020, Method: Composition-based stats.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 5/86 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP I + +L + D+ + T++ V AG G +DG ARF
Sbjct: 347 NPQG-ITSDATGNLYIADTGNNTIRTIT----PGGSVTTFAGLPSIGSADGLSSDARFRF 401
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
P++ AVD N+YVAD +N IRKI+
Sbjct: 402 PQAVAVDAATNVYVADTANQTIRKIS 427
Score = 44.7 bits (104), Expect = 0.021, Method: Composition-based stats.
Identities = 21/43 (48%), Positives = 29/43 (67%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GV G ++G A+F P+ AVD GN++VAD N++IRKIT
Sbjct: 495 GVSGTANGTNTDAQFYAPQGLAVDGTGNVFVADTFNNLIRKIT 537
Score = 44.3 bits (103), Expect = 0.027, Method: Composition-based stats.
Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 5/77 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDM 113
G+ ++ V D+ F L ++ V LAG+ G SDG +ARF P AVD
Sbjct: 519 GTGNVFVADT----FNNLIRKITPGGAVTTLAGNFENFGSSDGTNSNARFYWPSGVAVDN 574
Query: 114 KGNIYVADKSNHVIRKI 130
GN++VAD NH IR++
Sbjct: 575 AGNVFVADYMNHTIREL 591
Score = 43.5 bits (101), Expect = 0.048, Method: Composition-based stats.
Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 5/78 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
S +L V+D+ ++ VV AG G G ++G +A F P+ +D+
Sbjct: 246 SGNLYVVDTGNGTIRKIT----SSGVVTTFAGSAGNYGATNGIGANALFYAPQGITIDLF 301
Query: 115 GNIYVADKSNHVIRKITN 132
G +YVAD NH IRKIT+
Sbjct: 302 GCVYVADTGNHTIRKITS 319
Score = 42.4 bits (98), Expect = 0.092, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 33/63 (52%), Gaps = 13/63 (20%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-------ARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+S +V LAG S G PGS A F P+ VD GNIYVAD NH+IR
Sbjct: 426 ISPSGLVCTLAG------SIGHPGSVNNIGTNALFSGPQGITVDGVGNIYVADTLNHIIR 479
Query: 129 KIT 131
+IT
Sbjct: 480 RIT 482
Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 28/52 (53%), Gaps = 1/52 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
V +AG GV G D +A F P+ VD GN+YV D N IRKIT+
Sbjct: 213 VNTMAGSAGVYGNLDNSGANALFSGPQGLTVDSSGNLYVVDTGNGTIRKITS 264
Score = 39.3 bits (90), Expect = 0.73, Method: Composition-based stats.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V LAG G G +D SA F P+ D GN+Y+AD N+ IR IT
Sbjct: 317 ITSDGTVTTLAGLAGNYGSADSVNSSASFWNPQGITSDATGNLYIADTGNNTIRTIT 373
Score = 36.6 bits (83), Expect = 4.9, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V AG G G +D +A F +P A+D NI+VAD N+ IRKIT
Sbjct: 158 VTTFAGSAGNYGSTDNLGTNALFYRPTGIAIDNFNNIFVADTGNNTIRKIT 208
>gi|223937730|ref|ZP_03629631.1| NHL repeat containing protein [bacterium Ellin514]
gi|223893523|gb|EEF59983.1| NHL repeat containing protein [bacterium Ellin514]
Length = 1474
Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 54/95 (56%), Gaps = 9/95 (9%)
Query: 43 LEINPHSVIDRPGSSDLIVL----DSSRSAFYTLSFPL--SEESVVKRLAGDGVQGYSDG 96
L NP + G+S L+V + + SA T ++ L E +VV+ AG GV G+++G
Sbjct: 1002 LYTNPFPI---SGNSTLVVKAWLSNYAPSALATATYSLLDYEPTVVQTYAGTGVPGFTNG 1058
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
A+F+ P+ VD G ++V+D N+VIRKI+
Sbjct: 1059 PNLKAQFNAPQGICVDGLGTLFVSDTGNNVIRKIS 1093
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 31/56 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S VV AG GV G DG +A F P A+D N+YVAD N +IRK+T
Sbjct: 1092 ISTNGVVTTFAGSGVAGTHDGVGTNASFLAPTGIALDSSNNLYVADSGNSLIRKVT 1147
Score = 42.0 bits (97), Expect = 0.12, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 29/50 (58%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG G G +G A F++ + VD GNIY AD +NH IRKI+
Sbjct: 1299 VTTLAGAGDLGLRNGSGLFAAFNQLGAVTVDSAGNIYAADAANHSIRKIS 1348
>gi|29349195|ref|NP_812698.1| hypothetical protein BT_3787 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341103|gb|AAO78892.1| conserved hypothetical protein with NHL repeat [Bacteroides
thetaiotaomicron VPI-5482]
Length = 452
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
P E S+ L G G++DG P ARF+KP+ V + ++YVAD+ NH+IR++T
Sbjct: 396 PTGEVSLYAGLPGQA--GFTDGLPEKARFNKPECVTVYLDNSLYVADRDNHLIRRVT 450
Score = 43.5 bits (101), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 25/40 (62%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY +G +ARFD P+ A D GN++V + H IRKIT
Sbjct: 356 GYVNGSGVTARFDNPRQPAFDQDGNMFVPEYGRHTIRKIT 395
>gi|325186506|emb|CCA21046.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1881
Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/65 (41%), Positives = 36/65 (55%), Gaps = 6/65 (9%)
Query: 71 TLSFPLSE-----ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
T SFP S + V+ AG G +G++DGE ++ FD P A+D G YV D NH
Sbjct: 214 TWSFPKSNSKTKPQGAVETFAG-GSRGHNDGEAVASLFDNPNDLAIDSTGVTYVVDTGNH 272
Query: 126 VIRKI 130
IR+I
Sbjct: 273 CIRRI 277
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 19/39 (48%), Positives = 26/39 (66%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G+SDG +RFD P A+D+ GN+ VAD N++IR I
Sbjct: 353 GFSDGNASESRFDTPLGLAIDVDGNLIVADSGNNLIRLI 391
Score = 40.0 bits (92), Expect = 0.52, Method: Composition-based stats.
Identities = 32/107 (29%), Positives = 51/107 (47%), Gaps = 16/107 (14%)
Query: 30 DGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD 88
DG V ++ D NP+ + ID G + V+D+ + + S V AG+
Sbjct: 242 DGEAVASLFD------NPNDLAIDSTGVT--YVVDTGNHCIRRID----QRSRVTTFAGN 289
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN---IYVADKSNHVIRKITN 132
+G+ DG A+++ P AV + N +YVAD NH IR+I +
Sbjct: 290 RTRGFRDGPLDIAQYNFPSGIAVVNEQNRVTVYVADTGNHRIRRIRD 336
>gi|312200909|ref|YP_004020970.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311232245|gb|ADP85100.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 847
Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 37/58 (63%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S++ V +AG+G G+S DG P SA + P + VD GNIYV D +N+ IRKIT
Sbjct: 554 ISKDGTVTPIAGNGTAGFSGDGGPATSAELNGPGTAVVDKNGNIYVPDTANNRIRKIT 611
Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats.
Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + V+ +AG G +G + DG P +A + P S VD G +Y+AD N IR++
Sbjct: 777 VTPDGVITTIAGTGAEGNTGDGGPATAAQLKSPSSVVVDASGAVYIADNGNKEIRRV 833
Score = 37.4 bits (85), Expect = 2.7, Method: Composition-based stats.
Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + + + G+G G+S DG P + A + + AV G++Y+AD SN IRK+T
Sbjct: 610 ITPDGKITTVVGNGTAGFSGDGGPATQAEINSVEGIAVGPDGSLYLADYSNERIRKVT 667
>gi|452992643|emb|CCQ95895.1| hypothetical protein CULT_320031 [Clostridium ultunense Esp]
Length = 329
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 15/91 (16%)
Query: 53 RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSAR---------- 102
R ++ +LDS A ++ ++ V+ +AG G GY+ G+ G A+
Sbjct: 182 RGPEGNIYILDSKNYAVRKINI---DKETVELIAGTGKPGYT-GDGGDAKDATFGGNKES 237
Query: 103 -FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
FD P S ++D GNIYV D NHV+R ITN
Sbjct: 238 YFDGPWSLSLDEVGNIYVGDTQNHVVRMITN 268
>gi|357516379|ref|XP_003628478.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
gi|355522500|gb|AET02954.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
Length = 150
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 43/111 (38%), Gaps = 43/111 (38%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------------- 132
G G+ DG+ ARF PK A+D KGN+YVAD N IRKI +
Sbjct: 24 GRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGY 83
Query: 133 ---------------------------LDHGNQLIRQINLKPEDCSKSSQS 156
+D GN +RQI L EDC+ S S
Sbjct: 84 RDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQSSS 134
>gi|298387997|ref|ZP_06997545.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
gi|298259263|gb|EFI02139.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
Length = 439
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 43/73 (58%), Gaps = 8/73 (10%)
Query: 65 SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
S+SA Y +P + + + + G G G+ DG+ A+F++P+ A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIEEYIPGAGA-GFRDGDVQEAQFNEPRGIAIDKEGNL 369
Query: 118 YVADKSNHVIRKI 130
Y+AD +NH IRK+
Sbjct: 370 YIADVNNHRIRKV 382
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 35/52 (67%), Gaps = 1/52 (1%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++V +AG G +GY DG+P A+FD+P +D +Y+AD++NH IRK+
Sbjct: 387 NIVTTIAGSG-KGYKDGDPLEAQFDQPWGVYLDKNEFLYIADQNNHCIRKLA 437
>gi|380692433|ref|ZP_09857292.1| NHL repeat-containing protein [Bacteroides faecis MAJ27]
Length = 439
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 43/73 (58%), Gaps = 8/73 (10%)
Query: 65 SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
S+SA Y +P + + + + G G G+ DG+ A+F++P+ A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIEEYIPGAGA-GFRDGDVQEAQFNEPRGIAIDKEGNL 369
Query: 118 YVADKSNHVIRKI 130
Y+AD +NH IRK+
Sbjct: 370 YIADVNNHRIRKV 382
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 35/52 (67%), Gaps = 1/52 (1%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++V +AG G +GY DG+P A+FD+P +D +Y+AD++NH IRK+
Sbjct: 387 NIVTTIAGSG-KGYKDGDPLEAQFDQPWGVYLDKNEFLYIADQNNHCIRKLA 437
>gi|406836139|ref|ZP_11095733.1| NHL repeat containing protein [Schlesneria paludicola DSM 18645]
Length = 365
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 5/78 (6%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKG 115
D+I++ +A Y + +E V+ LAG G +GY+ DG G A+ + PK A+D +G
Sbjct: 236 DVILVLREGNAVYRID---RKEKSVRHLAGTGKKGYAGDGGDGKLAQVNGPKGIAIDHQG 292
Query: 116 NIYVADKSNHVIRKITNL 133
NI + D NHVIR I L
Sbjct: 293 NILLCDTENHVIRIIERL 310
Score = 42.0 bits (97), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 82 VKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ L GDG G DG P R ++P V + G +Y+ D NH IRK+T
Sbjct: 314 IDTLVGDGTIGDGPDGNPRHCRLNRPHGVFVALDGTVYIGDSGNHKIRKLT 364
>gi|29348037|ref|NP_811540.1| cell surface protein [Bacteroides thetaiotaomicron VPI-5482]
gi|29339939|gb|AAO77734.1| putative cell surface protein, have conserved domain [Bacteroides
thetaiotaomicron VPI-5482]
Length = 434
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 40/73 (54%), Gaps = 4/73 (5%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
L + +++ YT + E K AG G G++DG A F+ P+ ++DM+GNI
Sbjct: 307 LYIAYAAKHVIYTYNIETGEH---KLFAGAFGQSGWNDGIATDAEFNSPRQMSLDMEGNI 363
Query: 118 YVADKSNHVIRKI 130
Y+AD NH IR I
Sbjct: 364 YIADSGNHCIRMI 376
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V + G GY+DG P A + P+ AV+ +G++Y+AD N IRK+T
Sbjct: 382 VTTPIGQPGEAGYADGSPDMALLNDPRGVAVNSEGDVYIADLGNRCIRKLT 432
>gi|29348305|ref|NP_811808.1| hypothetical protein BT_2896 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383121762|ref|ZP_09942466.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
gi|29340208|gb|AAO78002.1| NHL repeat protein [Bacteroides thetaiotaomicron VPI-5482]
gi|251841348|gb|EES69429.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
Length = 452
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 52/106 (49%), Gaps = 13/106 (12%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF-----PLSEESVVKRL 85
G V +IDG + S +D+P + I +D+ + T + ++ E V
Sbjct: 351 GNGVWGIIDGQGVS----SRMDQP---NQIAVDAEGNLLVTTVYGRTVRKITPEGYVSTY 403
Query: 86 AGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AG G Q GY DG A+F+KP A+D +GN+YV D N IR I
Sbjct: 404 AGIGYQTGYVDGLAAEAKFNKPYGIAIDAQGNVYVGDCENWRIRVI 449
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 29/48 (60%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
R AG+GV G DG+ S+R D+P AVD +GN+ V +RKIT
Sbjct: 348 RFAGNGVWGIIDGQGVSSRMDQPNQIAVDAEGNLLVTTVYGRTVRKIT 395
>gi|380692993|ref|ZP_09857852.1| cell surface protein [Bacteroides faecis MAJ27]
Length = 457
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 77 SEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ E + AGD G G++DG+ A+F+ P+ +D + N+Y+AD NH IRK+T
Sbjct: 345 TREGNFRIYAGDKGRAGHADGQETEAQFNSPRQLVLDEEENLYIADSGNHCIRKVT 400
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ E VV + G+ GY DG P A F +P A+D +G IY+ DK N +RK++
Sbjct: 399 VTPEGVVSTVIGNPNSSGYKDGTPEIALFTEPWGLAIDSEGTIYIGDKDNRCVRKLS 455
>gi|333383538|ref|ZP_08475196.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
BAA-286]
gi|332827477|gb|EGK00223.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
BAA-286]
Length = 434
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 30/51 (58%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V + G +GY DG P A F KP+ AVD G++Y+AD N +RK+T
Sbjct: 382 VTTPIGQPGQEGYMDGSPEIALFKKPRGVAVDKNGDVYIADYGNRCLRKLT 432
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 2/72 (2%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
L + + R YT + +E + G G+SDG A F+ P+ +D++GN+Y
Sbjct: 307 LYIAYTQRHCIYTYNLATNEHLLFA--GAPGQPGWSDGRVSDAEFNLPRQMTLDLEGNLY 364
Query: 119 VADKSNHVIRKI 130
+AD+ NH IR I
Sbjct: 365 IADEGNHCIRMI 376
>gi|118161380|gb|ABK64062.1| conserved hypothetical protein [Janthinobacterium lividum]
Length = 779
Score = 50.1 bits (118), Expect = 4e-04, Method: Composition-based stats.
Identities = 31/85 (36%), Positives = 46/85 (54%), Gaps = 6/85 (7%)
Query: 53 RPGSSDLIVLDSSRSAFYTLSF-----PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKP 106
R G I D+S + + T S ++ VV LAG GV G +DG +A F+ P
Sbjct: 642 RFGYLRAIAFDASGNLYLTDSLNHNVRKITPAGVVTTLAGTTGVAGDADGRGSAASFNGP 701
Query: 107 KSFAVDMKGNIYVADKSNHVIRKIT 131
A+D GN+YVAD N+++R+I+
Sbjct: 702 HGIALDKAGNVYVADTENNLVRRIS 726
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 31/56 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
L+ +V LAG G +DG A F +D +GN++VAD+ NH IRKIT
Sbjct: 331 LTPAGIVSTLAGGSAYGSADGAGAVATFSSLNGLTIDKQGNLFVADEGNHTIRKIT 386
Score = 38.9 bits (89), Expect = 0.99, Method: Composition-based stats.
Identities = 17/38 (44%), Positives = 23/38 (60%)
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+DG ARF ++ A D GN+Y+ D NH +RKIT
Sbjct: 634 ADGSLAQARFGYLRAIAFDASGNLYLTDSLNHNVRKIT 671
Score = 38.1 bits (87), Expect = 1.8, Method: Composition-based stats.
Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 7/63 (11%)
Query: 76 LSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV----IR 128
++ VV LAG Q +DG +ARF+ P AVD GN+YV+D + + +R
Sbjct: 385 ITPAGVVSTLAGTASQLRPAVADGIGAAARFNLPYGLAVDGAGNVYVSDSNPGLQLNGVR 444
Query: 129 KIT 131
KIT
Sbjct: 445 KIT 447
Score = 37.7 bits (86), Expect = 2.5, Method: Composition-based stats.
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G +DG ARF P++ A+ G ++VAD N +IR+++
Sbjct: 463 GVTDGLASEARFAGPQAIALHSDGTLFVADTGNQLIRRVS 502
Score = 37.4 bits (85), Expect = 3.1, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 8/64 (12%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV-------I 127
++ + V LAG G SDG+ SARF AVD G IYVAD +++ I
Sbjct: 269 ITPDGTVTTLAGISRTYGTSDGDAKSARFGGISGIAVDTAGAIYVADAGSNINGIVGSRI 328
Query: 128 RKIT 131
RK+T
Sbjct: 329 RKLT 332
>gi|329935738|ref|ZP_08285543.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
gi|329304829|gb|EGG48702.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
Length = 617
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 26/75 (34%), Positives = 45/75 (60%), Gaps = 3/75 (4%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +L+V D++R L+ ES V+R+ G G +G++DG +A F +P+ A+ G
Sbjct: 198 SGNLLVSDTTRHQLVELA--ADGESEVRRI-GSGARGFADGTAEAAAFSEPQGLALLEDG 254
Query: 116 NIYVADKSNHVIRKI 130
++ VAD NH +R++
Sbjct: 255 SVVVADTVNHALRRL 269
>gi|196230675|ref|ZP_03129536.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
gi|196225016|gb|EDY19525.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
Length = 1120
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 6/77 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
++++ V DS SA ++ VV LAG G+ G +DG A F+ P+ AVD
Sbjct: 753 ANNIYVADSGNSAIRKIT----PGRVVSTLAGQPGLSGSADGTT-KALFNNPQGVAVDSA 807
Query: 115 GNIYVADKSNHVIRKIT 131
GN+YVAD +N IRKIT
Sbjct: 808 GNVYVADTTNETIRKIT 824
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 32/71 (45%), Positives = 41/71 (57%), Gaps = 5/71 (7%)
Query: 62 LDSSRSAFYTLSFPLSEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
L S+ S+ + PL V+ LAG + QG +DG +A+F P AVD GNIYVA
Sbjct: 649 LTSAPSSVLIVDSPL----VISTLAGQVLTQGTADGTGTAAQFGYPSGVAVDSSGNIYVA 704
Query: 121 DKSNHVIRKIT 131
D +N IRKIT
Sbjct: 705 DFNNDTIRKIT 715
Score = 43.5 bits (101), Expect = 0.037, Method: Composition-based stats.
Identities = 21/43 (48%), Positives = 26/43 (60%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GV G ++G +A F+ P A+D NIYVAD N IRKIT
Sbjct: 728 GVIGATNGTGTNATFNTPNGVAIDSANNIYVADSGNSAIRKIT 770
Score = 39.7 bits (91), Expect = 0.59, Method: Composition-based stats.
Identities = 18/29 (62%), Positives = 21/29 (72%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
F+ P S AVD GN+YVAD N V+RKIT
Sbjct: 851 FNGPSSVAVDSAGNVYVADLYNFVVRKIT 879
Score = 37.0 bits (84), Expect = 4.2, Method: Composition-based stats.
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQIN 144
G+ G DG +A F+ P AVD N+Y+ D + + +T+ GN L+R++N
Sbjct: 892 GMPGRLDGIGTAALFNAPIGVAVDANNNLYITD--SQIPPDLTSTSTGNNLVRRVN 945
>gi|116620673|ref|YP_822829.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223835|gb|ABJ82544.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 912
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 38/60 (63%), Gaps = 2/60 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + V+ +AG G G+S G +A+ ++P+ AVD +GN+YVAD N+ IRKI L
Sbjct: 524 VTTDGVIHTIAGFGTPGFSGDGGAATAAKLNRPRGVAVDAQGNVYVADTGNNRIRKIDPL 583
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 40/58 (68%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +++ +AG+G GY DG P + A+ + P + AVD +GN+++AD +N V+R++T
Sbjct: 124 IATTGIIQTVAGNGTPGYQGDGGPATTAQLNAPFAVAVDGQGNVFIADTNNVVVRRVT 181
Score = 45.8 bits (107), Expect = 0.007, Method: Composition-based stats.
Identities = 30/115 (26%), Positives = 54/115 (46%), Gaps = 5/115 (4%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL---SFPLS 77
L+ G + Y ++T+ G + G + +D++ + +++ F +
Sbjct: 10 LLFCGSVWAQQYVISTIAGGAAPPTPVAAAKASVGDPTRVAVDAAGNVYFSSLHSIFKVD 69
Query: 78 EESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ R AG+G G S G+ SA+ P AVD GNI+VAD+ +V+R+I
Sbjct: 70 SSGTMTRFAGNGRPGNSGDGGQASSAQLLFPMGLAVDAAGNIFVADRDANVVRRI 124
Score = 43.1 bits (100), Expect = 0.055, Method: Composition-based stats.
Identities = 35/118 (29%), Positives = 55/118 (46%), Gaps = 17/118 (14%)
Query: 17 LLFSLVSSGL--LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF 74
LL +L +GL DG TT Q + ID G +++++DS +S
Sbjct: 362 LLTTLAGNGLASFGGDGAAATTA----QFDTPTGVAIDAGG--NVLIVDSQNQRLRKIS- 414
Query: 75 PLSEESVVKRLAGDGVQGYSDGE---PGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
V+ +AG G G+ +GE P +A+ + P+ A D GN +VAD N +R+
Sbjct: 415 ----RGVITTIAGTGTAGF-NGEVVLPATAQLNTPRGVAADAYGNYFVADTGNRRVRE 467
Score = 42.0 bits (97), Expect = 0.11, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 35/58 (60%), Gaps = 4/58 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + + AG G +G++ G+ G+AR FD P+ AVD G +Y+AD N IR++
Sbjct: 180 VTPDGTISTYAGSGARGFA-GDGGAARNAWFDGPEGVAVDANGVLYIADTFNGRIRRV 236
Score = 40.0 bits (92), Expect = 0.44, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
+S + +AG+G Y+ DG G+A + ++P AVD GNIYVAD N+ IR
Sbjct: 636 VSTGGTITTIAGNGQCCYTGDGGLGTAAQLNQPWGIAVDSAGNIYVADSGNNAIR 690
Score = 40.0 bits (92), Expect = 0.53, Method: Composition-based stats.
Identities = 19/44 (43%), Positives = 23/44 (52%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
GV G +G P SA P AVD GN Y+AD N +R + N
Sbjct: 252 GVFGGDNGPPASAALSLPTDVAVDRSGNPYIADFGNSRVRMVAN 295
Score = 39.7 bits (91), Expect = 0.56, Method: Composition-based stats.
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 85 LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAG+G+ + DG + A+FD P A+D GN+ + D N +RKI+
Sbjct: 366 LAGNGLASFGGDGAAATTAQFDTPTGVAIDAGGNVLIVDSQNQRLRKIS 414
Score = 38.9 bits (89), Expect = 0.94, Method: Composition-based stats.
Identities = 20/48 (41%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 86 AGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
AG+G Y DG P + A ++P+ A D GN Y+AD ++V+RK+T
Sbjct: 478 AGNGNASYFGDGLPATQASVNQPEGVAGDAAGNTYIADTFDNVVRKVT 525
>gi|158315873|ref|YP_001508381.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158111278|gb|ABW13475.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 850
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 9/87 (10%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARF 103
P SV +DRPG L + DS + + + V++ +AG G+ GYS DG P + A
Sbjct: 711 PISVAMDRPGR--LYIADSGNNRIRRIGL----DGVIETVAGTGLPGYSGDGGPATRATL 764
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
P+ AVD +G I++ D++N IR++
Sbjct: 765 RSPRGVAVDARGAIFITDRTNRRIRRV 791
Score = 42.7 bits (99), Expect = 0.076, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + + +AG G G+S DG P + AR +P + A+D GNI +AD N IR++
Sbjct: 567 ITADGTITTIAGTGEAGFSGDGGPATQARLRQPAAVALDSAGNILIADTFNQRIRRV 623
Score = 41.2 bits (95), Expect = 0.20, Method: Composition-based stats.
Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 82 VKRLAGDGVQGYS-DGEP-GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG G +S DG P GSA + P D GNIYVAD N+ +R+IT
Sbjct: 517 VVTVAGTGEAAFSGDGGPAGSAALNGPFGMVADWAGNIYVADFDNNRVRRIT 568
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 20/54 (37%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 79 ESVVKRLAG-DGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +++ +AG DG + DG P + A P S A+D G +Y+AD N+ IR+I
Sbjct: 682 DGIIQTVAGGDGEGAFGDGGPAADALLAFPISVAMDRPGRLYIADSGNNRIRRI 735
>gi|302552781|ref|ZP_07305123.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
40736]
gi|302470399|gb|EFL33492.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
40736]
Length = 605
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 31/99 (31%), Positives = 52/99 (52%), Gaps = 10/99 (10%)
Query: 39 DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
DG + P + R PG S + +V D++R L+ ESVV+R+ G G +
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGNFLVSDTTRHQLVELAG--DGESVVRRI-GSGAR 221
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G++DG A F++P+ + G++ VAD NH +R++
Sbjct: 222 GFADGSADEAAFNEPQGLTLLDDGSVVVADTVNHALRRL 260
>gi|326778226|ref|ZP_08237491.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces griseus XylebKG-1]
gi|326658559|gb|EGE43405.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces griseus XylebKG-1]
Length = 615
Score = 49.7 bits (117), Expect = 5e-04, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
L+V D++R L ++ V+R G G +G DG P ARF +P+ AV G +
Sbjct: 198 LLVSDTTRHRLVELD---ADGETVRRHFGTGERGLRDGGPDEARFSEPQGLAVLPDGRVA 254
Query: 119 VADKSNHVIRKI 130
VAD NH IR +
Sbjct: 255 VADTVNHAIRAL 266
>gi|182437571|ref|YP_001825290.1| hypothetical protein SGR_3778 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178466087|dbj|BAG20607.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
NBRC 13350]
Length = 605
Score = 49.7 bits (117), Expect = 5e-04, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
L+V D++R L ++ V+R G G +G DG P ARF +P+ AV G +
Sbjct: 188 LLVSDTTRHRLVELD---ADGETVRRHFGTGERGLRDGGPDEARFSEPQGLAVLPDGRVA 244
Query: 119 VADKSNHVIRKI 130
VAD NH IR +
Sbjct: 245 VADTVNHAIRAL 256
>gi|116619867|ref|YP_822023.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223029|gb|ABJ81738.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 592
Score = 49.7 bits (117), Expect = 5e-04, Method: Composition-based stats.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 5/77 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDM 113
S +L+++D + + +++ AG+G QG+S G+ A F+ P + AVD
Sbjct: 166 SGNLLIVDQGAHKIRKVE---AASNIISTFAGNGSQGFSGDGGQAAQASFNNPTALAVDA 222
Query: 114 KGNIYVADKSNHVIRKI 130
G +YV D+SN IR+I
Sbjct: 223 AGTVYVTDQSNQRIRRI 239
Score = 42.4 bits (98), Expect = 0.091, Method: Composition-based stats.
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ LAG G G++ DG P + A+ + P +D GNIYV+D+ N +RKI
Sbjct: 75 VITTLAGIGTAGFAGDGGPAAQAQLNGPLGLCIDGSGNIYVSDQGNKRVRKI 126
Score = 41.2 bits (95), Expect = 0.21, Method: Composition-based stats.
Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 11/88 (12%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESV----VKRLAGDGVQGYSDGEPGSA---RF 103
++ PG IVLDSS + + S V + +AG G G+S G+ G+A +
Sbjct: 268 LNYPGG---IVLDSSGTLYIVDSVNQRVRKVSGTTISTVAGTGTAGFS-GDGGAALQAQL 323
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ P D GN+YV D SN+ +RKIT
Sbjct: 324 NNPFPITADGSGNLYVGDVSNNRVRKIT 351
Score = 35.8 bits (81), Expect = 8.8, Method: Composition-based stats.
Identities = 17/68 (25%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADK 122
SR L+ ++ + +AG+G G++ G+ A+ ++ D GN+Y+A++
Sbjct: 3 SRFVLPLLAAAIASGQTITTVAGNGTAGFAGDTGQATQAQINRAVGLVTDANGNLYLAEE 62
Query: 123 SNHVIRKI 130
N+ +RK+
Sbjct: 63 LNNRVRKV 70
>gi|380693812|ref|ZP_09858671.1| hypothetical protein BfaeM_07498 [Bacteroides faecis MAJ27]
Length = 456
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 27/44 (61%)
Query: 88 DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+G GY DG G A F+KP VD +GNI + D NH IRKIT
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKIT 398
>gi|29349488|ref|NP_812991.1| hypothetical protein BT_4080 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298384984|ref|ZP_06994543.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383120427|ref|ZP_09941155.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
gi|29341397|gb|AAO79185.1| putative cell surface protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251840523|gb|EES68605.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
gi|298262128|gb|EFI04993.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 456
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 27/44 (61%)
Query: 88 DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+G GY DG G A F+KP VD +GNI + D NH IRKIT
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKIT 398
>gi|116749588|ref|YP_846275.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
gi|116698652|gb|ABK17840.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
Length = 1750
Score = 49.7 bits (117), Expect = 6e-04, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG GV GYS DG AR + P + AVD +IY+AD +NH IRK+
Sbjct: 465 VAGKGVAGYSGDDGPAAEARLNNPSAIAVDGSESIYIADTNNHRIRKV 512
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 56/127 (44%), Gaps = 15/127 (11%)
Query: 12 ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPG------SSDLIVLDSS 65
I L LLF SS + D DG LE N +D P S ++ + D++
Sbjct: 2 IFLAILLFPGPSSAERVVDTVAGLQSDDGPALEAN----LDSPSGVAVDTSGNVYLSDTN 57
Query: 66 RSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKS 123
L + ++ LAG GYS G A ++P+ AVD GN+Y AD +
Sbjct: 58 NHRIRKLDVAAGQ---IQTLAGGQSPGYSGDGGTAAKAGLNRPRGIAVDAAGNVYFADSN 114
Query: 124 NHVIRKI 130
NH IRKI
Sbjct: 115 NHCIRKI 121
Score = 44.3 bits (103), Expect = 0.023, Method: Composition-based stats.
Identities = 25/52 (48%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG G G + DG P SAR P AVD GNIYVAD NH +R+I
Sbjct: 126 IITTVAGTGSAGSNGDGGPAASARLAYPFGIAVDPSGNIYVADLGNHKVRRI 177
Score = 42.7 bits (99), Expect = 0.067, Method: Composition-based stats.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 6/91 (6%)
Query: 42 QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS 100
+ +N S I GS + + D++ + + +AG+G GYS DG +
Sbjct: 482 EARLNNPSAIAVDGSESIYIADTNNHRIRKVD----GGGTITTVAGNGTPGYSGDGASAT 537
Query: 101 A-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
A + P AVD GN+++AD SNH +R +
Sbjct: 538 AASLNFPNGVAVDADGNVFIADTSNHRVRMV 568
Score = 42.0 bits (97), Expect = 0.11, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 79 ESVVKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V+ +AG G G DG P + AR P + AVD + NIY+AD +H IR++
Sbjct: 291 DGVINTVAGTGYGGSLEDGIPATGARLKSPVALAVDSQNNIYIADTYSHRIRRV 344
Score = 39.3 bits (90), Expect = 0.71, Method: Composition-based stats.
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + V+ +AG+G GYS G SA P VD G +Y+AD N+ +RK+
Sbjct: 567 MVDSGVITTVAGNGTPGYSGDGGAAVSASLKAPHGVWVDATGALYIADAHNYRVRKV 623
>gi|325965074|ref|YP_004242980.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
phenanthrenivorans Sphe3]
gi|323471161|gb|ADX74846.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
phenanthrenivorans Sphe3]
Length = 674
Score = 49.7 bits (117), Expect = 6e-04, Method: Composition-based stats.
Identities = 36/113 (31%), Positives = 57/113 (50%), Gaps = 10/113 (8%)
Query: 29 EDGYTVTTVIDGHQLEI---NPHSVI-DRPGSSDLIVLDSSRSAFYTLSF-PLSEESVVK 83
ED + + GH L++ +P V+ R ++ +I + + F SF PL+ +
Sbjct: 325 EDAAGFSGQLVGHPLDVALSSPWDVVWSRKLNAVVIAMAGTHQIF---SFDPLTGSVAI- 380
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+AG+G++G DG P A F +P A D GNI+VAD +RK+ D G
Sbjct: 381 -VAGNGLEGLLDGAPHEAWFAQPSGLAEDADGNIWVADSETSALRKLVIDDDG 432
>gi|451980854|ref|ZP_21929239.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
gracilis 3/211]
gi|451761976|emb|CCQ90480.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
gracilis 3/211]
Length = 729
Score = 49.3 bits (116), Expect = 7e-04, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + ++ + G GV GYS DG P + A P D +GN+YVAD+ NHV+RK+
Sbjct: 68 IDQNGMLTNVVGTGVSGYSGDGGPATEATLKVPAGLTFDKEGNLYVADRENHVVRKV 124
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 21/52 (40%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ AG G GYS G A+ + P VD KGN++++D+SN+VIRK+
Sbjct: 129 IITTFAGTGKAGYSGDKGPATQAKLNLPSDMTVDHKGNLFISDRSNNVIRKV 180
Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ E + +AG+G Y+ DG P A F P+S +D +GN+Y+ D +N+VIRKI
Sbjct: 528 VTPEGKIITIAGNGSSDYTGDGGPAKDASFRDPQSLTMDKEGNLYIGDTANNVIRKI 584
Score = 43.9 bits (102), Expect = 0.035, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 5/57 (8%)
Query: 86 AGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
AG G +GY+ D P DKP AVD GN+Y+AD+ N+ IRK+ D G+ L+
Sbjct: 190 AGTGNEGYNGDNMPALRTNLDKPFGLAVDKHGNLYIADRGNNRIRKV---DAGSGLM 243
Score = 37.7 bits (86), Expect = 2.2, Method: Composition-based stats.
Identities = 18/39 (46%), Positives = 24/39 (61%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+G A P AVD +GN+YVAD++N+ IRKI L
Sbjct: 258 NGPAYRASIAGPTDVAVDDEGNVYVADRNNNRIRKINTL 296
Score = 37.7 bits (86), Expect = 2.5, Method: Composition-based stats.
Identities = 22/58 (37%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 76 LSEESVVKRLAG-DGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + V+ +AG GVQG + DG + A +P A D KGN+Y+ D N+ IRK+
Sbjct: 640 ITRDGKVELVAGRPGVQGLFGDGGKATEAMLKQPACIAFDSKGNMYITDMGNNRIRKV 697
Score = 35.8 bits (81), Expect = 7.8, Method: Composition-based stats.
Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 11/78 (14%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYSD-----GEPGSARFDKP------KSFAVDM 113
S S F +L + ++ L G Q Y + G G + + A D
Sbjct: 452 SPSGFVSLIRKVDNNGIISLLISTGDQDYQEAVMTAGYTGRSSHSEITIITQFSGLAFDK 511
Query: 114 KGNIYVADKSNHVIRKIT 131
KGN+YV+D+ NH IRK+T
Sbjct: 512 KGNLYVSDRINHQIRKVT 529
>gi|428180908|gb|EKX49774.1| hypothetical protein GUITHDRAFT_67798, partial [Guillardia theta
CCMP2712]
Length = 247
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
V +AG G G+ DGE A F++P A D +GNI+VAD N IR IT HG
Sbjct: 182 VTTIAGTGECGHRDGEASQATFNEPCFIACDQQGNIFVADSRNDSIRCITR--HG 234
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 30/50 (60%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
+AG G +G DG S R + P+ D G++Y+AD SNH IR++ + D
Sbjct: 74 VAGCGRRGCQDGSHESGRLNGPRGLTFDPHGDLYIADSSNHRIRRLRSQD 123
>gi|158319004|ref|YP_001511512.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114409|gb|ABW16606.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 892
Score = 49.3 bits (116), Expect = 7e-04, Method: Composition-based stats.
Identities = 34/90 (37%), Positives = 48/90 (53%), Gaps = 9/90 (10%)
Query: 45 INPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSA 101
++P+SV ID G+ + L S R T + +S+ LAG G GYS +G +A
Sbjct: 576 LSPYSVAIDPQGTLFITSLSSDRIQKVTRTGEVSD------LAGTGADGYSGDNGPATAA 629
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ + P S D GNIY+ D N+ IRKIT
Sbjct: 630 KLNGPGSAVPDKNGNIYIPDAQNYRIRKIT 659
Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats.
Identities = 22/58 (37%), Positives = 36/58 (62%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + ++ +AG G G+S DG P +A + + + A+ G+IY+AD NH IRKIT
Sbjct: 658 ITPDGIITTIAGTGTAGFSGDGGPATAAQINSAEKVAIGPDGSIYIADYDNHRIRKIT 715
Score = 42.4 bits (98), Expect = 0.096, Method: Composition-based stats.
Identities = 20/58 (34%), Positives = 36/58 (62%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + ++ +AG G+QGYS DG P +A + D P + G +Y+A+ ++ I+KIT
Sbjct: 714 ITPDGIINTIAGTGLQGYSGDGGPATAAKLDGPNDVELGDDGTLYIANLGSNTIQKIT 771
Score = 38.9 bits (89), Expect = 0.93, Method: Composition-based stats.
Identities = 19/56 (33%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++++ +V +AG+G +G+S DG P +A S ++ G IY+AD N+ +RK+
Sbjct: 770 ITKDGIVTTVAGNGQKGFSGDGGPATAAQLSVPSVSLGNGGEIYIADYGNNRVRKV 825
>gi|153808835|ref|ZP_01961503.1| hypothetical protein BACCAC_03135 [Bacteroides caccae ATCC 43185]
gi|149128661|gb|EDM19879.1| NHL repeat protein [Bacteroides caccae ATCC 43185]
Length = 426
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ E V + G G GY DG P A FD+P AVD +G IY+AD N IRK+
Sbjct: 368 IDREGAVSTVIGVPGRAGYVDGTPDDALFDEPWGVAVDEEGTIYIADTKNKCIRKL 423
Score = 43.9 bits (102), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GY DG+ +ARF+ P D+ G +Y+AD SNH IR I
Sbjct: 330 GYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSI 368
>gi|423220897|ref|ZP_17207391.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
CL03T12C61]
gi|392622375|gb|EIY16503.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
CL03T12C61]
Length = 426
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ E V + G G GY DG P A FD+P AVD +G IY+AD N IRK+
Sbjct: 368 IDREGAVSTVIGVPGRAGYVDGTPDDALFDEPWGVAVDEEGTIYIADTKNKCIRKL 423
Score = 43.9 bits (102), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GY DG+ +ARF+ P D+ G +Y+AD SNH IR I
Sbjct: 330 GYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSI 368
>gi|428179226|gb|EKX48098.1| hypothetical protein GUITHDRAFT_69001 [Guillardia theta CCMP2712]
Length = 334
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 50/117 (42%), Gaps = 13/117 (11%)
Query: 15 LSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF 74
++LL GL EDG G Q N I DL + D ++
Sbjct: 26 ITLLAGCEEDGL--EDGV-------GDQARFNSPVGIVESKEGDLYLCDQENHCIRKVT- 75
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ V LAG+G +GY DG+ AR + P +D +GNI AD N IRK++
Sbjct: 76 ---RKGEVTTLAGNGEEGYHDGKGKDARLNIPTGLCMDAQGNIIFADSGNQRIRKVS 129
Score = 48.9 bits (115), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 34/53 (64%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
E +V +AG+G G++DG+ ARF+ P+ +D + +Y+AD N IRK++
Sbjct: 184 EGMVTTVAGNGKMGWADGKGAKARFNSPQGMCIDKEDTVYIADYGNQRIRKMS 236
Score = 45.1 bits (105), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 1/55 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S + V +AG +G+ DG G A F+ P AVD KG+I+V+D NH IRKI
Sbjct: 128 VSPDGTVTTIAGS-KKGFKDGPAGKALFNYPAYVAVDSKGSIFVSDFGNHCIRKI 181
Score = 42.7 bits (99), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 33/56 (58%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S+E V +AG G G++ G ARF P+S +V G +YV D+ N +RKI+
Sbjct: 235 MSKEGEVVTVAGSGEPGFAHGHGQLARFRGPRSVSVSQDGIVYVGDRENFRVRKIS 290
Score = 40.4 bits (93), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 30/56 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +E + LAG G DG ARF+ P +G++Y+ D+ NH IRK+T
Sbjct: 20 IDKEGWITLLAGCEEDGLEDGVGDQARFNSPVGIVESKEGDLYLCDQENHCIRKVT 75
>gi|344923600|ref|ZP_08777061.1| NHL repeat containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 1226
Score = 49.3 bits (116), Expect = 8e-04, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 5/77 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
GS + V DS+ + ++ P SV LAG GV Y++G +ARF P+ VD+
Sbjct: 35 GSGTVYVADSNNNVIKKIT-PAGTVSV---LAGSGVSDYAEGTGTAARFSYPEGVGVDVA 90
Query: 115 G-NIYVADKSNHVIRKI 130
G +YVAD N VIRKI
Sbjct: 91 GTTVYVADSQNGVIRKI 107
Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats.
Identities = 19/42 (45%), Positives = 24/42 (57%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + DG +ARF P S AVD N+Y+ D NH IRKI
Sbjct: 123 ALNAHVDGTYTTARFAYPTSVAVDSSSNLYIGDTLNHCIRKI 164
>gi|294630411|ref|ZP_06708971.1| NHL repeat protein [Streptomyces sp. e14]
gi|292833744|gb|EFF92093.1| NHL repeat protein [Streptomyces sp. e14]
Length = 609
Score = 49.3 bits (116), Expect = 8e-04, Method: Composition-based stats.
Identities = 26/74 (35%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G G +G++DG A F++P+ A+ G
Sbjct: 193 SGNFLVSDTTRHQLVELA--ADGESVVRRI-GSGARGFADGAAEDAAFNEPQGLALLDDG 249
Query: 116 NIYVADKSNHVIRK 129
+ VAD NH +R+
Sbjct: 250 AVVVADTVNHALRR 263
>gi|398339828|ref|ZP_10524531.1| hypothetical protein LkirsB1_10395 [Leptospira kirschneri serovar
Bim str. 1051]
Length = 358
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++++ VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ ++IRKI GN NL +D S
Sbjct: 96 DQMTNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D++ N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSG 209
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G+ GY DG+ SARF P A D K + + AD +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259
>gi|403382146|ref|ZP_10924203.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
JC66]
Length = 531
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 52/115 (45%), Gaps = 12/115 (10%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFP 75
L S L EDG V + DG + E++ S +D G +L V D A +
Sbjct: 98 LAGSSFLQEDGQVVDALGDG-KGELSSFSEPAGLALDHNG--NLFVADKGNHAVRKVD-- 152
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
E V AG GV G+ DG + F P+ V G +YVAD NHVIRKI
Sbjct: 153 --AEGNVTTYAGQGVLGHKDGTAEESLFYAPEDVVVASDGTVYVADTLNHVIRKI 205
Score = 42.7 bits (99), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 4/44 (9%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
LAGD Y DG A+F++P A+D GN+Y++D N VIR
Sbjct: 231 LAGD----YKDGPLQEAKFNEPTGLAIDHLGNLYISDTGNRVIR 270
Score = 41.2 bits (95), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 4/89 (4%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH-GNQ 138
S V+ +AG G G DG+ A F P A G +YV+D +H++R+ LDH G
Sbjct: 40 SSVQTIAGTGSFGARDGDKAEASFRHPSGLAAAPDGTLYVSDTKSHLLRR---LDHSGVS 96
Query: 139 LIRQINLKPEDCSKSSQSGSGMFWVTVFS 167
L+ + ED G G ++ FS
Sbjct: 97 LLAGSSFLQEDGQVVDALGDGKGELSSFS 125
>gi|225420274|ref|ZP_03762577.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
DSM 15981]
gi|225041091|gb|EEG51337.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
DSM 15981]
Length = 336
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ LAG G +GY D G A F P A D GN+Y+AD N VIR++
Sbjct: 150 RTLAGRGREGYEDNRGGRALFSGPTGLAADDAGNLYIADTGNDVIRRL 197
Score = 44.7 bits (104), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
G +G +DG G ARF P AV +G +YV+D N +RKI N
Sbjct: 247 GAEGQADGGFGQARFSSPTYLAVSEQGTLYVSDTGNAAVRKIEN 290
>gi|298386946|ref|ZP_06996500.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
gi|298260096|gb|EFI02966.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
Length = 457
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 77 SEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ E + AGD G G++DG+ A+F P+ +D + N+Y+AD NH IRK+T
Sbjct: 345 TREGNFRIYAGDKGRAGHADGQETEAQFSSPRQLVLDEEENLYIADSGNHCIRKVT 400
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ E VV + G+ GY DG P A F +P A+D +G IY+ DK N +RK++
Sbjct: 399 VTPEGVVSTVIGNPNTSGYKDGTPEIALFTEPWGLAIDSEGTIYIGDKDNRCVRKLS 455
>gi|421091013|ref|ZP_15551797.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
gi|410000210|gb|EKO50873.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
Length = 358
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++++ VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ ++IRKI GN NL +D S
Sbjct: 96 DQMTNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D++ N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSG 209
Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G+ GY DG+ SARF P A D K + + AD +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259
>gi|410098158|ref|ZP_11293137.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222431|gb|EKN15373.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
CL02T12C30]
Length = 446
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 29/46 (63%), Gaps = 3/46 (6%)
Query: 89 GVQ---GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GVQ G+ DGE A F++PK D G +YVAD NHVIRKIT
Sbjct: 344 GVQNETGWQDGEREEALFNEPKQICFDQDGVMYVADAGNHVIRKIT 389
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + VV + G GV+GY DG P A F P A+D +G IYV D N+ +RK+
Sbjct: 388 ITRDGVVSTVIGIAGVKGYVDGSPEDALFQYPTGVAIDKEGTIYVGDARNNCVRKL 443
Score = 36.2 bits (82), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 28/121 (23%), Positives = 54/121 (44%), Gaps = 10/121 (8%)
Query: 53 RPGSSDLIVLDSSRSAF-YTLSFPLSEESVVKRLAGDGVQGYS------DGEPGSARFDK 105
+PG+ I + + + + +F + ++ V +AG V G DG A FD+
Sbjct: 106 QPGAECQISVAIGKDSLTFENTFSYTTQTTVSTIAGKPVGGDDGHGEMIDGTLAEASFDQ 165
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTV 165
P VD + NI+V+++ H +R+I + ++ + +C + S + TV
Sbjct: 166 PWFLCVDAEKNIFVSERLGHAVRQI---NEEKNVVSTLVKGEGNCQNPNASATDAEGKTV 222
Query: 166 F 166
F
Sbjct: 223 F 223
>gi|357485931|ref|XP_003613253.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
gi|355514588|gb|AES96211.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
Length = 154
Score = 49.3 bits (116), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 44/107 (41%), Gaps = 36/107 (33%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-------------- 131
+ +G G+ D + AR + PK VD +GNIYVAD N IRKI+
Sbjct: 11 SAEGYSGHVDEKLREARMNHPKGITVDDRGNIYVADIINMAIRKISLGNNMTYLSFLYEE 70
Query: 132 ----------------------NLDHGNQLIRQINLKPEDCSKSSQS 156
+D G Q IR+I L+ +DC+ +S
Sbjct: 71 SLILFYLLLFDVIYVGSSYSLLVIDRGKQAIREIQLRFDDCAYQYES 117
>gi|429202583|ref|ZP_19193964.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
gi|428661888|gb|EKX61363.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
Length = 619
Score = 48.9 bits (115), Expect = 9e-04, Method: Composition-based stats.
Identities = 33/99 (33%), Positives = 52/99 (52%), Gaps = 10/99 (10%)
Query: 39 DGHQLEINPHSVIDR-PGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
DG + P + R PG + D +V D++R L+ P E+VV+R G G +
Sbjct: 170 DGPYVPPEPDPTVLRFPGKALLLPDGDFLVSDTTRHRLVRLA-P-DGETVVRRY-GTGER 226
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G+ DG ARF +P+ A+ G++ VAD NH +R++
Sbjct: 227 GFVDGPADHARFSEPQGLALLDSGDVVVADTVNHALRRL 265
>gi|365861024|ref|ZP_09400809.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
gi|364009547|gb|EHM30502.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
Length = 560
Score = 48.9 bits (115), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
+V D++R L ++ V+R G G +G SDG P ARF +P+ AV G + V
Sbjct: 144 LVSDTTRHRLVELD---ADGETVRRHFGTGERGLSDGGPDEARFSEPQGLAVLPDGRVAV 200
Query: 120 ADKSNHVIRKI 130
AD NH +R +
Sbjct: 201 ADTVNHALRAL 211
>gi|296136309|ref|YP_003643551.1| NHL repeat containing protein [Thiomonas intermedia K12]
gi|295796431|gb|ADG31221.1| NHL repeat containing protein [Thiomonas intermedia K12]
Length = 366
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + VV LAG G G+ DG ARF+ P +D +G IYV++ N+VIRKIT
Sbjct: 196 ISAKGVVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQGQIYVSEYFNNVIRKIT 252
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S E V LAG V+G++DG ARF+ P V+ KG +YVAD N +R+I+
Sbjct: 141 ISREGRVSTLAGKLEVEGFADGTGQQARFNHPVGLGVNAKGVVYVADAYNSTVRRIS 197
Score = 43.5 bits (101), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 26/38 (68%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
Y+ G+ ARF P+ A+D +GNI+VAD N V+RK+
Sbjct: 49 YTGGDFAQARFHDPRGMALDAQGNIFVADYVNSVVRKL 86
>gi|358461043|ref|ZP_09171215.1| serine/threonine protein kinase [Frankia sp. CN3]
gi|357074242|gb|EHI83734.1| serine/threonine protein kinase [Frankia sp. CN3]
Length = 866
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + ++ +AG G +GY+ DG P +A+ P S VD G IYVAD N VIR+I
Sbjct: 797 VTSDGIITTIAGTGAEGYTGDGGPANAAQLSDPTSVVVDAGGAIYVADNGNKVIRRI 853
Score = 40.0 bits (92), Expect = 0.42, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++++ VV +AG+ G+S DG P +A + P A D GN+Y+ D N+ +RKI
Sbjct: 574 IAKDGVVTPIAGNAQDGFSGDGGPATAAQLYGPGRVAWDKAGNLYIPDTQNYRVRKI 630
>gi|239989050|ref|ZP_04709714.1| hypothetical protein SrosN1_17240 [Streptomyces roseosporus NRRL
11379]
gi|291446053|ref|ZP_06585443.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
allergen [Streptomyces roseosporus NRRL 15998]
gi|291349000|gb|EFE75904.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
allergen [Streptomyces roseosporus NRRL 15998]
Length = 605
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V D++R L ++ V+R G G +G +DG P ARF +P+ AV G +
Sbjct: 188 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLNDGGPDEARFSEPQGLAVLPDGRVA 244
Query: 119 VADKSNHVIRKI 130
VAD NH IR +
Sbjct: 245 VADTVNHAIRAL 256
>gi|223939666|ref|ZP_03631540.1| NHL repeat containing protein [bacterium Ellin514]
gi|223891717|gb|EEF58204.1| NHL repeat containing protein [bacterium Ellin514]
Length = 452
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 75 PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
P VV +AG G+ G+ DG A F P A+D GN+YVAD N+ IRK+T L
Sbjct: 203 PDGTNWVVTTIAGAAGIHGFKDGTNNDALFYSPIGLALDSAGNMYVADTGNNAIRKLT-L 261
Query: 134 DHGNQLIRQI 143
+ N ++ I
Sbjct: 262 EGTNWVVTTI 271
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 41/78 (52%), Gaps = 3/78 (3%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDM 113
+ ++ V D+ +A L+ VV +AG Q G DG A F P+S AVD
Sbjct: 243 AGNMYVADTGNNAIRKLTLE-GTNWVVTTIAGSTNQQNGSLDGTNNQALFTWPESPAVDS 301
Query: 114 KGNIYVADKSNHVIRKIT 131
GN+YVAD N+ IRK+T
Sbjct: 302 AGNVYVADSYNYTIRKVT 319
Score = 42.0 bits (97), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ ++ VV +AG + G DG ARF P AVD GN++VA+ + IRK++
Sbjct: 88 IGKDWVVTTIAGTNQAYGTKDGTNAEARFTGPTGLAVDASGNVFVAEGYANTIRKLS 144
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
DG +A F P+S A D GN+YVAD + VIRK+
Sbjct: 50 DGTNTAAHFYSPQSVACDASGNVYVADYNGRVIRKV 85
>gi|86610239|ref|YP_479001.1| NHL repeat-containing protein [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86558781|gb|ABD03738.1| NHL repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 649
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 31/57 (54%)
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
F LS E ++ AG G G DG A+FD P+ D KG ++VAD NH +R I
Sbjct: 364 FRLSPEGELEVWAGTGQAGRRDGAANQAQFDSPQGLLWDPKGGLWVADSGNHCLRYI 420
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 21/50 (42%), Positives = 30/50 (60%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V AG + GY DG A+F +P A+ G++YVAD++N IR+IT
Sbjct: 426 VSTFAGTCIAGYRDGGLDRAQFREPSGLALGSDGSLYVADRANRRIRRIT 475
Score = 42.4 bits (98), Expect = 0.091, Method: Composition-based stats.
Identities = 18/39 (46%), Positives = 23/39 (58%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GY DG ARF A D GN++VAD+ NH IR++
Sbjct: 541 GYRDGPLAEARFQTLGGLAFDAAGNLWVADRDNHRIRRL 579
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 20/55 (36%), Positives = 28/55 (50%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
V LAG G+ DG ARFD+P AV G++ V D +R+I+ + G
Sbjct: 585 VSTLAGQNQPGWQDGPTSVARFDQPAGLAVLPDGSVVVVDAGLPGLRRISRVASG 639
Score = 37.7 bits (86), Expect = 2.6, Method: Composition-based stats.
Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG G G DG A+ +P + AVD +GN+++AD H +R+++
Sbjct: 480 VTTVAGTGQPGSVDGPAEQAQLLQPTALAVDKEGNLWIAD--GHRLRRLS 527
>gi|410938688|ref|ZP_11370531.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
gi|410786234|gb|EKR75182.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
Length = 358
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 43/91 (47%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + +V AG G DG +A F P +D GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITNKPIVSLFAGTGAAVSVDGTTSTASFKTPFGLEIDTSGNIFVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D++ ++IRKI GN NL ED S
Sbjct: 96 DQTANLIRKID--PSGNVKTLSTNLPLEDPS 124
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G G+ GY DG+ SARF P A + K + + AD +H IRKI
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNHKTDSLLAADIQDHRIRKI 259
>gi|410694120|ref|YP_003624742.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
3As]
gi|294340545|emb|CAZ88930.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
3As]
Length = 366
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + VV LAG G G+ DG ARF+ P +D +G IYV++ N+VIRKIT
Sbjct: 196 ISAKGVVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQGQIYVSEYFNNVIRKIT 252
Score = 43.1 bits (100), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 26/38 (68%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
Y+ G+ ARF P+ A+D +GNI+VAD N V+RK+
Sbjct: 49 YNGGDFAQARFHDPRGMALDAQGNIFVADYVNSVVRKL 86
>gi|421129432|ref|ZP_15589632.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
gi|410358807|gb|EKP05916.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
Length = 358
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++++ VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ ++IRKI GN NL +D S
Sbjct: 96 DQMTNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 40.0 bits (92), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D++ N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSG 209
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G+ GY DG+ SARF P A D K + + AD +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259
>gi|380692925|ref|ZP_09857784.1| hypothetical protein BfaeM_02978 [Bacteroides faecis MAJ27]
Length = 454
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
Query: 59 LIVLDSSRSAFYTLSFPLSEES--VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKG 115
L ++ + Y +++ + + + + AGD G GY+ G+ ARF++P + +D +G
Sbjct: 323 LYIIARKKHCIYKVAYNAATHTFGIPELFAGDYGESGYASGKGTGARFNQPSTPCLDPEG 382
Query: 116 NIYVADKSNHVIRKIT 131
N+ + DK NH IRKIT
Sbjct: 383 NLLIPDKMNHCIRKIT 398
>gi|171913873|ref|ZP_02929343.1| NHL repeat containing protein [Verrucomicrobium spinosum DSM 4136]
Length = 370
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + ++ +AG+G +GY+ DG P +A FD P D GN Y+AD SNH IRK+
Sbjct: 84 VTPDGKIETMAGNGKKGYTGDGGPATAASFDLPHEIRFDKAGNYYIADMSNHAIRKV 140
Score = 40.4 bits (93), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 8/87 (9%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARF 103
PH + D+ G + + D S A + ++ ++ LAG G GY+ DG A
Sbjct: 116 PHEIRFDKAG--NYYIADMSNHAIRKVD---AKTGIITTLAGTGRGGYTGDDGPSEKAEL 170
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
+P S G++Y+ D NHVIRKI
Sbjct: 171 KQPHSIQFGPDGDLYICDIGNHVIRKI 197
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ ++ +AG G G++ +G P A PK A+D +GN ++AD +H +R I
Sbjct: 257 KAGIIHHIAGTGASGFTGNGGPAKLATLKGPKGIALDAEGNAWLADTESHTVRVI 311
>gi|196233467|ref|ZP_03132310.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
gi|196222463|gb|EDY16990.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
Length = 509
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 45/87 (51%), Gaps = 6/87 (6%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP + ID G+ D V D + +S P E + AG+ G +DG +ARF
Sbjct: 224 NPSGLAIDEHGNVD--VADRTNQTLRRIS-PSGETETIAGTAGE--HGTADGAGKAARFW 278
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
P+ A+D GN+YVAD S +RK+T
Sbjct: 279 YPQGLAIDEHGNVYVADTSAKSVRKVT 305
Score = 42.0 bits (97), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 8/68 (11%)
Query: 71 TLSFPLSEESVVKRL-------AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
T+ SE VV L AG+ +G +DG +A+F P A+D GN+ VAD++
Sbjct: 184 TVKGASSEPRVVNELPCTFHVFAGE-TEGAADGAGVTAQFKNPSGLAIDEHGNVDVADRT 242
Query: 124 NHVIRKIT 131
N +R+I+
Sbjct: 243 NQTLRRIS 250
>gi|290972390|ref|XP_002668936.1| predicted protein [Naegleria gruberi]
gi|284082474|gb|EFC36192.1| predicted protein [Naegleria gruberi]
Length = 727
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 40/126 (31%), Positives = 60/126 (47%), Gaps = 23/126 (18%)
Query: 27 LLEDGYTVTTVIDGHQ-------LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF----- 74
+LE+G VT +G Q L +N V S V SS++ Y +
Sbjct: 35 ILENGNIVTIAGNGQQGYSGDNGLAVNAQLV------SPSYVFVSSKNEVYIADYQNNRI 88
Query: 75 -PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ E + +AG+G QGY+ +G +A+ P S V +K +Y+AD NH IRKI
Sbjct: 89 RKILENGNIITIAGNGQQGYNGDNGLAVNAQLASPSSVCVSLKNEVYIADSGNHSIRKI- 147
Query: 132 NLDHGN 137
L++GN
Sbjct: 148 -LENGN 152
Score = 39.3 bits (90), Expect = 0.85, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 4/55 (7%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+AG+G QGYS +G +A+ P V K +Y+AD N IRKI L++GN
Sbjct: 212 IAGNGQQGYSGDNGPAVNAKLSSPVDVFVSSKNEVYIADYQNTRIRKI--LENGN 264
Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 33/62 (53%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G QGY+ +G +A P S V +Y+AD NH IRKI L +
Sbjct: 149 ENGNIVTIAGNGQQGYNGDNGPAVNAPLFDPSSVFVTPNDEVYIADTGNHRIRKI--LRN 206
Query: 136 GN 137
GN
Sbjct: 207 GN 208
>gi|375148602|ref|YP_005011043.1| NHL repeat containing protein [Niastella koreensis GR20-10]
gi|361062648|gb|AEW01640.1| NHL repeat containing protein [Niastella koreensis GR20-10]
Length = 716
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 30/81 (37%), Positives = 43/81 (53%), Gaps = 8/81 (9%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFD--KPKSFAV 111
S ++ + D S S ++ V+ AG+G GYS DG P A+F P+ AV
Sbjct: 229 SGNIYIADPSNSVVRKIN----PAGVITTFAGNGTAGYSGDGGPAIKAQFQMGSPQGLAV 284
Query: 112 DMKGNIYVADKSNHVIRKITN 132
D GN+Y +D NH IRKI++
Sbjct: 285 DPAGNVYASDYQNHAIRKISS 305
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ ++ AG+G +G+S G+ A+ +P + AVD GN+Y++D SN VIRK+
Sbjct: 132 VTTSGIMSIFAGNGTEGFSGDGGQAAQAKLYRPTAVAVDKDGNLYISDASNKVIRKV 188
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 26/58 (44%), Positives = 37/58 (63%), Gaps = 3/58 (5%)
Query: 76 LSEESVVKRLAG-DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++++ V+ +AG G GY+ DG P + A +P AVD GNIY+AD SN V+RKI
Sbjct: 188 VNKQGVISTIAGVPGRAGYAGDGGPATKALLTQPAGIAVDYSGNIYIADPSNSVVRKI 245
Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 5/73 (6%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
++L +S S Y+ + + V + G G G +ARF++P++ A D N+Y+
Sbjct: 11 LLLFASSSVAYSQTI-----TTVAGVVGSPGYGGDAGPANNARFNQPRAVATDNDNNLYI 65
Query: 120 ADKSNHVIRKITN 132
AD NHVIRK+ +
Sbjct: 66 ADMRNHVIRKVNS 78
>gi|290985802|ref|XP_002675614.1| predicted protein [Naegleria gruberi]
gi|284089211|gb|EFC42870.1| predicted protein [Naegleria gruberi]
Length = 848
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 55/102 (53%), Gaps = 8/102 (7%)
Query: 32 YTVTTVIDG-HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
+T+ T+I +Q++ P +I P S DL + + + + S S E V +AG+G
Sbjct: 521 FTIQTIIQAPNQVQFLPQQLIYGP-SCDLYIACTLENQIFKYS---SSEGFV-LIAGNGK 575
Query: 91 QGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GY+ G AR + PK+ AV+ G +Y+AD N V+RKI
Sbjct: 576 VGYTGDYGPAIYARLNNPKAIAVNKFGEVYIADSFNKVVRKI 617
>gi|418677567|ref|ZP_13238841.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. RM52]
gi|418687786|ref|ZP_13248945.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. Moskva]
gi|418742122|ref|ZP_13298495.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
gi|400320757|gb|EJO68617.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. RM52]
gi|410738110|gb|EKQ82849.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. Moskva]
gi|410750480|gb|EKR07460.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
Length = 358
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++++ VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGIISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ ++IRKI GN NL +D S
Sbjct: 96 DQMTNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ ++ FD P +D++ N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSG 209
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G+ GY DG+ SARF P A D K + + AD +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259
>gi|336178839|ref|YP_004584214.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
gi|334859819|gb|AEH10293.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
Length = 776
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 2/54 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
V+ +AG G GYS DG P +A + +P A D GN+Y+AD +N+ IR++++
Sbjct: 608 VITTVAGSGTSGYSGDGGPATAAQLARPGGLAADTAGNLYIADNANNRIRRVSS 661
Score = 43.9 bits (102), Expect = 0.036, Method: Composition-based stats.
Identities = 22/57 (38%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S + V+ +AG G GYS DG P + A+ P S AV G +Y+AD N+ +R++
Sbjct: 659 VSSDGVIITVAGSGTSGYSGDGGPATAAQLANPGSVAVTDDGRVYIADTDNNRVRRV 715
Score = 43.1 bits (100), Expect = 0.055, Method: Composition-based stats.
Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + ++ +AG G G+S +G A F P A D GN+Y+AD N+ IR+I
Sbjct: 491 IGADGIITTVAGTGTYGFSGDNGPAAQAHFATPTDVARDRAGNLYIADTDNNRIRRI 547
Score = 41.6 bits (96), Expect = 0.14, Method: Composition-based stats.
Identities = 22/45 (48%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 92 GYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
GYS DG P +A + KP S VD G +Y+AD NH IR+I D
Sbjct: 563 GYSGDGGPATAAQLAKPTSVLVDADGTLYIADTGNHRIRRIGTND 607
>gi|418693423|ref|ZP_13254475.1| putative lipoprotein [Leptospira kirschneri str. H1]
gi|409958780|gb|EKO17669.1| putative lipoprotein [Leptospira kirschneri str. H1]
Length = 358
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 43/91 (47%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ ++IRKI GN NL +D S
Sbjct: 96 DQITNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ + FD P +D + N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLDSG 209
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G+ GY DG+ SARF P A D K + + AD +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259
>gi|329927682|ref|ZP_08281814.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
HGF5]
gi|328938318|gb|EGG34710.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
HGF5]
Length = 1251
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 4/87 (4%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
+N S I G + + DS A + + V +AG+G G DG+ A F
Sbjct: 111 LNRPSGIAVDGKGQVYIADSGNHAIRKID----QAGRVNTIAGNGRIGLKDGKAQDALFH 166
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
+P+ AV G +YVAD NHVIR+I+
Sbjct: 167 EPQDVAVTEDGILYVADTLNHVIRRIS 193
Score = 44.3 bits (103), Expect = 0.025, Method: Composition-based stats.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 4/46 (8%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LAGD Y +G A+F++P A+D KGN+YV+D N IR I
Sbjct: 218 LAGD----YKNGSLAEAQFNEPAGLALDGKGNLYVSDSGNQAIRYI 259
Score = 40.0 bits (92), Expect = 0.50, Method: Composition-based stats.
Identities = 16/30 (53%), Positives = 20/30 (66%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
A ++P AVD KG +Y+AD NH IRKI
Sbjct: 109 AFLNRPSGIAVDGKGQVYIADSGNHAIRKI 138
Score = 39.7 bits (91), Expect = 0.53, Method: Composition-based stats.
Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 14/90 (15%)
Query: 55 GSSDLIVLDSSRSAFYTLSF------------PLSEESVVKRLAGDGVQGYSDGEPGSAR 102
G +L V DS A + F P S+ L D GY DG +AR
Sbjct: 242 GKGNLYVSDSGNQAIRYIDFGKGTVSTAAGSVPSSDGYAKDALYAD--PGYLDGAADAAR 299
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
F+ P+ A ++ + +AD NH +R++ +
Sbjct: 300 FNSPRGLAWSVEDGLLIADSHNHAVRQLKD 329
>gi|357400623|ref|YP_004912548.1| hypothetical protein SCAT_3039 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|386356676|ref|YP_006054922.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|337767032|emb|CCB75743.1| conserved protein of unknown function [Streptomyces cattleya NRRL
8057 = DSM 46488]
gi|365807185|gb|AEW95401.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
Length = 613
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 29/76 (38%), Positives = 47/76 (61%), Gaps = 4/76 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ +L+V DS+R A L+ P E+VV+R+ G G +G +DG ARF +P+ A+ G
Sbjct: 192 TGNLLVSDSTRHALVELA-P-DGETVVRRI-GSGERGLADGPADRARFSEPQGLALLPDG 248
Query: 116 -NIYVADKSNHVIRKI 130
++ VAD NH +R++
Sbjct: 249 ESVIVADTVNHALRRV 264
>gi|421106984|ref|ZP_15567544.1| putative lipoprotein [Leptospira kirschneri str. H2]
gi|410007898|gb|EKO61577.1| putative lipoprotein [Leptospira kirschneri str. H2]
Length = 358
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 43/91 (47%), Gaps = 2/91 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
D+ ++IRKI GN NL +D S
Sbjct: 96 DQITNLIRKID--PSGNVKTLSTNLPLQDPS 124
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+ G +G+ + FD P +D + N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLDSG 209
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G+ GY DG+ SARF P A D K + + AD +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259
>gi|332027687|gb|EGI67755.1| NHL repeat-containing protein 2 [Acromyrmex echinatior]
Length = 686
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 35/126 (27%), Positives = 59/126 (46%), Gaps = 5/126 (3%)
Query: 5 FVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
+Y+ +AI L + + GL L+ + + L +V+ + L++ DS
Sbjct: 180 LLYTKIAIAYFKSLKQMSNHGLPLKPAHHLLPSSKDGLLFPGKLTVLQLEQETKLVISDS 239
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
+ + E V+ + G QG+ DG+ +ARF+ P+ V + IYVAD +N
Sbjct: 240 GNNRIVITN----EHGRVEHVIGGCNQGFKDGDFKNARFNSPQGVCV-LNNIIYVADNNN 294
Query: 125 HVIRKI 130
H IRKI
Sbjct: 295 HAIRKI 300
>gi|261404592|ref|YP_003240833.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
Y412MC10]
gi|261281055|gb|ACX63026.1| copper amine oxidase domain protein [Paenibacillus sp. Y412MC10]
Length = 1280
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 4/87 (4%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
+N S I G + + DS A + + V +AG+G G DG+ A F
Sbjct: 140 LNRPSGIAVDGKGQVYIADSGNHAIRKID----QAGRVTTIAGNGRIGLKDGKAQDALFH 195
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
+P+ AV G +YVAD NHVIR+I+
Sbjct: 196 EPQDVAVTEDGILYVADTLNHVIRRIS 222
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 29/46 (63%), Gaps = 4/46 (8%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LAGD Y +G A+F++P A+D KGN+YV+D NH IR I
Sbjct: 247 LAGD----YKNGSLAEAQFNEPAGLALDGKGNLYVSDSGNHAIRYI 288
Score = 40.0 bits (92), Expect = 0.49, Method: Composition-based stats.
Identities = 16/30 (53%), Positives = 20/30 (66%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
A ++P AVD KG +Y+AD NH IRKI
Sbjct: 138 AFLNRPSGIAVDGKGQVYIADSGNHAIRKI 167
Score = 39.7 bits (91), Expect = 0.64, Method: Composition-based stats.
Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 14/90 (15%)
Query: 55 GSSDLIVLDSSRSAFYTLSF------------PLSEESVVKRLAGDGVQGYSDGEPGSAR 102
G +L V DS A + F P S+ L D GY DG +AR
Sbjct: 271 GKGNLYVSDSGNHAIRYIDFGKGTVSTAAGSVPSSDGYAKDALYAD--PGYLDGAADAAR 328
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
F+ P+ A ++ + +AD NH +R++ +
Sbjct: 329 FNSPRGLAWSVEDGLLIADSHNHAVRQLKD 358
Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats.
Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 7/71 (9%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNI 117
L++ DS A L ++ V +AG G +GY+DG RF+ P AV G +
Sbjct: 343 LLIADSHNHAVRQL-----KDERVSTMAG-GTRGYADGIESEVRFNAPADIAVASDSGEL 396
Query: 118 YVADKSNHVIR 128
++AD+ N +R
Sbjct: 397 FIADQRNGAVR 407
>gi|289667480|ref|ZP_06488555.1| putative NHL repeat protein [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 310
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
L + V LAG G +G++DG +A F+ P +D GN+Y+AD NH IRK+T
Sbjct: 42 LGADGHVHTLAG-GREGFADGIGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLT 96
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 29/52 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
L+ + V LAGDGV G +G RF+ P AVD +G +YVAD N I
Sbjct: 95 LTPQGKVTTLAGDGVAGDRNGAAAQVRFNGPVGVAVDAQGRVYVADTYNDRI 146
>gi|345000781|ref|YP_004803635.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces sp. SirexAA-E]
gi|344316407|gb|AEN11095.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces sp. SirexAA-E]
Length = 608
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V D++R L ++ V+ G G +G +DG PG ARF +P+ A+ G I
Sbjct: 193 FLVSDTTRHRLVELD---ADGETVRGHFGTGERGLTDGGPGEARFSEPQGLALLPDGRIA 249
Query: 119 VADKSNHVIRKI 130
VAD NH IR +
Sbjct: 250 VADTVNHAIRAL 261
>gi|427400231|ref|ZP_18891469.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
gi|425720971|gb|EKU83886.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
Length = 367
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 85 LAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAGD ++G DG +ARF P+ A+D N+YVAD N VIRKIT
Sbjct: 55 LAGDPTIEGTMDGAGTAARFKWPQGIAIDADDNLYVADTGNFVIRKIT 102
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 3/57 (5%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV +AG G G+ DG+ G+ARF P + AV+ +G IYVAD N IR IT
Sbjct: 101 ITPAGVVTTVAGAAGTSGFVDGDAGNARFGNPVAVAVNRRGTIYVAD--NLRIRSIT 155
Score = 40.0 bits (92), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 29/41 (70%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+G ++G +ARF++ + VD +GN+Y AD N+++R+IT
Sbjct: 273 RGSNNGVGTAARFEQVVALTVDPRGNVYAADAINNLVRRIT 313
>gi|290982010|ref|XP_002673724.1| NHL repeat-containing protein [Naegleria gruberi]
gi|284087309|gb|EFC40980.1| NHL repeat-containing protein [Naegleria gruberi]
Length = 456
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNL 133
+K + G+G+ GY G+ G+A P S AVD GNIY+AD NH IRK T L
Sbjct: 115 IKTIVGNGMPGYG-GDSGAATSTSLLFPTSIAVDSSGNIYIADTQNHRIRKFTAL 168
Score = 40.4 bits (93), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 85 LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG +GY+ +A+ P A+D GN+Y+AD NH IRKIT
Sbjct: 175 IAGTEEEGYNGDNIIATTAKLSSPTGVALDSIGNVYIADSQNHRIRKIT 223
>gi|114568006|ref|YP_755160.1| leucine-rich repeat-containing protein [Syntrophomonas wolfei
subsp. wolfei str. Goettingen]
gi|114338941|gb|ABI69789.1| Leucine-rich repeat (LRR) protein-like protein [Syntrophomonas
wolfei subsp. wolfei str. Goettingen]
Length = 1351
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ +AG+G GYS DG P SA+ P A D GN+Y+ DK N+ IRK+ N
Sbjct: 506 IISTVAGNGTAGYSGDGGPATSAQLKDPVGVACDKNGNLYIVDKDNNRIRKVDN 559
Score = 41.2 bits (95), Expect = 0.23, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG+G GYS DG P SA+ + P+ D N+Y+ D N IRK+
Sbjct: 335 IISTVAGNGTAGYSGDGGPATSAQLNTPRGVVCDGADNLYIVDGDNQRIRKV 386
Score = 40.4 bits (93), Expect = 0.32, Method: Composition-based stats.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 5/51 (9%)
Query: 85 LAGDGVQGYS--DGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG QG++ +G+ G SAR + P A D GN+Y+AD+ NH IRK+
Sbjct: 451 VAGIVTQGWATYEGDGGLATSARLNYPFGVACDGSGNLYIADRGNHRIRKV 501
Score = 39.3 bits (90), Expect = 0.78, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG+G GY+ DG P SA+ P D GN+Y+AD +N IRK+
Sbjct: 563 ISTVAGNGTGGYAGDGSPATSAQIWVPYGVTFDNNGNMYIADMNNKRIRKV 613
>gi|386849777|ref|YP_006267790.1| Teneurin-4 [Actinoplanes sp. SE50/110]
gi|359837281|gb|AEV85722.1| Teneurin-4 [Actinoplanes sp. SE50/110]
Length = 631
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 80 SVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ V +AG G G S DG S A+ + P+ VD G+IYVAD NH++RKI+N
Sbjct: 172 AAVSVIAGTGTAGTSPDGTAASLAKLNAPRDVKVDASGDIYVADTGNHLVRKISN 226
>gi|351729451|ref|ZP_08947142.1| NHL repeat-containing protein [Acidovorax radicis N35]
Length = 663
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
Y D +ARF+ P++ AVD GN YVAD NH++RKI
Sbjct: 48 YLDAAGTAARFNAPQAVAVDTAGNTYVADTGNHLVRKI 85
Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats.
Identities = 17/39 (43%), Positives = 25/39 (64%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G++DG +ARFD P + A+ G +YV D++N IR I
Sbjct: 341 GWADGSGSTARFDGPGALAMGKSGTLYVRDRNNAAIRTI 379
Score = 40.4 bits (93), Expect = 0.33, Method: Composition-based stats.
Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 8/89 (8%)
Query: 67 SAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
S+ + F ++ V AG+ +G++DG GSA F S A+D GN+YV D H
Sbjct: 473 SSLQGVIFKINPTGAVSLFAGNPNARGHADGPAGSALFAGLGSMALDPAGNLYVVDGVYH 532
Query: 126 -------VIRKITNLDHGNQLIRQINLKP 147
IRKIT + L + N+ P
Sbjct: 533 FITGVGPTIRKITPAGMVSTLAGRANVPP 561
>gi|322435234|ref|YP_004217446.1| hypothetical protein AciX9_1613 [Granulicella tundricola MP5ACTX9]
gi|321162961|gb|ADW68666.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
Length = 781
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
++ LAG G QG+S D P +A+ D P A+D N+Y+AD NH IR+I + H
Sbjct: 168 LITTLAGTGTQGFSGDAGPALAAQIDTPTGLALDASNNLYLADTHNHRIRRIDAVTH 224
Score = 44.7 bits (104), Expect = 0.017, Method: Composition-based stats.
Identities = 37/134 (27%), Positives = 60/134 (44%), Gaps = 12/134 (8%)
Query: 10 LAITLLSLLFSLVSSGLLL------EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLD 63
LA T L+ + SG L GY + L +P ++ P + DL + D
Sbjct: 38 LAETAAHLILRVSPSGALTIVAGTGTQGYAGDGTLPTQALLDSPTALAITP-TGDLYLAD 96
Query: 64 SSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVAD 121
+ A + + ++ +AG G G S DG + A+ D P + A+D N+Y+AD
Sbjct: 97 THNHAIRRID---AATQIITTVAGTGTPGRSPDGTLATKAQLDTPTAIALDSSQNLYIAD 153
Query: 122 KSNHVIRKITNLDH 135
NH+IR++ H
Sbjct: 154 TRNHIIRRVDATTH 167
>gi|421614357|ref|ZP_16055418.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
gi|408494877|gb|EKJ99474.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
Length = 351
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ T + G + ++ +D G+ L+VL+ + + L + LAG G +G
Sbjct: 211 LVTTLAGRKSKLKDPRAVDLDGNGRLLVLERNGNRLRRLE----SNGDITTLAGSGKKGT 266
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301
Score = 38.9 bits (89), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 29/49 (59%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
GY DG SARF+K + +D + +Y++D +NH +R++ G ++
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHCVRRLIQTSDGEWMV 154
>gi|189462806|ref|ZP_03011591.1| hypothetical protein BACCOP_03504 [Bacteroides coprocola DSM 17136]
gi|189430422|gb|EDU99406.1| NHL repeat protein [Bacteroides coprocola DSM 17136]
Length = 500
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 36/69 (52%), Gaps = 9/69 (13%)
Query: 76 LSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGNI-YVADKSNHV 126
L+ E VV AG G +GY DG G ARF+ P+ A D + N+ YV D NH+
Sbjct: 412 LTPEGVVTTYAGRGSANLNNEARGYVDGTLRGEARFNNPRGLAYDEQNNVFYVGDMDNHI 471
Query: 127 IRKITNLDH 135
IRKI D
Sbjct: 472 IRKIAMEDE 480
>gi|345008031|ref|YP_004810385.1| redoxin domain-containing protein [Streptomyces violaceusniger Tu
4113]
gi|344034380|gb|AEM80105.1| Redoxin domain protein [Streptomyces violaceusniger Tu 4113]
Length = 615
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R L+ ESVV+R+ G G +G +DG ARF +P
Sbjct: 188 PGKALLLPGGT-FLVSDTTRHQLVELA--ADGESVVRRI-GAGERGLTDGTGERARFSEP 243
Query: 107 KSFAVDMKGNIYVADKSNHVIRK 129
+ A+ G + VAD NH +R+
Sbjct: 244 QGLALLPDGTVAVADTVNHALRR 266
>gi|323343289|ref|ZP_08083516.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
33269]
gi|323095108|gb|EFZ37682.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
33269]
Length = 436
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 79 ESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
E+ +RL+ + G + DG+ A+F P D GNIYVAD++NH IR+I+ D
Sbjct: 326 ENTYRRLSSSNISGGHRDGKLEKAQFHNPTQIYCDADGNIYVADRNNHCIRRISPDD 382
Score = 36.6 bits (83), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + +V+ + G +G+ DG A F++P + G +YVAD N +RK+T
Sbjct: 378 ISPDDMVETVLGMPETKGWKDGAKSEALFNEPTGIGIGKDGAVYVADWGNGRVRKLT 434
>gi|222054715|ref|YP_002537077.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
gi|221564004|gb|ACM19976.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
Length = 2393
Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 41/61 (67%), Gaps = 4/61 (6%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRK 129
S L+E+ ++ +AG+GV G++ G+ G A + P+ AVD+ GNIY+AD +N+ IR+
Sbjct: 1554 SITLAEQGIINTVAGNGVAGFT-GDGGIAVGSSLNYPEDAAVDLDGNIYIADSANNRIRR 1612
Query: 130 I 130
+
Sbjct: 1613 V 1613
Score = 38.5 bits (88), Expect = 1.2, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG+G+ +S DG P A ++P + A+D GNIY++D N+ +RKI
Sbjct: 1620 IATIAGNGLAEFSGDGGPAYMAGLNRPFAVALDTSGNIYISDNLNYRVRKI 1670
>gi|302543897|ref|ZP_07296239.1| putative NHL repeat protein [Streptomyces hygroscopicus ATCC 53653]
gi|302461515|gb|EFL24608.1| putative NHL repeat protein [Streptomyces himastatinicus ATCC
53653]
Length = 608
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R L+ ESV++R+ G G +G +DG ARF +P
Sbjct: 177 PGKALLLPGGT-FLVSDTTRHQLVELA--ADGESVLRRI-GTGERGLADGRADRARFSEP 232
Query: 107 KSFAVDMKGNIYVADKSNHVIRK 129
+ A G + VAD NH +R+
Sbjct: 233 QGLAALPDGTVAVADTVNHALRR 255
>gi|418747486|ref|ZP_13303786.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
CBC379]
gi|410791609|gb|EKR89564.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
CBC379]
Length = 356
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 56/129 (43%), Gaps = 10/129 (7%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I+ D Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTII-DCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTV 165
P VD GNI+V+D+ ++IRKI GN NL ED SG+ + +
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKIDR--SGNVTTLSTNLLLED-------PSGIKFDPI 129
Query: 166 FSPYLISCK 174
+SCK
Sbjct: 130 TGDKYVSCK 138
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
GV G+ +G+ ++ F P +D + N+YV + NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207
>gi|260821256|ref|XP_002605949.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
gi|229291286|gb|EEN61959.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
Length = 446
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 7/83 (8%)
Query: 60 IVLDSSRSAFYTLSFP------LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
I D++ Y + F ++ +VK LAG GY DG +ARF P +D
Sbjct: 308 IAFDTTHRVLYVVEFGNNCIRMVTPAGIVKTLAGGPEPGYKDGSGSAARFFHPTGLTLDP 367
Query: 114 KGN-IYVADKSNHVIRKITNLDH 135
IYV D+ NH++R +T + H
Sbjct: 368 TNKVIYVTDQYNHLVRSVTAVGH 390
>gi|359777541|ref|ZP_09280822.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
12137]
gi|359305319|dbj|GAB14651.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
12137]
Length = 666
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 37/110 (33%), Positives = 52/110 (47%), Gaps = 4/110 (3%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
ED T + H L++ S D SS L V+ + S+ PL+ + V LA
Sbjct: 317 EDAAGFTGRLGDHPLDVALSSPWDVVWSSKLNAVVIAMAGVHQIFSYEPLTGD--VSILA 374
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
G+G++G DG A F +P A D GNI+VAD +RK+ D G
Sbjct: 375 GNGLEGLLDGPAHEAWFAQPSGLAEDADGNIWVADSETSALRKLVISDDG 424
>gi|365121900|ref|ZP_09338810.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643613|gb|EHL82920.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
6_1_58FAA_CT1]
Length = 443
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 63/134 (47%), Gaps = 22/134 (16%)
Query: 3 SHFVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS------ 56
SH +Y+ +T LF+ G++ GY +DG I +++ ++P
Sbjct: 323 SHCIYTYDVLTKKHELFA----GIVNNAGY-----LDG----IGSYAMFNQPRQLILDED 369
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
+L + D+ ++ P + S V + GV GY DG+P A FD+P ++ +G
Sbjct: 370 DNLYIADTENHVIRKIT-PQGQVSTV--IGQAGVAGYQDGDPEVALFDRPHGVCINKEGI 426
Query: 117 IYVADKSNHVIRKI 130
IY+ D N IR++
Sbjct: 427 IYIGDYENQCIRRL 440
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 26/40 (65%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY DG A F++P+ +D N+Y+AD NHVIRKIT
Sbjct: 347 GYLDGIGSYAMFNQPRQLILDEDDNLYIADTENHVIRKIT 386
>gi|284038675|ref|YP_003388605.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
gi|283817968|gb|ADB39806.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
Length = 359
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 31/58 (53%)
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
F ++ + + AG G GY DG A+F P A D GN+YVAD N IRKI+
Sbjct: 73 FKVTPQGNLSLYAGSGGTGYQDGSLDKAKFLWPYGLAFDRAGNLYVADSGNQAIRKIS 130
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 9/60 (15%)
Query: 81 VVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+V+++A DG + GY DG +A F + A G++YVAD N IRKIT
Sbjct: 298 IVRKIATDGQVSDVTGSRLGGYKDGPLQAAEFGSVEDLAFSPSGSLYVADNRNGAIRKIT 357
>gi|428186137|gb|EKX54988.1| hypothetical protein GUITHDRAFT_59145, partial [Guillardia theta
CCMP2712]
Length = 231
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V AG G+QG+ DG G ARF +P + + +YVAD +NH IR I
Sbjct: 20 VSYFAGSGIQGFRDGAAGDARFSEPLGI-IHFRNKLYVADSNNHAIRSI 67
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 41/79 (51%), Gaps = 10/79 (12%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD-------KPKSFAV 111
L V DS+ A ++ P+S E V +AG+ +G DG ++ D +P+
Sbjct: 54 LYVADSNNHAIRSID-PISGE--VATVAGNQQRGNLDGAASQSQLDTPTSLLNRPEGVVF 110
Query: 112 DMKGNIYVADKSNHVIRKI 130
D G++YVAD NHVI+ +
Sbjct: 111 DSHGHLYVADTGNHVIKML 129
>gi|289664052|ref|ZP_06485633.1| putative NHL repeat protein [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 263
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 31/47 (65%), Gaps = 1/47 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAG G +G++DG +A F+ P +D GN+Y+AD NH IRK+T
Sbjct: 4 LAG-GREGFADGIGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLT 49
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 29/52 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
L+ + V LAGDGV G +G RF+ P AVD +G +YVAD N I
Sbjct: 48 LTPQGKVTTLAGDGVAGDRNGAAAQVRFNGPVGVAVDAQGRVYVADTYNDRI 99
>gi|290977087|ref|XP_002671270.1| predicted protein [Naegleria gruberi]
gi|284084837|gb|EFC38526.1| predicted protein [Naegleria gruberi]
Length = 1095
Score = 47.0 bits (110), Expect = 0.003, Method: Composition-based stats.
Identities = 36/107 (33%), Positives = 51/107 (47%), Gaps = 10/107 (9%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
G+T T ++ L +P + +SDL + D +S + +AG G
Sbjct: 319 GFTDNTTVNLATLA-SPELMNFAKNTSDLYIADKGNHRIRKIS-----NGFITTIAGQGS 372
Query: 91 QGYSDGEPGSARFD---KPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
Y GE +R KPK A+D GNIY+AD SNH +RKI+ LD
Sbjct: 373 PSYC-GENVDSRLSALSKPKGAALDSLGNIYIADTSNHRVRKISYLD 418
Score = 40.8 bits (94), Expect = 0.29, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 79 ESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + +AG G GY+ DG SA+ +KP A D GNIY+AD N+ IRKI
Sbjct: 418 DGTITTIAGTGSFGYNGDGILATSAQVNKPTGIAFDSIGNIYIADSGNNRIRKI 471
>gi|255530479|ref|YP_003090851.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255343463|gb|ACU02789.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 453
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S +++V +AG G GY DG P + F++P AV+ +G+IY+AD SN IRK+
Sbjct: 395 VSADNIVTTVAGQPGKSGYKDGGPVESLFNQPWGVAVNEQGDIYIADWSNARIRKL 450
Score = 42.7 bits (99), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 26/49 (53%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
KRL G + DG A F+ PK D GN++VAD NH IR ++
Sbjct: 348 KRLNAPGGSAFRDGPLADALFNDPKEIKFDNSGNMFVADYGNHCIRMVS 396
>gi|307182571|gb|EFN69764.1| NHL repeat-containing protein 2 [Camponotus floridanus]
Length = 687
Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats.
Identities = 36/133 (27%), Positives = 58/133 (43%), Gaps = 6/133 (4%)
Query: 5 FVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
+Y+ + I L + + L L Y + + L +++ + LI+ DS
Sbjct: 182 LLYARVMIAYFKSLNQISNHSLPLRPAYHLLSSFKNGLLFPGKLAILSSEHGTKLIISDS 241
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
+ + + V+ G QG+ DG +ARF+ P+ V + IYVAD +N
Sbjct: 242 GNNRIVIAT----KHGEVEHFIGGCNQGFKDGSFKNARFNSPQGVCV-LNNTIYVADNNN 296
Query: 125 HVIRKITNLDHGN 137
H IRKI NL N
Sbjct: 297 HAIRKI-NLTENN 308
>gi|302786076|ref|XP_002974809.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
gi|300157704|gb|EFJ24329.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
Length = 1049
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 32/48 (66%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
V+ +AG GV GY DG+ A+F +P A+ G+++VAD +N+VIR
Sbjct: 835 TVRTVAGTGVAGYEDGKGVKAQFSEPAGLALGPNGSLFVADTNNNVIR 882
Score = 41.2 bits (95), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
G G +G +DG A F++P+ A D + NI YVAD NH +R++
Sbjct: 607 GSGAEGLTDGSFSLASFNRPQGVAYDTEKNILYVADTENHALREV 651
>gi|294817139|ref|ZP_06775781.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
gi|294321954|gb|EFG04089.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
Length = 510
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + + +AG G G DG SA+ ++P AVD G +Y+AD NH IRKIT
Sbjct: 182 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKIT 239
Score = 42.7 bits (99), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V +AG G GY D P SA+ + P+ AVD G +YV D NH +RKIT
Sbjct: 70 ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKIT 127
Score = 41.2 bits (95), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG GV G++ +G +A+ ++P AVD G +Y +D +NH +RKIT
Sbjct: 23 VAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKIT 71
Score = 39.7 bits (91), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + + + G GV G+S DG P +A R ++P AVD G +YV + +N+ IRK+
Sbjct: 126 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKV 182
Score = 39.7 bits (91), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AR ++P +D G +YVAD NH IRKIT
Sbjct: 264 AARLNRPMGVVIDSTGTLYVADYGNHRIRKIT 295
>gi|326445969|ref|ZP_08220703.1| hypothetical protein SclaA2_33112 [Streptomyces clavuligerus ATCC
27064]
Length = 508
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + + +AG G G DG SA+ ++P AVD G +Y+AD NH IRKIT
Sbjct: 180 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKIT 237
Score = 42.7 bits (99), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V +AG G GY D P SA+ + P+ AVD G +YV D NH +RKIT
Sbjct: 68 ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKIT 125
Score = 40.8 bits (94), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG GV G++ +G +A+ ++P AVD G +Y +D +NH +RKIT
Sbjct: 21 VAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKIT 69
Score = 39.7 bits (91), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + + + G GV G+S DG P +A R ++P AVD G +YV + +N+ IRK+
Sbjct: 124 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKV 180
Score = 39.7 bits (91), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AR ++P +D G +YVAD NH IRKIT
Sbjct: 262 AARLNRPMGVVIDSTGTLYVADYGNHRIRKIT 293
>gi|449137215|ref|ZP_21772546.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
gi|448884292|gb|EMB14794.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
Length = 351
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ ++I G + ++ +D G L+VL+ + + + + LAG G +G
Sbjct: 211 MVSIIAGQKSKLKDPRAVDLDGERRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 266
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 29/49 (59%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
GY DG SARF+K + +D +Y++D +NH +R++ + G ++
Sbjct: 106 GYVDGPARSARFNKLHNLMIDSNDVLYLSDHANHSVRRLMESEDGKWVV 154
>gi|359685014|ref|ZP_09255015.1| hypothetical protein Lsan2_10264 [Leptospira santarosai str.
2000030832]
Length = 356
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G+Q DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKVTNNPVVSLFAGTGIQESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
P VD GNI+V+D+ ++IRKI
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKI 103
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
GV G+ +G+ ++ F P +D + N+YV + NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207
>gi|254390149|ref|ZP_05005369.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
27064]
gi|197703856|gb|EDY49668.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
27064]
Length = 527
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + + +AG G G DG SA+ ++P AVD G +Y+AD NH IRKIT
Sbjct: 199 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKIT 256
Score = 42.7 bits (99), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V +AG G GY D P SA+ + P+ AVD G +YV D NH +RKIT
Sbjct: 87 ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKIT 144
Score = 40.8 bits (94), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG GV G++ +G +A+ ++P AVD G +Y +D +NH +RKIT
Sbjct: 40 VAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKIT 88
Score = 39.7 bits (91), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + + + G GV G+S DG P +A R ++P AVD G +YV + +N+ IRK+
Sbjct: 143 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKV 199
Score = 39.7 bits (91), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AR ++P +D G +YVAD NH IRKIT
Sbjct: 281 AARLNRPMGVVIDSTGTLYVADYGNHRIRKIT 312
>gi|256393018|ref|YP_003114582.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
gi|256359244|gb|ACU72741.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
Length = 1046
Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 38/57 (66%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
L+ ++ RL G G GYS DG P A+ ++P A+D +G++Y+AD +N+VIR++
Sbjct: 849 LTPRGLLLRLIGTGRAGYSGDGRPSPLAQLNQPIGLALDAQGDLYIADSANNVIRRV 905
>gi|418721636|ref|ZP_13280812.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
str. UI 09149]
gi|410741989|gb|EKQ90740.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
str. UI 09149]
Length = 246
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 38/71 (53%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
++LD +T ++ VV AG G++ DG +A F P S +D GNI+V
Sbjct: 35 MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94
Query: 120 ADKSNHVIRKI 130
+D +++IRKI
Sbjct: 95 SDSDSNLIRKI 105
>gi|75909875|ref|YP_324171.1| NHL repeat-containing protein [Anabaena variabilis ATCC 29413]
gi|75703600|gb|ABA23276.1| NHL repeat protein [Anabaena variabilis ATCC 29413]
Length = 503
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 39/84 (46%), Gaps = 7/84 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ PG L V DS F + L G+G G +DG A+F P
Sbjct: 175 PGKVLATPGG--LFVADSGHHRIVVSDF----NGEILHLIGNGKSGLTDGNFQEAQFSAP 228
Query: 107 KSFAVDMKGNI-YVADKSNHVIRK 129
+ A DM+ I YVAD NHV+R+
Sbjct: 229 QGMAFDMENQILYVADTDNHVVRR 252
Score = 37.0 bits (84), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + GDG G +G+ + RF +P + M +Y++D +NHVIR++
Sbjct: 431 QTVLGDGSAGLQNGQGKNTRFFEPSGLSA-MDSYLYISDTNNHVIRRV 477
>gi|441163237|ref|ZP_20968264.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
ATCC 10970]
gi|440616376|gb|ELQ79518.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
ATCC 10970]
Length = 551
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + ++ +AG+G GY DG P +A R P+ A+D GN+Y+AD N+ +R +T
Sbjct: 78 VTTDGIITTVAGNGTAGYVDDGGPATATRLYGPRGVALDRAGNLYIADGDNNRVRGVT 135
Score = 40.4 bits (93), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + + +AG G GY SDG P S++ + P +D GN+Y+A+ + IRK+T
Sbjct: 22 VTPQGTITTVAGTGTAGYTSDGGPATSSQLNTPAYVTLDDAGNLYIAESGSQRIRKVT 79
>gi|29347940|ref|NP_811443.1| hypothetical protein BT_2530 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383121619|ref|ZP_09942326.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
gi|29339842|gb|AAO77637.1| conserved hypothetical protein, with a conserved domain
[Bacteroides thetaiotaomicron VPI-5482]
gi|251837946|gb|EES66035.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
Length = 441
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S +++V +AG GV GY DG P + F P AV+ +G+IY+AD N IRK+
Sbjct: 383 ISADNIVTTVAGQPGVAGYKDGGPVESLFKNPWGVAVNEQGDIYIADWGNARIRKL 438
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 26/49 (53%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
KRL G + DG A F+ PK D GN++VAD NH IR I+
Sbjct: 336 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMIS 384
>gi|442323754|ref|YP_007363775.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
gi|441491396|gb|AGC48091.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
Length = 906
Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats.
Identities = 24/50 (48%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V LAG Q G+ DG AR +P AVD GNI+VAD NH +R+I
Sbjct: 479 VSTLAGVAFQDGFVDGPVARARLRRPVGVAVDGLGNIFVADTGNHAVRRI 528
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 2/76 (2%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
G ++ V D+ A ++ P + +V +AG G G +G + P+S AV
Sbjct: 511 GLGNIFVADTGNHAVRRIA-PDAARTVTT-IAGLGTPGVGEGPGATTALRSPQSIAVAPD 568
Query: 115 GNIYVADKSNHVIRKI 130
G +YVAD NH I +I
Sbjct: 569 GTLYVADTGNHRIVRI 584
>gi|380693545|ref|ZP_09858404.1| hypothetical protein BfaeM_06134 [Bacteroides faecis MAJ27]
Length = 454
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 28/40 (70%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY +G + R +KP+ A D GN++V +++NH+IRKIT
Sbjct: 358 GYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKIT 397
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 28/40 (70%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G+ DG P A+F++P+ V +IYVAD+ NHVIR++T
Sbjct: 413 GFGDGLPEEAKFNQPECVTVYPDNSIYVADRDNHVIRRVT 452
>gi|298386893|ref|ZP_06996448.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
gi|298260567|gb|EFI03436.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
Length = 454
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 28/40 (70%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY +G + R +KP+ A D GN++V +++NH+IRKIT
Sbjct: 358 GYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKIT 397
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 28/40 (70%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G+ DG P A+F++P+ V +IYVAD+ NHVIR++T
Sbjct: 413 GFGDGLPEEAKFNQPECVTVYPDNSIYVADRDNHVIRRVT 452
>gi|359728310|ref|ZP_09267006.1| hypothetical protein Lwei2_15904 [Leptospira weilii str.
2006001855]
gi|417778551|ref|ZP_12426356.1| putative lipoprotein [Leptospira weilii str. 2006001853]
gi|410781344|gb|EKR65918.1| putative lipoprotein [Leptospira weilii str. 2006001853]
Length = 357
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 53/114 (46%), Gaps = 9/114 (7%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD + ++ + V+ AG G+ DG +A F+ P +D GNI+V+
Sbjct: 36 ILDCILKECHLCKLKVTNKPVISLFAGTGINQSVDGTTQTASFNTPFGLELDTFGNIFVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCK 174
D++ ++IRKI + + L + L Q+ SG+ + + +SCK
Sbjct: 96 DQAANLIRKIDHFGNVTTLFTSLTL---------QAPSGIKFDPITGDKYVSCK 140
Score = 36.6 bits (83), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G GV GY DG+ SA+F P D K + + VAD +H IRKI
Sbjct: 210 VSTLSG-GVSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKI 258
>gi|312198348|ref|YP_004018409.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311229684|gb|ADP82539.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 782
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 8/79 (10%)
Query: 60 IVLDSSRSAFYTLSF------PLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAV 111
I LD R FY + + ++ +AG+G+ G++ G+ SA+ D P AV
Sbjct: 400 ITLDLKRGYFYIADYYNERVRRVDHNGIITTVAGNGIAGFAGDGGQATSAQLDSPTDVAV 459
Query: 112 DMKGNIYVADKSNHVIRKI 130
G+IY+AD +N+ +R +
Sbjct: 460 AADGSIYIADSANNRVRHV 478
Score = 38.1 bits (87), Expect = 1.8, Method: Composition-based stats.
Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 85 LAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ DG+ DG P + A P AVD +GN+Y+AD N +R++
Sbjct: 502 IGDDGLTYAGDGVPATKAMLRSPNDLAVDGRGNLYIADGENERVRRV 548
>gi|418737091|ref|ZP_13293489.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
str. 200801910]
gi|410747250|gb|EKR00156.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
str. 200801910]
Length = 358
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 38/71 (53%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
++LD +T ++ VV AG G++ DG +A F P S +D GNI+V
Sbjct: 35 MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94
Query: 120 ADKSNHVIRKI 130
+D +++IRKI
Sbjct: 95 SDSDSNLIRKI 105
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G GV GY DG+ SA+F P D K + + VAD NH IRKI
Sbjct: 210 TVSTLSG-GVSGYLDGDLTSAQFKFPSGIIYDQKTDSLLVADLQNHRIRKI 259
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 31/51 (60%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+ S V L G+G++ DG+ +A FD P ++D G ++V+D S++ IR
Sbjct: 262 KTSTVSTLLGNGIEASIDGKGLNASFDGPAFISLDNSGYMFVSDTSSNKIR 312
>gi|298387816|ref|ZP_06997366.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
gi|298259421|gb|EFI02295.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
Length = 440
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S +++V +AG GV GY DG P + F P AV+ +G+IY+AD N IRK+
Sbjct: 382 VSADNIVTTVAGQPGVAGYKDGGPVESLFKNPWGVAVNEQGDIYIADWGNARIRKL 437
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 26/49 (53%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
KRL G + DG A F+ PK D GN++VAD NH IR ++
Sbjct: 335 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVS 383
>gi|345851882|ref|ZP_08804843.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
gi|345636654|gb|EGX58200.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
Length = 468
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 36/58 (62%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + + +AG G G+ DG P +A+ + P+ AVD G +Y+AD +NH IRK+T
Sbjct: 28 ITTDGRITTVAGTGAAGFRGDGGPAVAAQLNGPREVAVDGAGAVYIADSNNHRIRKVT 85
>gi|421094122|ref|ZP_15554843.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
gi|410363263|gb|EKP14295.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
gi|456886734|gb|EMF97860.1| putative lipoprotein [Leptospira borgpetersenii str. 200701203]
Length = 358
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 38/71 (53%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
++LD +T ++ VV AG G++ DG +A F P S +D GNI+V
Sbjct: 35 MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94
Query: 120 ADKSNHVIRKI 130
+D +++IRKI
Sbjct: 95 SDSDSNLIRKI 105
Score = 39.3 bits (90), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G GV GY DG+ SA+F P D K + + VAD NH IRKI
Sbjct: 210 TVSTLSG-GVSGYLDGDLTSAQFKSPSGIIYDQKTDSLLVADLQNHRIRKI 259
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 31/51 (60%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+ S V L G+G++ DG+ +A FD P ++D G ++V+D S++ IR
Sbjct: 262 KTSTVSTLLGNGIEASIDGKGLNASFDGPAFISLDNSGYMFVSDTSSNKIR 312
>gi|393782357|ref|ZP_10370541.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
CL02T12C01]
gi|392673185|gb|EIY66648.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
CL02T12C01]
Length = 446
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 32/56 (57%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S++ +V + G G G+ DG P A FDKP A+D G IY+ D N +R++
Sbjct: 388 ISQDGIVSTVIGLAGQSGFMDGTPEEALFDKPFGVALDTDGTIYIGDSENQCVRRL 443
Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 26/40 (65%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY+DG A F++P+ V +Y+AD +NHVIRKI+
Sbjct: 350 GYADGPCEYAMFNEPRQMIVSADNELYLADTNNHVIRKIS 389
>gi|320106842|ref|YP_004182432.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319925363|gb|ADV82438.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 382
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKI 130
L+ + V LAG+G GY+DG+ SARF+KP + G Y+AD N+ IR++
Sbjct: 261 LNGANQVTTLAGNGTAGYADGQGNSARFNKPTQVVYNSGDGAYYIADTFNNCIRRM 316
Score = 40.4 bits (93), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V AG G Q G DG SA+FDKP A+ G +YVAD N+ IR++
Sbjct: 322 VTTYAGIGGQSGLVDGASTSAKFDKPTGIAI-ANGYLYVADSGNNAIRRV 370
>gi|417302109|ref|ZP_12089222.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
gi|327541551|gb|EGF28082.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
Length = 351
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ T + G + ++ +D G+ L+VL+ + + + + LAG G +G
Sbjct: 211 LVTTLAGRKSKLKDPRAVDLDGNGRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 266
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 29/49 (59%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
GY DG SARF+K + +D + +Y++D +NH +R++ G ++
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTSDGEWMV 154
>gi|383124185|ref|ZP_09944852.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
gi|251839311|gb|EES67395.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
Length = 454
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 28/40 (70%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY +G + R +KP+ A D GN++V +++NH+IRKIT
Sbjct: 358 GYINGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKIT 397
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 28/40 (70%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G+ DG P A+F++P+ V ++YVAD+ NHVIR++T
Sbjct: 413 GFGDGLPEEAKFNQPECVTVYPDNSVYVADRDNHVIRRVT 452
>gi|322434821|ref|YP_004217033.1| hypothetical protein AciX9_1190 [Granulicella tundricola MP5ACTX9]
gi|321162548|gb|ADW68253.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
Length = 1631
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 81 VVKRLAGDGVQ--GYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
V+ AG+GV GY DG P +A + P AV KG IY+AD+ N +IRK+ N
Sbjct: 313 VITTFAGNGVTTGGYGDGGPAAAAMLNAPWGIAVSSKGEIYIADQGNSLIRKVVN 367
Score = 37.0 bits (84), Expect = 4.0, Method: Composition-based stats.
Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 79 ESVVKRLAGD-GVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + +AG G GY DG P S A+ + P D +GN+Y D +N+V+R+I
Sbjct: 253 DGTISTIAGQMGKNGYVGDGGPASSAKLNGPNGLVFDAQGNLYFCDTNNNVVRRI 307
Score = 36.2 bits (82), Expect = 6.7, Method: Composition-based stats.
Identities = 15/35 (42%), Positives = 22/35 (62%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
SA P S A+D GN++++D N+VIRK+ D
Sbjct: 219 SATLSSPSSVALDGAGNVFISDTGNNVIRKVNVFD 253
>gi|430744164|ref|YP_007203293.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
gi|430015884|gb|AGA27598.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 358
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
+ ++ AG G +GYS DG P +A F+ PK FA+D G++ + D N IR+I D
Sbjct: 245 QTGLITTRAGTGAKGYSGDGGPATAATFNGPKEFAIDRAGDLLIVDTENQAIRRI---DA 301
Query: 136 GNQLIRQI 143
LIR +
Sbjct: 302 RTGLIRTL 309
Score = 40.0 bits (92), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 52/110 (47%), Gaps = 11/110 (10%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ ++ +AG G +G+S G+ G A+ D+P +D +GN+Y AD+ N +R++ +
Sbjct: 72 GKSGIITTVAGSGTKGFS-GDGGVALKAKLDEPYGIVLDSRGNLYFADRLNRRVRRV-DA 129
Query: 134 DHGNQLIRQINLKPEDCSKS-SQSGSGMFWVTVFSPYLISCKSFWVHLYL 182
D G I D SK+ S G + P ++ S L++
Sbjct: 130 DSG-----MITTIAGDGSKTYSGDGGPGARAGLVEPNGVALDSQEARLFI 174
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG--SARFDKPK 107
IDR G DL+++D+ A + + +++ LAG+G +G SA D+P
Sbjct: 279 AIDRAG--DLLIVDTENQAIRRID---ARTGLIRTLAGNGQRGGEGDGGAATSALLDRPH 333
Query: 108 SFAVDMKGNIYVADKSNHVIRKI 130
AV G +Y+ D NH IRK+
Sbjct: 334 GVAVGPDGAVYIGDTGNHRIRKV 356
>gi|374988208|ref|YP_004963703.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
gi|297158860|gb|ADI08572.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
Length = 609
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 4/84 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R L+ ESV++R+ G G +G DG ARF +P
Sbjct: 176 PGKALLLPGGT-FLVSDTTRHQLVELA--ADGESVLRRI-GSGERGLVDGPADRARFSEP 231
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ A+ G + VAD NH +R +
Sbjct: 232 QGLALLPDGTVIVADTVNHALRSL 255
>gi|290975887|ref|XP_002670673.1| predicted protein [Naegleria gruberi]
gi|284084234|gb|EFC37929.1| predicted protein [Naegleria gruberi]
Length = 1074
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 49/88 (55%), Gaps = 7/88 (7%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFD 104
P+ + S ++I++D+S + +S + +AG G GYS +G+ +A+F+
Sbjct: 152 PYGLALNSSSGEVIIVDTSNNVIR----KVSSIGNITTIAGTGAAGYSGDNGQATNAKFN 207
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITN 132
P++ A G ++VAD NH IRKI+N
Sbjct: 208 APRA-AFYSNGELFVADSRNHRIRKISN 234
Score = 43.5 bits (101), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 29/52 (55%), Gaps = 2/52 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG G GYS DG SA+ + PK V G IY +D NH IRKI+
Sbjct: 492 ITTIAGTGTSGYSGDGSSATSAKLNNPKGVVVSSSGEIYFSDSENHRIRKIS 543
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
L+ + AG+G+ G S DG P SA+ + P AV G +Y++D SN+ IRK++
Sbjct: 934 LNVNGTISTSAGNGIAGLSGDGSLPTSAQLNFPTGLAVSSVGELYISDDSNNRIRKVS 991
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 4/53 (7%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG GV G+S G+ G SA + P AV+ G + ++D +NH IRK++
Sbjct: 828 ISTVAGSGVAGFS-GDGGLATSATLNNPSYVAVNSNGELLISDTNNHRIRKVS 879
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFD 104
P +V ++D+ + DS +S S ++ +AG G G+S +G +A+ +
Sbjct: 634 PFAVAVNLTNNDIYIADSGNHRIRKVS---SSSGIITTVAGTGTSGFSGDNGLATNAKLN 690
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI 130
P S ++ G IY++D+ N +RK+
Sbjct: 691 FPFSISIGNSGEIYISDQYNQRVRKV 716
>gi|429737840|ref|ZP_19271682.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
gi|429161715|gb|EKY04093.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
Length = 438
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 83 KRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
KRL+ + G + DG A+F P D GNIYVAD+ NH IR+IT
Sbjct: 332 KRLSSSNISGGHRDGPLSEAQFRNPSQIFCDNDGNIYVADRGNHCIRRIT 381
Score = 41.2 bits (95), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ E++V+ + G G +G+ DG+ A F++P + G++YVAD N +RK+T
Sbjct: 380 ITPENMVETVLGMPGTKGWKDGKKEDALFNEPTGIGIAQDGSVYVADFKNGRVRKLT 436
>gi|375146761|ref|YP_005009202.1| NHL repeat containing protein [Niastella koreensis GR20-10]
gi|361060807|gb|AEV99798.1| NHL repeat containing protein [Niastella koreensis GR20-10]
Length = 439
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V AG G GY+D +A F P AVD G +YV DK+N+ IRKI
Sbjct: 388 VSTFAGSGSAGYADAVGEAAAFSAPTGIAVDKNGTLYVLDKNNNRIRKI 436
Score = 42.4 bits (98), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 52/124 (41%), Gaps = 14/124 (11%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEI---NPHSVI----DRPGSSDLIVLDSSRSAFYTLS 73
L S + D +T ++G L++ N ++ + GS +V + SA T
Sbjct: 55 LFSGSNYINDTNQITVTLNGKPLKVVGVNGSQIMAVVPKKAGSGHFVVKIGADSAVSTGI 114
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFD-------KPKSFAVDMKGNIYVADKSNHV 126
F V AG G G+++G A FD + VD N+YVAD NH
Sbjct: 115 FNYVYTRTVSTFAGSGKAGFANGMGTDAMFDFGGQSWYRSMGIIVDDNLNLYVADPGNHC 174
Query: 127 IRKI 130
IRKI
Sbjct: 175 IRKI 178
Score = 41.6 bits (96), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 4/73 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
GV GY +G A+FD P A+D GN+YVA + N + +Q IR I
Sbjct: 331 GVPGYVNGAAAVAKFDHPWGLAIDTAGNLYVAGNGTY----DGNTGNSDQAIRYIEAASF 386
Query: 149 DCSKSSQSGSGMF 161
+ S + SGS +
Sbjct: 387 NVSTFAGSGSAGY 399
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GY+DG+ +A+F P A D +GNI+ D +N IRKI
Sbjct: 195 GYADGKGTAAKFSLPYDVAFDAQGNIWCVDPANWDIRKI 233
>gi|227537797|ref|ZP_03967846.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
gi|227242411|gb|EEI92426.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
Length = 441
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LS + +V +AG + GY DG P ++F++P AV+ +G+IY+AD +N IRK+
Sbjct: 383 LSADGIVSTVAGQPTKAGYKDGGPVESQFNQPWGVAVNDQGDIYIADWNNARIRKL 438
Score = 42.4 bits (98), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 27/49 (55%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
KR+ G G+ DG A F+ PK D GN+++AD NH IR ++
Sbjct: 336 KRINAPGGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLS 384
>gi|116619658|ref|YP_821814.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116222820|gb|ABJ81529.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 585
Score = 46.2 bits (108), Expect = 0.006, Method: Composition-based stats.
Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 7/88 (7%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG--SARF 103
+P + P + +L + DS+ +S S + +AG+G G++ + SA
Sbjct: 52 SPTGLALDPKTGNLYIADSANHRIRMIS-----GSTISTVAGNGTAGFAGDKAAATSANL 106
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ P A+D GN Y+AD N VIRK+T
Sbjct: 107 NTPSGVALDSSGNFYIADSLNSVIRKVT 134
Score = 37.4 bits (85), Expect = 3.0, Method: Composition-based stats.
Identities = 19/48 (39%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AG+ G++ DG + ++ +KP AVD GN+Y+AD +N IRK+
Sbjct: 250 FAGNLTAGFAGDGNTATLSQLNKPVGIAVDSAGNVYIADSNNGRIRKV 297
>gi|300772471|ref|ZP_07082341.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300760774|gb|EFK57600.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 441
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LS + +V +AG + GY DG P ++F++P AV+ +G+IY+AD +N IRK+
Sbjct: 383 LSADGIVSTVAGQPTKAGYKDGGPVESQFNQPWGVAVNDQGDIYIADWNNARIRKL 438
Score = 42.4 bits (98), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 27/49 (55%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
KR+ G G+ DG A F+ PK D GN+++AD NH IR ++
Sbjct: 336 KRINAPGGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLS 384
>gi|456865078|gb|EMF83443.1| putative lipoprotein [Leptospira weilii serovar Topaz str. LT2116]
Length = 357
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 52/114 (45%), Gaps = 9/114 (7%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD + ++ V+ AG G+ DG +A F+ P +D GNI+V+
Sbjct: 36 ILDCILKECHLCKLKVTNNPVISLFAGTGINQSVDGTTQTASFNTPFGLELDTFGNIFVS 95
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCK 174
D++ ++IRKI + + L + L Q+ SG+ + + +SCK
Sbjct: 96 DQAANLIRKIDHFGNVTTLFTSLTL---------QAPSGIKFDPITGDKYVSCK 140
Score = 36.6 bits (83), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G GV GY DG+ SA+F P D K + + VAD +H IRKI
Sbjct: 210 VSTLSG-GVSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKI 258
>gi|322788140|gb|EFZ13922.1| hypothetical protein SINV_02735 [Solenopsis invicta]
Length = 717
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 5/83 (6%)
Query: 49 SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
+V+ + L++ DS + + E V+ + G QG+ DG+ +ARF+ P+
Sbjct: 255 TVLQSEQGTKLVISDSGNNRIVITN----EHGRVEHVIGGCSQGFKDGDFKNARFNSPQG 310
Query: 109 FAVDMKGNIYVADKSNHVIRKIT 131
+ IY+AD +NH IRKI
Sbjct: 311 VCA-LNNTIYIADNNNHAIRKIN 332
>gi|290999745|ref|XP_002682440.1| predicted protein [Naegleria gruberi]
gi|284096067|gb|EFC49696.1| predicted protein [Naegleria gruberi]
Length = 731
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S ++ +AG GV GYS DG P + + + P +D NI +AD++NH IR I+N
Sbjct: 228 VSNSGIISTVAGTGVSGYSGDGIPANTTKLNTPNGITIDSNDNIIIADRNNHRIRLISN 286
Score = 44.3 bits (103), Expect = 0.022, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S ++ +AG G GY+ A+ P +VD KGN+Y+ADK NH IRKI
Sbjct: 508 VSNNGIISTIAGTGSAGYNGDVIMATEAKLYLPHGVSVDNKGNVYIADKQNHRIRKI 564
Score = 40.8 bits (94), Expect = 0.28, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+S++ VK +AG+G+ GY+ DG A+ + P A D GNIY++D NH +R
Sbjct: 396 ISKDGSVKTIAGNGIGGYNGDGMLAIDAQLNYPHGVASDSIGNIYISDSYNHRVR 450
Score = 38.5 bits (88), Expect = 1.4, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 3/60 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGN-IYVADKSNHVIRKITN 132
L+ ++ +AG G G++D + +R + P VD G IY+AD +NH IR+I N
Sbjct: 565 LASTGMISTIAGTGQAGFNDDNMSALESRVNSPYDVTVDESGQVIYIADTNNHKIRRIQN 624
>gi|380692527|ref|ZP_09857386.1| hypothetical protein BfaeM_00899 [Bacteroides faecis MAJ27]
Length = 441
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S +++V +AG GV GY DG P + F P AV+ +G+IY+AD N IRK+
Sbjct: 383 VSADNIVTTVAGQPGVAGYKDGGPLESLFKNPWGVAVNEQGDIYIADWGNARIRKL 438
Score = 41.2 bits (95), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 26/49 (53%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
KRL G + DG A F+ PK D GN++VAD NH IR ++
Sbjct: 336 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVS 384
>gi|374311754|ref|YP_005058184.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
gi|358753764|gb|AEU37154.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
Length = 1456
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 26/93 (27%), Positives = 51/93 (54%), Gaps = 7/93 (7%)
Query: 40 GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GE 97
G + EI ++DR G D+ ++D + ++ + ++ +AG+G +G+S G
Sbjct: 82 GAKFEIPFGILVDRAG--DIFIVDPGNNNVRKIA---ASTGIITTVAGNGTRGFSGDGGA 136
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
SA ++P++ A+D GN+Y+ D N +RK+
Sbjct: 137 ATSAELNQPQAVALDAAGNMYITDTINSRVRKV 169
Score = 45.8 bits (107), Expect = 0.008, Method: Composition-based stats.
Identities = 30/82 (36%), Positives = 49/82 (59%), Gaps = 9/82 (10%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVD 112
GS ++ ++D+ R A + L+ SVV AG+G +G+S G SA P++ A+D
Sbjct: 258 GSGNVFIVDTDRYAVFKLTTGTGTVSVV---AGNGTRGFSGDGGAATSAELAYPQAVALD 314
Query: 113 MKGNIYVAD---KSNHVIRKIT 131
GN+Y++D +SN+ IRK+T
Sbjct: 315 TAGNLYISDGDYRSNN-IRKVT 335
Score = 42.7 bits (99), Expect = 0.075, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 37/56 (66%), Gaps = 4/56 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ +AGDG +S G+ G+A R ++P A+D GN+Y+AD +N V+R+++ L
Sbjct: 340 IISTVAGDGTAEFS-GDGGAAVDARLNEPGGVALDTAGNLYIADVNNSVVREVSPL 394
Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats.
Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG+G GY+ DG P SA +P+ A+D G++Y AD +HV+R+I
Sbjct: 503 IITTIAGNGTSGYTGDGGPATSAGMYEPQGVALDNAGDVYFADFYHHVVRRI 554
Score = 40.8 bits (94), Expect = 0.26, Method: Composition-based stats.
Identities = 17/52 (32%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG+G +G+S G SA + P+ ++D N+Y+AD N+ +RK+
Sbjct: 560 IITTVAGNGTEGFSGDGGSATSAELNYPEGISLDSAANLYIADSGNYRVRKV 611
>gi|290995346|ref|XP_002680256.1| predicted protein [Naegleria gruberi]
gi|284093876|gb|EFC47512.1| predicted protein [Naegleria gruberi]
Length = 1226
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 4/72 (5%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
DL DS+ A + + V+ L G+G G+ DG+ SA F+ P S A+ G+
Sbjct: 398 DLYFCDSNNHAIR----KIDTQGVITTLTGNGTAGFQDGDAASALFNSPASIALLPNGDF 453
Query: 118 YVADKSNHVIRK 129
VAD +N IRK
Sbjct: 454 LVADFTNSRIRK 465
>gi|168701257|ref|ZP_02733534.1| NHL repeat containing protein [Gemmata obscuriglobus UQM 2246]
Length = 358
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++R AG G +GY+ DG A FD PK +D GN++V D N VIRKI
Sbjct: 248 IERFAGTGKKGYTGDGTKALDATFDGPKEIDIDKDGNVFVVDTENEVIRKI 298
Score = 37.4 bits (85), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ G G +G++ DG P + A D+P A D GN+Y +D NH++RK+
Sbjct: 18 VITSPVGSGKKGFAGDGGPAAKAELDQPFDVAFDKAGNLYFSDTFNHLVRKV 69
>gi|302760605|ref|XP_002963725.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
gi|300168993|gb|EFJ35596.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
Length = 1052
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 32/48 (66%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
V+ +AG GV GY DG+ A+F +P A+ G+++VAD +N++IR
Sbjct: 838 TVRTVAGTGVAGYEDGKSVKAQFSEPAGLALGPNGSLFVADTNNNLIR 885
Score = 40.0 bits (92), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
G G +G +DG A F++P+ A D + NI YVAD NH +R++
Sbjct: 610 GSGSEGLTDGSFSLASFNRPQGVAYDTEKNILYVADTENHALREV 654
>gi|428297525|ref|YP_007135831.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
gi|428234069|gb|AFY99858.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
Length = 511
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 48/91 (52%), Gaps = 15/91 (16%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN--IYVADKSNHVIRKITNLDHGNQL 139
V ++ G G G+ DG+ +A F P+ D +GN +Y+AD NH +R++ D GNQL
Sbjct: 206 VLQIIGVGKAGFVDGDFEAATFSTPQGMTFD-EGNQILYIADTGNHALRQV---DLGNQL 261
Query: 140 IRQI--------NLKPEDCSKSSQSGSGMFW 162
+R I N++P C K+ ++ W
Sbjct: 262 VRTIAGTGIQSRNIRPH-CGKALETPLNSPW 291
>gi|159471860|ref|XP_001694074.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277241|gb|EDP03010.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1591
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 33/59 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
++ + V +AG + G+ DG +ARF + AVD +GN Y +D SNH +R + D
Sbjct: 1233 VASDGFVTTIAGSSMPGHLDGPAATARFYNLRGVAVDGEGNCYCSDSSNHCVRLLHAAD 1291
>gi|116625316|ref|YP_827472.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116228478|gb|ABJ87187.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 981
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + ++ +AG+G +G+S DG P +A F P AVD GN+YV D SN +RKI
Sbjct: 226 VTPDGIITTIAGNGTRGFSGDGGPATAATFRGPIGLAVDAFGNVYVTDNSNGRVRKI 282
Score = 44.3 bits (103), Expect = 0.023, Method: Composition-based stats.
Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S ++ AG+G G++ G SA+ + P A+D GN+Y+ D SN+ +RK+T
Sbjct: 116 ISAAGIISTYAGNGTAGFAGDSGAATSAQLNGPTDVAIDGNGNLYICDSSNNRVRKVT 173
>gi|158315874|ref|YP_001508382.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158111279|gb|ABW13476.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 772
Score = 45.8 bits (107), Expect = 0.008, Method: Composition-based stats.
Identities = 23/51 (45%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G G+S DG P + A+ D+P S AV G +YVAD NH +R+I
Sbjct: 442 ITTVAGTGADGFSGDGGPATEAQLDEPTSVAVAPDGTLYVADTRNHRVRRI 492
Score = 36.6 bits (83), Expect = 4.6, Method: Composition-based stats.
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
SARF P + A G++YVAD+ NH +R+I
Sbjct: 585 SARFSYPSALARGPDGSLYVADQDNHRVRRI 615
>gi|413923862|gb|AFW63794.1| hypothetical protein ZEAMMB73_706161, partial [Zea mays]
Length = 139
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSE 78
E GYTV TV DG +L I P+SV + S +L+V+DS S Y ++ PLS
Sbjct: 75 EGGYTVETVFDGSKLGIEPYSV-EVTQSGELLVMDSMNSNIYRMALPLSR 123
>gi|348690525|gb|EGZ30339.1| hypothetical protein PHYSODRAFT_295167 [Phytophthora sojae]
Length = 383
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 60 IVLDSSRSAFYTLS---FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
IV DSS + + T + + + LAG G DG SARF P + A+ G+
Sbjct: 261 IVTDSSDNIYVTDNNRVMKFTSGGAMTVLAGSTSSGLVDGTGTSARFYHPDALAIGSDGD 320
Query: 117 IYVADKSNHVIRKI 130
+YVADK+N+ IR++
Sbjct: 321 LYVADKANYCIRRL 334
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V GY+DG + F KP+S AV ++YVAD N+ IRK+
Sbjct: 11 VSGYADGYGDAVFFKKPRSLAVGSDADLYVADVGNYYIRKV 51
>gi|395768739|ref|ZP_10449254.1| hypothetical protein Saci8_03116 [Streptomyces acidiscabies 84-104]
Length = 596
Score = 45.8 bits (107), Expect = 0.008, Method: Composition-based stats.
Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 4/75 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G G +G++DG A F +P+ A+ G
Sbjct: 181 SGNFLVSDTTRHQLVELA--QDGESVVRRI-GTGTRGFADGT-AEAAFSEPQGLALLDDG 236
Query: 116 NIYVADKSNHVIRKI 130
+ VAD NH +R++
Sbjct: 237 AVVVADTVNHALRRV 251
>gi|289774196|ref|ZP_06533574.1| NHL repeat containing protein [Streptomyces lividans TK24]
gi|289704395|gb|EFD71824.1| NHL repeat containing protein [Streptomyces lividans TK24]
Length = 605
Score = 45.8 bits (107), Expect = 0.008, Method: Composition-based stats.
Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + P S +V D++R L+ ES V+R+ G G +G++DG A F +P
Sbjct: 181 PGKAVSLP-SGTFLVSDTTRHQLVELA--EDGESPVRRI-GSGRRGFTDGPADRAEFSEP 236
Query: 107 KSFAVDMKGNIYVADKSNHVIRK 129
+ A+ G++ VAD NH +R+
Sbjct: 237 QGLALLGDGSVVVADTVNHALRR 259
>gi|21218701|ref|NP_624480.1| hypothetical protein SCO0142 [Streptomyces coelicolor A3(2)]
gi|6066645|emb|CAB58305.1| conserved hypothetical protein SCJ33.06c [Streptomyces coelicolor
A3(2)]
Length = 601
Score = 45.8 bits (107), Expect = 0.008, Method: Composition-based stats.
Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + P S +V D++R L+ ES V+R+ G G +G++DG A F +P
Sbjct: 177 PGKAVSLP-SGTFLVSDTTRHQLVELA--EDGESPVRRI-GSGRRGFTDGPADRAEFSEP 232
Query: 107 KSFAVDMKGNIYVADKSNHVIRK 129
+ A+ G++ VAD NH +R+
Sbjct: 233 QGLALLGDGSVVVADTVNHALRR 255
>gi|262195555|ref|YP_003266764.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
gi|262078902|gb|ACY14871.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
Length = 404
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G++DG ARFD+P AV + IYVAD +NH IRKIT
Sbjct: 241 GFADGNGAEARFDRPLGVAV-VDDEIYVADSANHRIRKIT 279
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 24/47 (51%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LAG G +DG A P AV GN+Y+A+ N IRKIT
Sbjct: 86 LAGGEEAGTTDGTRAVATLRNPAGLAVGPNGNVYIAEFDNDRIRKIT 132
Score = 35.8 bits (81), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV-IRKI 130
V LAG GV SDG A FD P + D GN+YV + + +RKI
Sbjct: 284 VSTLAGTGVAASSDGSLAEATFDTPYAMTRDSDGNLYVTELGDSFRVRKI 333
>gi|288921012|ref|ZP_06415304.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288347587|gb|EFC81872.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 882
Score = 45.8 bits (107), Expect = 0.009, Method: Composition-based stats.
Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + + +AG GV GYS DG P +A + + AV G+IY+AD NH IRKI+
Sbjct: 648 IAADGTISTIAGTGVAGYSGDGGPATAAQLKSAEKLAVAPDGSIYIADYENHRIRKIS 705
Score = 40.0 bits (92), Expect = 0.49, Method: Composition-based stats.
Identities = 34/112 (30%), Positives = 54/112 (48%), Gaps = 14/112 (12%)
Query: 30 DGYTVT---TVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF------PLSEES 80
DG T T +DG+ E P + G +D+ V +S Y + ++ +
Sbjct: 707 DGIITTIAGTGVDGYSGEGGPATAATLNGPNDVEV--ASDGTLYIANLGSDTIQKINTDG 764
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G QGYS DG P + A+ P S ++ G +Y+AD N+ IRK+
Sbjct: 765 TIHTVAGTGEQGYSGDGGPATKAQLSIP-SVSLGRNGTLYIADYGNNRIRKV 815
Score = 36.6 bits (83), Expect = 4.7, Method: Composition-based stats.
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+A+ D P S D KGNIY+ D N+ IRKI
Sbjct: 618 AAKLDGPGSAVTDGKGNIYIPDARNYRIRKI 648
Score = 35.8 bits (81), Expect = 8.4, Method: Composition-based stats.
Identities = 19/57 (33%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S + ++ +AG GV GYS +G P + A + P V G +Y+A+ + I+KI
Sbjct: 704 ISTDGIITTIAGTGVDGYSGEGGPATAATLNGPNDVEVASDGTLYIANLGSDTIQKI 760
>gi|428183007|gb|EKX51866.1| hypothetical protein GUITHDRAFT_84912 [Guillardia theta CCMP2712]
Length = 351
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLD 134
+ LAG G Q +SDG P + F+ P + AV G +YVAD +N +RKI D
Sbjct: 173 TRTLAGSGNQSFSDGPPSESSFNTPAAIAVSPDGLTVYVADLNNERVRKIAVRD 226
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
LAG G G++DG+ S+ F++P + G +YVAD N+ +RK+
Sbjct: 64 LAGTGTPGFADGDWNSSSFNQPCGVSTSPDGQWLYVADSGNNRVRKV 110
>gi|440714668|ref|ZP_20895245.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
gi|436440433|gb|ELP33756.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
Length = 283
Score = 45.8 bits (107), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ T + G ++ +D G+ L+VL+ + + + + LAG G +G
Sbjct: 143 LVTTLAGRTSKLKDPRAVDLDGNRRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 198
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 199 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 233
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 29/49 (59%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
GY DG SARF+K + +D + +Y++D +NH +R++ G ++
Sbjct: 38 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMV 86
>gi|410448598|ref|ZP_11302672.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
LV3954]
gi|410017668|gb|EKO79726.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
LV3954]
Length = 356
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
P VD GNI+V+D+ ++IRKI
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKI 103
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
GV G+ +G+ ++ F P +D + N+YV + NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207
>gi|32472743|ref|NP_865737.1| hypothetical protein RB3815 [Rhodopirellula baltica SH 1]
gi|32443980|emb|CAD73422.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 351
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ T + G ++ +D G+ L+VL+ + + + + LAG G +G
Sbjct: 211 LVTTLAGRTSKLKDPRAVDLDGNRRLLVLERNGNRLRRVK----SNGDITTLAGSGKKGK 266
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 29/49 (59%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
GY DG SARF+K + +D + +Y++D +NH +R++ G ++
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMV 154
>gi|421111981|ref|ZP_15572448.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
JET]
gi|410802632|gb|EKS08783.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
JET]
Length = 356
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
P VD GNI+V+D+ ++IRKI
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKI 103
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
GV G+ +G+ ++ F P +D + N+YV + NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207
>gi|451982091|ref|ZP_21930422.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
gi|451760645|emb|CCQ91702.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
Length = 724
Score = 45.4 bits (106), Expect = 0.010, Method: Composition-based stats.
Identities = 18/57 (31%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + ++ +AG+G G+S +G A + P +D+KGN+Y+AD++NH +R++
Sbjct: 81 IGKNGIITTVAGNGNAGFSGDEGPALEAALNFPAGLCLDLKGNLYIADRNNHRVRRV 137
Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats.
Identities = 20/57 (35%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + ++ +AG GVQG+S DG P A+ P A D +GN+Y D+ N+ +R++
Sbjct: 636 VDKNGIITTVAGTGVQGFSGDGGPAVQAQLKSPYRMAFDSEGNLYFTDRDNNRVRRV 692
Score = 36.6 bits (83), Expect = 4.7, Method: Composition-based stats.
Identities = 30/95 (31%), Positives = 43/95 (45%), Gaps = 4/95 (4%)
Query: 39 DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--G 96
+G EI H D S V S RS L ++ + + +AG G+ GY G
Sbjct: 158 EGPAEEIPLHFPSDVACDSQGQVYISDRSNNRVLK--MNPQGQIVTVAGLGMAGYGGDFG 215
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
A P VD GN+Y+AD+ N+ +RK+T
Sbjct: 216 PAIDALLKYPFGIHVDESGNLYIADRGNNRVRKVT 250
>gi|456874551|gb|EMF89837.1| hypothetical protein LEP1GSC005_3835 [Leptospira santarosai str.
ST188]
Length = 356
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
P VD GNI+V+D+ ++IRKI
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKI 103
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
GV G+ +G+ ++ F P +D + N+YV + NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207
>gi|422003150|ref|ZP_16350382.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
Shermani str. LT 821]
gi|417258114|gb|EKT87507.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
Shermani str. LT 821]
Length = 356
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
P VD GNI+V+D+ ++IRKI
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKI 103
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
GV G+ +G+ ++ F P +D + N+YV + NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207
>gi|398331208|ref|ZP_10515913.1| hypothetical protein LalesM3_04029 [Leptospira alexanderi serovar
Manhao 3 str. L 60]
Length = 357
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 36/70 (51%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y ++ V+ AG G+ DG +A F+ P +D GNI+V+
Sbjct: 36 ILDCILKECYLCKLKVTNNPVISLFAGTGINVSIDGTTQTASFNTPFGLELDTFGNIFVS 95
Query: 121 DKSNHVIRKI 130
D+S ++IRKI
Sbjct: 96 DQSANLIRKI 105
Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G GV GY DG+ SA+F P D K + + VAD NH IRKI
Sbjct: 210 VSTLSG-GVSGYLDGDLASAQFKSPLGITYDQKTDSLLVADLQNHRIRKI 258
>gi|290975027|ref|XP_002670245.1| predicted protein [Naegleria gruberi]
gi|284083802|gb|EFC37501.1| predicted protein [Naegleria gruberi]
Length = 1363
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 9/103 (8%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
L Y +TTV + +P I + + D+IV D+ + +S+ V+ R+AG
Sbjct: 393 LPSKYYITTVATDFK---DPQK-IAKLSNGDIIVSDTGDHSIKKISY---STGVISRIAG 445
Query: 88 DGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
GV G+S G A+ +KP A+ IY+AD NH IR
Sbjct: 446 TGVAGFSGDGGLASQAQLNKPYGIAITANDEIYIADNLNHRIR 488
Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats.
Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 6/121 (4%)
Query: 12 ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYT 71
+ ++++L +V+ L ++T G+ +P + R + D++V D+ A
Sbjct: 13 VVMMAMLVYMVNGAPLNTYSVKISTFAGGNGEFKDPQK-LARLSNGDILVSDALGHAIKK 71
Query: 72 LSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
++ V+ +AG+G G+ +G +A +KP AV IY AD N+ IRK
Sbjct: 72 INV---TSGVITTIAGNGTAGFGGDNGPAVNALVNKPFGIAVSQTDEIYFADSGNNRIRK 128
Query: 130 I 130
I
Sbjct: 129 I 129
>gi|290969880|ref|XP_002667977.1| predicted protein [Naegleria gruberi]
gi|284080923|gb|EFC35233.1| predicted protein [Naegleria gruberi]
Length = 366
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 29/43 (67%), Gaps = 1/43 (2%)
Query: 89 GVQGYSDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G G ++G P +ARF+ P A+ G ++VADK+NH+IR+I
Sbjct: 125 GTSGSTNGNTPTAARFNNPYGMALAADGTMFVADKANHIIRRI 167
>gi|393789630|ref|ZP_10377750.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
CL02T12C05]
gi|392650346|gb|EIY44015.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
CL02T12C05]
Length = 435
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 51/122 (41%), Gaps = 22/122 (18%)
Query: 31 GYTVTTVIDGHQLEINPHS----VIDRPGSSD----LIVLDSSRSAFYTLSFPL----SE 78
YT I + L+ H +++ PG +D + D+ R T L S
Sbjct: 311 AYTNAHCIYTYNLKTGAHKLFAGMVNTPGYADGPCEYALFDTPRQIILTAENDLFLADSN 370
Query: 79 ESVVKRLAGDGV----------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
V++++ DGV G DG P A F++P AVD G IY+ D N IR
Sbjct: 371 NHVIRKITQDGVVSTVIGQAGMTGNQDGTPEVALFNEPFGVAVDTDGTIYIGDSKNQSIR 430
Query: 129 KI 130
++
Sbjct: 431 RL 432
Score = 42.4 bits (98), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY+DG A FD P+ + + ++++AD +NHVIRKIT
Sbjct: 339 GYADGPCEYALFDTPRQIILTAENDLFLADSNNHVIRKIT 378
>gi|441163235|ref|ZP_20968262.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
rimosus ATCC 10970]
gi|440616374|gb|ELQ79516.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
rimosus ATCC 10970]
Length = 553
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 17/98 (17%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + ++ +AG+G GY DG P + R P A+D GN+Y+AD +NH +R +T
Sbjct: 80 VTSDGIITTVAGNGTAGYVDDGGPAAGTRLYYPYGIALDRGGNLYIADCNNHRVRGVT-- 137
Query: 134 DHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLI 171
+ Q+ P + + V SPY +
Sbjct: 138 -----AVAQMTPPPPPTAD--------LYGEVISPYRV 162
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ ++ +AG+G GY SDG P R P A+D G++Y+ D NH IRK+T
Sbjct: 24 VTPNGLITTVAGNGTAGYVSDGGPALGTRLHYPWGLALDEAGSLYIGDGHNHRIRKVT 81
>gi|116622401|ref|YP_824557.1| Ig domain-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116225563|gb|ABJ84272.1| Ig domain protein, group 1 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 2770
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 35/52 (67%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG G +GYS DG+P + A+ + P A+D GN+++AD NH +R++
Sbjct: 141 IITTVAGTGERGYSGDGQPATLAKINSPYHIALDGHGNLFIADDGNHRVRRV 192
Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G+QG+S G +AR P A+ G++Y+AD NH IRK+
Sbjct: 25 ISTVAGTGIQGFSGDSGPATAARLFDPTDVAIHPNGDLYIADTYNHRIRKV 75
Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats.
Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG G GY+ D + A P+ +D GN+Y+AD NH +R +
Sbjct: 198 VITTVAGTGNAGYNGDDQQATHADLQNPRGVLIDASGNLYIADYGNHRVRVV 249
>gi|290982604|ref|XP_002674020.1| predicted protein [Naegleria gruberi]
gi|284087607|gb|EFC41276.1| predicted protein [Naegleria gruberi]
Length = 499
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 43/90 (47%), Gaps = 13/90 (14%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
+V+D++ Y F + VV GD ++ + G +FD PK D+ GN+ V
Sbjct: 296 LVIDTTNEDIYICDFD-NHRIVVLSKNGDFLRDFGSKGSGDTQFDGPKDIDFDLDGNLVV 354
Query: 120 ADKSNH-----------VIRKITNLDHGNQ 138
+D+ NH +IRKI N HGNQ
Sbjct: 355 SDRENHRILVLSKLNGSLIRKIEN-GHGNQ 383
>gi|290970060|ref|XP_002668023.1| predicted protein [Naegleria gruberi]
gi|284081057|gb|EFC35279.1| predicted protein [Naegleria gruberi]
Length = 318
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 51/100 (51%), Gaps = 14/100 (14%)
Query: 79 ESVVKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+S++K +AG QG DG SA + P S A++ G +Y+ADKS+H IRK+ N N
Sbjct: 30 KSIIKTIAG---QGVCDGISATSASLNGPSSTAINSLGEVYIADKSHHRIRKVYN----N 82
Query: 138 QLIRQINLKPEDCSKSSQSGSGMFWVT--VFSPYLISCKS 175
I I + +G G+ T +FSPY I S
Sbjct: 83 GTIVTI----AGTGIAGYNGDGIDATTAQLFSPYGIGIDS 118
Score = 36.6 bits (83), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 30/55 (54%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G+ GY+ DG + +A+ P +D K +Y +D SNH IRKI
Sbjct: 81 NNGTIVTIAGTGIAGYNGDGIDATTAQLFSPYGIGIDSKDLVYFSDSSNHRIRKI 135
>gi|223939922|ref|ZP_03631790.1| NHL repeat containing protein [bacterium Ellin514]
gi|223891413|gb|EEF57906.1| NHL repeat containing protein [bacterium Ellin514]
Length = 359
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 8/79 (10%)
Query: 60 IVLDSSRSAFY------TLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAV 111
+ LD + + F+ T+ ++ +V +AG+G G+S DG P + A+ ++P S
Sbjct: 113 VRLDQAENLFWVERLSHTVRKCDAKTGIVTTIAGNGTAGFSGDGGPATKAQMNEPHSIGF 172
Query: 112 DMKGNIYVADKSNHVIRKI 130
D GN+Y+ D NH IRK+
Sbjct: 173 DKAGNLYICDVRNHRIRKV 191
Score = 40.4 bits (93), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
E+ + +AG G G++ +G P +A F PK +V GN++VAD NH IR I
Sbjct: 251 EKGRIFHVAGTGKNGFTGNGGPAKAATFKGPKGLSVASNGNVFVADTENHAIRMI 305
>gi|29830929|ref|NP_825563.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
gi|29608042|dbj|BAC72098.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
Length = 341
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 10/99 (10%)
Query: 39 DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
DG + P + R PG S + +V D++R L+ ESVV+R+ G G++
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGNFLVSDTTRHQLVELA--ADGESVVRRI-GSGIR 221
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G++DG +A F +P+ + + + VAD NH +R++
Sbjct: 222 GFADGPADAASFSEPQGMTLLDEDVVVVADTVNHALRRL 260
>gi|302821467|ref|XP_002992396.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
gi|300139812|gb|EFJ06546.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
Length = 161
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 53/131 (40%), Gaps = 45/131 (34%)
Query: 74 FPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
FPL +S ++ AG G GY DG + F++P+S ++ G ++VAD N IRKI+
Sbjct: 14 FPL--DSSLEAFAGSFVGESGYVDGPAAKSLFNRPQSLSICDNGAVFVADTRNLAIRKIS 71
Query: 132 NL----------------------------------------DHGNQLIRQINL-KPEDC 150
D GN+LIR+I + P+ C
Sbjct: 72 KFLPTVTTIAGGSSRKPGFAGDTARFSSEFSLACSCGSLLIADCGNRLIREIQIDDPKSC 131
Query: 151 SKSSQSGSGMF 161
S + SG F
Sbjct: 132 DSSDSAVSGDF 142
>gi|114566902|ref|YP_754056.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114337837|gb|ABI68685.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 1030
Score = 45.4 bits (106), Expect = 0.012, Method: Composition-based stats.
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ AG+G GYS DG P SA+ + P + D +GN+Y+AD NH IR +
Sbjct: 656 IISTAAGNGTYGYSGDGGPATSAQLNNPNGLSFDNRGNMYIADTYNHRIRMV 707
Score = 43.9 bits (102), Expect = 0.030, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 8/61 (13%)
Query: 81 VVKRLAGDGV------QGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ +AG+GV GYS G SA+ + P A D GN+Y+AD +NH IRK+
Sbjct: 594 IISTVAGNGVLTGSYKSGYSGDGGSATSAQLNYPYGVAFDASGNMYIADSNNHCIRKVDT 653
Query: 133 L 133
L
Sbjct: 654 L 654
Score = 43.9 bits (102), Expect = 0.036, Method: Composition-based stats.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 3/63 (4%)
Query: 71 TLSFPLSEES-VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
T L++ + ++ +AG+G GYS G SA + P A D GN+Y+AD SN +
Sbjct: 471 TFRLTLTQNTGIISTVAGNGTAGYSGDGGLAASALLNYPHGLAFDGNGNLYIADASNRRV 530
Query: 128 RKI 130
RKI
Sbjct: 531 RKI 533
Score = 42.4 bits (98), Expect = 0.086, Method: Composition-based stats.
Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 8/79 (10%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAV 111
G+ +L + D+S + ++ +AG+G GYS G+ GSA + P A
Sbjct: 516 GNGNLYIADASNRRVRKID----SAGIITTVAGNGTSGYS-GDGGSAIAAKITCPYGVAF 570
Query: 112 DMKGNIYVADKSNHVIRKI 130
D GN+Y+AD NH IRK+
Sbjct: 571 DSNGNMYIADIFNHRIRKV 589
Score = 40.4 bits (93), Expect = 0.40, Method: Composition-based stats.
Identities = 34/146 (23%), Positives = 67/146 (45%), Gaps = 14/146 (9%)
Query: 44 EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPG-SA 101
++N + I S ++ + DS+ + + ++ AG+G G + DG P SA
Sbjct: 743 QLNNPNGITFDSSGNMYIADSNNNCIRKVD----HSGMISTFAGNGTSGHFGDGGPATSA 798
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI--------RQINLKPEDCSKS 153
+ P A+D GN+++AD +H IRK+ N I ++I+ + + +
Sbjct: 799 QLRNPVGVALDNSGNLFIADYFDHSIRKVVLAAQQNSTISPTTGNFDKRISAQADVSTTM 858
Query: 154 SQSGSGMFWVTVFSPYLISCKSFWVH 179
+ +G+ + +T LI K + V+
Sbjct: 859 TLNGNELLRITNGETALIQGKDYNVN 884
Score = 37.4 bits (85), Expect = 3.1, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 13/69 (18%)
Query: 81 VVKRLAGDGVQGYSDGEPG----------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG+G G G G SA+ + P D GN+Y+AD +N+ IRK+
Sbjct: 712 VISTVAGNGNSGDRYGNDGGYSGDGGLATSAQLNNPNGITFDSSGNMYIADSNNNCIRKV 771
Query: 131 TNLDHGNQL 139
DH +
Sbjct: 772 ---DHSGMI 777
>gi|312198347|ref|YP_004018408.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311229683|gb|ADP82538.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 814
Score = 45.1 bits (105), Expect = 0.013, Method: Composition-based stats.
Identities = 20/54 (37%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 79 ESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V++ +AG GV GYS G+ +A P+ VD GN+Y+ D N+ +R+I
Sbjct: 703 DGVIETIAGTGVAGYSGDGGKATAATLRDPRGVTVDAAGNVYITDSGNNRVRRI 756
Score = 42.4 bits (98), Expect = 0.088, Method: Composition-based stats.
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 79 ESVVKRLAGD-GVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ ++ +AG G + DG+P + A KP + A+D +G IY+AD NH IR+I
Sbjct: 647 DGIITTVAGRFGYGSWGDGKPATQAMISKPFNVALDRQGRIYIADSYNHKIRRI 700
Score = 38.5 bits (88), Expect = 1.4, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 35/61 (57%), Gaps = 8/61 (13%)
Query: 73 SFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
SFP V +AG G G++ DG P + A+ D P A+D GN+Y AD N+ +R+I
Sbjct: 477 SFP------VVAIAGTGQAGFAGDGGPAAQAQLDHPYGPAMDGFGNLYFADFDNNRVRRI 530
Query: 131 T 131
+
Sbjct: 531 S 531
Score = 37.0 bits (84), Expect = 3.8, Method: Composition-based stats.
Identities = 18/58 (31%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + + +AG+G G+S DG P +A +KP + A+ G +Y+ D N +R+++
Sbjct: 530 ISPDGTITTVAGNGQPGFSGDGGPATAAMLNKPVAVAIGPGGTLYIVDTFNMRVRQVS 587
Score = 36.6 bits (83), Expect = 4.7, Method: Composition-based stats.
Identities = 23/88 (26%), Positives = 47/88 (53%), Gaps = 9/88 (10%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS---DGEPGS-AR 102
P +V PG + L ++D+ F +S + +++ +AG G + ++ DG P + A
Sbjct: 562 PVAVAIGPGGT-LYIVDT----FNMRVRQVSPDGIIQTIAGSGERPWNPADDGGPATNAA 616
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
P A+D G++++AD N ++R++
Sbjct: 617 LWYPSGIAIDSAGDLFIADNGNDIVRRV 644
>gi|373955343|ref|ZP_09615303.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
gi|373891943|gb|EHQ27840.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
Length = 929
Score = 45.1 bits (105), Expect = 0.014, Method: Composition-based stats.
Identities = 20/47 (42%), Positives = 31/47 (65%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
AG G G ++G ++ F P A D+ GN+YVAD+ N++IRKI++
Sbjct: 142 AGTGNSGSTNGSALTSTFYSPTRVAADLSGNLYVADRDNNLIRKISS 188
Score = 43.1 bits (100), Expect = 0.052, Method: Composition-based stats.
Identities = 21/52 (40%), Positives = 31/52 (59%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
V LAG G+ ++DG +A F P +D GN+Y+AD N+ +RK+T L
Sbjct: 286 VTTLAGSGMAAFADGTGTAASFYGPCGGTLDAMGNLYIADGVNNRVRKVTPL 337
Score = 42.7 bits (99), Expect = 0.075, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 31/49 (63%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V +AG+G G ++G +A F P S VD GN+YV+D N++IRKI
Sbjct: 232 VTVVAGNGSMGSNNGIGSAASFYYPFSVTVDGAGNLYVSDNGNNLIRKI 280
Score = 40.8 bits (94), Expect = 0.26, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 27/49 (55%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V LAG V G ++G +A F +P D G IYV D N+VIRKI
Sbjct: 394 VTILAGSNVAGSANGIGTAASFRRPNDVQADQSGFIYVTDYGNNVIRKI 442
Score = 36.2 bits (82), Expect = 7.4, Method: Composition-based stats.
Identities = 14/29 (48%), Positives = 21/29 (72%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
F++P VD+ GN+YVAD + + I+KIT
Sbjct: 199 FNQPNGVTVDLNGNVYVADAATNSIKKIT 227
Score = 35.8 bits (81), Expect = 8.8, Method: Composition-based stats.
Identities = 17/50 (34%), Positives = 28/50 (56%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
VV +AG+G + +G SA + P +D+ G +YVA+ + IRK+
Sbjct: 339 VVTTVAGNGTRATINGNGTSASLNTPTGATIDIAGIVYVAELDGNCIRKV 388
>gi|418754100|ref|ZP_13310334.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
MOR084]
gi|409965522|gb|EKO33385.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
MOR084]
Length = 356
Score = 45.1 bits (105), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I+ D Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTII-DCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
P VD GNI+V+D+ ++IRKI
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKI 103
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
GV G+ +G+ ++ F P +D + N+YV + NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207
>gi|332878080|ref|ZP_08445810.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
gi|357047673|ref|ZP_09109271.1| NHL repeat protein [Paraprevotella clara YIT 11840]
gi|332684042|gb|EGJ56909.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
gi|355529361|gb|EHG98795.1| NHL repeat protein [Paraprevotella clara YIT 11840]
Length = 461
Score = 45.1 bits (105), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 46/101 (45%), Gaps = 26/101 (25%)
Query: 53 RPGSSD------------LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV---------- 90
RPG +D +VLD + L SE + ++++ +GV
Sbjct: 363 RPGHADGLASDAQFHSPRQLVLDEEEN----LYIADSENNCIRKITPEGVVSTVIGIPGK 418
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY DG P A F +P A+D +G IYV DK N +R+++
Sbjct: 419 SGYKDGTPDVALFTQPWGLAIDSEGIIYVGDKDNLCVRQLS 459
Score = 43.5 bits (101), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 33/63 (52%)
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
F ++ P E + G G++DG A+F P+ +D + N+Y+AD N+ IR
Sbjct: 342 FEDITDPTREGNFKIYAGMQGRPGHADGLASDAQFHSPRQLVLDEEENLYIADSENNCIR 401
Query: 129 KIT 131
KIT
Sbjct: 402 KIT 404
>gi|320105416|ref|YP_004181006.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319923937|gb|ADV81012.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 822
Score = 45.1 bits (105), Expect = 0.014, Method: Composition-based stats.
Identities = 25/53 (47%), Positives = 32/53 (60%), Gaps = 4/53 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG GV G+S G+ G+A P AVD GN+YVAD NH IRKI+
Sbjct: 149 IATIAGTGVAGFS-GDGGAAVLATLSHPTGLAVDTGGNLYVADTDNHRIRKIS 200
Score = 43.5 bits (101), Expect = 0.045, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 80 SVVKRLAGDGVQGYS-DGEPGSARF-DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ + +AG G QG++ DG P +A + D P AVD GN+Y+AD N IR ++
Sbjct: 202 TTITTVAGSGEQGFAGDGGPATAAWLDSPDGVAVDATGNLYIADTHNQRIRVVS 255
Score = 42.4 bits (98), Expect = 0.086, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S ++ +AG G QG++ G SA D P AVD GN+Y++D N IR+++
Sbjct: 88 VSAAGIITTVAGTGEQGFAGDGGSATSAWLDSPVGVAVDRVGNLYISDSHNQRIRRVS 145
Score = 40.8 bits (94), Expect = 0.28, Method: Composition-based stats.
Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S E + +AG+G + Y+ G+ GSA +P+ +VD GNIY AD N+ IR I
Sbjct: 254 VSAEGTISTIAGNGSRAYA-GDGGSAVAASLARPRGLSVDALGNIYFADSDNNRIRLI 310
Score = 36.6 bits (83), Expect = 5.7, Method: Composition-based stats.
Identities = 16/38 (42%), Positives = 23/38 (60%)
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
S+ E +A P A D GN+Y AD +NHV+RK++
Sbjct: 52 SNVEATTAPLASPAGIAYDASGNMYFADLNNHVVRKVS 89
>gi|383120609|ref|ZP_09941337.1| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
gi|382985045|gb|EES68418.2| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
Length = 483
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 40/89 (44%), Gaps = 4/89 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
NPH + L + D + + PL V + G++GY DG P A F
Sbjct: 394 NPHQICFTE-DGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALF 452
Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
+ P AV G I YVAD N VIRK++
Sbjct: 453 NHPFGVAVSADGQIVYVADTGNKVIRKLS 481
>gi|357412450|ref|YP_004924186.1| alkyl hydroperoxide reductase [Streptomyces flavogriseus ATCC
33331]
gi|320009819|gb|ADW04669.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces flavogriseus ATCC 33331]
Length = 603
Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 3/72 (4%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V DS+R L ++ V+ G G +G++DG ARF +P+ AV G I
Sbjct: 188 FLVSDSTRHRLVELE---ADGETVRGHFGTGERGFADGGREEARFSEPQGLAVLPDGRIA 244
Query: 119 VADKSNHVIRKI 130
VAD NH IR +
Sbjct: 245 VADTVNHAIRAL 256
>gi|126179662|ref|YP_001047627.1| NHL repeat-containing protein [Methanoculleus marisnigri JR1]
gi|125862456|gb|ABN57645.1| NHL repeat containing protein [Methanoculleus marisnigri JR1]
Length = 487
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
AGDG +GY DG G AR ++P VD G Y+AD NH++R + H
Sbjct: 430 AGDGDRGYRDGLSGEARLNEPGGL-VDFGGLWYIADTGNHIVRVYDPVRH 478
Score = 35.8 bits (81), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
G G G +DG A F P+ A D + I YVAD NH IR+++
Sbjct: 210 GSGAAGNADGPFDEAAFYLPEGLAFDEEAGILYVADTGNHTIRRVS 255
>gi|88601128|gb|ABD46562.1| NHL repeat-containing protein [Vermamoeba vermiformis]
Length = 309
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + VV LAG G G DG ARF+ P VD GN+YVAD N+ +RK+T
Sbjct: 29 ITPDGVVVTLAGHPGHWGSRDGVGSKARFNGPSGLDVDTDGNVYVADYYNNTMRKVT 85
Score = 44.7 bits (104), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 36/53 (67%), Gaps = 1/53 (1%)
Query: 82 VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
V +AG +Q G +DG+ +ARF++P +VD GN++V+D NH IRKI+ L
Sbjct: 202 VTTIAGRVMQPGSADGKLTAARFNQPSGISVDKIGNLFVSDYYNHTIRKISPL 254
Score = 36.2 bits (82), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 38/60 (63%), Gaps = 1/60 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
++ E +V +AG G G +DG ARF+ P +D +GNI ++D +N+ +RKI+N+D
Sbjct: 84 VTPEGIVTTIAGHVGQWGSTDGSGEKARFNGPSGVRIDTEGNIIISDNNNNTVRKISNID 143
Score = 35.8 bits (81), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G DG ARF P AV G I+V D+ NH IR I+
Sbjct: 158 GSEDGNGQQARFFGPSGIAVAPDGTIFVCDRYNHTIRSIS 197
>gi|255070387|ref|XP_002507275.1| predicted protein [Micromonas sp. RCC299]
gi|226522550|gb|ACO68533.1| predicted protein [Micromonas sp. RCC299]
Length = 131
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 32/54 (59%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
V ++G G+ G+ DG+ + F P AV G++YVAD N+ IR++++ H
Sbjct: 6 VTSISGSGIPGFRDGQGEKSHFSSPAGVAVSSDGSVYVADSGNNRIRRVSSKGH 59
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + V +AG G+ G+SD + A F++P+ G I+VAD NH IR ++
Sbjct: 54 VSSKGHVTWIAGSGLAGFSDDKLLRAEFNRPQGIVTIPTGLIFVADTLNHRIRLVS 109
>gi|111219920|ref|YP_710714.1| serine/threonine protein kinase [Frankia alni ACN14a]
gi|111147452|emb|CAJ59102.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
Length = 725
Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats.
Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + VV +AG+G++G++ DG P A P AVD G+IY++D+ N +R++
Sbjct: 422 VDRDGVVTTVAGNGIRGFTGDGGPAIRAELADPAGIAVDAAGDIYISDQGNQRVRRV 478
Score = 40.0 bits (92), Expect = 0.47, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 31/64 (48%), Gaps = 15/64 (23%)
Query: 81 VVKRLAGDGVQGYSDGEPG--------------SARFDKPKSFAVDMKGNIYVADKSNHV 126
V+ AG GV G+S GE G A D+P + VD GN+Y+ D SN
Sbjct: 483 VITTFAGTGVFGFS-GENGPKIGGFSGDGVLARQAMLDEPSALWVDRAGNVYICDGSNDR 541
Query: 127 IRKI 130
IRK+
Sbjct: 542 IRKV 545
Score = 37.4 bits (85), Expect = 2.7, Method: Composition-based stats.
Identities = 18/46 (39%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
Query: 87 GDGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G G G+S +A +KP + AVD GN+Y+ DKSN +R++
Sbjct: 377 GTGNDGFSGDGGRAAGAGLNKPDAMAVDTAGNLYIVDKSNQRVRRV 422
Score = 36.2 bits (82), Expect = 6.8, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+AR P+S AVD+ G +YV D+ N+ +R+I HG
Sbjct: 571 AARLQWPESLAVDVAGTMYVTDQGNNRVRRIDT--HG 605
>gi|357008076|ref|ZP_09073075.1| copper amine oxidase domain-containing protein [Paenibacillus elgii
B69]
Length = 533
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 29/50 (58%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG+G+ G DGE A F +P AV G +YVAD NH IR I+
Sbjct: 160 VSTVAGNGLLGRKDGEGKDALFYRPTDVAVAADGTLYVADSLNHAIRSIS 209
Score = 41.6 bits (96), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 11/60 (18%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR-------KITNLDHGNQ 138
AGD ++DG+ SA+F++P + +D KGN+ V+D N IR K+T L G Q
Sbjct: 235 AGD----FADGDLKSAKFNEPTALVLDAKGNLIVSDSGNQRIRYIDLQQGKVTTLAGGGQ 290
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 41/89 (46%), Gaps = 15/89 (16%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG---------VQG-YSDGEPG 99
V+D G +LIV DS + ++ V LAG G VQG ++DG
Sbjct: 255 VLDAKG--NLIVSDSGNQRIRYIDL---QQGKVTTLAGGGQAATNKELHVQGGFADGSAS 309
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIR 128
ARF P A+ +G + +AD NH IR
Sbjct: 310 DARFSFPMGLALTEEGGLVIADSQNHAIR 338
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
PL+E + LAG+G G +DG SA F P V G + V+D N +IRK++
Sbjct: 40 PLTEMAT---LAGNGGLGSADGAGVSASFRMPGGLTVLKDGTVLVSDSRNQLIRKLSQ 94
>gi|399157177|ref|ZP_10757244.1| NHL repeat containing protein, partial [SAR324 cluster bacterium
SCGC AAA001-C10]
Length = 352
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V LAG G +D SA+F+ PK D N+YVAD SNH IRKI
Sbjct: 238 TAVTTLAGQSDNGSTDATGTSAQFNYPKGITTDGT-NLYVADYSNHRIRKI 287
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 27/50 (54%), Gaps = 2/50 (4%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
VV LAG QG +D SA F P D N+YVAD+ NH IRKI
Sbjct: 293 VVTTLAGSS-QGSTDATGTSASFYYPSGITTDGT-NLYVADRYNHRIRKI 340
>gi|441185716|ref|ZP_20970504.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
subsp. rimosus ATCC 10970]
gi|440613960|gb|ELQ77297.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
subsp. rimosus ATCC 10970]
Length = 108
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 2/55 (3%)
Query: 79 ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ + +AG+GV G+ SDG G+ R P+ VD GN+++AD+ NH IRK+T
Sbjct: 18 DGSIATVAGNGVAGFISDGGHGALTRVYYPQGVTVDKNGNLFIADRYNHRIRKVT 72
>gi|423215200|ref|ZP_17201728.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692463|gb|EIY85701.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
CL03T12C04]
Length = 451
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 28/43 (65%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G G+ DG P A+F+ P+ V ++YVAD+ NHVIR++T
Sbjct: 407 GQSGFGDGLPELAKFNSPECVTVYPDNSVYVADRENHVIRRVT 449
Score = 43.5 bits (101), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GY +G + R ++P+ A D GN++V +KS H+IRKIT
Sbjct: 355 GYVNGSGATVRLNQPRQPAFDEDGNMFVPEKSAHIIRKIT 394
>gi|320107007|ref|YP_004182597.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319925528|gb|ADV82603.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 710
Score = 44.7 bits (104), Expect = 0.019, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ E + +AG G QGYS E + AR D P S A D G + AD+ NH IR++
Sbjct: 175 VDENGSISTVAGTGDQGYSGDEGPAILARIDSPSSLAFDAAGRLLFADRRNHRIRRV 231
Score = 43.5 bits (101), Expect = 0.045, Method: Composition-based stats.
Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LS +AGDG QG++ DG P SA D P AV +G +++AD NH +R ++
Sbjct: 73 LSTTGTWTTVAGDGRQGFAGDGGPAISAELDTPMGIAVSAEGTLWIADAHNHRVRLVS 130
>gi|298245110|ref|ZP_06968916.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
gi|297552591|gb|EFH86456.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
Length = 496
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 57/115 (49%), Gaps = 17/115 (14%)
Query: 27 LLEDGYTVTTVIDGHQLEIN----PHSVIDRPG-------SSDLIVLDSSRSAFYTLSFP 75
+LE+ + ++D QL + P S + PG + LI+ DS+ +
Sbjct: 148 MLEE-FEARGLLDHRQLRFSAATAPSSFLAFPGKLAVDAQADRLIISDSAHHRLVETNL- 205
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V+ + G GVQG +DG A+F+ P+ A+ + +YVAD NH+IR++
Sbjct: 206 ---QGRVRTIIGSGVQGQADGSFAEAQFNHPQGVAL-VNDLLYVADTDNHLIRRV 256
Score = 41.2 bits (95), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 6/67 (8%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLI 140
V+ LAG G G +G A+F++P A+ GN +Y+AD +NH IR LD + I
Sbjct: 435 VRTLAGTGEAGIHNGPAEQAQFNEPGDLAI--TGNTLYIADTNNHAIRL---LDMATRNI 489
Query: 141 RQINLKP 147
++++P
Sbjct: 490 TTLDIQP 496
>gi|153867703|ref|ZP_01997987.1| NHL repeat protein [Beggiatoa sp. SS]
gi|152144978|gb|EDN72014.1| NHL repeat protein [Beggiatoa sp. SS]
Length = 263
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 86 AGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AGDG G++ DG P +A + + P +D GN+Y+AD NH IRKI
Sbjct: 11 AGDGNLGFAGDGGPATAAQLNSPLGMGIDSAGNLYIADTENHRIRKI 57
Score = 44.3 bits (103), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AGDG +GYS DG SA+ + P ++D GN+Y+AD N+V+RK+
Sbjct: 63 ISTVAGDGNRGYSGDGAAAVSAKLNNPMWVSLDSAGNLYIADTGNNVVRKL 113
Score = 43.9 bits (102), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 82 VKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AGDG G++ DG+ +A+ + P VD G +Y+AD SNH IRK++
Sbjct: 176 ITTIAGDGTPGFAGDGKIATAAQLNAPTQVMVDSTGQVYIADTSNHRIRKVS 227
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 6/53 (11%)
Query: 82 VKRLAGD---GVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AGD GV G DG P +A + P +D GN+Y+AD SNH IRK+
Sbjct: 120 ISTVAGDSSVGVAG--DGGPATAAQLSYPTGIDIDTAGNLYIADSSNHRIRKV 170
>gi|283779991|ref|YP_003370746.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
gi|283438444|gb|ADB16886.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
Length = 364
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 49/96 (51%), Gaps = 9/96 (9%)
Query: 38 IDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
IDG + P +V ID+ + I L S + L+ SE + +AG G +G+S D
Sbjct: 214 IDGTMSLLGPRAVCIDK--NVMWIALREGHS-VWKLNLATSE---LTHIAGTGKKGFSGD 267
Query: 96 GEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G P A FD PK AV G + V D NHVIRKI
Sbjct: 268 GGPAKDATFDGPKGVAVCPDGGVVVVDTENHVIRKI 303
Score = 43.1 bits (100), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 77 SEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ ++ +AG G +G++ DG P A F P S A+D NIYV D NH IRKI
Sbjct: 135 AKTGIISTIAGSGKEGFAGDGGPAKEAVFSIPHSIALDADDNIYVCDLGNHRIRKI 190
>gi|29349707|ref|NP_813210.1| hypothetical protein BT_4299 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298383965|ref|ZP_06993526.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383120644|ref|ZP_09941370.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
gi|29341617|gb|AAO79404.1| putative cell surface protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251840310|gb|EES68392.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
gi|298263569|gb|EFI06432.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 495
Score = 44.7 bits (104), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 3/76 (3%)
Query: 59 LIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
+ + DS S + + P +V + G +GY DG P A+F P AV+ G
Sbjct: 418 MYIADSGNSCIRVIDTTMPKERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGT 477
Query: 117 I-YVADKSNHVIRKIT 131
I YVAD N VIRK++
Sbjct: 478 IVYVADTQNKVIRKLS 493
>gi|297568314|ref|YP_003689658.1| hypothetical protein [Desulfurivibrio alkaliphilus AHT2]
gi|296924229|gb|ADH85039.1| YD repeat protein [Desulfurivibrio alkaliphilus AHT2]
Length = 2439
Score = 44.7 bits (104), Expect = 0.020, Method: Composition-based stats.
Identities = 19/50 (38%), Positives = 30/50 (60%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+++ +AG G G+S G +AR +P ++D G IY+AD NH IR +
Sbjct: 1068 IIETIAGTGSWGHSGGLAVNARLYRPSCVSIDPDGTIYIADTGNHRIRSV 1117
Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats.
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ ++ +AG G +G+S DG P AR P + A+ G+IY+AD NH IR++
Sbjct: 1244 DGIITTIAGTGYRGFSGDGGPADEARLFGPIAVAIGPDGSIYIADTYNHRIRRV 1297
Score = 41.2 bits (95), Expect = 0.22, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 84 RLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
R G G GYS DG +AR + P A+ G+IY+AD NH IR++
Sbjct: 1132 RAGGLGDSGYSGDDGPAVNARLNSPTDIALGPDGSIYIADSINHRIRRV 1180
Score = 40.8 bits (94), Expect = 0.28, Method: Composition-based stats.
Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ ++ +AG G +G+S DG P P+ ++ G+IYVAD +NH IR I
Sbjct: 1354 DGIITTIAGTGYRGFSGDGGPADEARLGPRGVSLGPDGSIYVADSNNHRIRHI 1406
Score = 37.0 bits (84), Expect = 3.8, Method: Composition-based stats.
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AR D+P + A+ G+IY+AD NH IR++
Sbjct: 1212 ARLDRPHAVAIGPDGSIYIADTYNHRIRRV 1241
>gi|380694052|ref|ZP_09858911.1| hypothetical protein BfaeM_08738 [Bacteroides faecis MAJ27]
Length = 495
Score = 44.7 bits (104), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 3/76 (3%)
Query: 59 LIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
+ + DS S + + P +V + G +GY DG P A+F P AV+ G
Sbjct: 418 MYIADSGNSCIRVVDTTMPKERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGT 477
Query: 117 I-YVADKSNHVIRKIT 131
I YVAD N VIRK++
Sbjct: 478 IVYVADTQNKVIRKLS 493
>gi|290983166|ref|XP_002674300.1| predicted protein [Naegleria gruberi]
gi|284087889|gb|EFC41556.1| predicted protein [Naegleria gruberi]
Length = 1435
Score = 44.7 bits (104), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 53/115 (46%), Gaps = 16/115 (13%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRP----------GSSDLIVLDSSRSAFYTLSFPLSEES 80
G ++T + +G+ L+ + ++RP +L D S S L +
Sbjct: 140 GTSITGLYNGNNLK-GTLTALNRPIGLFYTEIFSSQGELYFADESNSLIRKL---VIANG 195
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
V +AG+ V GYS GE SA P S V G IY+AD N VIRK++NL
Sbjct: 196 TVSNVAGNVVAGYSGDGGEATSASLRYPSSVYVSNSGEIYIADTFNCVIRKVSNL 250
>gi|298383938|ref|ZP_06993499.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298263542|gb|EFI06405.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 483
Score = 44.7 bits (104), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 4/89 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
NPH + L + D + + PL V + G++GY DG P A F
Sbjct: 394 NPHQICFTE-DGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALF 452
Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
+ P AV G I Y+AD N VIRK++
Sbjct: 453 NHPFGVAVSADGQIVYIADTGNKVIRKLS 481
>gi|395226993|ref|ZP_10405365.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
gi|394444658|gb|EJF05823.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
Length = 240
Score = 44.7 bits (104), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S V+ G G+ GY DG +++ +KP+ D KGN+ V D N +IR+I+
Sbjct: 162 ISPSGEVQNFVGSGIAGYQDGVGFNSQLNKPRGMTFDEKGNLLVVDSYNEMIREIS 217
>gi|303248818|ref|ZP_07335068.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
gi|302489759|gb|EFL49690.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
Length = 2115
Score = 44.3 bits (103), Expect = 0.025, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 37/62 (59%), Gaps = 2/62 (3%)
Query: 71 TLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
T F L + ++ +AG+G Y+ DG P +A P AVD +GN+Y+AD+ N+ IR
Sbjct: 846 TKVFRLDTDGTIRVVAGNGSTHYNGDGIPATAASLYHPNGLAVDAQGNLYIADQYNNRIR 905
Query: 129 KI 130
K+
Sbjct: 906 KV 907
>gi|302551068|ref|ZP_07303410.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
40736]
gi|302468686|gb|EFL31779.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
40736]
Length = 498
Score = 44.3 bits (103), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G+ G++ D +P +A+ + P + AVD GN+++AD SNH +R++
Sbjct: 244 ITTVAGTGIAGFNGDNQPAVNAQLNAPHTVAVDSGGNVFIADTSNHRVRRV 294
Score = 40.0 bits (92), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG+G G++ D +P +R P+ AVD GN+++AD +NH +R++
Sbjct: 73 ITTVAGNGTAGFNGDNQPAVYSRLHYPRGVAVDAGGNLFIADCNNHRVRRV 123
Score = 39.3 bits (90), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ E + +AG+G G+S G A+ + P+ AVD GN++ +D SNH +R++
Sbjct: 10 VTREPAIYIVAGNGTAGFSSDYGPAVCAQLNYPRKVAVDSSGNLFFSDYSNHRVRRV 66
Score = 36.6 bits (83), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 48/90 (53%), Gaps = 8/90 (8%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARF 103
PH+V +D G ++ + D+S + + + +AG G G++ DG P +
Sbjct: 270 PHTVAVDSGG--NVFIADTSNHRVRRVDAATRQ---ITTVAGIGTAGFNGDGGPAIGVQL 324
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ P AVD G +++AD SN+ +RK++++
Sbjct: 325 NSPVGVAVDSGGGLFIADASNYRVRKVSDV 354
Score = 35.8 bits (81), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G+ G++ D +P +A+ + P AVD GN+++ D +N +R++
Sbjct: 130 ITTVAGTGIAGFNGDNQPAVNAQLNSPIGVAVDAGGNLFITDFNNQRVRRV 180
>gi|221633398|ref|YP_002522623.1| hypothetical protein trd_1418 [Thermomicrobium roseum DSM 5159]
gi|221156519|gb|ACM05646.1| conserved hypothetical protein [Thermomicrobium roseum DSM 5159]
Length = 443
Score = 44.3 bits (103), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 48/108 (44%), Gaps = 17/108 (15%)
Query: 52 DRPGSSDLIVLDSSRSAFYTLSFPLSEESV---VKRL----------AGDGVQGYSDGEP 98
DR G+ D ++L +T L ++ +KRL G G G+ DG P
Sbjct: 333 DRDGTGDTVLLQHPLDVAWTGEELLVADTYNHKIKRLDPVARRCSSWLGTGQPGHEDGPP 392
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLK 146
ARF +P A +YVAD +NH +R I D L+R + L+
Sbjct: 393 ERARFWEPSGLATTFD-RVYVADTNNHAVRVI---DRTTGLVRTLELQ 436
Score = 43.1 bits (100), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 43/177 (24%), Positives = 72/177 (40%), Gaps = 27/177 (15%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEE 79
L +G+ + + + +V++ Q + P + P L+V D+ +
Sbjct: 94 LARTGVTTSEPFALLSVLERPQGPLAFPGKLAVDPSRDRLVVSDTGHHRLVIARL----D 149
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL 139
V + GDG G DG ARF +P+ A+ + +VAD+ NH IR+I +L G
Sbjct: 150 GTVTAVIGDGRPGLVDGTFAEARFREPQGIAL-VGETCFVADRGNHAIRRI-DLAAGTV- 206
Query: 140 IRQINLKPEDCSKSSQSGSGMF------WVTVFSPYLISCKSFWVHLYLYFCSFKGS 190
E + + + G GM V + SP+ + W L F + GS
Sbjct: 207 --------ETLAGTGRLGQGMLSAGPAHQVDLRSPW-----ALWHRRGLLFVAMAGS 250
>gi|125561542|gb|EAZ06990.1| hypothetical protein OsI_29235 [Oryza sativa Indica Group]
Length = 498
Score = 44.3 bits (103), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 25/34 (73%)
Query: 134 DHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFS 167
D GN+LIRQINLK EDC++ +Q G G V++ +
Sbjct: 264 DRGNRLIRQINLKREDCARETQPGLGTTSVSIIA 297
>gi|116620348|ref|YP_822504.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223510|gb|ABJ82219.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 915
Score = 44.3 bits (103), Expect = 0.027, Method: Composition-based stats.
Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 6/79 (7%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD 112
G +L + D+ + ++ + V+ + G+G G S +G SAR P+ VD
Sbjct: 549 GKGNLYIADTGNHSIRKVT----ADGVISTVVGNGTVGASGDEGAAASARLASPRGLTVD 604
Query: 113 MKGNIYVADKSNHVIRKIT 131
GN+Y+ D N+ +R++T
Sbjct: 605 DNGNLYIGDTGNNRVRQVT 623
Score = 41.2 bits (95), Expect = 0.22, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V +AG G+ G+S G+ G +AR + P A+D GNIYVAD N +R+I
Sbjct: 82 VTTIAGTGIAGFS-GDGGLAVNARLNLPYGLALDDFGNIYVADLGNERVRRI 132
Score = 40.0 bits (92), Expect = 0.53, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 3/51 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG G G DG P +A +F A D GN+YVAD +NH +RK++
Sbjct: 29 IDTVAGSGRIG--DGGPATAAQFSDMSGIAADRLGNLYVADTNNHRVRKVS 77
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 8/83 (9%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKS 108
+DR G+ L ++D+S L+ + V+ +AG+G GY+ + A+ P +
Sbjct: 491 LDRGGT--LYIVDTSNHRV----LRLAADGTVQTVAGNGSGGYAGDGGSARFAQLRVPGA 544
Query: 109 FAVDMKGNIYVADKSNHVIRKIT 131
A D KGN+Y+AD NH IRK+T
Sbjct: 545 CAFDGKGNLYIADTGNHSIRKVT 567
Score = 38.1 bits (87), Expect = 1.7, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 28/56 (50%), Gaps = 3/56 (5%)
Query: 81 VVKRLAGDGVQGYSDGEP---GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
V+ LAG GV G G+ G A + P AVD GN+ VAD NH + I L
Sbjct: 402 VMTTLAGTGVVGRDTGDAVPAGIAPLNSPMGLAVDGAGNVMVADAFNHRVVAINAL 457
Score = 36.2 bits (82), Expect = 5.9, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LS + + +AG G+ G DG A+ + P A D G +Y+AD N+V+RKI
Sbjct: 188 LSADGRLVTVAGTGIAGLGGDGFASVKAQVNYPAGLAFDRAGALYIADSGNNVVRKI 244
>gi|290995070|ref|XP_002680154.1| predicted protein [Naegleria gruberi]
gi|284093774|gb|EFC47410.1| predicted protein [Naegleria gruberi]
Length = 636
Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 8/102 (7%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ ++ +AG G QGYS DG SA+ P A+ +G IYVAD +N+ IRKIT
Sbjct: 457 ITTSGIISTIAGTGTQGYSGDGSSATSAQLYNPYGVAISSRGEIYVADYNNNRIRKITT- 515
Query: 134 DHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCKS 175
+ +I I S S GS +++PY ++ S
Sbjct: 516 ---SGIISTI--AGTGTSGYSGDGSSAISAQLYNPYGVAISS 552
Score = 42.7 bits (99), Expect = 0.076, Method: Composition-based stats.
Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ ++ +AG G GYS DG SA+ + P A+ G IY+AD +N+ IRKIT
Sbjct: 401 ITTSGIISTIAGTGTSGYSGDGSSATSAQLNSPYGIAISSSGEIYIADTNNNRIRKIT 458
Score = 42.4 bits (98), Expect = 0.086, Method: Composition-based stats.
Identities = 20/56 (35%), Positives = 30/56 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ ++ +AG+G GYS SA+ P A+ Y+AD +NH IRKIT
Sbjct: 129 ITTSGIISTIAGNGTAGYSGDVATSAKLYYPSGIAISSSDETYIADTNNHRIRKIT 184
Score = 39.7 bits (91), Expect = 0.55, Method: Composition-based stats.
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ ++ +AG+G GYS DG SA+ + P A+ IY+AD N+ IRKIT
Sbjct: 239 ITTSGIISTIAGNGTAGYSGDGSSATSAQLNSPSGIAISSSDEIYIADMFNNRIRKIT 296
Score = 39.3 bits (90), Expect = 0.71, Method: Composition-based stats.
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ ++ +AG+G GYS DG SA+ P A+ IY+ D+SN+ IRKIT
Sbjct: 73 ITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKIT 130
Score = 39.3 bits (90), Expect = 0.71, Method: Composition-based stats.
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ ++ +AG+G GYS DG SA+ P A+ IY+ D+SN+ IRKIT
Sbjct: 183 ITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKIT 240
Score = 37.0 bits (84), Expect = 4.3, Method: Composition-based stats.
Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 8/85 (9%)
Query: 64 SSRSAFYTLSF------PLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKG 115
SSR Y + ++ ++ +AG G GYS DG SA+ P A+
Sbjct: 495 SSRGEIYVADYNNNRIRKITTSGIISTIAGTGTSGYSGDGSSAISAQLYNPYGVAISSSD 554
Query: 116 NIYVADKSNHVIRKITNLDHGNQLI 140
IY+ D +N+ IRK++ GN ++
Sbjct: 555 EIYITDTNNNRIRKLSPWCTGNAIL 579
>gi|374849371|dbj|BAL52388.1| PKD domain protein [uncultured candidate division OP1 bacterium]
gi|374857216|dbj|BAL60069.1| hypothetical protein HGMM_OP4C705 [uncultured candidate division
OP1 bacterium]
Length = 635
Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats.
Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 16/85 (18%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV----QGYSDG-EPGSARFD 104
++DR G+ + RS F E + R DG+ Q DG EPG+ F
Sbjct: 271 LVDRQGNRSEQSVLRQRSTF---------EFIAGRKVADGLFFLDQFGQDGSEPGA--FR 319
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRK 129
+P+ AVD KG+IYVAD NH I++
Sbjct: 320 RPQGIAVDSKGSIYVADTENHRIQR 344
Score = 41.2 bits (95), Expect = 0.21, Method: Composition-based stats.
Identities = 16/32 (50%), Positives = 20/32 (62%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
PG +F+ P AVD GN+YV D NH I+K
Sbjct: 384 PGDGQFNGPTDIAVDAAGNVYVVDSGNHRIQK 415
Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats.
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+ G +F +P++ AVD GN+YV+D NH I+K
Sbjct: 502 QTGDGQFFEPQAIAVDGAGNVYVSDTGNHRIQK 534
>gi|291612976|ref|YP_003523133.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
gi|291583088|gb|ADE10746.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
Length = 385
Score = 44.3 bits (103), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LA GV G SDG +ARF++P D N+YVAD +NH IRKI
Sbjct: 285 LASSGV-GSSDGIGANARFNQPMGITTDGS-NLYVADTNNHTIRKI 328
>gi|296128041|ref|YP_003635291.1| alkyl hydroperoxide reductase [Cellulomonas flavigena DSM 20109]
gi|296019856|gb|ADG73092.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Cellulomonas flavigena DSM 20109]
Length = 639
Score = 43.9 bits (102), Expect = 0.028, Method: Composition-based stats.
Identities = 33/131 (25%), Positives = 61/131 (46%), Gaps = 21/131 (16%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P ++ PG + L+ + + +TL+ ++ + R G G +G DG P ARF +P
Sbjct: 181 PAKAVELPGGTFLV----ADAGHHTLTEVAADGETLVRRIGSGERGLVDGGPDDARFSEP 236
Query: 107 KSFAV---DMKG----NIYVADKSNHVIRKITNLD--------HGNQLIRQI--NLKPED 149
+ +++G ++ VAD NH +R + D G Q + N++P D
Sbjct: 237 NGLGLVPDELRGRVGYDVLVADTVNHALRGVRLSDGHVTTLAGTGEQYMVGAVDNVRPAD 296
Query: 150 CSKSSQSGSGM 160
++ +GSG+
Sbjct: 297 GDTANPAGSGV 307
Score = 40.8 bits (94), Expect = 0.30, Method: Composition-based stats.
Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 15/80 (18%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V LAG +G DG P A F +P +VD G +++AD +R +
Sbjct: 353 VTHLAGTMNEGLVDGTPQEAWFAQPSGLSVDAAGRLWLADAETSALRWV----------- 401
Query: 142 QINLKPEDCSKSSQSGSGMF 161
P D S S G+G+F
Sbjct: 402 ----DPADASVHSVVGTGLF 417
>gi|423301816|ref|ZP_17279839.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
CL09T03C10]
gi|408470907|gb|EKJ89439.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
CL09T03C10]
Length = 469
Score = 43.9 bits (102), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 24/41 (58%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+GY DG A FD P AVD G IYVAD N+ IR+I
Sbjct: 426 TEGYQDGLISEALFDDPMGIAVDKNGVIYVADSDNYAIRRI 466
Score = 36.6 bits (83), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
DG+ G + + P +D + NIYVAD+ N IRKI+ LD G
Sbjct: 375 DGDLGGSTLNHPCQICMDSQNNIYVADRYNCCIRKIS-LDAG 415
>gi|399034109|ref|ZP_10732477.1| gluconolactonase [Flavobacterium sp. CF136]
gi|398067543|gb|EJL59038.1| gluconolactonase [Flavobacterium sp. CF136]
Length = 418
Score = 43.9 bits (102), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 48/108 (44%), Gaps = 15/108 (13%)
Query: 38 IDGHQLEINPHSVIDRP----------GSSDLIVLDSSRSAFYTLS-----FPLSEESVV 82
++G+ + P V+ RP G++ + LDS + + + L+ ++
Sbjct: 212 VNGYIYKFTPSGVLTRPRNNEGALIRVGAASGMCLDSKNNLYISEDATSKIMKLTPTGIL 271
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AG G+ +G A+F KP D N+YV D+ N+ IRKI
Sbjct: 272 TIFAGANGTGFQNGNIKEAKFSKPSGIVFDKMDNLYVVDRYNNRIRKI 319
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 31/56 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V +AG G++G DG A F+ P+ +D N+Y+ + N+ IR IT
Sbjct: 319 IAVDGTVSTVAGSGIRGNKDGITSEATFNFPQEITIDKYNNLYIVEPDNNRIRMIT 374
>gi|116672456|ref|YP_833389.1| NHL repeat-containing protein [Arthrobacter sp. FB24]
gi|116612565|gb|ABK05289.1| NHL repeat containing protein [Arthrobacter sp. FB24]
Length = 672
Score = 43.9 bits (102), Expect = 0.030, Method: Composition-based stats.
Identities = 35/110 (31%), Positives = 52/110 (47%), Gaps = 4/110 (3%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL-SF-PLSEESVVKRLA 86
ED T + H L++ S D S+ L + + + + + SF P+S V LA
Sbjct: 323 EDAAGFTGRLSDHPLDVALSSPWDVVWSAKLNAVVVAMAGVHQIFSFDPIS--GAVSILA 380
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
G+G++G DG A F + A D GNI+VAD +RK+ D G
Sbjct: 381 GNGLEGLLDGAAHEAWFAQSSGLAEDADGNIWVADSETSALRKLVIDDAG 430
>gi|308178347|ref|YP_003917753.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
gi|307745810|emb|CBT76782.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
Length = 635
Score = 43.9 bits (102), Expect = 0.031, Method: Composition-based stats.
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 11/89 (12%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK-------PK 107
+ + +V DS L+ S+ S V R G GV+GY+DG+ +A+F++ P
Sbjct: 189 ANGNFLVGDSGHHRLVELA---SDLSTVVRTIGAGVKGYADGDAQTAQFNELQGLTALPA 245
Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLDHG 136
A ++ ++ VAD NH +R + NL+ G
Sbjct: 246 ELAAEVGYDVIVADTVNHRLRSV-NLETG 273
Score = 37.7 bits (86), Expect = 2.6, Method: Composition-based stats.
Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
+P V+ P +I + + F +F S V AG G++G +DG P A F +
Sbjct: 312 SPWDVLYHPSGQVIIAMAGTHQIF---AFN-PRTSAVSVFAGTGLEGLADGAPDEAWFAQ 367
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
A+D + N+++AD +R I
Sbjct: 368 SSGLALDGQ-NVWIADSETSALRWI 391
>gi|384248465|gb|EIE21949.1| hypothetical protein COCSUDRAFT_16964 [Coccomyxa subellipsoidea
C-169]
Length = 1021
Score = 43.9 bits (102), Expect = 0.032, Method: Composition-based stats.
Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ D+IV DS L P ++ ++ +AG G G +DG G+AR +P +G
Sbjct: 773 NGDVIVADSYNHRLKLLD-PATDS--IRTVAGSGAAGLTDGTGGNARVSEPGGLCAGPRG 829
Query: 116 NIYVADKSNHVIR 128
+++AD +N IR
Sbjct: 830 TVFIADTNNSAIR 842
>gi|290975165|ref|XP_002670314.1| predicted protein [Naegleria gruberi]
gi|284083871|gb|EFC37570.1| predicted protein [Naegleria gruberi]
Length = 1504
Score = 43.9 bits (102), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 1/58 (1%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++++ V +AG G+ GYS G+P + P V G+IY++D NH IR+I N
Sbjct: 471 ITQDGFVVNVAGTGISGYSLIGDPLLSYLSNPNGIVVLKNGDIYISDTDNHCIRQIKN 528
>gi|421097867|ref|ZP_15558546.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
str. 200901122]
gi|410799150|gb|EKS01231.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
str. 200901122]
Length = 357
Score = 43.9 bits (102), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 52/114 (45%), Gaps = 9/114 (7%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD + ++ V+ AG G++ DG +A F+ P +D GNI+V+
Sbjct: 35 ILDCILKECHLCKLKVTNNPVISLFAGTGIKESIDGTTQTASFNTPFGLELDTFGNIFVS 94
Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCK 174
D++ ++IRKI R N+ S Q+ SG+ + + +SCK
Sbjct: 95 DQTANLIRKID---------RFGNVTTLSTSLVFQNPSGIKFDPITGDKYVSCK 139
Score = 39.3 bits (90), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
GV G+ +G+ ++ F P +D + N+YV + SNH IRKI NL+ G
Sbjct: 162 GVDGFQNGDRLNSLFKSPFFMDLDRERNLYVGELSNHAIRKI-NLNSG 208
Score = 39.3 bits (90), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
V L+G GV GY DG+ SA+F P D K + + VAD NH IRKI
Sbjct: 210 VSTLSG-GVLGYLDGDLASAQFKSPLGITYDQKTDSLLVADIQNHNIRKI 258
>gi|332668794|ref|YP_004451801.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Cellulomonas fimi ATCC 484]
gi|332337831|gb|AEE44414.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Cellulomonas fimi ATCC 484]
Length = 639
Score = 43.9 bits (102), Expect = 0.034, Method: Composition-based stats.
Identities = 28/91 (30%), Positives = 45/91 (49%), Gaps = 11/91 (12%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + L+V D+ L+ E++V+R+ G G +G+ DG P ARF +P
Sbjct: 181 PAKAVALPGGT-LLVADAGHHGLAELA--ADGETLVRRI-GSGERGFVDGGPDEARFSEP 236
Query: 107 KSFAVDMKG-------NIYVADKSNHVIRKI 130
A+ G ++ VAD NH +R +
Sbjct: 237 NGLALVPDGLRATLGYDVLVADTVNHALRGV 267
>gi|449684055|ref|XP_002166330.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
magnipapillata]
Length = 193
Score = 43.5 bits (101), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 60/126 (47%), Gaps = 20/126 (15%)
Query: 28 LEDGYTVTTVIDGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFP------L 76
L+ GY T G EI ++ +I P + +V D + Y F +
Sbjct: 16 LKTGYVTTLSSQGFGAEIGLNNQPVNLIIHYP---EGLVYDPDYNLLYVCEFENHNIRVV 72
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT---- 131
S + +VK LAG+ +G DG +ARF P A D K +YV+D+ NH++R +T
Sbjct: 73 SMQGLVKTLAGE-FKGKKDGVGRTARFYYPSGLAFDRKNRMLYVSDQYNHLVRTVTAIGS 131
Query: 132 NLDHGN 137
N+++ N
Sbjct: 132 NVEYSN 137
>gi|427719807|ref|YP_007067801.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
gi|427352243|gb|AFY34967.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
Length = 505
Score = 43.5 bits (101), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GDG G DGE ++RF +P +V + ++YVAD +NHVIR++
Sbjct: 437 GDGSAGLQDGEGKNSRFFEPSGLSV-INSHLYVADTNNHVIRRV 479
Score = 41.2 bits (95), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 8/89 (8%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+S L + DS SF + + + G G G+ DG+ A+F P+ A D +
Sbjct: 184 ASGLFIADSGHHRLVRSSF----DGEILHVIGTGKPGFIDGDFSEAQFFAPQGMAFDEEN 239
Query: 116 NI-YVADKSNHVIRKITNLDHGNQLIRQI 143
I Y+AD NH +R++ D +QL++ I
Sbjct: 240 QILYIADTENHALRRV---DLRHQLVKTI 265
>gi|427731644|ref|YP_007077881.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
gi|427367563|gb|AFY50284.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
Length = 505
Score = 43.5 bits (101), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + GDG G DG+ SARF +P + M N+Y+ D +NH IR++
Sbjct: 433 QTILGDGNAGLQDGQGKSARFFEPSGLSA-MDSNLYICDTNNHAIRRV 479
Score = 40.0 bits (92), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
V L G G G++DG A+F P+ D I YVAD NH +R+I
Sbjct: 206 VLHLIGTGKSGFTDGAFDEAQFSAPQGMTFDAANQILYVADTDNHAVRQIN 256
>gi|17228508|ref|NP_485056.1| hypothetical protein alr1013 [Nostoc sp. PCC 7120]
gi|17130359|dbj|BAB72970.1| alr1013 [Nostoc sp. PCC 7120]
Length = 503
Score = 43.5 bits (101), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 7/85 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ P S L V DS F + L G+G G +DG A+F P
Sbjct: 175 PGKVLATP--SGLFVADSGHHRIVVSDF----NGEILHLIGNGKSGLTDGNFQEAQFSAP 228
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI 130
+ A DM+ I Y+AD NH +R++
Sbjct: 229 QGMAFDMENQILYLADTENHALRRV 253
Score = 36.6 bits (83), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 15/48 (31%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + GDG+ G +G+ + RF +P + + +Y++D +NHVIR++
Sbjct: 431 QTVLGDGLAGLQNGQGKNTRFFEPSGLSA-IDSYLYISDTNNHVIRRV 477
>gi|326432343|gb|EGD77913.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
Length = 1505
Score = 43.5 bits (101), Expect = 0.041, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
V+ +AG GV G+S G P AR D P+ AV G++ +AD NH +R+
Sbjct: 1391 VITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVRE 1441
>gi|222056335|ref|YP_002538697.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
gi|221565624|gb|ACM21596.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
Length = 372
Score = 43.5 bits (101), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
L+ V LAG Q G +DG +ARF+ P D N+YVAD N++IRK+
Sbjct: 44 LTLTGTVSVLAGQAPQMGTADGTGSAARFNAPSGITTDGT-NLYVADTGNNLIRKV 98
Score = 39.3 bits (90), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 44/95 (46%), Gaps = 7/95 (7%)
Query: 40 GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ---GYSDG 96
G N S I G+ +L V D+ + + + + G G G +DG
Sbjct: 67 GSAARFNAPSGITTDGT-NLYVADTGNNLIRKVVITTGAVTTLAGTVGTGTAQTSGSTDG 125
Query: 97 EPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
+A+F+ P FA+ G N+YVAD +N+ IRK+
Sbjct: 126 TGSAAKFNAP--FAITTDGTNLYVADTNNNTIRKV 158
>gi|302821969|ref|XP_002992645.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
gi|300139609|gb|EFJ06347.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
Length = 79
Score = 43.5 bits (101), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 28/43 (65%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G GY DG +RF++P+S A+ G ++VAD +N IRKI+
Sbjct: 15 GESGYVDGPAAKSRFNRPQSLAICDNGAVFVADTTNLAIRKIS 57
>gi|145487484|ref|XP_001429747.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124396841|emb|CAK62349.1| unnamed protein product [Paramecium tetraurelia]
Length = 450
Score = 43.5 bits (101), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 78 EESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
++ + K +AG G G+ DG G F+ P +D+ GNI+V D +NH IR IT +
Sbjct: 190 QKKITKTIAGICGQSGFKDGLLGMNLFNTPDQMGIDVLGNIFVNDFNNHFIRMITLDGYV 249
Query: 137 NQLI 140
N LI
Sbjct: 250 NTLI 253
>gi|116625892|ref|YP_828048.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116229054|gb|ABJ87763.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 2384
Score = 43.5 bits (101), Expect = 0.044, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ ++ +AG GVQG++ G GSA+ P A D GN+Y+AD N IR+I
Sbjct: 138 QTGMISTVAGTGVQGFAGDGGLAGSAQLSHPAGIAFDSAGNLYIADMGNFRIRRI 192
>gi|290985160|ref|XP_002675294.1| predicted protein [Naegleria gruberi]
gi|284088889|gb|EFC42550.1| predicted protein [Naegleria gruberi]
Length = 1229
Score = 43.5 bits (101), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITN 132
+V +G + GY DG +ARF P +D G++YV+D N VIRKI+N
Sbjct: 608 IVSTFSGGTISGYQDGPIQTARFYFPNGLDIDTTNGDLYVSDTFNCVIRKISN 660
>gi|238062013|ref|ZP_04606722.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
gi|237883824|gb|EEP72652.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
Length = 582
Score = 43.5 bits (101), Expect = 0.045, Method: Composition-based stats.
Identities = 30/97 (30%), Positives = 54/97 (55%), Gaps = 16/97 (16%)
Query: 47 PHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
P + + PG + +L+V DS+R + L+ P E++V+R+ G GV+G +DG G+
Sbjct: 139 PETALRFPGKAIALDGGNLLVSDSARHSLVELA-P-DGETLVRRI-GSGVRGRADGPAGA 195
Query: 101 ARFDKPKSF------AVDMKG-NIYVADKSNHVIRKI 130
A F +P+ ++ G ++ VAD NH++R +
Sbjct: 196 AAFAEPQGLCRLPEHVAEVAGYDVVVADTVNHLLRGV 232
>gi|389875076|ref|YP_006374432.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
gi|388532256|gb|AFK57450.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
Length = 487
Score = 43.5 bits (101), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 34/72 (47%), Gaps = 5/72 (6%)
Query: 81 VVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQ 138
V L GDG+ G DG +ARF P DM GNI VAD N IR + D
Sbjct: 372 TVSTLVGDGLFEFGMVDGPARTARFQHPLGLCFDMNGNILVADAYNDAIRVV---DPETG 428
Query: 139 LIRQINLKPEDC 150
++ ++ P DC
Sbjct: 429 MVSSLDDPPYDC 440
Score = 40.8 bits (94), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
R G G G DG SARF +P+ + G I+VAD NH +R+I
Sbjct: 210 RRVGSGEPGLVDGPAESARFQRPQGL-IAADGAIFVADTWNHAVRRI 255
>gi|407780651|ref|ZP_11127872.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
gi|407208878|gb|EKE78785.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
Length = 498
Score = 43.5 bits (101), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G G G++ G ARF P+ A D G IYVAD NH IR+I
Sbjct: 217 GSGEAGFASGPADRARFRDPQGLAADT-GAIYVADTGNHAIRRI 259
>gi|290983036|ref|XP_002674235.1| predicted protein [Naegleria gruberi]
gi|284087824|gb|EFC41491.1| predicted protein [Naegleria gruberi]
Length = 737
Score = 43.5 bits (101), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 33/53 (62%), Gaps = 4/53 (7%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG G G+S G+ G +A+ P++ V +G I +AD SNHVIRKIT
Sbjct: 90 IHTIAGTGTSGHS-GDGGLAVNAKLFHPRTVTVTSEGEILIADTSNHVIRKIT 141
>gi|326435583|gb|EGD81153.1| hypothetical protein PTSG_13151 [Salpingoeca sp. ATCC 50818]
Length = 1551
Score = 43.5 bits (101), Expect = 0.048, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
V+ +AG GV G+S G P AR D P+ AV G++ +AD NH +R+
Sbjct: 1437 VITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVRE 1487
>gi|433458786|ref|ZP_20416675.1| NHL repeat containing protein [Arthrobacter crystallopoietes
BAB-32]
gi|432192788|gb|ELK49609.1| NHL repeat containing protein [Arthrobacter crystallopoietes
BAB-32]
Length = 680
Score = 43.1 bits (100), Expect = 0.048, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
PL+ E V LAG G++G DGE +A F + A D G+I++AD +R +T D
Sbjct: 378 PLTGELAV--LAGTGLEGLQDGEANAAWFAQSSGLAQDANGSIWIADSETSALRVLTFSD 435
>gi|345482264|ref|XP_001607897.2| PREDICTED: NHL repeat-containing protein 2-like [Nasonia
vitripennis]
Length = 768
Score = 43.1 bits (100), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ ++ V+ + G G+ DG+ +ARF+ P+ V +K +++VAD NH IRKI
Sbjct: 329 MGKDGKVEHVVGGYSPGFKDGDFETARFNAPQGVCV-LKDDVFVADNENHAIRKIN 383
>gi|283781091|ref|YP_003371846.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
gi|283439544|gb|ADB17986.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
Length = 782
Score = 43.1 bits (100), Expect = 0.048, Method: Composition-based stats.
Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G GV G +DG A FD P+ A+D + +YVAD NH++RKI
Sbjct: 332 GSGVIGKADGSFAEASFDHPQGCALDGE-TLYVADTENHLLRKI 374
Score = 40.0 bits (92), Expect = 0.42, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 32/49 (65%), Gaps = 4/49 (8%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
VK +AG G G SD +P A FD+P A+ + +YVAD +NH+IRK+
Sbjct: 578 VKTIAGSGSPGTSD-DP--ATFDEPAGLAL-VGETLYVADTNNHLIRKV 622
>gi|434403812|ref|YP_007146697.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
stagnale PCC 7417]
gi|428258067|gb|AFZ24017.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
stagnale PCC 7417]
Length = 506
Score = 43.1 bits (100), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 38/85 (44%), Gaps = 7/85 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ P L + DS SF V + G G G++DG A+F P
Sbjct: 177 PGKVLATPAG--LFIADSGHHRLVMSSF----NGEVFHIIGTGKSGFTDGNFSEAQFSAP 230
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI 130
+ A D + I YVAD NH +RK+
Sbjct: 231 QGMAYDAENQILYVADTENHALRKV 255
>gi|163841552|ref|YP_001625957.1| hypothetical protein RSal33209_2819 [Renibacterium salmoninarum
ATCC 33209]
gi|162955028|gb|ABY24543.1| conserved hypothetical protein [Renibacterium salmoninarum ATCC
33209]
Length = 425
Score = 43.1 bits (100), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 17/112 (15%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKS-------FAVDMKGNIYVADKSNHVIRKIT 131
E+VV+ G G +G++DG ARF++P+ A + ++ VAD NH +R I+
Sbjct: 88 ETVVRAF-GTGTKGFADGAAQMARFNEPQGLVLLPEVLAAQLGYDVVVADSVNHRLRGIS 146
Query: 132 NLD------HGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCKSFW 177
D GN + R ++ P + + SG V P I+ S W
Sbjct: 147 LTDGSVRTLAGNGVQRLLDAGPNTVNDDGEGTSG---VLSGDPLTIALSSPW 195
>gi|307202136|gb|EFN81636.1| NHL repeat-containing protein 2 [Harpegnathos saltator]
Length = 688
Score = 43.1 bits (100), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 39/73 (53%), Gaps = 5/73 (6%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
LI+ DS S + + ++ + G QG+ DG+ SA+F+ P+ + + IY
Sbjct: 236 LIISDSGNSRILITN----QHGEIEHIIGGTNQGFEDGDFKSAKFNSPQGVCM-LDDVIY 290
Query: 119 VADKSNHVIRKIT 131
VAD +NH IRKI
Sbjct: 291 VADNNNHAIRKIN 303
>gi|428225769|ref|YP_007109866.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
gi|427985670|gb|AFY66814.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
Length = 498
Score = 43.1 bits (100), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 1/53 (1%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
E V+ G G GY DG +A+F P+ D G+ +YVAD NH+IR +
Sbjct: 197 EGKVRATVGSGTPGYQDGPWETAQFRAPQGLTYDPAGDRLYVADTGNHLIRCV 249
Score = 37.4 bits (85), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 38/75 (50%), Gaps = 4/75 (5%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
G +L + D+ L+ L+E V + G G+ G+ DG A F +P A + +
Sbjct: 402 GPGNLWIADTYNHKIKRLN--LAEGHCVN-IVGSGLPGHQDGWGPEASFSEPSGLASEGQ 458
Query: 115 GNIYVADKSNHVIRK 129
+Y+AD +NH IR+
Sbjct: 459 -TLYIADTNNHAIRR 472
>gi|340370782|ref|XP_003383925.1| PREDICTED: NHL repeat-containing protein 2-like [Amphimedon
queenslandica]
Length = 730
Score = 43.1 bits (100), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 53 RPGSSDLIVL-DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV 111
R G S+L+ + DSS + + +VK++ G G G+ DG A F+ P+ +
Sbjct: 233 RGGVSELLCISDSSNHRVLVVD---AVTGLVKQVYGSGSPGFKDGRGKEAEFNCPQGLVI 289
Query: 112 DMKGNIYVADKSNHVIRKIT 131
+ +YVAD NH+IRKI+
Sbjct: 290 -CEECVYVADTENHLIRKIS 308
Score = 40.0 bits (92), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 4/67 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQL 139
V LAG G G DG A+F +P + +G+ +YVAD +NH IR LD +
Sbjct: 504 VCSTLAGTGSPGLVDGSFEVAQFSEPAGLCMSQEGDTLYVADTNNHAIRI---LDLKEKK 560
Query: 140 IRQINLK 146
+ Q+ L+
Sbjct: 561 VSQVKLR 567
>gi|290989149|ref|XP_002677205.1| predicted protein [Naegleria gruberi]
gi|284090811|gb|EFC44461.1| predicted protein [Naegleria gruberi]
Length = 691
Score = 43.1 bits (100), Expect = 0.054, Method: Composition-based stats.
Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLD 134
++ + + +AG +GY DG A+F+ P A+D + G++YV D N+ IRKI+
Sbjct: 333 INRDRTLVTIAGTTEEGYQDGPVRMAKFNNPSDIAIDNETGDVYVVDGGNNYIRKISTTK 392
Query: 135 HGN 137
+ N
Sbjct: 393 YLN 395
Score = 40.8 bits (94), Expect = 0.28, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 34/54 (62%), Gaps = 4/54 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +AG G +GY DG+ G AR P AV+ +G++Y +D N+VIRKI+
Sbjct: 226 LIHTVAGFGEKGY-DGDGGFAFYARIRCPTGIAVNQRGDVYFSDSGNNVIRKIS 278
>gi|365864816|ref|ZP_09404493.1| NHL repeat-containing protein [Streptomyces sp. W007]
gi|364005745|gb|EHM26808.1| NHL repeat-containing protein [Streptomyces sp. W007]
Length = 292
Score = 43.1 bits (100), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + + +AG G GYS G SAR + P VD N+Y++D +NH +RK+T
Sbjct: 20 VTADGTISTVAGTGTAGYSGDSGAATSARLNLPLGVVVDSADNLYISDYNNHRVRKVT 77
>gi|405960880|gb|EKC26754.1| NHL repeat-containing protein 2 [Crassostrea gigas]
Length = 699
Score = 43.1 bits (100), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 33/56 (58%), Gaps = 3/56 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
LS++ VV+ G +G+ +G RFD P+ V MK N IY+AD NH IRKI
Sbjct: 240 LSKDGVVQHCIGGKERGFKNGTFTQCRFDSPQ--GVVMKENVIYLADTENHAIRKI 293
>gi|285018581|ref|YP_003376292.1| nhl repeat protein [Xanthomonas albilineans GPE PC73]
gi|283473799|emb|CBA16302.1| hypothetical nhl repeat protein [Xanthomonas albilineans GPE PC73]
Length = 473
Score = 43.1 bits (100), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
R G G + DGE G A F +P+ A+ +G +YVAD NH +R+I
Sbjct: 201 RQFGTGTADFIDGELGQAAFHRPQGLAL-QRGVLYVADTGNHALRRI 246
>gi|284097436|ref|ZP_06385541.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
gi|283831050|gb|EFC35055.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
Length = 387
Score = 43.1 bits (100), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 28/44 (63%), Gaps = 9/44 (20%)
Query: 96 GEPG---------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GEPG +A ++PK+ A+D GN+Y+AD NH++RK+
Sbjct: 1 GEPGWGGDAGPATAAMLNEPKTVALDAAGNLYIADSENHLVRKV 44
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 35/77 (45%), Gaps = 5/77 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
G L + D S + L V+ + G G GY+ DG PG P AVD
Sbjct: 194 GPDRLYIADQSNNRVRKLDL---TSGVMTTVVGTGESGYNGDGAPGPETALAGPSGLAVD 250
Query: 113 MKGNIYVADKSNHVIRK 129
+GN+Y+AD + IRK
Sbjct: 251 QEGNLYIADTFSGRIRK 267
>gi|74317309|ref|YP_315049.1| hypothetical protein Tbd_1291 [Thiobacillus denitrificans ATCC
25259]
gi|74056804|gb|AAZ97244.1| conserved hypothetical protein containing twin-arginine
translocation pathway signal [Thiobacillus denitrificans
ATCC 25259]
Length = 510
Score = 43.1 bits (100), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 26/47 (55%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ G + DG ARF+ D GN+YVAD+ NHVIRKIT
Sbjct: 332 MVGQKQGAFVDGPANEARFNGCSQIDYDNAGNLYVADRFNHVIRKIT 378
>gi|269926393|ref|YP_003323016.1| alkyl hydroperoxide reductase [Thermobaculum terrenum ATCC BAA-798]
gi|269790053|gb|ACZ42194.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Thermobaculum terrenum ATCC BAA-798]
Length = 493
Score = 43.1 bits (100), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 31/48 (64%), Gaps = 3/48 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR 128
++R+ G+G +G DG A+F+ P+ A+ +GN +YVAD NH +R
Sbjct: 210 IRRIWGNGEEGLVDGSASEAKFNHPQGMAI--RGNELYVADTENHALR 255
>gi|347757619|ref|YP_004865181.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
gi|347590137|gb|AEP09179.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
Length = 495
Score = 43.1 bits (100), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 46/105 (43%), Gaps = 24/105 (22%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFP----------------------LSEESVVKRLAGDGVQ 91
P +D+ L +R+ TLSFP L VK G G
Sbjct: 170 PPFTDISTLVPARTKTSTLSFPARIEDAGDFIFVADSGHNRILVLDRTGAVKMTIGSGEL 229
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
G+ DG A+F+ P+ + + N+Y+AD NH++R + NLD G
Sbjct: 230 GHQDGSFVMAKFNHPRGMVL-IGRNLYIADTDNHMLR-VANLDSG 272
>gi|290987128|ref|XP_002676275.1| predicted protein [Naegleria gruberi]
gi|284089876|gb|EFC43531.1| predicted protein [Naegleria gruberi]
Length = 1182
Score = 43.1 bits (100), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 11/102 (10%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
G +TT+ + I+P S + P + +DSS T+ + +S +V AG G
Sbjct: 422 GGIITTITPSY---ISPRSPL-VPNVGEYYFVDSS-----TVKYLISSSGIVITKAGSGT 472
Query: 91 QGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GY +D P SA + PK A G++Y+ D NH IRK+
Sbjct: 473 TGYNADNIPATSAHLNLPKGIARSTNGDLYITDSGNHRIRKV 514
>gi|320107313|ref|YP_004182903.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319925834|gb|ADV82909.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 1821
Score = 42.7 bits (99), Expect = 0.066, Method: Composition-based stats.
Identities = 20/38 (52%), Positives = 27/38 (71%), Gaps = 1/38 (2%)
Query: 95 DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
DG P SA RF+ P A+D G++Y+AD++NH IR IT
Sbjct: 341 DGGPASAGRFNNPLGLAIDSGGSLYIADQANHRIRMIT 378
Score = 40.8 bits (94), Expect = 0.26, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 4/51 (7%)
Query: 85 LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+AG GV Y +G+ G +A+ +P + AVD+ N+Y+AD +NH++RK+++
Sbjct: 386 VAGTGVPRY-NGDGGLAVAAQLQQPAAVAVDVARNLYIADSNNHLVRKVSS 435
Score = 36.6 bits (83), Expect = 4.7, Method: Composition-based stats.
Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 8/80 (10%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARF---DKPKSFA 110
G+ DL + D+ +A L+ ++ +AG GV G S G+ GSA + P A
Sbjct: 244 GAGDLYIADTRNNAIRKLTLA---TGILSTIAGRLGVPG-SSGDGGSATLATLNAPGGLA 299
Query: 111 VDMKGNIYVADKSNHVIRKI 130
+D G +Y+AD N IR+I
Sbjct: 300 MDAAGFLYIADTGNDTIRRI 319
>gi|197118445|ref|YP_002138872.1| lipoprotein [Geobacter bemidjiensis Bem]
gi|197087805|gb|ACH39076.1| repeat-containing lipoprotein, putative [Geobacter bemidjiensis
Bem]
Length = 652
Score = 42.7 bits (99), Expect = 0.067, Method: Composition-based stats.
Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 5/85 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
P + +S L V D+ + +++ V LAG GV G ++G G+ARF
Sbjct: 130 PSGIALSSDNSTLYVSDTGNNTIRSINVA---TGAVTTLAGTAGVVGSTNGTGGAARFSS 186
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
P A D N+YVAD NH +RKI
Sbjct: 187 PSGVATD-GANLYVADSLNHRVRKI 210
Score = 39.7 bits (91), Expect = 0.57, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 1/51 (1%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V LAG G+Q ++DG +A F+ P+ A D ++Y+AD+ N +R+I
Sbjct: 215 AAVTTLAGSGLQDFADGTGIAASFNSPRGIATD-GVSLYLADQGNSAVRRI 264
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 23/50 (46%), Positives = 27/50 (54%), Gaps = 2/50 (4%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V LAG G G +DG +ARF P D N+YVAD N+ IRKI
Sbjct: 477 VSTLAGTAGATGSADGAGSAARFHSPNGLTTD-GTNLYVADSGNNAIRKI 525
Score = 36.2 bits (82), Expect = 6.2, Method: Composition-based stats.
Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 11/93 (11%)
Query: 45 INPHSVIDRPGS-----SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGE 97
+ P S I+ P + + V D R+ + V LAGDG G +D
Sbjct: 275 VQPASGIESPAGIATDGTSVFVTDLERNNLRKVDIA---TQAVSTLAGDGGGSPGSTDAV 331
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G+ARF P ++ G +YVAD N ++RK+
Sbjct: 332 GGAARFSAPAGLVLN-AGVLYVADTGNDLLRKV 363
Score = 36.2 bits (82), Expect = 7.1, Method: Composition-based stats.
Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 3/75 (4%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
+ L V+D+ S + S V LAG + G DG ARF+ P A+ +
Sbjct: 83 TSLYVVDTGNSVIRKVVLATGATSTVAGLAG--ITGSGDGTGAVARFNYPSGIALSSDNS 140
Query: 117 -IYVADKSNHVIRKI 130
+YV+D N+ IR I
Sbjct: 141 TLYVSDTGNNTIRSI 155
>gi|290989086|ref|XP_002677176.1| predicted protein [Naegleria gruberi]
gi|284090782|gb|EFC44432.1| predicted protein [Naegleria gruberi]
Length = 2103
Score = 42.7 bits (99), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 34/60 (56%), Gaps = 4/60 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S ++ +AG+G G+ G+ G A ++P A+ G +Y+AD NH IRK++N
Sbjct: 525 VSNNGIITTIAGNGTAGFC-GDGGLAVNTCLNRPNGIAISSSGELYIADYGNHRIRKVSN 583
Score = 39.3 bits (90), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 45/98 (45%), Gaps = 7/98 (7%)
Query: 38 IDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
ID + +N PH V P S ++ D + + + +AG+G Y+ D
Sbjct: 105 IDATKATVNSPHGVAFHPTSGEVYFADYANHRIR----KIYSNGTITTIAGNGEPNYNGD 160
Query: 96 GEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
P SA+ P AV G +++AD N+ IRKI+N
Sbjct: 161 NIPATSAQLGYPIGIAVSSGGEVFIADTLNNRIRKISN 198
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
++ +AGDG GY+ DG + A + P A+D N Y+AD +N+ IRK++
Sbjct: 313 IISSIAGDGQSGYNYDGIDATIATLNNPVGIAIDSSNNEFYIADSNNNRIRKVS 366
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +LIV +++ S + + L+ +V +AG G DG S F F + G
Sbjct: 790 SENLIVCETTSS--WKIRKILATTGMVSTIAG----GIGDGLNASNAFLVSTLFDISNSG 843
Query: 116 NIYVADKSNHVIRKI 130
IY+AD NH IRKI
Sbjct: 844 EIYIADTGNHRIRKI 858
>gi|430749689|ref|YP_007212597.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
gi|430733654|gb|AGA57599.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
Length = 187
Score = 42.7 bits (99), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 26/48 (54%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+AG G+ G DG F P AV G IYVAD N+VIRKI N
Sbjct: 4 IAGTGILGLKDGSASQENFYYPMDVAVSPSGVIYVADTLNYVIRKIEN 51
>gi|269957976|ref|YP_003327765.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Xylanimonas cellulosilytica DSM 15894]
gi|269306657|gb|ACZ32207.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Xylanimonas cellulosilytica DSM 15894]
Length = 688
Score = 42.7 bits (99), Expect = 0.072, Method: Composition-based stats.
Identities = 30/91 (32%), Positives = 48/91 (52%), Gaps = 11/91 (12%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P I PG + L+V D+ + L+ P E++V+R+ G G +G DG P ARF +P
Sbjct: 178 PAKAITLPGGT-LLVADAGHHSLAELA-P-DGETLVRRI-GSGQRGLLDGGPDEARFAEP 233
Query: 107 KSFAV---DMKG----NIYVADKSNHVIRKI 130
+ D++G ++ VAD NH +R +
Sbjct: 234 NGLCLVPEDLRGRLGYDVVVADTVNHALRGV 264
>gi|406901359|gb|EKD44040.1| NHL repeat containing protein, partial [uncultured bacterium]
Length = 667
Score = 42.7 bits (99), Expect = 0.074, Method: Composition-based stats.
Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 2/72 (2%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGN-IY 118
V D+ ++ S++ K +AG + G+ DG +A F+ P + +D N +Y
Sbjct: 531 VADTGNEVIRRVTLTGSKQGQTKLIAGSPTESGFKDGTKTAAEFNVPIALTIDSADNYLY 590
Query: 119 VADKSNHVIRKI 130
VAD+ NH IRK+
Sbjct: 591 VADRDNHAIRKV 602
Score = 41.2 bits (95), Expect = 0.23, Method: Composition-based stats.
Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
I+ D + + + T +F + +V + G G G +G +++F P + AVD
Sbjct: 51 IIADGAGNFYVTDTFNGVIRKIDANGIVSTVVGQGGYGDVNGSATTSKFAHPSAVAVDDS 110
Query: 115 GNIYVADKSNHVIRKIT 131
GN+Y+AD N I+K +
Sbjct: 111 GNVYIADAGNGKIKKFS 127
>gi|290986743|ref|XP_002676083.1| predicted protein [Naegleria gruberi]
gi|284089683|gb|EFC43339.1| predicted protein [Naegleria gruberi]
Length = 733
Score = 42.7 bits (99), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 34/55 (61%), Gaps = 4/55 (7%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+AG+G G+S +G +A+ + P V +Y+AD+SNH IRKI L++GN
Sbjct: 158 IAGNGTSGFSGDNGPATNAQLNNPVGVFVSSNNEVYIADQSNHRIRKI--LENGN 210
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+ +AG G G+S +G +A+ P S V +Y+AD+SNH IRKI L++GN
Sbjct: 43 IATIAGKGTCGFSGDNGPATNAQIYYPSSVFVSSNNEVYIADQSNHRIRKI--LENGN 98
Score = 39.3 bits (90), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 35/55 (63%), Gaps = 5/55 (9%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+AG+G G+S +G +A+ ++P S V +Y+AD +NH IRKI L++GN
Sbjct: 269 IAGNGTSGFSGDNGPATNAQLNRPSSVFVS-NNEVYIADLNNHKIRKI--LENGN 320
>gi|153864226|ref|ZP_01997193.1| NHL repeat protein [Beggiatoa sp. SS]
gi|152146280|gb|EDN72808.1| NHL repeat protein [Beggiatoa sp. SS]
Length = 313
Score = 42.7 bits (99), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 40 GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGE 97
G QL ID GS L ++D++ + +++ LAG GV G+S DG+
Sbjct: 120 GAQLNSPSDIAIDDSGS--LYIIDTNNHRIRKVE---KATNIITTLAG-GVSGFSGDDGQ 173
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
SA+F+ P A+ +G++ +AD NH +R IT +
Sbjct: 174 ALSAQFNTPDGIAISDEGHLLIADSENHRVRMITGV 209
Score = 38.9 bits (89), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + G+GV G GE A+ + P A+D G++Y+ D +NH IRK+
Sbjct: 99 IITTIVGNGVASFGGDGGEAPGAQLNSPSDIAIDDSGSLYIIDTNNHRIRKV 150
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query: 84 RLAGD-GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
R+AG GV G+S G A+ + PK V G +Y+ DK NH IRK+
Sbjct: 45 RIAGTLGVAGFSGDGGRAVEAKLNNPKRIVVADDGVVYITDKGNHRIRKV 94
>gi|184200016|ref|YP_001854223.1| hypothetical protein KRH_03700 [Kocuria rhizophila DC2201]
gi|183580246|dbj|BAG28717.1| hypothetical protein [Kocuria rhizophila DC2201]
Length = 690
Score = 42.7 bits (99), Expect = 0.075, Method: Composition-based stats.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 14/104 (13%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ P S +V D+ L S+ + V+R G G +G+SDG ARF++P
Sbjct: 179 PGKVVALPDGS-FMVSDTGHHRLVQLE---SDLTTVRRAIGSGARGWSDGPRDVARFNEP 234
Query: 107 KSFAV-------DMKGNIYVADKSNHVIRKITNLDHGNQLIRQI 143
+ AV ++ ++ VAD NH +R + D G+ +R +
Sbjct: 235 QGIAVLPGRLRSEVGYDLVVADTVNHRLRGV---DLGSGSVRTL 275
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 16/46 (34%), Positives = 26/46 (56%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LAG G +G DG + F +P A D G+++VAD + +R++
Sbjct: 345 LAGTGDEGLRDGSAHESWFAQPSGLAQDAHGDVWVADSESSALRRV 390
>gi|398335921|ref|ZP_10520626.1| hypothetical protein LkmesMB_09964 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 947
Score = 42.7 bits (99), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G G +DG +ARF+ PK A D +YVAD NH IRKI
Sbjct: 722 GTSGNADGVGTAARFNAPKGIATDGT-YLYVADTGNHKIRKI 762
>gi|290975761|ref|XP_002670610.1| predicted protein [Naegleria gruberi]
gi|284084171|gb|EFC37866.1| predicted protein [Naegleria gruberi]
Length = 308
Score = 42.7 bits (99), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+F+ P S V K +Y+ D SNH IRKI L++
Sbjct: 39 ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHRIRKI--LEN 96
Query: 136 GN 137
GN
Sbjct: 97 GN 98
Score = 42.7 bits (99), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+F+ P S V K +Y+ D SNH IRKI L++
Sbjct: 151 ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHRIRKI--LEN 208
Query: 136 GN 137
GN
Sbjct: 209 GN 210
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+ P S V +Y+AD NH IRKI L++
Sbjct: 95 ENGNIITIAGNGTVGFSGDNGPATNAQLYNPSSVFVSSNNEVYIADFCNHRIRKI--LEN 152
Query: 136 GN 137
GN
Sbjct: 153 GN 154
>gi|170066897|ref|XP_001868267.1| NHL repeat containing 2 [Culex quinquefasciatus]
gi|167863075|gb|EDS26458.1| NHL repeat containing 2 [Culex quinquefasciatus]
Length = 734
Score = 42.7 bits (99), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
SE +V+ R+ G G+ DG+ ARF+ P+ A I+VAD NH IR+I
Sbjct: 260 SEGTVLHRIGGKK-SGFVDGDFRKARFNAPQGLAFQNDDVIFVADNENHAIRRI 312
>gi|423216521|ref|ZP_17203044.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
CL03T12C04]
gi|392690731|gb|EIY83987.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
CL03T12C04]
Length = 589
Score = 42.7 bits (99), Expect = 0.078, Method: Composition-based stats.
Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
++ V +AG G +G DG+ +A+ P + GN IY +D N++IRKIT
Sbjct: 531 DATVSTIAGSGTKGNQDGDGATAQLAMPHGITMTADGNTIYFSDLDNYIIRKIT 584
>gi|336403610|ref|ZP_08584322.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
gi|335945536|gb|EGN07347.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
Length = 419
Score = 42.7 bits (99), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 61/140 (43%), Gaps = 21/140 (15%)
Query: 56 SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
S + IV D +++ S +S + K +AGDGV+G Y DGEPG SA+
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT---NLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
A G +Y++D + + IRK+T N D+ + I K+ S
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETI----AGSGKAGFSDGKGLK 385
Query: 163 VTVFSPYLI----SCKSFWV 178
T PY I CK+ +V
Sbjct: 386 ATFNQPYEIIITEDCKTMYV 405
>gi|406909052|gb|EKD49392.1| lipoprotein, partial [uncultured bacterium]
Length = 552
Score = 42.7 bits (99), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 5/83 (6%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
V++R G+ V DS + ++ + ++LAG G+ +G A F+ P
Sbjct: 360 VVNRAGTK-AYVTDSGNNMLREINLSTGK---TRKLAGTRSAGFKNGFGPLATFNGPADL 415
Query: 110 AVDMKG-NIYVADKSNHVIRKIT 131
+D G N+YVADK NH IR+I
Sbjct: 416 DLDAAGANLYVADKGNHAIRRIN 438
>gi|423214677|ref|ZP_17201205.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692583|gb|EIY85820.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
CL03T12C04]
Length = 419
Score = 42.7 bits (99), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 61/140 (43%), Gaps = 21/140 (15%)
Query: 56 SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
S + IV D +++ S +S + K +AGDGV+G Y DGEPG SA+
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT---NLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
A G +Y++D + + IRK+T N D+ + I K+ S
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETI----AGSGKAGFSDGKGLK 385
Query: 163 VTVFSPYLI----SCKSFWV 178
T PY I CK+ +V
Sbjct: 386 ATFNQPYEIIITEDCKTMYV 405
>gi|262405589|ref|ZP_06082139.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
gi|294644531|ref|ZP_06722285.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|294805793|ref|ZP_06764670.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
gi|345510784|ref|ZP_08790345.1| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
gi|262356464|gb|EEZ05554.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
gi|292640084|gb|EFF58348.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|294447014|gb|EFG15604.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
gi|345454355|gb|EEO48593.2| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
Length = 419
Score = 42.7 bits (99), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 61/140 (43%), Gaps = 21/140 (15%)
Query: 56 SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
S + IV D +++ S +S + K +AGDGV+G Y DGEPG SA+
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT---NLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
A G +Y++D + + IRK+T N D+ + I K+ S
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETI----AGSGKAGFSDGKGLK 385
Query: 163 VTVFSPYLI----SCKSFWV 178
T PY I CK+ +V
Sbjct: 386 ATFNQPYEIIITEDCKTMYV 405
>gi|168024767|ref|XP_001764907.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683943|gb|EDQ70349.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1040
Score = 42.7 bits (99), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 31/65 (47%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V LAG G G+ DG + +P A+ G +YVAD +N +IR + G I
Sbjct: 818 VTTLAGTGTAGFKDGTAQEGQLSEPAGIALGPDGKLYVADTNNSLIRVLDLQAQGGPKIE 877
Query: 142 QINLK 146
+ LK
Sbjct: 878 TLELK 882
>gi|412985326|emb|CCO20351.1| predicted protein [Bathycoccus prasinos]
Length = 897
Score = 42.7 bits (99), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 42/98 (42%), Gaps = 9/98 (9%)
Query: 39 DGHQLEIN---PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
DG LE P V+ P D+ + D+ + E V LAG G G +D
Sbjct: 120 DGPSLEARFNKPFGVVASPDGRDIFIADTFNHRIRRIDV---EFGTVTTLAGTGASGSAD 176
Query: 96 G--EPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G + G A F P AV G +YVAD NH IR I
Sbjct: 177 GLAKRGDATFKFPSGVAVSPDGGFVYVADTKNHKIRVI 214
Score = 36.2 bits (82), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
VV+ +AG G+ GY D G+ ARF++P + G+ ++V+D N+ IRK+
Sbjct: 220 VVRTVAGSGLTGYHDDLVGTKARFNQPMDVVMHPDGHTLFVSDAMNNCIRKV 271
>gi|302036371|ref|YP_003796693.1| hypothetical protein NIDE1006 [Candidatus Nitrospira defluvii]
gi|300604435|emb|CBK40767.1| conserved protein of unknown function, contains NHL repeats
[Candidatus Nitrospira defluvii]
Length = 404
Score = 42.7 bits (99), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++PK VD KGN+Y+AD NHV+R++
Sbjct: 34 LNEPKGLCVDRKGNLYIADSENHVVRRV 61
Score = 40.8 bits (94), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 1/37 (2%)
Query: 95 DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
DG P AR + P + AVD GN+Y+AD NH +RK+
Sbjct: 137 DGGPARRARLNFPSAVAVDRAGNLYIADTMNHRVRKV 173
>gi|440789641|gb|ELR10946.1| NHL repeat containing 2 isoform 3, putative [Acanthamoeba
castellanii str. Neff]
Length = 934
Score = 42.7 bits (99), Expect = 0.082, Method: Composition-based stats.
Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 4/83 (4%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
PH V +DR G + V DS LS + + + + D G DG+ +ARF
Sbjct: 298 PHGVAVDREGDR-MFVADSGHHRILVLSLDGAFRTAIG--SDDATTGLVDGDYATARFHS 354
Query: 106 PKSFAVDMKGNIYVADKSNHVIR 128
P + + +YVAD NH IR
Sbjct: 355 PLGLSYAGEDKLYVADSENHCIR 377
>gi|29349674|ref|NP_813177.1| hypothetical protein BT_4266 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341584|gb|AAO79371.1| conserved hypothetical protein with NHL repeat [Bacteroides
thetaiotaomicron VPI-5482]
Length = 490
Score = 42.7 bits (99), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 42/89 (47%), Gaps = 4/89 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARF 103
NPH + L + D + + LS + +V + G++GY DG P A F
Sbjct: 401 NPHQICFTE-DGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMF 459
Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
+ P AV G I YVAD N VIRK++
Sbjct: 460 NHPFGVAVSADGQIVYVADTGNKVIRKLS 488
>gi|397781307|ref|YP_006545780.1| Tripartite motif-containing protein 71 [Methanoculleus bourgensis
MS2]
gi|396939809|emb|CCJ37064.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
[Methanoculleus bourgensis MS2]
Length = 1996
Score = 42.7 bits (99), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G+ +FDKP AVD G+IYVAD +NH I+K
Sbjct: 1676 GNGQFDKPWGIAVDTAGDIYVADYNNHRIQK 1706
Score = 42.4 bits (98), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G+ +FDKP AVD G+IYVAD +NH I+K
Sbjct: 1394 GNGQFDKPWGIAVDAAGDIYVADYNNHRIQK 1424
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+ +FD+P A D+ GN+YV+D NH I+K
Sbjct: 1348 NGQFDRPTGIATDISGNVYVSDYYNHRIQK 1377
Score = 35.8 bits (81), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 21/40 (52%), Gaps = 4/40 (10%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
G G SD E F P A D GN+YVAD SNH IR
Sbjct: 1294 GSGGSSDVE----SFSYPTGIATDATGNVYVADYSNHCIR 1329
>gi|290972869|ref|XP_002669173.1| predicted protein [Naegleria gruberi]
gi|284082716|gb|EFC36429.1| predicted protein [Naegleria gruberi]
Length = 1010
Score = 42.7 bits (99), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 38/79 (48%), Gaps = 6/79 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDM 113
S DL + DSSR F + + AG G GYS DG + F+ P A+
Sbjct: 75 SDDLFISDSSRHVI----FRMFSNGTISLFAGIGFAGYSKDGYSALDSLFNSPNGIAMSP 130
Query: 114 KGNIYVADKSNHVIRKITN 132
GN+Y+AD N IR ++N
Sbjct: 131 NGNLYIADSQNDKIRIVSN 149
>gi|290972086|ref|XP_002668792.1| predicted protein [Naegleria gruberi]
gi|284082314|gb|EFC36048.1| predicted protein [Naegleria gruberi]
Length = 679
Score = 42.7 bits (99), Expect = 0.082, Method: Composition-based stats.
Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+F+ P S V K +Y+ D SNH IRKI L++
Sbjct: 39 ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHSIRKI--LEN 96
Query: 136 GN 137
GN
Sbjct: 97 GN 98
Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+ P S V +Y+ D SNH IRKI L++
Sbjct: 151 ENGNIITIAGNGTYGFSGDNGPATNAQLYNPYSVFVSSNNEVYITDYSNHRIRKI--LEN 208
Query: 136 GN 137
GN
Sbjct: 209 GN 210
Score = 35.8 bits (81), Expect = 9.9, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 5/62 (8%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+ ++P S V +Y++D+SN IRKI L++
Sbjct: 207 ENGNIVTIAGNGNYGFSGDNGPATNAQLNRPNSVFVS-NNEVYISDQSNQRIRKI--LEN 263
Query: 136 GN 137
GN
Sbjct: 264 GN 265
>gi|428174200|gb|EKX43097.1| hypothetical protein GUITHDRAFT_153351 [Guillardia theta CCMP2712]
Length = 392
Score = 42.4 bits (98), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 6/85 (7%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P ++ P + LIV D T + + S + G+G +GY DG +A F+ P
Sbjct: 150 PSALALAPNQATLIVSDLELHQLLTFNMADGKTS---KFVGNGQRGYQDGA--TASFNGP 204
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
+ G I VAD NHV+RKI
Sbjct: 205 RGLTFSPDGTYILVADTGNHVVRKI 229
>gi|428179990|gb|EKX48859.1| hypothetical protein GUITHDRAFT_105483 [Guillardia theta CCMP2712]
Length = 388
Score = 42.4 bits (98), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKIT 131
S V LAG G GY DG SA+F P AV ++ + V+D+ NH IR IT
Sbjct: 60 SQVTLLAGTGEAGYKDGIGSSAKFKSPSGLAVQSLRTGLCVSDRGNHAIRIIT 112
Score = 38.9 bits (89), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 33/54 (61%), Gaps = 3/54 (5%)
Query: 80 SVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
+VV +AG D G++DGE +A F+ P+ A+ G+ I VAD NH IRK+
Sbjct: 120 AVVDTVAGSDDSGAGFADGEGPNALFNHPEGLAMSPDGSFILVADSGNHRIRKV 173
>gi|406947273|gb|EKD78225.1| lipoprotein [uncultured bacterium]
Length = 883
Score = 42.4 bits (98), Expect = 0.084, Method: Composition-based stats.
Identities = 25/79 (31%), Positives = 44/79 (55%), Gaps = 6/79 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV--DM 113
+ D+ VLD+ + + + ++R+AG G GY + +A+F +P+ AV D
Sbjct: 50 ADDIYVLDTINNRIEKVK----SDGSLERVAGSGEYGYKNDSALNAQFAEPQDIAVFGDN 105
Query: 114 KGNIYVADKSNHVIRKITN 132
I++AD +N+VIRK+ N
Sbjct: 106 AEQIFIADTNNNVIRKLEN 124
Score = 39.7 bits (91), Expect = 0.62, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 3/52 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKIT 131
VV L+G+G GY +G +A + PK F D + G +YVAD N VIR I+
Sbjct: 833 VVMTLSGNGNVGYVNGTDTAAEWHNPKGF--DFRAGKLYVADTMNDVIRTIS 882
>gi|380694022|ref|ZP_09858881.1| hypothetical protein BfaeM_08588 [Bacteroides faecis MAJ27]
Length = 490
Score = 42.4 bits (98), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 42/89 (47%), Gaps = 4/89 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARF 103
NPH + L + D + + LS + +V + G++GY DG P A F
Sbjct: 401 NPHQICFTE-DGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMF 459
Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
+ P AV G I YVAD N VIRK++
Sbjct: 460 NHPFGVAVSADGQIVYVADTGNKVIRKLS 488
>gi|428184600|gb|EKX53455.1| hypothetical protein GUITHDRAFT_101156 [Guillardia theta CCMP2712]
Length = 367
Score = 42.4 bits (98), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
VV +AG GV GY +G + + + P A+ IYV+D+ NH IR++ ++
Sbjct: 12 VVCSIAGTGVPGYYEGNRMTCKMNCPSGIAMGQGDCIYVSDRQNHCIRRLPGIEE 66
>gi|290975268|ref|XP_002670365.1| predicted protein [Naegleria gruberi]
gi|284083923|gb|EFC37621.1| predicted protein [Naegleria gruberi]
Length = 1034
Score = 42.4 bits (98), Expect = 0.086, Method: Composition-based stats.
Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 9/78 (11%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
++D++ +D+ R LS + +V LAG G++ DG P SA+ + P ++
Sbjct: 15 ANNDILFIDNDR--LRKLS-----QGIVSTLAGTKEAGFNGDGIPAVSAQLNSPLGVSMS 67
Query: 113 MKGNIYVADKSNHVIRKI 130
G IY+ADK NH IRKI
Sbjct: 68 KGGEIYIADKLNHRIRKI 85
>gi|290997788|ref|XP_002681463.1| predicted protein [Naegleria gruberi]
gi|284095087|gb|EFC48719.1| predicted protein [Naegleria gruberi]
Length = 2297
Score = 42.4 bits (98), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 46/99 (46%), Gaps = 10/99 (10%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFD 104
P+ V P + +L + D+ + + ++ + + G GV GY G+ A+
Sbjct: 240 PNDVAISPTTGELFIADTGNNVIRKVGL----DNKIITVVGTGVSGYLGDGGQAKQAQLS 295
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQI 143
P S A G + ++D N+VIRK+ + N +IR I
Sbjct: 296 SPTSIAFTSAGEMLISDSDNYVIRKV----YSNGIIRTI 330
Score = 36.2 bits (82), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
E + GDG +G S DG P A + V G++Y+ADK+ H IRK++N
Sbjct: 42 ERFTITSYVGDGNEGNSGDGVSPSHASIGYVDAIEVGSNGDLYIADKTYHRIRKVSN 98
>gi|395646873|ref|ZP_10434733.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
gi|395443613|gb|EJG08370.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
Length = 2489
Score = 42.4 bits (98), Expect = 0.088, Method: Composition-based stats.
Identities = 19/44 (43%), Positives = 26/44 (59%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+G + + PG + F PK AVD GN+YVAD +NH I+K
Sbjct: 1671 SGTFITTWGSEPPGDSEFSYPKGVAVDDAGNVYVADTNNHRIQK 1714
Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats.
Identities = 16/46 (34%), Positives = 25/46 (54%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
G + + G G FD+P VD GN++V+D N+ I+K T+
Sbjct: 1151 GTFITKWGSGGTGDGEFDRPHGIVVDADGNVFVSDAGNNNIQKFTS 1196
Score = 37.4 bits (85), Expect = 2.9, Method: Composition-based stats.
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
G A+F +P+ AVD G++YVAD N+ I+K +N
Sbjct: 42 GDAQFIRPEGIAVDPGGDVYVADPGNNRIQKFSN 75
Score = 37.0 bits (84), Expect = 3.6, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRK 129
V++ GDG + G GS +F+ P AVD NIYV D NH ++K
Sbjct: 117 VQKFDGDGTYIATWGTKGSGEGQFESPGGIAVDNASNIYVTDTVNHRVQK 166
Score = 36.2 bits (82), Expect = 6.6, Method: Composition-based stats.
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
G +F+ P+ AVD +GN++VAD NH I+
Sbjct: 1210 GDGQFNVPRGIAVDSRGNVFVADSLNHRIQ 1239
>gi|119961614|ref|YP_949427.1| NHL repeat-containing protein [Arthrobacter aurescens TC1]
gi|403528903|ref|YP_006663790.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
gi|119948473|gb|ABM07384.1| putative NHL repeat protein [Arthrobacter aurescens TC1]
gi|403231330|gb|AFR30752.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
Length = 657
Score = 42.4 bits (98), Expect = 0.089, Method: Composition-based stats.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 2/62 (3%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
PL+ E V LAG G++G DG+ A F + A+D + NI+VAD +R++ D
Sbjct: 359 PLANE--VSILAGSGLEGLLDGKAEEAWFAQSSGLAIDGEDNIWVADSETSSLRRLVISD 416
Query: 135 HG 136
G
Sbjct: 417 SG 418
>gi|428214156|ref|YP_007087300.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
PCC 6304]
gi|428002537|gb|AFY83380.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
PCC 6304]
Length = 506
Score = 42.4 bits (98), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 5/85 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ SS L + DS + V+ + G+G+ G +DG A+F P
Sbjct: 178 PGKVLADAASSRLFIADSGHHRIIVTTL----NGTVQSIIGNGIPGLTDGSFSEAQFFGP 233
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
+ + D +YVAD NH IR+I
Sbjct: 234 QGMSWDANSQRLYVADTENHAIRQI 258
Score = 38.9 bits (89), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
K + GDG G DG+ ++RF +P ++ G +Y+AD +NH IR++
Sbjct: 437 KTVLGDGTPGLLDGKNTTSRFFEPGGLSI-ASGILYIADTNNHRIRRVA 484
>gi|297848174|ref|XP_002891968.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
lyrata]
gi|297337810|gb|EFH68227.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
lyrata]
Length = 1058
Score = 42.4 bits (98), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 38/67 (56%), Gaps = 4/67 (5%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHG 136
E + + ++ G +G+ DG A F++P+ A + K N +YVAD NH +R+I D
Sbjct: 600 EGNFIVQIGSTGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREI---DFV 656
Query: 137 NQLIRQI 143
N+ +R +
Sbjct: 657 NERVRTL 663
Score = 39.3 bits (90), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
P+++ V +AG G G+ DG+ A+ +P A+ G ++VAD +N +IR I +L+
Sbjct: 834 PITKRVVT--VAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI-DLN 890
Query: 135 HG 136
G
Sbjct: 891 KG 892
>gi|219852663|ref|YP_002467095.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546922|gb|ACL17372.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 522
Score = 42.4 bits (98), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 30/47 (63%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
L GD V S G+++F++P S AVD G++YVAD N+ I+K T
Sbjct: 216 LTGDLVATRSISNSGASQFNRPTSVAVDTGGSVYVADTGNNRIQKFT 262
Score = 39.3 bits (90), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 22/33 (66%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G+ +F+ P+ AVD GN+YVAD N+ I K T
Sbjct: 134 GTNQFNSPEGIAVDGAGNVYVADTGNNRIEKFT 166
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 13/83 (15%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYT-LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
+D G+ + +++ R +F+T S P S+ + +G G +F P
Sbjct: 49 AVDGTGNVYVADMNNDRISFFTKASLPQMPSSIGRIGSGHG------------QFFYPHG 96
Query: 109 FAVDMKGNIYVADKSNHVIRKIT 131
AVD GN+YVAD NH I+K T
Sbjct: 97 VAVDSTGNVYVADTGNHQIQKFT 119
>gi|381205317|ref|ZP_09912388.1| NHL repeat containing protein [SAR324 cluster bacterium JCVI-SC
AAA005]
Length = 458
Score = 42.4 bits (98), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 31/46 (67%), Gaps = 1/46 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG G GY+DG SA+F+ P +F + K +++VAD N+ IR+I
Sbjct: 271 IAGGGSIGYTDGSGISAQFNAPNAF-TNTKDSLFVADTCNNTIRRI 315
>gi|290991506|ref|XP_002678376.1| NHL repeat domain-containing protein [Naegleria gruberi]
gi|284091988|gb|EFC45632.1| NHL repeat domain-containing protein [Naegleria gruberi]
Length = 985
Score = 42.0 bits (97), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 7/87 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE---PGSARF 103
P +++ P S DL+++D+ ++S S VK +AG G+ GY +GE +
Sbjct: 452 PSNLVFHPISGDLLIVDTQNHRIRSMS---KSTSFVKTIAGTGIAGY-NGEGMLSNMTKM 507
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ P AV G I AD N+++R I
Sbjct: 508 NSPSGIAVLSTGEIIFADTFNNLVRMI 534
>gi|383784367|ref|YP_005468936.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
gi|383083279|dbj|BAM06806.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
Length = 382
Score = 42.0 bits (97), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 26/41 (63%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+G +DG SARF+ P+ G+IYVAD N +IRKIT
Sbjct: 53 RGDNDGNGASARFEFPQGIVAAPDGSIYVADTGNDMIRKIT 93
Score = 40.0 bits (92), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 3/52 (5%)
Query: 82 VKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
V LAG G V G++DG A F +P+ A+ G I Y+AD NH+IRK+
Sbjct: 214 VTTLAGQGALVPGHADGVGTQATFLEPRGLAISPDGQILYIADTRNHLIRKL 265
Score = 36.6 bits (83), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
+ DG +ARF+ P+ A+ G +YVAD N+ IRKI
Sbjct: 113 FRDGAGAAARFNNPEGLAISPDGKTLYVADSRNNRIRKI 151
>gi|290974673|ref|XP_002670069.1| predicted protein [Naegleria gruberi]
gi|284083624|gb|EFC37325.1| predicted protein [Naegleria gruberi]
Length = 821
Score = 42.0 bits (97), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 51/97 (52%), Gaps = 23/97 (23%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL-------------AGDGVQGYSDGE 97
++ PG L+ + + Y + FPLS + V+R+ AG+G G + GE
Sbjct: 101 VENPG---LLAFSPNETTLY-VCFPLSNK--VQRMDFLPNGTEIFSDFAGNGNSGNT-GE 153
Query: 98 PG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G +A+ ++P AVD G++Y++D N +IRK++
Sbjct: 154 GGLAINAQLNQPSGVAVDSNGDVYISDSGNGLIRKVS 190
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 51/102 (50%), Gaps = 13/102 (12%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + P + DL + DS + ++ ++ +AGD GYS G+ G A+ K
Sbjct: 220 NPRGLNFGP-NGDLYIADSDNNVVRRIN----SSGIITTIAGDTTSGYS-GDGGDAKLAK 273
Query: 106 ---PKSFAVDMKGNIYVADKSNHVIRKITNLD----HGNQLI 140
P + V I+++D N+VIRK++N + GN+L+
Sbjct: 274 MKNPINVIVSKTNEIFISDADNNVIRKVSNGNISTIAGNELV 315
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 8/87 (9%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYS--DGEPGSARF 103
P+SV + SSD IV+ + + F + + +AG G QGYS +G S+
Sbjct: 501 PNSV--KVSSSDEIVIADTSNNVIRKVF---KNGTIITIAGTIGSQGYSGDNGLAISSVL 555
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
P A+ G +++AD NHVIRKI
Sbjct: 556 SSPSDVAIAPNGEVFIADWGNHVIRKI 582
>gi|158520859|ref|YP_001528729.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
gi|158509685|gb|ABW66652.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
Length = 2831
Score = 42.0 bits (97), Expect = 0.11, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S + +++ AG GV G+ G AR P AVD GN+++AD N IRK+
Sbjct: 1629 ISPDGIIEAFAGMGVDAGYSGDGGLAVDARLQSPTGLAVDKTGNLFIADSGNFSIRKV 1686
Score = 40.8 bits (94), Expect = 0.26, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+K AGDG G AR P+ +D GNIY+AD NH +R+I+
Sbjct: 1587 MKVFAGDG------GPAIEARLYHPQGMEIDASGNIYIADTDNHCVRRIS 1630
>gi|406944252|gb|EKD76066.1| Serine/threonine protein kinase, partial [uncultured bacterium]
Length = 212
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV--DMKGNIYVADKSNHVIRKI 130
+ + ++ +AG G GY DG A F +P+ A+ D +++AD +N+VIRKI
Sbjct: 100 IGTDGILTNVAGAGDYGYRDGSSDYALFAQPQDIAIYGDTASELFIADTNNNVIRKI 156
>gi|302768701|ref|XP_002967770.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
gi|300164508|gb|EFJ31117.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
Length = 128
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 79 ESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+S ++ LAG G GY DG + F++P+S A+ G ++VAD N IRKI+ G
Sbjct: 3 DSSLEALAGSLVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEG 62
Query: 137 NQLI 140
I
Sbjct: 63 MTTI 66
>gi|156399714|ref|XP_001638646.1| predicted protein [Nematostella vectensis]
gi|156225768|gb|EDO46583.1| predicted protein [Nematostella vectensis]
Length = 286
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 1/51 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
V AG+G +G DGE A+F P+S ++D +G+ ++V D NH IR I+
Sbjct: 131 VTTFAGNGSEGLVDGERTRAQFKGPQSLSLDDEGDRLFVGDTDNHAIRVIS 181
>gi|379719861|ref|YP_005311992.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus 3016]
gi|378568533|gb|AFC28843.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus 3016]
Length = 535
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 4/45 (8%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AGD ++DG A+F++P A+D KGN+YV+D N IR I
Sbjct: 235 AGD----FADGPLAQAKFNEPTGIALDAKGNLYVSDSGNQRIRYI 275
Score = 40.0 bits (92), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 8/106 (7%)
Query: 29 EDGYTVTTVIDGHQ---LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ GY V ++DG L P + P S L V D+ A + + V +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
AG G G DG+ +A F +P A G +YVAD H IR+I+
Sbjct: 164 AGSGRLGAKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRIS 209
>gi|337745906|ref|YP_004640068.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus KNP414]
gi|336297095|gb|AEI40198.1| copper amine oxidase domain protein [Paenibacillus mucilaginosus
KNP414]
Length = 537
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 4/45 (8%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AGD ++DG A+F++P A+D KGN+YV+D N IR I
Sbjct: 235 AGD----FADGPLAQAKFNEPTGIALDAKGNLYVSDSGNQRIRYI 275
Score = 40.0 bits (92), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 8/106 (7%)
Query: 29 EDGYTVTTVIDGHQ---LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ GY V ++DG L P + P S L V D+ A + + V +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
AG G G DG+ +A F +P A G +YVAD H IR+I+
Sbjct: 164 AGSGRLGAKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRIS 209
>gi|315500945|ref|YP_004079832.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
allergen [Micromonospora sp. L5]
gi|315407564|gb|ADU05681.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora sp. L5]
Length = 610
Score = 42.0 bits (97), Expect = 0.12, Method: Composition-based stats.
Identities = 29/97 (29%), Positives = 53/97 (54%), Gaps = 16/97 (16%)
Query: 47 PHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
P +V+ PG + +L+V DS+R + L+ E+VV+R+ G G +G +DG +
Sbjct: 167 PDTVLRFPGKALPLDGGNLLVSDSARHSLVELA--ADGETVVRRI-GTGSRGRTDGPASA 223
Query: 101 ARFDKPKSFAV------DMKG-NIYVADKSNHVIRKI 130
A F +P+ + ++ G ++ VAD NH++R +
Sbjct: 224 ATFAEPQGLCLLPPHVAEVAGYDVVVADTVNHLLRGV 260
>gi|290982388|ref|XP_002673912.1| predicted protein [Naegleria gruberi]
gi|284087499|gb|EFC41168.1| predicted protein [Naegleria gruberi]
Length = 2313
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 15/50 (30%), Positives = 31/50 (62%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
LAG ++G++DG +A + P S ++ + +Y +D+ NH IR ++ ++
Sbjct: 500 LAGGSLKGFADGVGSNANLNYPDSISIGLNNMLYFSDRDNHAIRSVSTIN 549
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 33/59 (55%), Gaps = 2/59 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S +S +AG G G+S G SA + P + D GN+ ++D SN+ IRK+TN
Sbjct: 272 VSLQSGTPIVAGTGTSGFSGDGGVATSALLNGPSALTFDSSGNMLISDSSNNRIRKVTN 330
Score = 40.0 bits (92), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 4/56 (7%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKIT 131
++V ++G G+ GY+ G+ G A + K P S V + G+I DK N IRKIT
Sbjct: 549 NALVTTISGSGIAGYT-GDEGPAIYAKLNLPGSIEVALNGDIIFMDKGNQRIRKIT 603
Score = 39.7 bits (91), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
E ++ +AG G GY+ G SA + P AV +Y AD NHVIRKI+
Sbjct: 134 ENGIISTIAGTGSPGYTASGTATSALINTPLGLAV-YNNEVYFADSLNHVIRKIS 187
Score = 36.6 bits (83), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 29/54 (53%), Gaps = 2/54 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
VV +AG G GYS DG A+ + P+ + G I V+D NH IRKI N
Sbjct: 82 VVSTIAGTGNAGYSGDGANALFAQLNSPQGIGLLSGGAIIVSDTLNHRIRKIEN 135
>gi|296124331|ref|YP_003632109.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
gi|296016671|gb|ADG69910.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
Length = 581
Score = 42.0 bits (97), Expect = 0.12, Method: Composition-based stats.
Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 7/85 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P ++ P + + DS+ + S + ++ G G G DG SA+FD P
Sbjct: 250 PGKLLVDPAHERVFISDSNHNRIVVASLA----GQLLKVIGSGKIGAKDGPAESAQFDHP 305
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
+ A+D GN +YVAD NH++R +
Sbjct: 306 QGMALD--GNTLYVADTENHLLRTV 328
>gi|255542920|ref|XP_002512523.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
gi|223548484|gb|EEF49975.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
Length = 1016
Score = 42.0 bits (97), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 4/61 (6%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQ 142
++ G +G DG A F++P+ A + K N+ YVAD NH +R+I D N+++R
Sbjct: 565 QIGSTGEEGLRDGPFDEATFNRPQGLAYNAKKNLLYVADTENHALREI---DFVNEMVRT 621
Query: 143 I 143
+
Sbjct: 622 L 622
Score = 35.8 bits (81), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V +AG G G+ DG+ +A+ +P G + +AD +N +IR + +L+ +R
Sbjct: 798 VSTIAGTGKAGFKDGKALAAQLSEPSGIIEAENGRLIIADTNNSIIRYL-DLNKEEAELR 856
Query: 142 QINLK 146
+ LK
Sbjct: 857 TLELK 861
>gi|116621305|ref|YP_823461.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116224467|gb|ABJ83176.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 1042
Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 3/61 (4%)
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
F L+ V +AG+G G G+ G +A F + A+D GN+Y+ D N+ IRK+
Sbjct: 110 FKLTSAGKVTTVAGNGTSGTYTGDGGPAINAGFSGAYAIALDSTGNLYICDSGNNAIRKV 169
Query: 131 T 131
T
Sbjct: 170 T 170
Score = 38.9 bits (89), Expect = 0.96, Method: Composition-based stats.
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 8/75 (10%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGD-------GVQGYSDGEPG-SARFDKPKSFAVDMKGN 116
S+ TLS L + +LAG GV DG P SA + P++ D G
Sbjct: 2 SKHFILTLSAALCLTTAHAQLAGSYVSRTIAGVFPLGDGGPATSAILETPQAVVADSSGT 61
Query: 117 IYVADKSNHVIRKIT 131
IY+AD N VIRK++
Sbjct: 62 IYIADAGNGVIRKVS 76
>gi|302036773|ref|YP_003797095.1| hypothetical protein NIDE1420 [Candidatus Nitrospira defluvii]
gi|190343188|gb|ACE75576.1| putative NHL repeat protein [Candidatus Nitrospira defluvii]
gi|300604837|emb|CBK41169.1| protein of unknown function, contains NHL repeats [Candidatus
Nitrospira defluvii]
Length = 989
Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats.
Identities = 21/41 (51%), Positives = 23/41 (56%), Gaps = 5/41 (12%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G +G DGE F P AVD GNIYVAD NH I+K
Sbjct: 160 GQEGVEDGE-----FSSPTGIAVDKDGNIYVADTDNHSIQK 195
Score = 36.2 bits (82), Expect = 7.3, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+++ GDG G GS F P+ AVD GN YVAD+ N+ I+K
Sbjct: 293 IQKFKGDGTPLLKWGRDGSFDGAFFFPRGVAVDFVGNTYVADEGNNRIQK 342
>gi|37522270|ref|NP_925647.1| hypothetical protein gll2701 [Gloeobacter violaceus PCC 7421]
gi|35213270|dbj|BAC90642.1| gll2701 [Gloeobacter violaceus PCC 7421]
Length = 351
Score = 42.0 bits (97), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + ++ +AG G Y DG P ARF+ P AVD GN+ VAD NH IR I
Sbjct: 124 VGTDGRIETIAGTGEADYGGDGGPAKQARFNDPAGVAVDALGNVLVADTYNHRIRTI 180
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + ++ +AG G GYS DG P +A R D P AV G I +AD N+ IR I
Sbjct: 180 IGPDGTIRTVAGTGQAGYSGDGGPATAARLDFPWGVAVAPDGRILIADTGNNRIRSI 236
>gi|290986699|ref|XP_002676061.1| predicted protein [Naegleria gruberi]
gi|284089661|gb|EFC43317.1| predicted protein [Naegleria gruberi]
Length = 819
Score = 42.0 bits (97), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG+G QGY+ +G G + +P S +D GNIY ADK N +IRKI
Sbjct: 207 IAGNGKQGYNGEGLSGLDSALKEPYSVDMDSDGNIYFADKGNSIIRKI 254
>gi|324997987|ref|ZP_08119099.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Pseudonocardia sp. P1]
Length = 614
Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats.
Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
Y+DG GSARF +P+ V + VAD NH +R+++ LD G
Sbjct: 231 YTDGPAGSARFSEPQGLLVLDPSTVLVADTVNHAVRRVS-LDDG 273
>gi|292491957|ref|YP_003527396.1| hypothetical protein Nhal_1896 [Nitrosococcus halophilus Nc4]
gi|291580552|gb|ADE15009.1| YD repeat protein [Nitrosococcus halophilus Nc4]
Length = 2539
Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 4/58 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S E ++ +AG+G+ GY DG+ G A+ + P + A+ N+Y+AD NH IR++
Sbjct: 1461 VSPEGIITTVAGNGILGY-DGDGGISTGAKLNLPIAVALSPNENLYIADYYNHRIRRV 1517
Score = 41.6 bits (96), Expect = 0.15, Method: Composition-based stats.
Identities = 19/56 (33%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + VV AG+G+ G+S G+ + +P + A+D GN+Y+A+ N+VIR++
Sbjct: 1294 IGTDGVVSIAAGNGIAGFSGTGDLATQAMMRPFAIALDSDGNLYIAESQNNVIRRV 1349
Score = 39.7 bits (91), Expect = 0.64, Method: Composition-based stats.
Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 79 ESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ ++ +AG+G +G+S G AR + P A+ G++Y+AD NH IR ++
Sbjct: 1352 DGLITLVAGNGTRGFSGDGGLATQARLNHPNDLAIGPDGSLYIADYDNHRIRLVS 1406
>gi|290987760|ref|XP_002676590.1| predicted protein [Naegleria gruberi]
gi|284090193|gb|EFC43846.1| predicted protein [Naegleria gruberi]
Length = 836
Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 35/58 (60%), Gaps = 5/58 (8%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMK-GNIYVADKSNHVIRKI 130
+ +V+ +AG G QG+S G+ G +A+ +KP S D GNIY+AD NH IR I
Sbjct: 501 TSNEMVEVIAGIGFQGFS-GDGGLAINAQLNKPHSVHFDKNSGNIYIADSGNHRIRMI 557
>gi|301064161|ref|ZP_07204608.1| RHS repeat-associated core domain protein [delta proteobacterium
NaphS2]
gi|300441781|gb|EFK06099.1| RHS repeat-associated core domain protein [delta proteobacterium
NaphS2]
Length = 2050
Score = 41.6 bits (96), Expect = 0.14, Method: Composition-based stats.
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG+GV G+S DG P A A+D GN+Y+ D SNH +RK+
Sbjct: 917 IITTVAGNGVAGFSGDGGPAVEASLGWAMGIAIDSAGNLYILDGSNHRVRKV 968
Score = 37.7 bits (86), Expect = 2.2, Method: Composition-based stats.
Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG G G+S DG P A P A+D GN+Y+ D N+ +RK+
Sbjct: 804 IITTVAGSGDYGFSGDGGPAIEASLSFPMGIAIDSAGNLYILDSDNNRVRKV 855
>gi|290972323|ref|XP_002668903.1| predicted protein [Naegleria gruberi]
gi|284082440|gb|EFC36159.1| predicted protein [Naegleria gruberi]
Length = 296
Score = 41.6 bits (96), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 25/35 (71%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
+A + P AV G++Y+ADKSNHVIRK++ L+
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALN 142
>gi|414163305|ref|ZP_11419552.1| hypothetical protein HMPREF9697_01453 [Afipia felis ATCC 53690]
gi|410881085|gb|EKS28925.1| hypothetical protein HMPREF9697_01453 [Afipia felis ATCC 53690]
Length = 357
Score = 41.6 bits (96), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
Query: 80 SVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
S V ++ DG S GE GS +F+ P S A D KGN+YVAD+ N +I D
Sbjct: 184 SRVAKVDKDGNWIKSWGEKGSEPGQFNTPHSIATDAKGNVYVADRGNA---RIQVFDGDG 240
Query: 138 QLIRQINL 145
+ +RQ+ +
Sbjct: 241 KFLRQMKI 248
>gi|290973047|ref|XP_002669261.1| predicted protein [Naegleria gruberi]
gi|284082806|gb|EFC36517.1| predicted protein [Naegleria gruberi]
Length = 618
Score = 41.6 bits (96), Expect = 0.15, Method: Composition-based stats.
Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+ P S V + +Y+AD+ NH IRKI L++
Sbjct: 148 ENGNIITIAGNGTDGFSGDNGPATNAQLYYPSSVFVSLTNEVYIADQHNHRIRKI--LEN 205
Query: 136 GN 137
GN
Sbjct: 206 GN 207
Score = 38.1 bits (87), Expect = 1.7, Method: Composition-based stats.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 15/79 (18%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+F+ P S V + +AD NH IRKI L++
Sbjct: 39 ENGNIVTIAGNGTAGFSGDNGPATNAQFNYPSSVFVSSNNEVCIADLHNHRIRKI--LEN 96
Query: 136 G-----------NQLIRQI 143
G NQ IR+I
Sbjct: 97 GSIITIAGNGTHNQRIRKI 115
>gi|290979906|ref|XP_002672674.1| predicted protein [Naegleria gruberi]
gi|284086252|gb|EFC39930.1| predicted protein [Naegleria gruberi]
Length = 747
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ DL + DS A + E ++ +AG+G+ G+ DG A+F+ +V G
Sbjct: 510 NGDLYICDSINHAIRKIDH---ETGIISTIAGNGIAGFKDGNASIAQFNSNLGLSVLPNG 566
Query: 116 NIYVADKSNHVIRK 129
++ +AD +N+ IRK
Sbjct: 567 DLLIADYNNNRIRK 580
>gi|167534306|ref|XP_001748831.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772793|gb|EDQ86441.1| predicted protein [Monosiga brevicollis MX1]
Length = 5844
Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats.
Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 10/94 (10%)
Query: 57 SDLIVLDSSR--------SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
SDL+VLD R ++ L + V +AG G++DG GS + ++P+
Sbjct: 896 SDLLVLDEKRILISDEANNSIRLLQLQKDKNWTVTTVAGSRTAGFADGRRGSTQLNQPRG 955
Query: 109 --FAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
F + + +AD NH IR + D +L+
Sbjct: 956 MCFLNQRRTVVAIADAGNHCIRALNTSDWSLKLV 989
>gi|428177953|gb|EKX46830.1| hypothetical protein GUITHDRAFT_70199, partial [Guillardia theta
CCMP2712]
Length = 305
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 46/96 (47%), Gaps = 6/96 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPL----SEESVVKRLAGDGVQGYSDGEPGSAR 102
P V P S L+V DS ++ S + V +AG+G G +DG +A+
Sbjct: 41 PSDVALSPDESFLVVADSRNHVIRRINKGAQGWSSNDVTVTTIAGNGQPGLADGSATTAQ 100
Query: 103 FDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGN 137
F KP + G + VAD +N+ IRK+T L GN
Sbjct: 101 FQKPLGVDISPDGTWVAVADSANNRIRKVT-LATGN 135
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P SV P + L V D++ + +S P S K G+ GY++G +A ++ P
Sbjct: 159 PASVSFHPQGTGLAVADTNNNRIRWISLP-SYSVTTKSGRCLGIAGYANGLGKTAAYNSP 217
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKITNLDHGN 137
+ V G IY+AD N+VIR+ +D+G+
Sbjct: 218 QYVRVSPDGVWIYIADGQNNVIRR-ARVDNGD 248
>gi|356571070|ref|XP_003553704.1| PREDICTED: NHL repeat-containing protein 2-like [Glycine max]
Length = 1098
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLI 140
+ ++ G +G DG A F++P+ A + K NI YVAD NH +R+I D N+ +
Sbjct: 644 IVQIGSSGEEGLQDGSFDDATFNRPQGLAYNAKKNILYVADTENHALREI---DFVNEKV 700
Query: 141 RQI 143
R +
Sbjct: 701 RTL 703
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V +AG G G+ DG A+ +P KG +++AD +N +IR + +L+ +R
Sbjct: 879 VSTIAGTGKAGFKDGTAVKAQLSEPSGIVEGNKGRLFIADTNNSLIRYL-DLNINETELR 937
Query: 142 QINLK 146
+ LK
Sbjct: 938 TLELK 942
>gi|290978635|ref|XP_002672041.1| predicted protein [Naegleria gruberi]
gi|284085614|gb|EFC39297.1| predicted protein [Naegleria gruberi]
Length = 706
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 36/64 (56%), Gaps = 3/64 (4%)
Query: 75 PLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
P E + +AG+G++G+ +G A+ P AV+ N +Y+AD NH IRK++
Sbjct: 5 PKGMEQTIWTIAGNGIKGFYGDNGLAIEAKLSSPYGVAVNPYSNDVYIADYRNHCIRKVS 64
Query: 132 NLDH 135
LD+
Sbjct: 65 ALDN 68
Score = 40.8 bits (94), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 48/89 (53%), Gaps = 8/89 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARF 103
P+ V P S+D+ + D +S + ++ + +AG GV GYS G+ G A+
Sbjct: 38 PYGVAVNPYSNDVYIADYRNHCIRKVS---ALDNKITTIAGTGVAGYS-GDGGLALHAQL 93
Query: 104 DKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
+ P S ++ K +Y+AD SNH IRKI+
Sbjct: 94 NCPLSVSIHPKNEELYIADYSNHRIRKIS 122
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 58/128 (45%), Gaps = 12/128 (9%)
Query: 23 SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
++G E G+ ++G P + P + D+ + DS+ ++ + ++
Sbjct: 194 TAGFCGEGGFATNAQLNG------PSGLDFNPTTGDVYIADSNNHRVRKVN---CKSGII 244
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLIR 141
LAG G GYSDG A+ + P + +G IYV D+SN+ I ++ +D +
Sbjct: 245 TTLAGTGKAGYSDGI--DAQLNYPYDVSFCTRGQIIYVTDRSNNRICTVSPIDGRITTVC 302
Query: 142 QINLKPED 149
IN K D
Sbjct: 303 GINEKGFD 310
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 44/88 (50%), Gaps = 8/88 (9%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARF 103
P V P + D+ ++D + +S + ++ AG+G G+ GE G +A+
Sbjct: 154 PQCVKIHPITFDIYIIDFVNNKIRRIS---NSSGIISTFAGNGTAGFC-GEGGFATNAQL 209
Query: 104 DKPKSFAVD-MKGNIYVADKSNHVIRKI 130
+ P + G++Y+AD +NH +RK+
Sbjct: 210 NGPSGLDFNPTTGDVYIADSNNHRVRKV 237
>gi|288923319|ref|ZP_06417453.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288345335|gb|EFC79730.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 752
Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats.
Identities = 32/113 (28%), Positives = 51/113 (45%), Gaps = 27/113 (23%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRL---------AGDGVQGYS-DGEPGSA-RFDKPKS 108
+ +D++R Y +E + ++R+ AG+G G+ DG P +A +P
Sbjct: 423 LTIDTTRGVIYVAD---TENNRIRRIDRTGRITTVAGNGTAGFDGDGGPATAASLQEPSG 479
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMF 161
V G +++AD NH IR+I N +I I + QSGSG F
Sbjct: 480 VVVAPDGTLFIADNGNHRIRRIGT----NGVITTI---------AGQSGSGGF 519
>gi|30696124|ref|NP_564718.2| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|17529226|gb|AAL38840.1| unknown protein [Arabidopsis thaliana]
gi|332195279|gb|AEE33400.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 1055
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 43/95 (45%), Gaps = 4/95 (4%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHG 136
E + + ++ G +G+ DG A F++P+ A + K N +YVAD NH +R+I ++
Sbjct: 597 EGNFIVQIGSSGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREIDFVNER 656
Query: 137 NQLIRQINLKPEDCSKSSQSGSGMF---WVTVFSP 168
Q + K D + + W F P
Sbjct: 657 VQTLAGNGTKGSDYQGGRKGTKQLLNSPWDVCFEP 691
Score = 41.2 bits (95), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 4/61 (6%)
Query: 79 ESVVKR---LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
+ V KR LAG G G+ DG+ A+ +P A+ G ++VAD +N +IR I +L+
Sbjct: 830 DPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI-DLNK 888
Query: 136 G 136
G
Sbjct: 889 G 889
>gi|444912221|ref|ZP_21232386.1| putative hemolysin [Cystobacter fuscus DSM 2262]
gi|444717129|gb|ELW57964.1| putative hemolysin [Cystobacter fuscus DSM 2262]
Length = 844
Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 27/56 (48%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S V LAG G G DG A F P AV G +YVAD N ++R IT
Sbjct: 788 ISPAGEVTTLAGSGRFGARDGRGADADFVVPGGLAVGTDGTLYVADSGNALLRAIT 843
Score = 39.7 bits (91), Expect = 0.57, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
VKR++ DG S R + P A D +GN+YV+D N+ IR+IT
Sbjct: 591 VKRVSPDG----SVSTLSHVRLNGPLGIAADAQGNVYVSDSDNYCIRRIT 636
>gi|168702582|ref|ZP_02734859.1| NHL repeat [Gemmata obscuriglobus UQM 2246]
Length = 599
Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats.
Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 8/82 (9%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
V D PG L + DS+ ++ V+ G G G +DG A+FD P+
Sbjct: 269 VADEPGRR-LFIADSTHHRVVVTDLAGNKLHVI----GTGAPGKTDGPFDKAQFDDPQGM 323
Query: 110 AVDMKGN-IYVADKSNHVIRKI 130
AV +G+ ++VAD+ NH++R+I
Sbjct: 324 AV--RGDTLFVADRKNHLVREI 343
>gi|149176433|ref|ZP_01855047.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
gi|148844785|gb|EDL59134.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
Length = 368
Score = 41.6 bits (96), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG G +G+S DG P SA F +P S A+D N+Y+ D NH IR++
Sbjct: 141 VAGTGEKGFSGDGGPATSATFSRPHSIALDNDNNLYICDIGNHRIRRV 188
Score = 40.8 bits (94), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 19/67 (28%), Positives = 33/67 (49%), Gaps = 1/67 (1%)
Query: 70 YTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+T+ ++ ++ GDG +G DG+P R +P V GN+Y+ D + +R
Sbjct: 301 HTIRVIRKKDGTIETAVGDGKKGDGPDGDPAHCRMARPHGVYVGPNGNVYIGDSETYRVR 360
Query: 129 KITNLDH 135
K+T H
Sbjct: 361 KLTTGTH 367
Score = 36.6 bits (83), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 18/108 (16%)
Query: 39 DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL----------AGD 88
DG ++E P ++ P + D S + + Y L E + V R+ AG
Sbjct: 209 DGAKVEGTP---LNGPRALDFFTDGSGKGSLY---LALREGNKVYRIDLDDQTLHHIAGT 262
Query: 89 GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
G +GY+ G A PK +V G++Y+AD +H IR I D
Sbjct: 263 GKKGYTGHGGPAKKATLSGPKGISVAPNGDVYLADTESHTIRVIRKKD 310
>gi|334116916|ref|ZP_08491008.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
gi|333461736|gb|EGK90341.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
Length = 546
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 59/143 (41%), Gaps = 9/143 (6%)
Query: 20 SLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEE 79
L G LL + +D L +SVI S L + DS + + E
Sbjct: 191 QLHQQGALLGEMSQGNESVDETVLPGPANSVIQNLVGSCLFIADSGHNRIVVSTL----E 246
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQ 138
V + G G G +DG+ A F P+ A D + I YVAD NH +RKI D Q
Sbjct: 247 GEVLHVIGSGQPGLTDGDFEEAEFFAPQGMAFDAESQILYVADTENHALRKI---DFTTQ 303
Query: 139 LIRQINLKPEDCSK-SSQSGSGM 160
+ + E + S +SG G+
Sbjct: 304 RVETVAGTGEQSHEISPRSGKGL 326
>gi|428185691|gb|EKX54543.1| hypothetical protein GUITHDRAFT_63359, partial [Guillardia theta
CCMP2712]
Length = 305
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 45/89 (50%), Gaps = 6/89 (6%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
++P + P S L+V DS ++S+ E VV AG G +G+ DG A+FD
Sbjct: 57 LHPKGICPLP--SGLLVCDSGHHRIRSVSY--DGERVVA-FAGSGKRGHRDGPVQVAQFD 111
Query: 105 KPKSFAV-DMKGNIYVADKSNHVIRKITN 132
P S V +I VAD N+ +R+I N
Sbjct: 112 TPCSICVCPSDKSIIVADSGNNAVRRIAN 140
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 9/86 (10%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG----DGVQGYSDGEPGSAR 102
P S+ P +IV DS +A ++ +V LAG D G+ DGE A+
Sbjct: 113 PCSICVCPSDKSIIVADSGNNAVRRIA-----NGMVTTLAGGSGPDRAGGFVDGESEGAK 167
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIR 128
F +P D + + V D NH +R
Sbjct: 168 FRRPTFVMFDKEETLLVIDSGNHCLR 193
>gi|386722458|ref|YP_006188784.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus K02]
gi|384089583|gb|AFH61019.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus K02]
Length = 537
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 4/45 (8%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AGD ++DG A+F++P A+D KGN+YV+D N +R I
Sbjct: 235 AGD----FADGPLAQAKFNEPTGIALDAKGNLYVSDSGNQRVRYI 275
Score = 40.0 bits (92), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 8/106 (7%)
Query: 29 EDGYTVTTVIDGHQ---LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ GY V ++DG L P + P S L V D+ A + + V +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
AG G G DG+ +A F +P A G +YVAD H IR+I+
Sbjct: 164 AGSGRLGVKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRIS 209
>gi|428317077|ref|YP_007114959.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240757|gb|AFZ06543.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
Length = 546
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 9/115 (7%)
Query: 48 HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
+SVI S L + DS + + E V + G G G +DG+ A F P+
Sbjct: 219 NSVIQNLVGSCLFIADSGHNRIVVSTL----EGEVLHVIGSGKPGLTDGDFEEAEFFAPQ 274
Query: 108 SFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSK-SSQSGSGM 160
A D + I YVAD NH +RKI D Q + + E + S +SG G+
Sbjct: 275 GMAFDAESQILYVADTENHALRKI---DFTTQRVETVAGTGEQSHEISPRSGKGL 326
Score = 36.6 bits (83), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + GDG G D + ARF +P + + +++VAD +NHVIR++
Sbjct: 476 RTMIGDGTAGLVDAKGLKARFFEPSGLSA-IGPHLFVADTNNHVIRRV 522
>gi|290995422|ref|XP_002680294.1| predicted protein [Naegleria gruberi]
gi|284093914|gb|EFC47550.1| predicted protein [Naegleria gruberi]
Length = 1170
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 53/109 (48%), Gaps = 26/109 (23%)
Query: 48 HSVIDRPGSSDLIVLDS-----SRSAFYT-LSFPLSEESV---------VKRLAGD---- 88
+S++DR + +I LD+ + ++ YT +S SE S +K+L
Sbjct: 281 NSLVDRVAGTGIIELDTDGKLGNLTSLYTPISVTQSESSSDIYIGTTIGIKKLTDSNKLI 340
Query: 89 ----GVQGYS--DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G Q YS DG P +A R P+ A+ G ++++DK +H IRKI
Sbjct: 341 SSFAGSQAYSSGDGYPATAARLSSPEGIAISTSGEVFISDKGSHTIRKI 389
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 17/56 (30%), Positives = 30/56 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ + V+ +AG G GY DG A+ + P A G++ + + SN+ IRK++
Sbjct: 389 IDSKGVISTVAGTGSAGYVDGPALKAKLNGPGFLAFTPNGDLLITESSNNRIRKLS 444
>gi|390369862|ref|XP_798415.2| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 322
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 41/86 (47%), Gaps = 9/86 (10%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V P + L V D+ + ++ + VV+ G G++DG ARF P
Sbjct: 173 PGKVTTNPEGTLLAVSDTGHNRVIIVAL----DGVVQHCIGGPETGFNDGLYQEARFHSP 228
Query: 107 KS--FAVDMKGNIYVADKSNHVIRKI 130
+ +A D+ IYVAD NH IRKI
Sbjct: 229 QGLCWAQDV---IYVADTENHAIRKI 251
>gi|430741385|ref|YP_007200514.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
gi|430013105|gb|AGA24819.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 361
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 7/86 (8%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SAR 102
I+P +V G ++ +LD +A + ++ L G G +G S DG PG A+
Sbjct: 218 IDPRAVATD-GKGNVYILDRGGNALRVVD----RSGKIRTLVGTGEKGASGDGGPGRQAK 272
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIR 128
+ PK V+ +G++ +AD NH IR
Sbjct: 273 LNGPKHLCVNPQGDVIIADTENHRIR 298
>gi|290982061|ref|XP_002673749.1| predicted protein [Naegleria gruberi]
gi|284087335|gb|EFC41005.1| predicted protein [Naegleria gruberi]
Length = 185
Score = 41.2 bits (95), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
Query: 74 FPLSEESVVKRLAGDGV-QGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
F + V + +AG+G+ Q DG G+ A + P+ A+DM GN+Y+A++ H IR I
Sbjct: 16 FTSHAQYVTQTVAGNGISQSLGDGFLGTQASLNSPRDVALDMYGNMYIAEEKAHKIRFI 74
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 36/65 (55%), Gaps = 5/65 (7%)
Query: 70 YTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKG-NIYVADKSNH 125
+ + F ++ L G+G+ G S G+ G +A+ P S A+D G N++++D NH
Sbjct: 69 HKIRFINKTTGIISTLVGNGIAG-SVGDGGLAINAQLHLPFSLAIDSTGQNLFISDSGNH 127
Query: 126 VIRKI 130
VIR I
Sbjct: 128 VIRHI 132
>gi|390559793|ref|ZP_10244078.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
gi|390173637|emb|CCF83377.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
Length = 442
Score = 41.2 bits (95), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ + L + DS + +E ++ G G G +DG+ +A+F+ P
Sbjct: 128 PGKVLADAAGNRLFIADSGHNRIVVTRLDGAEPWMI----GSGQPGLADGDAATAQFNHP 183
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTV 165
+ A + G+I Y+AD NHVIR++ + RQ+ E + + + GM + +
Sbjct: 184 QGMA--LAGDILYLADTRNHVIRRV------DLASRQV----ETIAGTGE--PGMSYASA 229
Query: 166 FSPYLISCKSFW 177
S I +S W
Sbjct: 230 GSARTIDLRSPW 241
Score = 36.2 bits (82), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLIRQINL 145
GDG G DG +ARF +P V + G+ +Y+AD +NH IR + L+ G +I L
Sbjct: 386 GDGTPGDRDGVREAARFHEPG--GVSLAGDRLYIADTNNHAIR-VAELETGRVTTLRITL 442
>gi|242017281|ref|XP_002429120.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212513984|gb|EEB16382.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 791
Score = 41.2 bits (95), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 26/50 (52%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
E VVK G QG+ DG A FD P A + +Y+AD +NH IR
Sbjct: 237 EGVVKYTIGGKEQGFKDGNFQEALFDTPHGLAFFNESILYIADTNNHAIR 286
>gi|395225280|ref|ZP_10403807.1| NHL repeat protein, partial [Thiovulum sp. ES]
gi|394446527|gb|EJF07348.1| NHL repeat protein, partial [Thiovulum sp. ES]
Length = 4349
Score = 41.2 bits (95), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 25/40 (62%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G DG + RF P+ A D +GN++VAD +N I+KIT
Sbjct: 795 GDEDGNLSTVRFYNPRGLAFDEEGNLFVADSNNSKIKKIT 834
>gi|12323029|gb|AAG51506.1|AC058785_9 hypothetical protein [Arabidopsis thaliana]
Length = 1041
Score = 41.2 bits (95), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 4/61 (6%)
Query: 79 ESVVKR---LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
+ V KR LAG G G+ DG+ A+ +P A+ G ++VAD +N +IR I +L+
Sbjct: 816 DPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI-DLNK 874
Query: 136 G 136
G
Sbjct: 875 G 875
Score = 41.2 bits (95), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
E + + ++ G +G+ DG A F++P+ A + K N +YVAD NH +R+I
Sbjct: 597 EGNFIVQIGSSGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREI 650
>gi|291239557|ref|XP_002739689.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 744
Score = 41.2 bits (95), Expect = 0.21, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 4/57 (7%)
Query: 87 GDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
GDG YS G+ GS R P + D GN+YV D +NH I+K + HG L R
Sbjct: 650 GDGEYLYSFGDCGNGSDRLQNPLGVSTDRDGNVYVCDHNNHRIQKYDS--HGKYLTR 704
>gi|147918859|ref|YP_687415.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
gi|110622811|emb|CAJ38089.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
Length = 514
Score = 41.2 bits (95), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 27/39 (69%), Gaps = 3/39 (7%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
+PG FD+P+ AVD +GNIYV+D NH I ++ N+ H
Sbjct: 363 QPG--EFDRPRGIAVDGRGNIYVSDSYNHRI-QVWNVAH 398
Score = 37.0 bits (84), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 5/44 (11%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR---KITN 132
G D +PG +F P+ AVD GNIYVAD N+ I+ K TN
Sbjct: 170 GSPDDDPG--QFSSPRGIAVDGAGNIYVADTYNYRIQVWNKTTN 211
>gi|406947628|gb|EKD78528.1| NHL repeat containing protein [uncultured bacterium]
Length = 810
Score = 41.2 bits (95), Expect = 0.22, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 34/53 (64%), Gaps = 3/53 (5%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
+ VK +AGDG +GY++G+ + +FD P A KG N+ +AD N +IRK+
Sbjct: 657 DEAVKLVAGDGNRGYNNGDRDNTQFDNPVGLA--RKGDNLLIADSLNDLIRKV 707
Score = 38.5 bits (88), Expect = 1.4, Method: Composition-based stats.
Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 5/75 (6%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
IV D + S + +F ++ VV + G G G + G S +F P + +D
Sbjct: 51 IVSDGAGSFYLADTFNGVIRKINSNGVVSTVVGAGGYGDTTGSGSSTKFALPAAVGLDSS 110
Query: 115 GNIYVADKSNHVIRK 129
GN+Y+AD N I+K
Sbjct: 111 GNVYIADTGNGKIKK 125
Score = 35.8 bits (81), Expect = 9.3, Method: Composition-based stats.
Identities = 16/40 (40%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
GY D A+F+ P + +D N +YVAD+ NH IR++
Sbjct: 559 GYQDATRKDAQFNVPIAVTIDNADNYLYVADRDNHTIRQV 598
>gi|344923601|ref|ZP_08777062.1| hypothetical protein COdytL_03000 [Candidatus Odyssella
thessalonicensis L13]
Length = 1165
Score = 41.2 bits (95), Expect = 0.22, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 8/58 (13%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
G +G +A FD P D G++Y+AD NHVIRKI +IN+ P +
Sbjct: 332 GLVNGSGATALFDGPSGMVQDADGSLYIADSYNHVIRKID--------ANKININPSN 381
Score = 39.7 bits (91), Expect = 0.53, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 3/56 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V +AG G G+++G GSA+F+ P+ V G + VAD +N IR + D GN
Sbjct: 100 VATVAGTGTPGHTNGPGGSAQFNMPQGITVMNDGRLLVADTNNDRIRLV---DPGN 152
>gi|282879766|ref|ZP_06288496.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
gi|281306435|gb|EFA98465.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
Length = 486
Score = 41.2 bits (95), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 1/58 (1%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
P ++ +V + G G+ +G P A+F P+ AV+ G+I YVAD N IRK+T
Sbjct: 427 PDNKATVTTVIGLPGSAGFKEGGPDVAQFRYPRGVAVNADGSIVYVADTGNRCIRKLT 484
>gi|390957686|ref|YP_006421443.1| gluconolactonase [Terriglobus roseus DSM 18391]
gi|390412604|gb|AFL88108.1| gluconolactonase [Terriglobus roseus DSM 18391]
Length = 738
Score = 41.2 bits (95), Expect = 0.23, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 2/59 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S V+ +AG G QGY G +A + P + AV G++Y AD NH IR+I +
Sbjct: 96 VSSGGVLAVVAGTGHQGYVGDGGAATAAELNAPTAVAVAPDGSVYFADSGNHCIRRIAS 154
>gi|380511998|ref|ZP_09855405.1| nhl repeat protein [Xanthomonas sacchari NCPPB 4393]
Length = 473
Score = 40.8 bits (94), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
S V R G G + DG G A F +P+ A++ +G +YVAD NH +R+I L
Sbjct: 194 STNGRVLRQFGIGTADFIDGAIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLL 249
>gi|348681654|gb|EGZ21470.1| hypothetical protein PHYSODRAFT_329412 [Phytophthora sojae]
Length = 251
Score = 40.8 bits (94), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 68/138 (49%), Gaps = 16/138 (11%)
Query: 6 VYSSLAITLLSLLFSLVSSGLLLE--DGYTVTTVIDGHQLEINPHSVIDRPGSSDL---- 59
+ S+ IT+ SL SLV G+ ++ + V+ + + ++I P + +S +
Sbjct: 55 ITSTDIITIASLADSLVLQGITVDSSENVYVSRLTEFDVVKIAPDGSVSIFANSSMFTLA 114
Query: 60 --IVLDSSRSAFYT-----LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD 112
I +DSS + + T L F L+ E V +AG VQG+ + ARF P + +
Sbjct: 115 MGIAIDSSDNIYITDQHRILKFTLAGEMGV--VAGR-VQGFLNALGELARFSTPWALTIG 171
Query: 113 MKGNIYVADKSNHVIRKI 130
GN+YV D N+ IRK+
Sbjct: 172 SDGNLYVVDSDNNCIRKV 189
>gi|290986946|ref|XP_002676184.1| NHL repeat-containing protein [Naegleria gruberi]
gi|284089785|gb|EFC43440.1| NHL repeat-containing protein [Naegleria gruberi]
Length = 1290
Score = 40.8 bits (94), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 33/62 (53%), Gaps = 2/62 (3%)
Query: 71 TLSFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIR 128
T+ + +S +V AG G GY +D P SA + PK A G++++ D NH IR
Sbjct: 450 TVKYLISSSGIVITKAGSGTTGYNADNIPATSAHLNFPKGIARSTNGDLFITDSGNHRIR 509
Query: 129 KI 130
K+
Sbjct: 510 KV 511
>gi|268574858|ref|XP_002642408.1| C. briggsae CBR-NHL-1 protein [Caenorhabditis briggsae]
Length = 979
Score = 40.8 bits (94), Expect = 0.25, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
L GD + + G+A+F+ P AVD G+IYVADK NH ++ D Q I
Sbjct: 792 LEGDHLFSFGGHGQGNAKFNNPWGVAVDDLGSIYVADKDNH---RVQVFDKNGQFI 844
>gi|341877854|gb|EGT33789.1| CBN-NHL-1 protein [Caenorhabditis brenneri]
Length = 976
Score = 40.8 bits (94), Expect = 0.25, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
L GD + + G+A+F+ P AVD G+IYVADK NH ++ D Q I
Sbjct: 789 LEGDHLFSFGGHGQGNAKFNNPWGVAVDDLGSIYVADKDNH---RVQVFDKNGQFI 841
>gi|423298428|ref|ZP_17276486.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
CL03T12C18]
gi|392663340|gb|EIY56891.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
CL03T12C18]
Length = 507
Score = 40.8 bits (94), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 23/41 (56%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
GY DG ARF + + D GNIY+ D+ NH IR + N
Sbjct: 413 GYWDGIFAEARFLRGEQMCSDDAGNIYIVDQDNHAIRMVNN 453
>gi|290982352|ref|XP_002673894.1| predicted protein [Naegleria gruberi]
gi|284087481|gb|EFC41150.1| predicted protein [Naegleria gruberi]
Length = 2807
Score = 40.8 bits (94), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G GYS G +A+ P++ AV G IY+AD NH IRKI
Sbjct: 572 ISTIAGTGSVGYSGDGGLATNAQLYYPQTVAVSSSGEIYIADAYNHRIRKI 622
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G GY+ DG SA+ + P A+ G I +AD +NH IRKI
Sbjct: 902 ISTIAGTGTGGYNGDGVLATSAQLNYPNGIAIQENGEILIADNNNHRIRKI 952
Score = 40.0 bits (92), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
G+ DG+ + F SF + G I +AD +NH IRKIT L +
Sbjct: 802 GFGDGDMATTSFISAYSFEFTLNGEIIIADSNNHRIRKITTLGY 845
Score = 39.7 bits (91), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG G GYS G SA+ P ++ G IY+AD NH IRK+T
Sbjct: 251 ISTVAGSGTAGYSGDGGLATSAQLYYPLGVSISSAGEIYIADYYNHRIRKVT 302
Score = 39.3 bits (90), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 80 SVVKRLAGDG-VQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
SV ++AG G + GY DG SAR + P A G++Y+AD +N+ IRK+T
Sbjct: 457 SVASKVAGTGTLNGYGGDGNLATSARLNYPYGTAFSSNGDMYIADTNNNRIRKVT 511
Score = 39.3 bits (90), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G GYS G SA+ P S A+ G IY+AD NH IRKI
Sbjct: 628 ISTIAGTGSVGYSGDGGLATSAQLYYPFSVAISSVGEIYIADTYNHRIRKI 678
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/141 (23%), Positives = 63/141 (44%), Gaps = 26/141 (18%)
Query: 10 LAITLLSLLFSLVSSGLLLEDG----YTVTTVIDG------------HQLEINPHSVIDR 53
+ + LL++L +V S ++ Y +TT+ G QL I I
Sbjct: 1 MKVVLLAILLCIVMSAKFIDTQSLLLYNITTIAGGGSVVGDGLLATNAQLAIPAGVAISS 60
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFA 110
G ++I+ DS+ + +S + +AG G + G+ G +A+ + P + +
Sbjct: 61 IG--EIIIADSNNNRIR----KISTNGYISTIAGTGSASFG-GDGGLAINAQLNSPSAVS 113
Query: 111 VDMKGNIYVADKSNHVIRKIT 131
+ G++Y++D N IRKI+
Sbjct: 114 ISSNGDMYISDTGNQRIRKIS 134
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 22/31 (70%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
SA+ + P+S A+ G IY+AD +NH IRKI
Sbjct: 536 SAQLNYPRSVAISSSGEIYIADFNNHRIRKI 566
>gi|320103301|ref|YP_004178892.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
gi|319750583|gb|ADV62343.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
Length = 676
Score = 40.8 bits (94), Expect = 0.27, Method: Composition-based stats.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
L V D+ + + F + ++ L GD +G +D P RFD+P +V + G +Y
Sbjct: 441 LYVADTYNNKIKVIDFS-ARTKTIRTLVGDRRRGDTDDPP---RFDQPGGLSV-LGGQLY 495
Query: 119 VADKSNHVIRKI 130
VAD +NH IR +
Sbjct: 496 VADTNNHKIRVV 507
Score = 40.4 bits (93), Expect = 0.38, Method: Composition-based stats.
Identities = 19/50 (38%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
+K + G+G G DG A F++P+ +D ++ +YVAD NH IR I
Sbjct: 236 LKAVVGNGKIGMRDGAYERASFNRPQGIRLDALRNRLYVADTENHAIRAI 285
>gi|453054414|gb|EMF01867.1| redoxin domain-containing protein [Streptomyces mobaraensis NBRC
13819 = DSM 40847]
Length = 608
Score = 40.8 bits (94), Expect = 0.27, Method: Composition-based stats.
Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 7/82 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG L V D++R L+ E+VV+R+ G G +G+ D ARF +P
Sbjct: 183 PGKALALPGGGYL-VSDTTRHQLVELA--ADGETVVRRI-GTGERGFGDS---PARFSEP 235
Query: 107 KSFAVDMKGNIYVADKSNHVIR 128
+ A+ G + VAD NH +R
Sbjct: 236 QGLALLPDGRVAVADTVNHALR 257
>gi|421872066|ref|ZP_16303685.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
gi|372458678|emb|CCF13234.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
Length = 633
Score = 40.8 bits (94), Expect = 0.27, Method: Composition-based stats.
Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 5/44 (11%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
G +G DG+ F++P A+D K NIYV D++NH I+K N
Sbjct: 487 GQKGTKDGQ-----FNQPMQLAIDSKDNIYVVDRNNHRIQKFDN 525
>gi|348681648|gb|EGZ21464.1| hypothetical protein PHYSODRAFT_329405 [Phytophthora sojae]
Length = 145
Score = 40.8 bits (94), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 5/76 (6%)
Query: 60 IVLDSSRSAFY-----TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
I +DSS S + L ++ + VK +AG G +G+ +G SARF P +
Sbjct: 3 IAIDSSDSIYVLDMHRILKVTVTLDGDVKVVAGSGAKGFVNGFGESARFSTPWALTFGSD 62
Query: 115 GNIYVADKSNHVIRKI 130
G +YV D N IRK+
Sbjct: 63 GYLYVPDLDNDCIRKV 78
>gi|298386035|ref|ZP_06995592.1| hypothetical protein HMPREF9007_02755 [Bacteroides sp. 1_1_14]
gi|298261263|gb|EFI04130.1| hypothetical protein HMPREF9007_02755 [Bacteroides sp. 1_1_14]
Length = 462
Score = 40.8 bits (94), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 84 RLAGDGVQGYSDGE-PGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
R GDG +G++DG+ ARF+ P S D ++G + V D +NH IRKI
Sbjct: 410 RPNGDGTKGFNDGDLRTQARFNYPASIVYDEVRGCLLVGDSNNHRIRKI 458
>gi|17553622|ref|NP_499028.1| Protein NHL-1 [Caenorhabditis elegans]
gi|6226931|sp|Q03601.2|NHL1_CAEEL RecName: Full=RING finger protein nhl-1
gi|3877606|emb|CAA79562.1| Protein NHL-1 [Caenorhabditis elegans]
Length = 974
Score = 40.8 bits (94), Expect = 0.27, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
L GD + + G+A+F+ P AVD G+IYVADK NH ++ D Q I
Sbjct: 787 LEGDHLFSFGGHGQGNAKFNNPWGVAVDDLGSIYVADKDNH---RVQVFDKNGQFI 839
>gi|308799191|ref|XP_003074376.1| NHL repeat (ISS) [Ostreococcus tauri]
gi|116000547|emb|CAL50227.1| NHL repeat (ISS) [Ostreococcus tauri]
Length = 1783
Score = 40.8 bits (94), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 41/87 (47%), Gaps = 11/87 (12%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS--------ARFDKPK 107
+S +V DS + + SE V AG+G +G +DG+ + A F P
Sbjct: 201 ASTCVVGDSGNNVVRVIDLGKSE---VTNFAGNGTEGSTDGKTSANAELGETQALFKGPT 257
Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLD 134
S A+ G++ VAD N +R I+N D
Sbjct: 258 SVALLASGDVLVADTGNKAVRNISNGD 284
Score = 40.0 bits (92), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 46/100 (46%), Gaps = 7/100 (7%)
Query: 34 VTTVIDGHQLEINP-HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
VTTV+ L P V+D D+ VL S+ +S +S +V G
Sbjct: 285 VTTVV--ASLPFAPIELVVDESNGGDMYVLGQSQHGVMKIS--VSTLAVTTIAGSQTTSG 340
Query: 93 YSDGEPG-SARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
+ D G SARF P+ A+D + +YVAD NH +R I
Sbjct: 341 FVDHNTGTSARFTLPRGLALDSLNSKLYVADTGNHAVRMI 380
>gi|290976772|ref|XP_002671113.1| predicted protein [Naegleria gruberi]
gi|284084679|gb|EFC38369.1| predicted protein [Naegleria gruberi]
Length = 520
Score = 40.8 bits (94), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + ++ +AG G GYS G+ G SA KP AVD G IY D +N+ +RK+
Sbjct: 305 ITPDGIITTIAGTGNFGYS-GDGGLATSADISKPTGIAVDSNGTIYFCDNNNNRVRKL 361
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 29/53 (54%), Gaps = 4/53 (7%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG G GYS G+ G +A PKS A+ G I+ D N VIR+IT
Sbjct: 255 ITTIAGTGAYGYS-GDGGLAINAVIRYPKSIAIGKHGEIFFTDSDNQVIRRIT 306
>gi|308501841|ref|XP_003113105.1| CRE-NHL-1 protein [Caenorhabditis remanei]
gi|308265406|gb|EFP09359.1| CRE-NHL-1 protein [Caenorhabditis remanei]
Length = 986
Score = 40.8 bits (94), Expect = 0.28, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
L GD + + G+A+F+ P AVD G+IYVADK NH ++ D Q I
Sbjct: 799 LEGDHLFSFGGHGQGNAKFNNPWGVAVDDLGSIYVADKDNH---RVQVFDKNGQFI 851
>gi|410096564|ref|ZP_11291551.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
CL02T12C30]
gi|409226528|gb|EKN19437.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
CL02T12C30]
Length = 457
Score = 40.8 bits (94), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G GY DG+P A F+KP A G +YVAD N++IR I
Sbjct: 413 GESGYKDGKPQEALFNKPYDVAATPDGILYVADTYNYLIRCIA 455
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 33/58 (56%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
S P ++ ++ + G G +G +A+ + + A D +GN+Y+AD +NHVI K+
Sbjct: 342 STPWITQNEIEHIVGTGKGAAKEGNGKAAQLGEIEGMAADQEGNVYLADYTNHVIWKV 399
>gi|402571882|ref|YP_006621225.1| hypothetical protein Desmer_1347 [Desulfosporosinus meridiei DSM
13257]
gi|402253079|gb|AFQ43354.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
Length = 2025
Score = 40.8 bits (94), Expect = 0.29, Method: Composition-based stats.
Identities = 20/41 (48%), Positives = 26/41 (63%), Gaps = 5/41 (12%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G G GE F++P+ AVD +GNIYVAD +NH I+K
Sbjct: 966 GSNGSDQGE-----FNRPQGIAVDSEGNIYVADSNNHRIQK 1001
Score = 38.5 bits (88), Expect = 1.4, Method: Composition-based stats.
Identities = 18/43 (41%), Positives = 24/43 (55%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G+ + + G +F PKS VD GNIYVAD +N I+K
Sbjct: 770 GEYLMKFGSSGSGEGQFRNPKSVTVDSAGNIYVADTTNKRIQK 812
Score = 37.0 bits (84), Expect = 4.5, Method: Composition-based stats.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LA G +G S G+ FD P A D GN+YVAD +N+ I+K+
Sbjct: 1199 LAKWGTRGNSAGQ-----FDNPYGIAADSAGNVYVADTNNNRIQKL 1239
Score = 36.2 bits (82), Expect = 6.4, Method: Composition-based stats.
Identities = 17/43 (39%), Positives = 21/43 (48%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G QG+ F KP AVD GNIYV + NH ++K
Sbjct: 1053 GGNPQGFGSTGTDEGEFKKPSGVAVDSDGNIYVVEAMNHRMQK 1095
Score = 35.8 bits (81), Expect = 8.7, Method: Composition-based stats.
Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 2/38 (5%)
Query: 96 GEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G PG+ ++F P AVD GN+YVAD S + IRK T
Sbjct: 1156 GSPGAGDSQFLFPHGIAVDSAGNVYVADTSANWIRKYT 1193
>gi|309790072|ref|ZP_07684646.1| NHL repeat-containing protein [Oscillochloris trichoides DG-6]
gi|308227927|gb|EFO81581.1| NHL repeat-containing protein [Oscillochloris trichoides DG6]
Length = 1124
Score = 40.8 bits (94), Expect = 0.29, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 2/37 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
GY+ GEPG+ F++P AVD +GN+YVAD N ++
Sbjct: 980 GYAGGEPGA--FNEPVGVAVDAQGNVYVADTWNSRVQ 1014
>gi|290991466|ref|XP_002678356.1| predicted protein [Naegleria gruberi]
gi|284091968|gb|EFC45612.1| predicted protein [Naegleria gruberi]
Length = 1017
Score = 40.8 bits (94), Expect = 0.29, Method: Composition-based stats.
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG+G+ G+S +G A F+ AV G++YVAD NH IRKI+
Sbjct: 574 ISTIAGNGIVGFSGDNGLATKATFNSRNGIAVASNGDVYVADTRNHRIRKIS 625
Score = 40.4 bits (93), Expect = 0.40, Method: Composition-based stats.
Identities = 17/35 (48%), Positives = 25/35 (71%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
+A + P AV G++Y+ADKSNHVIRK++ L+
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALN 142
Score = 39.3 bits (90), Expect = 0.74, Method: Composition-based stats.
Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 7/88 (7%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSAR 102
+N +S P S + I+ DS+ + L +S ++ +AG G G + A+
Sbjct: 820 LNSNSFTTTP-SGEFIIADSNNN----LIRKISTSGIISTIAGTGAATFGGDNANATIAK 874
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ P + AV G I++AD +NH IRKI
Sbjct: 875 LNNPLNVAVSSSGEIFIADTNNHRIRKI 902
Score = 37.0 bits (84), Expect = 4.0, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG+G GYS DG + + + P + AV G I++ D NH IRKI
Sbjct: 908 ITTIAGNGTAGYSGDGLDSTKCQLNYPSAVAVSSGGEIFIVDTHNHRIRKI 958
Score = 37.0 bits (84), Expect = 4.1, Method: Composition-based stats.
Identities = 34/92 (36%), Positives = 46/92 (50%), Gaps = 6/92 (6%)
Query: 49 SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDK 105
SVI P S + S A Y SF S + + +AG G G G+ G SAR K
Sbjct: 429 SVISAPLPS-AVGYKSKLHAMYLDSFLGSLATKITVVAGTGKLGGYAGDGGLATSARIQK 487
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
P S ++ + N+Y+ D NH IRK++ L GN
Sbjct: 488 PTSVVLNDQ-NLYIVDTLNHRIRKVS-LTFGN 517
>gi|108761192|ref|YP_633033.1| lipoprotein [Myxococcus xanthus DK 1622]
gi|108465072|gb|ABF90257.1| putative lipoprotein [Myxococcus xanthus DK 1622]
Length = 868
Score = 40.8 bits (94), Expect = 0.29, Method: Composition-based stats.
Identities = 21/50 (42%), Positives = 26/50 (52%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG G G+SDG +A P V G +YVAD N ++R IT
Sbjct: 818 VTTLAGSGRYGHSDGPGDAADLVLPAGLTVGPDGRLYVADAGNSLVRAIT 867
>gi|290971625|ref|XP_002668588.1| predicted protein [Naegleria gruberi]
gi|284082057|gb|EFC35844.1| predicted protein [Naegleria gruberi]
Length = 731
Score = 40.8 bits (94), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 30/54 (55%), Gaps = 2/54 (3%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
S + +AG+G GYS DG P +A+ + P V G +Y+AD N +RK+
Sbjct: 177 NSQLSTVAGNGYAGYSGDGGPARNAKLNYPADVTVSSTGEVYIADWGNRAVRKV 230
>gi|224536973|ref|ZP_03677512.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521412|gb|EEF90517.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
DSM 14838]
Length = 446
Score = 40.8 bits (94), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 27/40 (67%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+GY+DG A+F++P+ D N YV D++N++IRK+
Sbjct: 347 KGYTDGPGSKAQFNEPQQGCFDNDDNFYVCDQNNNLIRKV 386
>gi|395226792|ref|ZP_10405227.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
gi|394444937|gb|EJF05961.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
Length = 1326
Score = 40.8 bits (94), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++++ + ++ G G G++DG+ A+ P F D NI+VAD +N +RKIT
Sbjct: 205 INKDGEISKIVG-GSYGFADGDSSIAQLKSPSGFDFDKNWNIFVADTNNGRVRKIT 259
>gi|225450563|ref|XP_002277564.1| PREDICTED: NHL repeat-containing protein 2-like [Vitis vinifera]
Length = 1078
Score = 40.8 bits (94), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 4/89 (4%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQ 142
++ G +G DG A F++P+ A + K N+ YVAD NH +R+I ++ Q +
Sbjct: 627 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 686
Query: 143 INLKPEDCSKSSQSGSGMF---WVTVFSP 168
K D + + + W F P
Sbjct: 687 NGTKGSDYQGGGKGATQLLNSPWDVCFEP 715
>gi|158318845|ref|YP_001511353.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114250|gb|ABW16447.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 770
Score = 40.8 bits (94), Expect = 0.30, Method: Composition-based stats.
Identities = 38/113 (33%), Positives = 52/113 (46%), Gaps = 14/113 (12%)
Query: 30 DGYTVTTVIDGHQLEINPHSVIDRPGSSDLIV-LDSSRSAFYTLSFPLSEESVVKRLAGD 88
DG T+TTV L P +V PG LI DS R + L + +AG
Sbjct: 565 DG-TITTVPTPAALR-RPRAVTTGPGGVLLIADTDSHRV------WRLGPGETARVIAGT 616
Query: 89 GVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL 139
GYS DG P + A +P+S AVD G + +AD IR++ DH ++
Sbjct: 617 AEPGYSGDGGPATRAAIGRPQSLAVDGAGRLLIADPDQRRIRRV---DHAGRI 666
>gi|291223251|ref|XP_002731624.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 434
Score = 40.8 bits (94), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 6/87 (6%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP +V +D V D+ A +S P E V +AG G +G+ DG+ A F
Sbjct: 183 NPQAVAVDHVYKDRFFVADTDNHAIREVSLPDGE---VTTIAG-GEKGFKDGKGTGATFY 238
Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
P +D NI ++AD NH IR I
Sbjct: 239 HPAGVTIDPIRNILFIADHYNHAIRMI 265
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 47/108 (43%), Gaps = 12/108 (11%)
Query: 33 TVTTVIDGHQLEINPHSVIDRPG-------SSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
T+ + G+Q + ++ P + L V DS+ S ++ E V
Sbjct: 107 TLAGSLQGYQDGVGEKAMFHHPAGMSMHRKTRTLFVADSANSRIRAINVVTGE---VTTF 163
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVD--MKGNIYVADKSNHVIRKIT 131
AG G + DG A F P++ AVD K +VAD NH IR+++
Sbjct: 164 AGSGKEELKDGLKTIASFFNPQAVAVDHVYKDRFFVADTDNHAIREVS 211
>gi|196229731|ref|ZP_03128595.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
gi|196226057|gb|EDY20563.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
Length = 360
Score = 40.8 bits (94), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + + AG G +GYS DG P A F+ P D GN Y+AD +NH IR++
Sbjct: 74 VTPDGKIHTAAGIGQKGYSGDGGPALEATFNLPHEIRFDRAGNYYIADMANHAIRRV 130
Score = 39.3 bits (90), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADK 122
+R L F ++ ++ AG G +G++ +G P A PK AVD +GN+++AD
Sbjct: 235 TREGNQVLKFD-AKTGIISIAAGTGKKGFTGNGGPALEATLSGPKGIAVDAQGNVWLADT 293
Query: 123 SNHVIRKI 130
+H IR+I
Sbjct: 294 ESHSIREI 301
>gi|348688202|gb|EGZ28016.1| hypothetical protein PHYSODRAFT_321722 [Phytophthora sojae]
Length = 490
Score = 40.8 bits (94), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 30/59 (50%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQI 143
AG G +G+ DG A F+ P G+I VAD NH IR+I +G +++ I
Sbjct: 134 FAGSGKKGWRDGLALEANFNSPSGVCEYADGSIIVADTGNHCIRQIRRGANGKLVVKTI 192
Score = 36.2 bits (82), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 46/96 (47%), Gaps = 5/96 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES--VVKRLAGDGVQGYSDGEPGSARFD 104
P +V+ P S +L+V D+ + L P S VK + G G++DG A FD
Sbjct: 230 PSTVLAGP-SGELLVADTMNNCIRGLLPPPDGTSPWRVKTVCGQTRPGHADGNCEVALFD 288
Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQL 139
+P S N +VAD+ N IR++ +GN L
Sbjct: 289 QPLSLCWGEDSNTFFVADRGNACIRQV-GRSYGNGL 323
>gi|405375498|ref|ZP_11029528.1| putative hemolysin [Chondromyces apiculatus DSM 436]
gi|397086225|gb|EJJ17355.1| putative hemolysin [Myxococcus sp. (contaminant ex DSM 436)]
Length = 855
Score = 40.8 bits (94), Expect = 0.31, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 26/50 (52%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG G G++DG A+ P V G +YVAD N ++R IT
Sbjct: 805 VTTLAGSGTYGHADGPGEDAQLVLPAGLVVGPDGRLYVADSGNALVRTIT 854
>gi|395218326|ref|ZP_10401983.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
gi|394454578|gb|EJF09204.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
Length = 474
Score = 40.8 bits (94), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
LS++ V + G G +G+++G A F++P A+ +YVAD NH IRK+ +L H
Sbjct: 197 LSQQGQVLEVIGSGERGFNNGGYADATFNEPHGLAL-HGSTLYVADAKNHAIRKV-DLQH 254
>gi|290983716|ref|XP_002674574.1| predicted protein [Naegleria gruberi]
gi|284088165|gb|EFC41830.1| predicted protein [Naegleria gruberi]
Length = 2271
Score = 40.8 bits (94), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 15/121 (12%)
Query: 19 FSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-----GSSDLIVL-DSSRSAFYTL 72
F + +S L G ++ +DG I +++++ P G ++I D +A ++
Sbjct: 500 FEINNSTLTTLVGRSLKGFLDG----IGSNALLNYPESLIIGPDNMIYFSDRDNNAIRSV 555
Query: 73 SFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
S + ++V LAGD G+ DG P S+R D P + + G I D+ N IRKI
Sbjct: 556 S---TSSALVTTLAGDRFNGFLGDGGPAKSSRLDSPGPIQLTLGGEIIFMDRGNQRIRKI 612
Query: 131 T 131
T
Sbjct: 613 T 613
Score = 40.0 bits (92), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
E ++ LAG+G G + G SA+ + P + V +IY A+ NH+IR+I+
Sbjct: 136 ENGIISTLAGNGSPGLTVGSAISAQLNTPTALIV-ASNDIYFAESGNHLIRRIS 188
Score = 39.7 bits (91), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
AG G GYS DG G SA + P + A D GN+ ++D N+ IRK+ N
Sbjct: 291 FAGSGTSGYSGDGGLGTSALLNGPSALAFDSSGNLLISDSFNNRIRKVAN 340
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 37/79 (46%), Gaps = 6/79 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDM 113
+SD+I D++ + ++ +AG GV YS G +A+ + P+ +
Sbjct: 63 NSDIIFCDTNNHRIRKIDV----NGIMSTIAGTGVANYSGDGGAAVNAQLNSPQGIGILS 118
Query: 114 KGNIYVADKSNHVIRKITN 132
G I +D NH IRKI N
Sbjct: 119 TGAIVFSDTLNHCIRKIEN 137
>gi|296089788|emb|CBI39607.3| unnamed protein product [Vitis vinifera]
Length = 1002
Score = 40.4 bits (93), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 4/89 (4%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQ 142
++ G +G DG A F++P+ A + K N+ YVAD NH +R+I ++ Q +
Sbjct: 551 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 610
Query: 143 INLKPEDCSKSSQSGSGMF---WVTVFSP 168
K D + + + W F P
Sbjct: 611 NGTKGSDYQGGGKGATQLLNSPWDVCFEP 639
>gi|296122042|ref|YP_003629820.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
gi|296014382|gb|ADG67621.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
Length = 425
Score = 40.4 bits (93), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 49/105 (46%), Gaps = 15/105 (14%)
Query: 30 DGYTVTTVI--DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
DG T + + H+L P + D+++ D +T+ + ++ LAG
Sbjct: 158 DGGPATKALMNEPHELRFTP--------AGDIVIADMR---THTIRKIDGKTGMISTLAG 206
Query: 88 DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G G+S DG P A+ + P S +D G++ + D NH +RK+
Sbjct: 207 TGTAGFSGDGGPAEKAQLNMPHSIQIDPAGDLLICDTGNHRVRKV 251
Score = 36.2 bits (82), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 38/94 (40%), Gaps = 16/94 (17%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV--------KRLAGDGVQGYSDG 96
+N ID D+I+ +A Y FP E ++ L GDG+
Sbjct: 281 LNGPRSIDFTPEGDMILALREGNAVY--RFPKGEAKLIHIAGVGGKPSLVGDGIDARK-- 336
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
A PK AVD G+IY+AD H IR I
Sbjct: 337 ----AILGAPKGAAVDANGDIYLADTETHTIRVI 366
>gi|290995436|ref|XP_002680301.1| predicted protein [Naegleria gruberi]
gi|284093921|gb|EFC47557.1| predicted protein [Naegleria gruberi]
Length = 699
Score = 40.4 bits (93), Expect = 0.33, Method: Composition-based stats.
Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G+ G+S +G +A+ P S V +Y+AD+ NH IRKI L++
Sbjct: 95 ENGNIITIAGNGIHGFSGDNGLATNAQLYTPCSVFVSSNNEVYIADQGNHRIRKI--LEN 152
Query: 136 GN 137
GN
Sbjct: 153 GN 154
Score = 39.7 bits (91), Expect = 0.54, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G +G S +G +A+ ++P S V +Y+AD+ N+ IRKI L++
Sbjct: 39 ENGTIVTIAGNGTKGSSGDNGLATNAQLNRPYSVFVSSNNEVYIADQGNNRIRKI--LEN 96
Query: 136 GN 137
GN
Sbjct: 97 GN 98
Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 4/63 (6%)
Query: 77 SEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
+E + +AG+G+ G++ +G +AR + P S V +Y+AD N+ IRKI L+
Sbjct: 226 NENGNIITIAGNGIHGFNGDNGLATNARLNHPFSVFVSSNNEVYIADYYNNRIRKI--LE 283
Query: 135 HGN 137
+GN
Sbjct: 284 NGN 286
>gi|256556932|gb|ACU83555.1| thiol-disulfide isomerase/thioredoxin [uncultured bacterium
HF130_AEPn_1]
Length = 497
Score = 40.4 bits (93), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
VK + G G +G DG+ ARF +P A D K + +Y+AD NH I+ +
Sbjct: 221 VKMVIGSGKEGNKDGDIKVARFRRPHGLAFDEKNDLLYIADTDNHSIKSL 270
>gi|436838118|ref|YP_007323334.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
gi|384069531|emb|CCH02741.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
Length = 375
Score = 40.4 bits (93), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 6/63 (9%)
Query: 74 FPLSEESVVKRLAG----DGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
F ++ V RL+G DG GY DG A+F P S +D G +Y+ D NH I
Sbjct: 141 FRINSAGQVTRLSGGPDPDGGYSFGYRDGPLARAKFSNPWSIIMDKAGMLYIGDGLNHCI 200
Query: 128 RKI 130
RK+
Sbjct: 201 RKV 203
>gi|325110177|ref|YP_004271245.1| hypothetical protein Plabr_3626 [Planctomyces brasiliensis DSM
5305]
gi|324970445|gb|ADY61223.1| NHL repeat containing protein [Planctomyces brasiliensis DSM 5305]
Length = 370
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 94 SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
DG P AR PK +VD +GNI++AD +H IR+I
Sbjct: 273 GDGRPARLARLAGPKGISVDKQGNIFLADTESHTIRRI 310
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 4/66 (6%)
Query: 69 FYTLSFP--LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
+ L P + E V + GDG GY+ +G A+ ++P + G +YV + +N
Sbjct: 17 LFVLQLPATVQAEGKVITILGDGKAGYTGDNGSADEAQCEQPFGLVIGPDGALYVCETAN 76
Query: 125 HVIRKI 130
HVIR+I
Sbjct: 77 HVIRRI 82
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 77 SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ V+ +AG G +G+S DG+ + A F++P S D G++Y+ D NH +R +
Sbjct: 141 AKTGVISTVAGTGEKGFSGDGKAATEATFNRPHSICFDSAGHLYICDIGNHRVRVV 196
Score = 35.8 bits (81), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 1/60 (1%)
Query: 70 YTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+T+ + ++ L G+G +G DG P + ++P V G +Y+ D SNH +R
Sbjct: 305 HTIRRIDGQTGIITTLVGNGKKGDGPDGNPLECKLNRPHGVFVAADGKVYIGDSSNHRVR 364
>gi|219851088|ref|YP_002465520.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219545347|gb|ACL15797.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 831
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ +F P+ A+D GN+YVAD+SNH + K T++
Sbjct: 173 NGQFSYPEGIAIDGTGNVYVADESNHRVEKFTSI 206
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ +F P+ A+D GN+YVAD+SNH + K T++
Sbjct: 473 NGQFSYPEGIAIDGTGNVYVADESNHRVEKFTSI 506
Score = 39.7 bits (91), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GS +F P A+D GN+YV D+ NH ++K T
Sbjct: 125 GSGQFSSPYGVAIDSAGNVYVVDRGNHRVQKFT 157
Score = 39.7 bits (91), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GS +F P A+D GN+YV D+ NH ++K T
Sbjct: 425 GSGQFSSPYGVAIDGAGNVYVVDRGNHRVQKFT 457
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 15/76 (19%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSG 159
+ +F P AVD GN+YVAD+ N+ ++K T+ G L + +S +GSG
Sbjct: 79 NGQFRTPTGIAVDSAGNVYVADRDNNRVQKFTST--GTYLAKW---------GTSGTGSG 127
Query: 160 MFWVTVFSPYLISCKS 175
F SPY ++ S
Sbjct: 128 QF----SSPYGVAIDS 139
Score = 37.0 bits (84), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 14/32 (43%), Positives = 21/32 (65%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +F P AVD GN+YVAD+ N+ ++K T
Sbjct: 379 NGQFRTPTGIAVDSAGNVYVADRDNNRVQKFT 410
Score = 35.8 bits (81), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 5/43 (11%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G +G DG+ F P AVD GN+YVAD N+ ++K T
Sbjct: 214 GTKGSEDGQ-----FAYPDGVAVDSLGNVYVADSGNNRVQKFT 251
Score = 35.8 bits (81), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 5/43 (11%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G +G DG+ F P AVD GN+YVAD N+ ++K T
Sbjct: 514 GTKGSEDGQ-----FAYPDGVAVDSLGNVYVADSGNNRVQKFT 551
>gi|408405002|ref|YP_006862985.1| NHL repeat-containing protein [Candidatus Nitrososphaera gargensis
Ga9.2]
gi|408365598|gb|AFU59328.1| NHL repeat-containing protein [Candidatus Nitrososphaera gargensis
Ga9.2]
Length = 501
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 40/92 (43%), Gaps = 8/92 (8%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGN 116
L V DS SA + VV+ G+G V G+ DG AR P A +
Sbjct: 359 LFVADSEVSAVRRIDLG---RKVVQTAVGEGLFVFGHKDGPLEEARLQHPLGVACESSNK 415
Query: 117 IYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
IYVAD NH +R I D Q I + +PE
Sbjct: 416 IYVADTYNHAVRLI---DLAEQRISTLVGRPE 444
>gi|348689157|gb|EGZ28971.1| hypothetical protein PHYSODRAFT_474561 [Phytophthora sojae]
Length = 98
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
V GY+DG + F P+S AV G++YVAD +N VI+
Sbjct: 11 VSGYADGYGNAVLFKNPRSLAVGSDGHLYVADDNNRVIK 49
>gi|254383375|ref|ZP_04998727.1| NHL repeat containing protein [Streptomyces sp. Mg1]
gi|194342272|gb|EDX23238.1| NHL repeat containing protein [Streptomyces sp. Mg1]
Length = 607
Score = 40.4 bits (93), Expect = 0.37, Method: Composition-based stats.
Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 8/74 (10%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V DS+R L+ ESVV+R+ G G +G+ G F +P+ A+ G
Sbjct: 191 SGNFLVSDSTRHQLVELA--ADAESVVRRI-GSGERGF-----GPDAFSEPQGLALLPDG 242
Query: 116 NIYVADKSNHVIRK 129
++ VAD NH +R+
Sbjct: 243 SVVVADTVNHALRR 256
>gi|261855185|ref|YP_003262468.1| redoxin [Halothiobacillus neapolitanus c2]
gi|261835654|gb|ACX95421.1| Redoxin domain protein [Halothiobacillus neapolitanus c2]
Length = 498
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GDG G++DG +A F++P+ A + ++YVAD N IR+I
Sbjct: 234 GDGKAGFADGSFATAEFNRPQGIAW-LGNDLYVADTDNQRIRRI 276
>gi|424792718|ref|ZP_18218917.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|422796954|gb|EKU25368.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
graminis ART-Xtg29]
Length = 473
Score = 40.4 bits (93), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
V R G G + DG G A F +P+ A++ +G +YVAD NH +R+I L
Sbjct: 199 VLRQFGLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLL 249
>gi|291235470|ref|XP_002737667.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 638
Score = 40.0 bits (92), Expect = 0.41, Method: Composition-based stats.
Identities = 17/45 (37%), Positives = 25/45 (55%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G ++ + G RF P +D KGN+ VADK+NH I+ +T
Sbjct: 499 GGYIKSFGSRGDGDCRFKSPYGITIDSKGNVIVADKNNHCIQVLT 543
>gi|449017726|dbj|BAM81128.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 924
Score = 40.0 bits (92), Expect = 0.41, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 32/67 (47%), Gaps = 4/67 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLI 140
V+ LAG G G DG A F P+ A D + YVAD NH IR++ D +
Sbjct: 842 VRTLAGSGEPGLHDGASTQAAFFAPQGIAYDPTVRMAYVADTYNHCIRQV---DVATGTV 898
Query: 141 RQINLKP 147
R + L P
Sbjct: 899 RTLTLTP 905
>gi|397781302|ref|YP_006545775.1| Tripartite motif-containing protein 71 [Methanoculleus bourgensis
MS2]
gi|396939804|emb|CCJ37059.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
[Methanoculleus bourgensis MS2]
Length = 786
Score = 40.0 bits (92), Expect = 0.42, Method: Composition-based stats.
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 5/41 (12%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G G DGE F P A+D GN+YVADK NH I+K
Sbjct: 542 GSYGRGDGE-----FRNPSGVAIDAAGNVYVADKENHRIQK 577
Score = 38.9 bits (89), Expect = 1.0, Method: Composition-based stats.
Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 4/41 (9%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G + Y++ FD PK AVD GN+YVAD+ N+ I+K
Sbjct: 68 GTKAYTE----DGAFDTPKGVAVDAAGNVYVADRGNNRIQK 104
Score = 36.6 bits (83), Expect = 5.7, Method: Composition-based stats.
Identities = 29/78 (37%), Positives = 36/78 (46%), Gaps = 16/78 (20%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQIN 144
LA G G DGE F +P S AVD GN+YV D N+ I+ + GN L
Sbjct: 112 LAKWGSHGKGDGE-----FRQPSSIAVDAAGNVYVTDVGNNRIQIFDSA--GNFL----- 159
Query: 145 LKPEDCSKSSQSGSGMFW 162
+K E C +G FW
Sbjct: 160 MKWEPCG----TGDAEFW 173
>gi|428308867|ref|YP_007119844.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
7113]
gi|428250479|gb|AFZ16438.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
7113]
Length = 515
Score = 40.0 bits (92), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + G G G+ DG+ ARF +P + + ++YVAD +NH IR I
Sbjct: 440 QTMIGQGTAGHQDGQSIKARFSEPSGLSA-IAAHLYVADTNNHAIRCI 486
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLI 140
V+++ G G G +DG A+F P+ +D + +YVAD NH +R++ D NQ +
Sbjct: 213 VQQVIGTGKPGLTDGSFEEAQFFAPQGMTLDSENQLLYVADTENHCLRQV---DLKNQQV 269
Query: 141 RQI 143
+ I
Sbjct: 270 KTI 272
>gi|290975813|ref|XP_002670636.1| predicted protein [Naegleria gruberi]
gi|284084197|gb|EFC37892.1| predicted protein [Naegleria gruberi]
Length = 2442
Score = 40.0 bits (92), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + V+ +AG G+ +G +A + P+S V+ KG IYV+D NH +RKI
Sbjct: 857 ITSDGVISLVAGSYYGGFGGDNGPAANAVLNSPESLFVNSKGEIYVSDTVNHRVRKI 913
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G+ G Y DG + A+ P V G+IY+AD NH IRK+
Sbjct: 919 ITTVAGSGLAGFYGDGGLATNAKLSYPMGIYVSKNGDIYIADYGNHRIRKV 969
>gi|196228192|ref|ZP_03127059.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
gi|196227595|gb|EDY22098.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
Length = 357
Score = 40.0 bits (92), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+K + G G +G S D PG A + PK A+D G++ +AD NHVIR+
Sbjct: 248 IKTVVGTGAKGLSGDAGPGLQATMNGPKYIALDRDGSVLIADAENHVIRR 297
>gi|383100947|emb|CCD74491.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis halleri subsp. halleri]
Length = 1028
Score = 40.0 bits (92), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 1/56 (1%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITN 132
E + + ++ G +G+ DG A F++P+ A + K N +YVAD NH +R + N
Sbjct: 600 EGNFIVQIGSTGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALRYLLN 655
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 35/61 (57%), Gaps = 4/61 (6%)
Query: 79 ESVVKRL---AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
+ V KR+ AG G G+ DG+ A+ +P A+ G ++VAD +N +IR I +L+
Sbjct: 803 DPVTKRVVTVAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI-DLNK 861
Query: 136 G 136
G
Sbjct: 862 G 862
>gi|302821973|ref|XP_002992647.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
gi|300139611|gb|EFJ06349.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
Length = 124
Score = 40.0 bits (92), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 26/43 (60%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G GY DG + F++P+S A+ G ++VAD N IRKI+
Sbjct: 15 GESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKIS 57
>gi|440804160|gb|ELR25038.1| Bbox zinc finger domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 690
Score = 40.0 bits (92), Expect = 0.46, Method: Composition-based stats.
Identities = 15/42 (35%), Positives = 26/42 (61%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
G V+ + G G +F +P+ AVD +G+++VAD NH ++
Sbjct: 514 GSFVRTFGSGGSGPGQFQRPRGVAVDKEGHVFVADSENHRVQ 555
>gi|372267905|ref|ZP_09503953.1| hypothetical protein AlS89_08394 [Alteromonas sp. S89]
Length = 2227
Score = 40.0 bits (92), Expect = 0.46, Method: Composition-based stats.
Identities = 18/52 (34%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG G +G+S DG P + A P +D +GN+Y+ D N IR++
Sbjct: 1056 IISTVAGSGARGFSGDGGPATDASLSAPSGITLDPEGNLYIVDSGNRRIRRV 1107
>gi|219851969|ref|YP_002466401.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546228|gb|ACL16678.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 676
Score = 40.0 bits (92), Expect = 0.48, Method: Composition-based stats.
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
PG +F+ P+ AVD GN+Y+AD+ N+ I+K +
Sbjct: 86 PGDGQFNNPEGVAVDNNGNVYIADRDNNRIQKFNS 120
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 2/63 (3%)
Query: 69 FYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
YT P +S V++ +G + + G +F++P AVD GN+YVAD N+
Sbjct: 537 LYTCGPPELIQSRVQKFTPNGTFITEWGSLGSGDGQFNRPSGVAVDSAGNVYVADSGNNR 596
Query: 127 IRK 129
I+K
Sbjct: 597 IQK 599
Score = 35.8 bits (81), Expect = 7.8, Method: Composition-based stats.
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
G +F++P A+D GN+YV DK N+ I+K +
Sbjct: 134 GDGQFNQPSGVALDSAGNVYVTDKQNNRIQKFNS 167
>gi|229822379|ref|YP_002883905.1| alkyl hydroperoxide reductase [Beutenbergia cavernae DSM 12333]
gi|229568292|gb|ACQ82143.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Beutenbergia cavernae DSM 12333]
Length = 641
Score = 40.0 bits (92), Expect = 0.49, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 33/73 (45%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S DL + + +TL E+ + RLAG +G DG A F +P AV G
Sbjct: 324 SPDLAAFVVAMAGNHTLWAFDDEQGSLTRLAGTEHEGLRDGPAAEAWFAQPSGLAVAQDG 383
Query: 116 NIYVADKSNHVIR 128
I+VAD +R
Sbjct: 384 RIWVADSETSALR 396
>gi|357417484|ref|YP_004930504.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
gi|355335062|gb|AER56463.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
Length = 474
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
V R G+G DGEP A F +P ++ ++ +YVAD NH++R+I NL G
Sbjct: 200 VLRRFGNGTADALDGEPEIASFCRPNGLSL-LREELYVADTGNHLLRRI-NLRSG 252
>gi|116620267|ref|YP_822423.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223429|gb|ABJ82138.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 380
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +V+ AG G +GYS DG P A ++P D GN+++A+ NHV+R++
Sbjct: 98 KTNVISTAAGSGEKGYSGDGGPALGAALNEPYEVRFDRVGNMFIAEMQNHVVRRV 152
Score = 36.6 bits (83), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG G GYS DG P A+ PK A G++Y+AD +H IR++
Sbjct: 276 VAGTGESGYSGDGGPAKLAKLSGPKGIAWAPDGSLYLADTESHTIRRV 323
>gi|72094692|ref|XP_795849.1| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 706
Score = 40.0 bits (92), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 41/86 (47%), Gaps = 9/86 (10%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V P + L V D+ + ++ + VV+ G G++DG ARF P
Sbjct: 212 PGKVTTNPEGTLLAVSDTGHNRVIIVAL----DGVVQHCIGGPETGFNDGLYREARFHSP 267
Query: 107 KS--FAVDMKGNIYVADKSNHVIRKI 130
+ +A D+ IYVAD NH IRKI
Sbjct: 268 QGLCWAQDV---IYVADTENHAIRKI 290
>gi|308159172|gb|EFO61716.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia P15]
Length = 967
Score = 40.0 bits (92), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 6/86 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
PHS++ P + + + D+ TLS + VK LAG GV GY DG + F++
Sbjct: 143 PHSIVSDPVNGIIYLTDTKNHCIRTLSLMTRK---VKTLAGTPGVFGYKDGL--NPLFNE 197
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
P + GN+ V D N +R I+
Sbjct: 198 PLGLVLTEDGNLIVCDSKNGALRYIS 223
>gi|374855656|dbj|BAL58511.1| NHL repeat containing protein [uncultured candidate division OP1
bacterium]
Length = 419
Score = 40.0 bits (92), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +FD+P AVD +G IYVA++ NH I+KI
Sbjct: 385 GDGQFDQPVDLAVDPEGVIYVAEEGNHRIQKI 416
>gi|194877446|ref|XP_001973882.1| GG16549 [Drosophila erecta]
gi|190657069|gb|EDV54282.1| GG16549 [Drosophila erecta]
Length = 717
Score = 40.0 bits (92), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 30/56 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
L+ VV+ G G+ DG +ARF+ P+ A + + VAD NH IR+I+
Sbjct: 252 LTGGGVVQHKIGGYQPGFVDGNSTAARFNNPQGIAFLNENTLIVADTKNHAIRQIS 307
>gi|219852041|ref|YP_002466473.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546300|gb|ACL16750.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 930
Score = 40.0 bits (92), Expect = 0.52, Method: Composition-based stats.
Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 11/80 (13%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLK 146
G ++ + G +F P AVD GN+YVAD N+ I+K T++ I+Q
Sbjct: 76 GTFIKKWGSSGSGDGQFSSPSGVAVDSAGNVYVADTGNNRIQKFTSM---GIFIKQWG-- 130
Query: 147 PEDCSKSSQSGSGMFWVTVF 166
SS SG+G F+ + F
Sbjct: 131 ------SSGSGNGQFFSSPF 144
>gi|430760419|ref|YP_007216276.1| YD repeat protein [Thioalkalivibrio nitratireducens DSM 14787]
gi|430010043|gb|AGA32795.1| YD repeat protein [Thioalkalivibrio nitratireducens DSM 14787]
Length = 2395
Score = 40.0 bits (92), Expect = 0.53, Method: Composition-based stats.
Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 8/79 (10%)
Query: 60 IVLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAV 111
+ +D+ S F+T F + + V +AG+G + ++ DG P +A F P+ AV
Sbjct: 1125 VAVDAGGSVFFTRRNGGPRIFRANPDGTVHHVAGNGSRTFAGDGGPAVNASFIDPRGIAV 1184
Query: 112 DMKGNIYVADKSNHVIRKI 130
G++YV D H +R+I
Sbjct: 1185 GADGSLYVGDFRTHRVRRI 1203
>gi|398337816|ref|ZP_10522521.1| hypothetical protein LkmesMB_21054 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 359
Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 33/69 (47%)
Query: 62 LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
LD Y ++ VV AG G DG+ +A F P +D GNIYV++
Sbjct: 37 LDCILKECYLCRMKVTGNPVVTLFAGTGTASSVDGDISTATFQNPFGIELDSFGNIYVSE 96
Query: 122 KSNHVIRKI 130
++ + IRKI
Sbjct: 97 QNVNKIRKI 105
>gi|290971645|ref|XP_002668597.1| predicted protein [Naegleria gruberi]
gi|284082068|gb|EFC35853.1| predicted protein [Naegleria gruberi]
Length = 223
Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S+ +++ +AG+G +G+S DG A+ + P S V+ G IY+ D +N IRK+
Sbjct: 86 VSKSGIIETIAGNGNEGFSGDDGLATQAQLNCPMSTFVNSNGEIYITDSNNFRIRKV 142
>gi|290978083|ref|XP_002671766.1| predicted protein [Naegleria gruberi]
gi|284085337|gb|EFC39022.1| predicted protein [Naegleria gruberi]
Length = 258
Score = 39.7 bits (91), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S+ +++ +AG+G +G+S DG A+ + P S V+ G IY+ D +N IRK+
Sbjct: 198 VSKSGIIETIAGNGNEGFSGDDGLATQAQLNCPMSTFVNSNGEIYITDSNNFRIRKV 254
>gi|425444428|ref|ZP_18824479.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
gi|389735830|emb|CCI00727.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
Length = 358
Score = 39.7 bits (91), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 20/30 (66%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
G +FD P AVD GNIYVAD+ NH ++
Sbjct: 245 GDGQFDDPVGIAVDSGGNIYVADRDNHRVQ 274
>gi|338536093|ref|YP_004669427.1| putative lipoprotein [Myxococcus fulvus HW-1]
gi|337262189|gb|AEI68349.1| putative lipoprotein [Myxococcus fulvus HW-1]
Length = 855
Score = 39.7 bits (91), Expect = 0.56, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 26/50 (52%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG+G G++DG A P V G +YVAD N ++R IT
Sbjct: 805 VTTLAGNGRYGHADGPGDKAELVLPAGLTVGPDGRLYVADAGNGLVRAIT 854
Score = 36.6 bits (83), Expect = 4.8, Method: Composition-based stats.
Identities = 36/104 (34%), Positives = 51/104 (49%), Gaps = 8/104 (7%)
Query: 27 LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
L+ +G+ VTT + G L P V S L V++S S + L+ SVV A
Sbjct: 699 LVAEGHPVTT-LQGDWL-YRPSGVAVSADGSTLYVVESGMSRVVRIRDGLT--SVV---A 751
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G GY DG P SA+F AV G++ VAD N+ +R++
Sbjct: 752 GT-TPGYRDGAPTSAQFLPYLGIAVLKDGSLAVADPGNYRVRRV 794
>gi|408792280|ref|ZP_11203890.1| concanavalin A-like lectin/glucanases family protein [Leptospira
meyeri serovar Hardjo str. Went 5]
gi|408463690|gb|EKJ87415.1| concanavalin A-like lectin/glucanases family protein [Leptospira
meyeri serovar Hardjo str. Went 5]
Length = 1234
Score = 39.7 bits (91), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
L+ + V +AGDGVQ +DG +A+F+ P D N+++++ S H IR +
Sbjct: 475 LTGTNAVDTIAGDGVQASTDGIGTAAQFNDPHGLTTDGT-NLFISEWSGHKIRHL 528
>gi|290971764|ref|XP_002668649.1| predicted protein [Naegleria gruberi]
gi|284082135|gb|EFC35905.1| predicted protein [Naegleria gruberi]
Length = 556
Score = 39.7 bits (91), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 53/109 (48%), Gaps = 26/109 (23%)
Query: 48 HSVIDRPGSSDLIVLDS-----SRSAFYT-LSFPLSEESV---------VKRLAGD---- 88
+S++DR + +I LD+ + ++ YT +S SE S +K+L
Sbjct: 338 NSLVDRVAGTGIIELDTDGKLGNLTSLYTPISVTQSESSSDIYIGTTIGIKKLTDSNKLI 397
Query: 89 ----GVQGYS--DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G Q YS DG P +A R P+ A+ G ++++DK +H IRKI
Sbjct: 398 SSFAGSQAYSSGDGYPATAARLSSPEGIAISTSGEVFISDKGSHTIRKI 446
>gi|290995458|ref|XP_002680312.1| predicted protein [Naegleria gruberi]
gi|284093932|gb|EFC47568.1| predicted protein [Naegleria gruberi]
Length = 744
Score = 39.7 bits (91), Expect = 0.57, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 4/55 (7%)
Query: 85 LAGDGVQGYSDGE-PG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+AG+G +GY D P +A ++P S V +Y+AD NH IRKI L +GN
Sbjct: 177 IAGNGKEGYGDDNVPAVNAMLNRPSSVFVTSNNEVYIADTGNHRIRKI--LRNGN 229
Score = 37.0 bits (84), Expect = 3.6, Method: Composition-based stats.
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 4/55 (7%)
Query: 85 LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+AG+G+ G+ + + + PKS V G +Y+AD NH IRKI L +GN
Sbjct: 233 IAGNGLVGFHEDNILATNTSLNFPKSVFVSSNGEVYIADTGNHRIRKI--LANGN 285
>gi|188501588|gb|ACD54714.1| NHL repeat containing hypothetical protein RRC374-like protein
[Adineta vaga]
Length = 493
Score = 39.7 bits (91), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 14/108 (12%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRP------GSSDLIVLDSSRSAFYTLSFPLSEESV 81
L++G V T I G + N + P SS++ V+D+ Y + +V
Sbjct: 236 LQEGSLVGTTIAGTGIAGNSAGQLKIPLDLFVDSSSNIYVVDNDN---YRVMLWRKNATV 292
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
++AG G QG + ++ F +P VD GN+YV+D + H + K
Sbjct: 293 GIKVAGSGSQGST-----ASTFREPNGITVDSMGNVYVSDFTTHRVMK 335
>gi|223940572|ref|ZP_03632418.1| NHL repeat containing protein [bacterium Ellin514]
gi|223890764|gb|EEF57279.1| NHL repeat containing protein [bacterium Ellin514]
Length = 791
Score = 39.7 bits (91), Expect = 0.58, Method: Composition-based stats.
Identities = 20/40 (50%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
V G G PG F P S A D KGN+YVAD NH ++K
Sbjct: 736 VLGGVGGAPG--EFSNPWSIAFDSKGNLYVADSQNHRVQK 773
>gi|302535478|ref|ZP_07287820.1| NHL repeat containing protein [Streptomyces sp. C]
gi|302444373|gb|EFL16189.1| NHL repeat containing protein [Streptomyces sp. C]
Length = 612
Score = 39.7 bits (91), Expect = 0.58, Method: Composition-based stats.
Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 23/107 (21%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +L+V DS+R L E+VV+R+ G G +G+ G F +P+ A+ G
Sbjct: 198 SGNLLVSDSTRHQLVELE--ADGETVVRRI-GSGERGF-----GPDAFSEPQGLALLPDG 249
Query: 116 NIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
+ VAD NH +R L PE + + +G+G W
Sbjct: 250 RVVVADTVNHALRV---------------LDPETGAVETVAGTGRQW 281
>gi|224125014|ref|XP_002319481.1| predicted protein [Populus trichocarpa]
gi|222857857|gb|EEE95404.1| predicted protein [Populus trichocarpa]
Length = 1065
Score = 39.7 bits (91), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
+ ++ G +G DG A F++P+ A + K NI YVAD NH +R++
Sbjct: 610 IAQIGSSGEEGLRDGSFDDATFNRPQGLAYNSKKNILYVADTENHALREV 659
Score = 36.6 bits (83), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 5/76 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V +AG G G+ DG+ +A+ +P G + +AD +N VIR + +L+ G +
Sbjct: 846 VTTIAGTGKAGFKDGKALTAQLSEPAGLIEAENGRLIIADTNNSVIRYL-DLNKGEAELL 904
Query: 142 QINLK----PEDCSKS 153
+ LK P SKS
Sbjct: 905 TLELKGVQPPASKSKS 920
>gi|162448821|ref|YP_001611188.1| NHL repeat-containing protein [Sorangium cellulosum So ce56]
gi|161159403|emb|CAN90708.1| hypothetical protein with NHL repeat [Sorangium cellulosum So ce56]
Length = 1889
Score = 39.7 bits (91), Expect = 0.60, Method: Composition-based stats.
Identities = 16/38 (42%), Positives = 23/38 (60%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G DG +ARF +P D +GN+Y+AD NH +R+
Sbjct: 949 GNVDGVGTTARFAEPSGIVADGRGNLYIADAFNHSLRR 986
Score = 37.4 bits (85), Expect = 3.0, Method: Composition-based stats.
Identities = 14/42 (33%), Positives = 27/42 (64%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
S+V +AG+G G +DG +AR P++ A+D G++++ +
Sbjct: 992 SMVTTVAGNGTAGSADGVGTAARLTSPQALAIDGSGSVFIGE 1033
>gi|423214678|ref|ZP_17201206.1| hypothetical protein HMPREF1074_02738 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692584|gb|EIY85821.1| hypothetical protein HMPREF1074_02738 [Bacteroides xylanisolvens
CL03T12C04]
Length = 593
Score = 39.7 bits (91), Expect = 0.61, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 36/60 (60%), Gaps = 1/60 (1%)
Query: 72 LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LSF + S V +AG+ +DG+ +A F P+ A+ G+I++A++ N+VIRK+
Sbjct: 281 LSFTYEDLSYTVATVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNVIRKM 340
Score = 37.7 bits (86), Expect = 2.2, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 79 ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +K +AG G +G DG +A F+ P S V K +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRTIT 590
>gi|290976760|ref|XP_002671107.1| predicted protein [Naegleria gruberi]
gi|284084673|gb|EFC38363.1| predicted protein [Naegleria gruberi]
Length = 1349
Score = 39.7 bits (91), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 30/54 (55%), Gaps = 2/54 (3%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
S + +AG+G GYS DG P +A+ + P V G +Y+AD N +RK+
Sbjct: 553 NSQLSTVAGNGYAGYSGDGGPARNAKLNYPADVTVSSTGEVYIADWGNRAVRKV 606
>gi|294644532|ref|ZP_06722286.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
gi|294805794|ref|ZP_06764671.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|336403611|ref|ZP_08584323.1| hypothetical protein HMPREF0127_01636 [Bacteroides sp. 1_1_30]
gi|345510783|ref|ZP_08790344.1| hypothetical protein BSAG_00302 [Bacteroides sp. D1]
gi|229442801|gb|EEO48592.1| hypothetical protein BSAG_00302 [Bacteroides sp. D1]
gi|292640085|gb|EFF58349.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
gi|294447015|gb|EFG15605.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|335945537|gb|EGN07348.1| hypothetical protein HMPREF0127_01636 [Bacteroides sp. 1_1_30]
Length = 593
Score = 39.7 bits (91), Expect = 0.61, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 36/60 (60%), Gaps = 1/60 (1%)
Query: 72 LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LSF + S V +AG+ +DG+ +A F P+ A+ G+I++A++ N+VIRK+
Sbjct: 281 LSFTYEDLSYTVATVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNVIRKM 340
Score = 37.7 bits (86), Expect = 2.2, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 79 ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +K +AG G +G DG +A F+ P S V K +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRTIT 590
>gi|157112324|ref|XP_001657496.1| hypothetical protein AaeL_AAEL000965 [Aedes aegypti]
gi|108883769|gb|EAT47994.1| AAEL000965-PA [Aedes aegypti]
Length = 812
Score = 39.7 bits (91), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ R+ G G+ DG+ ARF+ P+ A ++VAD NH IRKI
Sbjct: 345 ILYRIGGKRA-GFVDGDFRKARFNAPQGVAFQNDTTLFVADNENHAIRKI 393
>gi|402571883|ref|YP_006621226.1| hypothetical protein Desmer_1348 [Desulfosporosinus meridiei DSM
13257]
gi|402253080|gb|AFQ43355.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
Length = 1977
Score = 39.7 bits (91), Expect = 0.61, Method: Composition-based stats.
Identities = 25/66 (37%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 77 SEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
S + KR AG GV + G+ +F P AVD GN+YVAD N I+K + +
Sbjct: 112 SNHRIQKRNAGTGVWTTWGSSGSGNKQFSSPSGVAVDADGNVYVADTFNSRIQKFDS--N 169
Query: 136 GNQLIR 141
GN L +
Sbjct: 170 GNYLTQ 175
Score = 38.9 bits (89), Expect = 1.0, Method: Composition-based stats.
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G+ +F P AVD GN+YVAD SNH I+K
Sbjct: 88 GAGKFYSPAGVAVDGAGNVYVADTSNHRIQK 118
Score = 38.5 bits (88), Expect = 1.2, Method: Composition-based stats.
Identities = 18/35 (51%), Positives = 22/35 (62%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
G +F S AVD NIYVAD NH I+++TNL
Sbjct: 556 GDGQFSGIYSLAVDRNDNIYVADSQNHRIQRLTNL 590
>gi|392945711|ref|ZP_10311353.1| serine/threonine protein kinase [Frankia sp. QA3]
gi|392289005|gb|EIV95029.1| serine/threonine protein kinase [Frankia sp. QA3]
Length = 859
Score = 39.7 bits (91), Expect = 0.61, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S V+ +AG G GYS DG P +A + + + G++Y +D NH IRKI
Sbjct: 622 ISPAGVISTIAGTGTAGYSGDGGPATAAQLNNAEQVTTGPDGSVYFSDYENHRIRKI 678
>gi|440733650|ref|ZP_20913345.1| nhl repeat protein [Xanthomonas translucens DAR61454]
gi|440359672|gb|ELP96967.1| nhl repeat protein [Xanthomonas translucens DAR61454]
Length = 473
Score = 39.7 bits (91), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
G G + DG G A F +P+ A++ +G +YVAD NH +R+I L
Sbjct: 204 GLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLL 249
>gi|167041535|gb|ABZ06284.1| putative NHL repeat protein [uncultured marine microorganism
HF4000_008B14]
Length = 426
Score = 39.7 bits (91), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+ G+G G DG A+F +P ++D G IY+AD +NH IR + +L+ G
Sbjct: 367 MLGNGQPGDRDGPGNQAQFSEPSGLSID-GGKIYIADTNNHAIR-VADLESGE 417
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 5/84 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ S+ L + DS+ + + + ++ G G G DG +A FD P
Sbjct: 111 PGKVVADADSNRLFIADSNHNRIIVTTL----DGDALKVVGSGEPGMVDGSFAAASFDHP 166
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ + + +YVAD NH IRK+
Sbjct: 167 QGMVL-VGDLLYVADAENHAIRKV 189
>gi|290992324|ref|XP_002678784.1| predicted protein [Naegleria gruberi]
gi|284092398|gb|EFC46040.1| predicted protein [Naegleria gruberi]
Length = 2331
Score = 39.7 bits (91), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 1/51 (1%)
Query: 82 VKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG+G GYSD +P A F+ S ++ G++Y++D +N+VIRK++
Sbjct: 509 ISSVAGNGYLGYSDNTQPLRASFNLAVSVLINNVGDMYISDYANNVIRKMS 559
>gi|302834543|ref|XP_002948834.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
nagariensis]
gi|300266025|gb|EFJ50214.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
nagariensis]
Length = 774
Score = 39.7 bits (91), Expect = 0.63, Method: Composition-based stats.
Identities = 26/83 (31%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
+P +V+ S + V DS L+ P + E V LAG G G+ DG +A+F +
Sbjct: 498 HPLAVLSSADGSAIYVADSYNHRIKALN-PNTNEIVT--LAGSGAAGFRDGVGTAAQFSE 554
Query: 106 PKSFAVDMKGNIYVADKSNHVIR 128
P G I +AD +N +R
Sbjct: 555 PAGLCRGPNGTILIADTNNSAVR 577
>gi|206602867|gb|EDZ39348.1| Conserved protein of unknown function [Leptospirillum sp. Group II
'5-way CG']
Length = 374
Score = 39.7 bits (91), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFD 104
P + P L V DS +A + + V LAG G + G +DG A F
Sbjct: 174 PRGLAISPDGKTLYVADSGNNAIRKIDL---ATNTVTTLAGAGKLMSGSADGVGVQATFH 230
Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
+P+S A+ G + Y+AD N++IRK+
Sbjct: 231 EPRSLAISGDGQVLYIADTRNNLIRKM 257
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
Y DG +ARF+ P+ A+ G +Y+AD N++IRKI
Sbjct: 105 YRDGVGANARFNNPEGMAISPDGKTLYIADSRNNMIRKI 143
Score = 36.2 bits (82), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 35/86 (40%), Gaps = 4/86 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + P L + DS + + V +AG DG A F+
Sbjct: 117 NPEGMAISPDGKTLYIADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 173
Query: 106 PKSFAVDMKG-NIYVADKSNHVIRKI 130
P+ A+ G +YVAD N+ IRKI
Sbjct: 174 PRGLAISPDGKTLYVADSGNNAIRKI 199
>gi|433676068|ref|ZP_20508221.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
DSM 18974]
gi|430818813|emb|CCP38486.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
DSM 18974]
Length = 473
Score = 39.7 bits (91), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
G G + DG G A F +P+ A++ +G +YVAD NH +R+I L
Sbjct: 204 GLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLL 249
>gi|380692796|ref|ZP_09857655.1| hypothetical protein BfaeM_02286 [Bacteroides faecis MAJ27]
Length = 465
Score = 39.7 bits (91), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 37/79 (46%), Gaps = 2/79 (2%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE-PGSARFDKPKSFAVDMKGN 116
D +D TL+ + R GDG +G++DG+ ARF+ P S D K
Sbjct: 387 DFYFVDKQNHCVRTLTPTGRVKIYAGRPNGDGTKGFNDGDLRKEARFNYPASIVWDEKRE 446
Query: 117 -IYVADKSNHVIRKITNLD 134
+ V D +NH IRKI D
Sbjct: 447 CLLVGDSNNHRIRKIAMED 465
>gi|160883851|ref|ZP_02064854.1| hypothetical protein BACOVA_01824 [Bacteroides ovatus ATCC 8483]
gi|156110581|gb|EDO12326.1| NHL repeat protein [Bacteroides ovatus ATCC 8483]
Length = 494
Score = 39.7 bits (91), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 39/96 (40%), Gaps = 18/96 (18%)
Query: 64 SSRSAFYTLSFPLSEESVVKRLAGDGV----------------QGYSDGE-PGSARFDKP 106
+ + Y F + +++L DG+ GY DG ARFD P
Sbjct: 387 AGKDEIYDFYFTDRDNHCIRKLTPDGITSTFAGRGSVGMNIHANGYVDGALREEARFDSP 446
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIR 141
+ A D NI YV D +NH IRKI LIR
Sbjct: 447 AALAYDEVNNIFYVGDVNNHRIRKIALEKIPEDLIR 482
>gi|186683945|ref|YP_001867141.1| redoxin domain-containing protein [Nostoc punctiforme PCC 73102]
gi|186466397|gb|ACC82198.1| Redoxin domain protein [Nostoc punctiforme PCC 73102]
Length = 509
Score = 39.7 bits (91), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + GDG G DG+ ++RF +P +++ +Y+AD +NH IR++
Sbjct: 432 QTIQGDGTAGLQDGQGKNSRFFEPSGLSIN-GSYLYIADTNNHAIRRV 478
Score = 39.7 bits (91), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 35/76 (46%), Gaps = 5/76 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ L + DS SF + + L G G G +DG A+F P+ A D +
Sbjct: 184 QTGLFIADSGHHRLIMSSF----DGEILHLIGTGKSGLTDGAFNEAQFFAPQGMAYDAEN 239
Query: 116 NI-YVADKSNHVIRKI 130
I YVAD NH +R++
Sbjct: 240 QILYVADTENHTLRRV 255
>gi|423072880|ref|ZP_17061629.1| repeat protein [Desulfitobacterium hafniense DP7]
gi|361856375|gb|EHL08282.1| repeat protein [Desulfitobacterium hafniense DP7]
Length = 2017
Score = 39.7 bits (91), Expect = 0.67, Method: Composition-based stats.
Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 5/41 (12%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G G +GE F++P AVD GNIYVAD +NH I+K
Sbjct: 956 GSNGSDEGE-----FNRPYGIAVDSDGNIYVADSNNHRIQK 991
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
G +F PKS AVD GNIYVAD +N I+K
Sbjct: 772 GDGQFRNPKSVAVDNDGNIYVADTTNKRIQK 802
Score = 38.1 bits (87), Expect = 1.9, Method: Composition-based stats.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 5/45 (11%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
LA G +G S G+ FD P AVD GNIYVAD +N+ ++K
Sbjct: 1189 LAKWGTRGNSAGQ-----FDNPSGIAVDNDGNIYVADTNNNRVQK 1228
Score = 37.7 bits (86), Expect = 2.2, Method: Composition-based stats.
Identities = 18/47 (38%), Positives = 26/47 (55%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
AG + G+ +F P+S AVD GNIYV+D N+ I+K +
Sbjct: 254 AGSFIMGWGGAGIDDGQFATPRSIAVDSAGNIYVSDSMNNRIQKFAS 300
Score = 35.8 bits (81), Expect = 8.2, Method: Composition-based stats.
Identities = 15/35 (42%), Positives = 22/35 (62%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
G +F+ PK AVD GN+YVAD N ++K ++
Sbjct: 1008 GLGQFNSPKGIAVDSNGNVYVADIENDRVQKFDSM 1042
>gi|288923318|ref|ZP_06417452.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288345334|gb|EFC79729.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 849
Score = 39.7 bits (91), Expect = 0.67, Method: Composition-based stats.
Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + ++ LAG DG + DG P S A P + A+D G +Y+AD SN+ IR+I
Sbjct: 678 VGSDGIITTLAGQDGEGSFGDGGPASKALLAFPLAVALDRFGRLYIADTSNNRIRRI 734
>gi|290994196|ref|XP_002679718.1| predicted protein [Naegleria gruberi]
gi|284093336|gb|EFC46974.1| predicted protein [Naegleria gruberi]
Length = 762
Score = 39.7 bits (91), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
E + +AG+G G+S DG +A+ + P S V K +++AD+ NH IRKI
Sbjct: 67 ESGRIVTIAGNGKGGFSGDDGLATNAQLNCPSSVFVSNKNEVFIADQYNHRIRKI 121
Score = 39.3 bits (90), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 35/62 (56%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E V +AG G QG+S DG P + A+ P V + IY AD +N+ IRK+ L++
Sbjct: 283 ENGNVVTVAGTGQQGFSGDGGPATRAQLACPTGLFVTLNNEIYFADSANNRIRKV--LEN 340
Query: 136 GN 137
GN
Sbjct: 341 GN 342
Score = 36.2 bits (82), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 4/61 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G +G+S +G SAR + P S V +Y D +N+ +RKI L++
Sbjct: 123 ESGRIVTIAGNGEEGFSGDNGPATSARLNCPMSIFVSNMNEVYFVDSNNNRVRKI--LEN 180
Query: 136 G 136
G
Sbjct: 181 G 181
>gi|354612773|ref|ZP_09030715.1| NHL repeat containing protein [Saccharomonospora paurometabolica
YIM 90007]
gi|353222911|gb|EHB87206.1| NHL repeat containing protein [Saccharomonospora paurometabolica
YIM 90007]
Length = 436
Score = 39.7 bits (91), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 2/61 (3%)
Query: 77 SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
+ + + +AG G +G S DG P + A P AVD GN+++ D +H +RK++ D
Sbjct: 208 ATDHTISTVAGTGREGDSGDGGPATDAELRFPNCVAVDGHGNVFLTDPRSHRVRKVSATD 267
Query: 135 H 135
H
Sbjct: 268 H 268
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 14/30 (46%), Positives = 19/30 (63%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
P S AVD GN+Y D NH +R+++ DH
Sbjct: 296 PNSLAVDGAGNLYFGDTGNHRVRRVSAADH 325
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
+AG G G S DG P A P + A+D GN+Y+ D + +RK++ DH
Sbjct: 159 VAGTGQNGDSGDGGPAVDAELRHPTTVALDRHGNVYLTDPESRRVRKVSATDH 211
>gi|298388124|ref|ZP_06997668.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298259116|gb|EFI01996.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 435
Score = 39.7 bits (91), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AG+ +G +G P + F +P+ G IYVAD NHVIRKI
Sbjct: 388 AGNRSEGLVNGLPLKSSFRRPEGLTRSKDGVIYVADHDNHVIRKI 432
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+++G ARF++P D GN+YV+D+ N+ IRKIT
Sbjct: 341 FANGIGEEARFNQPCQGVFDEDGNLYVSDRDNNCIRKIT 379
>gi|383111216|ref|ZP_09932033.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
gi|313694781|gb|EFS31616.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
Length = 419
Score = 39.7 bits (91), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 61/140 (43%), Gaps = 21/140 (15%)
Query: 56 SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
S + IV D +++ S +S + K +AGDGV+G Y DG+PG SA+
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGDPGNPLSAKVGA 329
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT---NLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
A G +Y++D + + IRK+T + D+ + I KS S
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDASGDYSKGTLETI----AGSGKSGFSDGKGLK 385
Query: 163 VTVFSPYLI----SCKSFWV 178
T PY I CK+ +V
Sbjct: 386 ATFNQPYEIIITKDCKTMYV 405
>gi|452910311|ref|ZP_21958992.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
gi|452834558|gb|EME37358.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
Length = 632
Score = 39.3 bits (90), Expect = 0.70, Method: Composition-based stats.
Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 7/56 (12%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-------NIYVADKSNHVIRKI 130
V+ + GDG +G +DG P A F++P+ A+ +G ++ VAD NH +R +
Sbjct: 207 VRSVIGDGTRGPADGGPEEAHFNEPRGLALLPQGVREQVGYDVIVADTVNHRLRGV 262
>gi|29348891|ref|NP_812394.1| hypothetical protein BT_3482 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383125072|ref|ZP_09945731.1| hypothetical protein BSIG_5391 [Bacteroides sp. 1_1_6]
gi|29340797|gb|AAO78588.1| conserved hypothetical protein containing NHL repeat [Bacteroides
thetaiotaomicron VPI-5482]
gi|382983525|gb|EES65521.2| hypothetical protein BSIG_5391 [Bacteroides sp. 1_1_6]
Length = 513
Score = 39.3 bits (90), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 13/83 (15%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGE-PGSARFDKPKSF 109
D + D A L+ PL + V AG G GY +G+ ARF++PK+
Sbjct: 429 DFYIADKHNHAIRILT-PLGQ---VSTFAGRGSASLNSNPWGYVNGDLRKEARFERPKAI 484
Query: 110 AVDMK-GNIYVADKSNHVIRKIT 131
A D K G +YV D NH IRKI
Sbjct: 485 AYDEKTGIVYVGDAYNHRIRKIA 507
>gi|290986797|ref|XP_002676110.1| predicted protein [Naegleria gruberi]
gi|284089710|gb|EFC43366.1| predicted protein [Naegleria gruberi]
Length = 1218
Score = 39.3 bits (90), Expect = 0.72, Method: Composition-based stats.
Identities = 21/50 (42%), Positives = 32/50 (64%), Gaps = 4/50 (8%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ G GV+GY+ D A+ +KP S +VD G+IY++D S +RKI+N
Sbjct: 675 ICGTGVEGYNGDDILATDAQLNKPTSVSVDQNGDIYISDNSR--LRKISN 722
>gi|424869138|ref|ZP_18292858.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
gi|124514239|gb|EAY55753.1| conserved protein of unknown function [Leptospirillum rubarum]
gi|387220840|gb|EIJ75456.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
Length = 389
Score = 39.3 bits (90), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFD 104
P + P L V DS +A + + V LAG G + G +DG A F
Sbjct: 189 PRGLAISPDGKTLYVADSGNNAIRKIDL---ATNTVTTLAGAGKLMSGSADGVGVQATFH 245
Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
+P+S A+ G + Y+AD N++IRK+
Sbjct: 246 EPRSLAISGDGQVLYIADTRNNLIRKM 272
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
Y DG +ARF+ P+ A+ G +YVAD N++IRKI
Sbjct: 120 YRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKI 158
Score = 36.6 bits (83), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 35/86 (40%), Gaps = 4/86 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + P L V DS + + V +AG DG A F+
Sbjct: 132 NPEGMAISPDGKTLYVADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 188
Query: 106 PKSFAVDMKG-NIYVADKSNHVIRKI 130
P+ A+ G +YVAD N+ IRKI
Sbjct: 189 PRGLAISPDGKTLYVADSGNNAIRKI 214
Score = 35.8 bits (81), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 23/41 (56%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+G DG ARF+ P+ G IY+AD N +IRKI+
Sbjct: 47 RGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKIS 87
>gi|350561579|ref|ZP_08930417.1| YD repeat protein [Thioalkalivibrio thiocyanoxidans ARh 4]
gi|349780611|gb|EGZ34929.1| YD repeat protein [Thioalkalivibrio thiocyanoxidans ARh 4]
Length = 2546
Score = 39.3 bits (90), Expect = 0.73, Method: Composition-based stats.
Identities = 24/79 (30%), Positives = 40/79 (50%), Gaps = 8/79 (10%)
Query: 60 IVLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAV 111
+ +D+ S F+T F + + V +AG+G ++ DG P +A F P+ AV
Sbjct: 1276 VAVDAGGSVFFTRRNGGPRIFRANPDGTVHHVAGNGSSTFAGDGGPAVNASFIDPRGIAV 1335
Query: 112 DMKGNIYVADKSNHVIRKI 130
G++YV D H +R+I
Sbjct: 1336 GADGSLYVGDFRTHRVRRI 1354
>gi|20092638|ref|NP_618713.1| cell surface protein [Methanosarcina acetivorans C2A]
gi|19917919|gb|AAM07193.1| cell surface protein [Methanosarcina acetivorans C2A]
Length = 526
Score = 39.3 bits (90), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 25/41 (60%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + G G+ +F P AVD GNIYVAD +NH ++K+
Sbjct: 84 IDKWGSGGSGNGQFYTPNGVAVDSMGNIYVADYNNHRVQKL 124
>gi|290980328|ref|XP_002672884.1| predicted protein [Naegleria gruberi]
gi|284086464|gb|EFC40140.1| predicted protein [Naegleria gruberi]
Length = 1380
Score = 39.3 bits (90), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 43/94 (45%), Gaps = 5/94 (5%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMK 114
SD + S+ + L + ++ + GDG GYS D P S+ PK+F V
Sbjct: 845 SDQTIYFSTNTKIRVLELAQNGSYYLRTIVGDGNGGYSGDNLPALSSSVQNPKAFHVKKN 904
Query: 115 GNIYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
G+I D+ N++IRK L++ I KP
Sbjct: 905 GDIVFVDEGNYLIRKFV---KATGLLKNIAGKPR 935
>gi|219851898|ref|YP_002466330.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546157|gb|ACL16607.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 387
Score = 39.3 bits (90), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
FDKP AVD GN+YVAD N+ ++K T
Sbjct: 233 FDKPSGVAVDSAGNVYVADWGNNWVQKFT 261
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
F+KP AVD GN+YVAD +N+ I+K T
Sbjct: 136 GGAFNKPGGVAVDSVGNVYVADSANNQIQKFT 167
>gi|410478931|ref|YP_006766568.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
gi|406774183|gb|AFS53608.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
Length = 427
Score = 39.3 bits (90), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFD 104
P + P L V DS +A + + V LAG G + G +DG A F
Sbjct: 227 PRGLAISPDGKTLYVADSGNNAIRKIDL---ATNTVTTLAGAGKLMSGSADGVGVQATFH 283
Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
+P+S A+ G + Y+AD N++IRK+
Sbjct: 284 EPRSLAISGDGQVLYIADTRNNLIRKM 310
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
Y DG +ARF+ P+ A+ G +YVAD N++IRKI
Sbjct: 158 YRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKI 196
Score = 36.6 bits (83), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 35/86 (40%), Gaps = 4/86 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + P L V DS + + V +AG DG A F+
Sbjct: 170 NPEGMAISPDGKTLYVADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 226
Query: 106 PKSFAVDMKG-NIYVADKSNHVIRKI 130
P+ A+ G +YVAD N+ IRKI
Sbjct: 227 PRGLAISPDGKTLYVADSGNNAIRKI 252
Score = 35.8 bits (81), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 23/41 (56%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+G DG ARF+ P+ G IY+AD N +IRKI+
Sbjct: 85 RGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKIS 125
>gi|290994368|ref|XP_002679804.1| predicted protein [Naegleria gruberi]
gi|284093422|gb|EFC47060.1| predicted protein [Naegleria gruberi]
Length = 1359
Score = 39.3 bits (90), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG G++G+ DG ++ F+ P + IYVAD +NHVIRKIT
Sbjct: 520 TVSLIAGQPGIKGHIDGIALNSTFNHPSGIHAEYSA-IYVADSNNHVIRKIT 570
>gi|290978007|ref|XP_002671728.1| predicted protein [Naegleria gruberi]
gi|284085299|gb|EFC38984.1| predicted protein [Naegleria gruberi]
Length = 2520
Score = 39.3 bits (90), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 26/44 (59%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G G GY+DG +A ++ P A ++Y+AD+ NHV+R +
Sbjct: 228 GVGSAGYADGSAFNALYNAPTGVAYGPNKDLYIADRGNHVVRVV 271
>gi|290976488|ref|XP_002670972.1| predicted protein [Naegleria gruberi]
gi|284084536|gb|EFC38228.1| predicted protein [Naegleria gruberi]
Length = 704
Score = 39.3 bits (90), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 81 VVKRLAGD-GVQGYSDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ +AG G G++D SA+ + P++ A+ G+IY+AD N+ IRKIT
Sbjct: 211 IITTIAGTAGSSGFTDNVLSTSAKLNGPQAVAIMSNGDIYIADTQNNRIRKIT 263
>gi|380693318|ref|ZP_09858177.1| hypothetical protein BfaeM_04967 [Bacteroides faecis MAJ27]
Length = 472
Score = 39.3 bits (90), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 9/64 (14%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS--------ARFDKPKSFAVDMKGNI-YVADKSNHV 126
L+ E +V AG G G +D G+ ARFD+P + A D N+ YV D NH
Sbjct: 391 LTPEGLVTTFAGRGSAGLNDKPYGNVNGDLREEARFDQPAAIAYDSINNVFYVGDIENHS 450
Query: 127 IRKI 130
IR+I
Sbjct: 451 IRRI 454
>gi|440683842|ref|YP_007158637.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
gi|428680961|gb|AFZ59727.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
Length = 506
Score = 39.3 bits (90), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 15/48 (31%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + GDG+ G+ DG+ + RF +P + + +Y++D +NH IR++
Sbjct: 431 QTVLGDGLPGFQDGQGKNTRFSEPSGLS-GIDSYLYISDTNNHAIRRV 477
>gi|219853190|ref|YP_002467622.1| SMP-30/Gluconolaconase/LRE domain-containing protein
[Methanosphaerula palustris E1-9c]
gi|219547449|gb|ACL17899.1| SMP-30/Gluconolaconase/LRE domain protein [Methanosphaerula
palustris E1-9c]
Length = 343
Score = 39.3 bits (90), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
PG +F P+ AVD GN+YV D N+ I+K T
Sbjct: 105 PGDGQFKSPEGIAVDNAGNVYVVDTVNNQIQKFT 138
Score = 35.8 bits (81), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 7/103 (6%)
Query: 32 YTVTTVIDGHQLEINPHSVIDRPGS-SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
Y V TV + Q + + I + G+ + I +DS + + T + + V++ DG
Sbjct: 125 YVVDTVNNQIQKFTSDGTFITKWGTRASGIAVDSDGNVYVTDA----TSNQVQKFTSDGT 180
Query: 91 QGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ G GS +F P+ AVD GN+YV D N ++K T
Sbjct: 181 HLLNWGTSGSENGQFSSPEGIAVDNAGNVYVNDVGNQRVQKFT 223
>gi|149279484|ref|ZP_01885614.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
gi|149229777|gb|EDM35166.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
Length = 742
Score = 39.3 bits (90), Expect = 0.78, Method: Composition-based stats.
Identities = 16/52 (30%), Positives = 30/52 (57%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + ++ +G+ DG A+F + + A+D GNIY+ D N+ IRK+
Sbjct: 688 QNTISTVSAQTTRGFKDGGLNEAQFMQRDAMAMDKDGNIYIVDMGNNAIRKL 739
Score = 38.1 bits (87), Expect = 1.8, Method: Composition-based stats.
Identities = 23/63 (36%), Positives = 30/63 (47%)
Query: 68 AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
+ Y F L R AG+G G +DG SA PK F D KGN+Y +D +
Sbjct: 622 SLYIGRFDLVNSIYDLRWAGNGNFGNADGPIKSAMLVSPKGFVGDGKGNMYFSDGFSQNT 681
Query: 128 RKI 130
RK+
Sbjct: 682 RKV 684
>gi|290985668|ref|XP_002675547.1| predicted protein [Naegleria gruberi]
gi|284089144|gb|EFC42803.1| predicted protein [Naegleria gruberi]
Length = 1037
Score = 39.3 bits (90), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 2/53 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
V+ + G+G+ G+S DGE +A+ ++P VD G + D SN+ IR I+N
Sbjct: 28 VEVIGGNGISGFSGDGELAINAKLNRPTKIVVDSAGRVVFCDNSNNRIRMISN 80
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 6/86 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG-SARFDK 105
P+ V+ S D+ + D+S +S S ++ +AG G G+ D A+ +
Sbjct: 445 PYGVV-TSSSGDVFIADTSNCRIRKVS---SSTGIITTVAG-GTCGFGDNVLAVDAQLNT 499
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
P +V+ KG +++AD +NH IRK++
Sbjct: 500 PYGISVNSKGELFIADTNNHRIRKVS 525
>gi|339008739|ref|ZP_08641312.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
gi|338774539|gb|EGP34069.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
Length = 633
Score = 39.3 bits (90), Expect = 0.82, Method: Composition-based stats.
Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 5/44 (11%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
G +G +G+ F++P A+D K NIYV D++NH I+K N
Sbjct: 487 GQKGTKEGQ-----FNQPMQLAIDSKDNIYVVDRNNHRIQKFDN 525
>gi|209522300|ref|ZP_03270926.1| NHL repeat containing protein [Burkholderia sp. H160]
gi|209497260|gb|EDZ97489.1| NHL repeat containing protein [Burkholderia sp. H160]
Length = 383
Score = 39.3 bits (90), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
G+ ++ + D G F P S AVD ++YVAD+SN R+I D + +RQ +
Sbjct: 220 GNWLKSWGDRGTGPGEFHTPHSIAVDANDHVYVADRSN---RRIQVFDTEGRFLRQFTI 275
>gi|219851408|ref|YP_002465840.1| carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
gi|219545667|gb|ACL16117.1| Carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
Length = 627
Score = 39.3 bits (90), Expect = 0.84, Method: Composition-based stats.
Identities = 31/93 (33%), Positives = 40/93 (43%), Gaps = 20/93 (21%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
G G DG+ F+ P+ AVD GNIYVAD N ++K T+ G + PE
Sbjct: 39 GTNGSGDGQ-----FNYPQGIAVDGAGNIYVADNGNKQVQKFTS--SGTFVTAWGTAGPE 91
Query: 149 DCSKSSQSG-------------SGMFWVTVFSP 168
D S SG +G F V F+P
Sbjct: 92 DDRFSEPSGIAVDSEGNVYVTDTGKFRVQKFTP 124
Score = 35.8 bits (81), Expect = 8.7, Method: Composition-based stats.
Identities = 16/46 (34%), Positives = 23/46 (50%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+G V + P RF +P AVD +GN+YV D ++K T
Sbjct: 78 SGTFVTAWGTAGPEDDRFSEPSGIAVDSEGNVYVTDTGKFRVQKFT 123
>gi|378548897|ref|ZP_09824113.1| hypothetical protein CCH26_02380 [Citricoccus sp. CH26A]
Length = 708
Score = 39.3 bits (90), Expect = 0.84, Method: Composition-based stats.
Identities = 17/46 (36%), Positives = 26/46 (56%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
AG G++G DG+P A F + A G I+VAD + +R+I+
Sbjct: 396 AGTGLEGLEDGDPTVAWFAQSSGLAESADGTIWVADSESSALRRIS 441
>gi|115376708|ref|ZP_01463935.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
gi|310822897|ref|YP_003955255.1| hypothetical protein STAUR_5665 [Stigmatella aurantiaca DW4/3-1]
gi|115366257|gb|EAU65265.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
gi|309395969|gb|ADO73428.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
Length = 856
Score = 39.3 bits (90), Expect = 0.86, Method: Composition-based stats.
Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 10/71 (14%)
Query: 77 SEESVVKRLAGDGVQ----------GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S+ ++R+A DG G DG +ARF++P A G + VAD N V
Sbjct: 635 SDNHCIRRIAPDGTATVFAGAVMEPGQGDGTAKAARFNQPAGLAFGPGGELLVADLGNGV 694
Query: 127 IRKITNLDHGN 137
IR++ GN
Sbjct: 695 IRRVDLSAPGN 705
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 25/49 (51%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V LAG G G DGE SA P AV G +YVAD N ++R +
Sbjct: 806 VTTLAGSGRFGSRDGEGQSADLVLPAGLAVGEDGTLYVADAGNALLRAV 854
>gi|290971380|ref|XP_002668485.1| predicted protein [Naegleria gruberi]
gi|284081915|gb|EFC35741.1| predicted protein [Naegleria gruberi]
Length = 277
Score = 39.3 bits (90), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 8/89 (8%)
Query: 44 EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
+IN H + R SSD ++ + + TL+ P+S + + +G DG +
Sbjct: 138 DINNHRI--RKVSSDGVISTIAGTGRTTLN-PISPNTYRQEGGYNG-----DGLATATDL 189
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ PK VD KG++Y AD N +IRK++N
Sbjct: 190 NYPKYAQVDSKGDVYFADTYNGLIRKVSN 218
>gi|158318814|ref|YP_001511322.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114219|gb|ABW16416.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 732
Score = 39.3 bits (90), Expect = 0.88, Method: Composition-based stats.
Identities = 19/48 (39%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG G GYS DG P A +P++ AVD +G + +AD+ + IR++
Sbjct: 581 VAGSGTPGYSGDGGPAVHASLTRPQAVAVDAQGRLLIADQEHRRIRRV 628
>gi|159113256|ref|XP_001706855.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
50803]
gi|157434955|gb|EDO79181.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
50803]
Length = 969
Score = 39.3 bits (90), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 42/86 (48%), Gaps = 6/86 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
PHS++ P + + + D+ TLS + VK LAG GV GY DG + F++
Sbjct: 143 PHSIVSDPVNGIIYLTDAKNHCIRTLSLMTRK---VKTLAGTPGVFGYKDGL--NPLFNE 197
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
P + GN+ V D N +R ++
Sbjct: 198 PLGLVLTEDGNVIVCDSKNGALRYVS 223
>gi|47219684|emb|CAG12606.1| unnamed protein product [Tetraodon nigroviridis]
Length = 805
Score = 39.3 bits (90), Expect = 0.89, Method: Composition-based stats.
Identities = 40/137 (29%), Positives = 62/137 (45%), Gaps = 27/137 (19%)
Query: 16 SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPGSSDLI 60
+LLFSLV G +L D Y ++ H + + P S++ PG I
Sbjct: 168 NLLFSLVGEGHWDKQMLFTDAALRYYRERGLLRTHAVPLKLLRDSLPPSILSFPGK---I 224
Query: 61 VLD--SSRSAF----YTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
+D S R A + +S + ++ G G DG+ A F+ P+ A+ K
Sbjct: 225 AIDDRSKRVAIADTGHHRVLLVSTTGQLLQVIGGPESGRKDGDLSEASFNSPQGVAI--K 282
Query: 115 GN-IYVADKSNHVIRKI 130
G+ +YVAD NH+IRK+
Sbjct: 283 GDTVYVADTENHLIRKV 299
>gi|260904571|ref|ZP_05912893.1| NHL repeat-containing protein [Brevibacterium linens BL2]
Length = 647
Score = 39.3 bits (90), Expect = 0.90, Method: Composition-based stats.
Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 12/95 (12%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V + P S +L+V DS + + + S +++++R+ G G +G +DG+ SA F +P
Sbjct: 186 PGKVTELP-SGNLLVADSGHHSL--VEYDASGQNIIRRI-GTGERGANDGDFTSASFSEP 241
Query: 107 KSFAV---DMKG----NIYVADKSNHVIRKITNLD 134
V D+ ++ VAD NH +R I NLD
Sbjct: 242 GGITVLPDDVAAKAGYHLVVADTVNHTLRGI-NLD 275
Score = 37.7 bits (86), Expect = 2.1, Method: Composition-based stats.
Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 39 DGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
DG L++ +P V+ P +++++V + ++ E ++ L+G +G +D
Sbjct: 308 DGPALDVKLSSPWDVLYIPATAEVVVAMAGNHTIWSFD---PETGSIRLLSGTMNEGLAD 364
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
GE +A F + + G++++AD IR++
Sbjct: 365 GEAEAAWFAQTSGLDLSSDGDVFIADSETSAIRRL 399
>gi|392403177|ref|YP_006439789.1| NHL repeat containing protein [Turneriella parva DSM 21527]
gi|390611131|gb|AFM12283.1| NHL repeat containing protein [Turneriella parva DSM 21527]
Length = 392
Score = 38.9 bits (89), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AGD G +DG ARF+ P+ D N++VAD N +RKI
Sbjct: 174 AGDTTSGDTDGTANDARFNNPQGIVTDGT-NLFVADSLNRKVRKI 217
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 41/89 (46%), Gaps = 10/89 (11%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGY--SDGEPGSA 101
NP ++ ++L V DS + +V LAG GV G +DG +A
Sbjct: 193 NPQGIVTD--GTNLFVADSLNRKVRKIVI---ASGIVSTLAGPAQGVTGSGDTDGSANTA 247
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
RF P + D N+Y+ D SNH IRKI
Sbjct: 248 RFGLPGAMTTDGV-NLYLCDSSNHKIRKI 275
Score = 35.8 bits (81), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 5/54 (9%)
Query: 81 VVKRLAGDG----VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
VV LAG G + G DG +ARF ++ D N+YVAD SN+ IRKI
Sbjct: 107 VVSTLAGPGPGTSLSGDVDGIGEAARFSSMRAITTDGT-NLYVADNSNNKIRKI 159
>gi|290979059|ref|XP_002672252.1| predicted protein [Naegleria gruberi]
gi|284085827|gb|EFC39508.1| predicted protein [Naegleria gruberi]
Length = 1239
Score = 38.9 bits (89), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 48/100 (48%), Gaps = 10/100 (10%)
Query: 39 DGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
DG+ L N + V++R G +L ++ LSF + + +AG+G GY
Sbjct: 42 DGNALVTNLKNPRGLFVVERNGG-ELYFVEEENHIIRKLSFVTNN---ITTIAGNGTAGY 97
Query: 94 SDGEPGSARFDKPKSFAV-DMKGNIYVADKSNHVIRKITN 132
SDG + + P S V + G IY+AD N +R++ N
Sbjct: 98 SDGVATLSMLNYPHSVHVSNTTGEIYIADTFNCKLRRVFN 137
>gi|290973961|ref|XP_002669715.1| predicted protein [Naegleria gruberi]
gi|284083266|gb|EFC36971.1| predicted protein [Naegleria gruberi]
Length = 710
Score = 38.9 bits (89), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 3/49 (6%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG+G +G+S +G +A+ + P V +Y+AD SNHVIRKI+
Sbjct: 144 IAGNGTKGFSGDNGPATNAQLNGPAGVFVS-NNEVYIADYSNHVIRKIS 191
Score = 36.2 bits (82), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S+ + +AG+G G+S +G +A+ P V +Y++D NHVIRKI
Sbjct: 190 ISQNGTIVTIAGNGKPGFSGDNGLATNAQLYNPSGTFVSSNNEVYISDCFNHVIRKI 246
>gi|145340877|ref|XP_001415544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575767|gb|ABO93836.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1675
Score = 38.9 bits (89), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 38/81 (46%), Gaps = 10/81 (12%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGD-----GVQGYS----DGEPGSARFDKPKSFA 110
+V D S A Y L + S + +G G QG S + P SARF+ P+ A
Sbjct: 186 MVFDESDGAIYFLGSSQNHVSKIAYGSGTVSTYAGSQGTSGLVDNVTPSSARFNAPEGLA 245
Query: 111 VDMKGNI-YVADKSNHVIRKI 130
+D I YVAD NH +R I
Sbjct: 246 LDGANRILYVADTGNHAVRAI 266
>gi|427430403|ref|ZP_18920257.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
gi|425878863|gb|EKV27574.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
Length = 498
Score = 38.9 bits (89), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 41/90 (45%), Gaps = 5/90 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTL---SFPLSEESVVKRLAGDGVQ--GYSDGEPGS 100
P + P +S L DS SA + + L + + V+ L G G+ G+ DG +
Sbjct: 344 QPSGLALAPDASALYFADSETSAVRKVILDGWRLGQTARVETLVGRGLFDFGHDDGPMTA 403
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AR P AV G + VAD NH IR I
Sbjct: 404 ARMQHPLGVAVMPDGRVAVADSYNHAIRLI 433
>gi|325917587|ref|ZP_08179787.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
gi|325536221|gb|EGD08017.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
Length = 472
Score = 38.9 bits (89), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL +G
Sbjct: 198 VLRQFGHGNADLIDGGTGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRNGQ 251
>gi|254447301|ref|ZP_05060768.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
gi|198263440|gb|EDY87718.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
Length = 2515
Score = 38.9 bits (89), Expect = 1.00, Method: Composition-based stats.
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ E +++ AG +GYS G A + P+ AV G +++AD NH IRK+
Sbjct: 1213 ITPEGEIEKYAGKETRGYSGDGGAATEAELNLPRGMAVSSGGTLFIADWGNHRIRKV 1269
>gi|116007528|ref|NP_001036460.1| CG12547, isoform A [Drosophila melanogaster]
gi|442622271|ref|NP_001260704.1| CG12547, isoform B [Drosophila melanogaster]
gi|17945781|gb|AAL48938.1| RE33981p [Drosophila melanogaster]
gi|30923523|gb|EAA46001.1| CG12547, isoform A [Drosophila melanogaster]
gi|220948326|gb|ACL86706.1| CG12547-PA [synthetic construct]
gi|440214080|gb|AGB93239.1| CG12547, isoform B [Drosophila melanogaster]
Length = 717
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 30/56 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
L+ VV+ G G+ DG+ ARF+ P+ A + + VAD NH IR+I+
Sbjct: 252 LTAGGVVQHRIGGHQPGFVDGDLTVARFNNPQGIAFLNENVLIVADTKNHAIRQIS 307
>gi|299145011|ref|ZP_07038079.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
gi|298515502|gb|EFI39383.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
Length = 459
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 26/40 (65%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G+++G+ A+ ++P +D +GNIYVA + H I KIT
Sbjct: 364 GFTEGQGNEAKLEEPCQMDLDEEGNIYVAVRKKHRIAKIT 403
>gi|423300918|ref|ZP_17278942.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
CL09T03C10]
gi|408472253|gb|EKJ90781.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
CL09T03C10]
Length = 459
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 25/43 (58%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G G+ +G+ AR +P +D +GNIYVA + H I KIT
Sbjct: 361 GKTGFVEGQGNDARLQEPCQMDLDEEGNIYVAVRKKHRIAKIT 403
>gi|119509213|ref|ZP_01628363.1| NHL repeat protein [Nodularia spumigena CCY9414]
gi|119466055|gb|EAW46942.1| NHL repeat protein [Nodularia spumigena CCY9414]
Length = 502
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 51/124 (41%), Gaps = 10/124 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ P + L + DS S+ + + L G G G +DG A+F P
Sbjct: 177 PGKVLATP--AGLFIADSGHHRVVMSSY----DGEILHLIGTGKSGLTDGAFNEAQFFAP 230
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTV 165
+ A D + +YVAD NH +R++ D Q+++ I E G +
Sbjct: 231 QGMAFDPENQMLYVADTENHTLRRV---DLKRQIVKTIAGTGEQSRNIYPHGGAGIETAL 287
Query: 166 FSPY 169
SP+
Sbjct: 288 NSPW 291
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+ + GDG G DG+ + RF +P +V M +Y++D +NH IR+
Sbjct: 433 QTILGDGSAGLQDGQGQNTRFFEPSGLSV-MGDYLYISDTNNHAIRR 478
>gi|317419409|emb|CBN81446.1| NHL repeat-containing protein 2 [Dicentrarchus labrax]
Length = 719
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 61/141 (43%), Gaps = 27/141 (19%)
Query: 16 SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPGSSDLI 60
+LLFS+V G +L D Y ++ H ++I P S + PG +
Sbjct: 168 NLLFSIVGEGHRDRLMLFTDCALRYYGEQGLLKTHAVKIKLYRDSLPTSFLSFPGK---V 224
Query: 61 VLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
+DSS LS + + G G DG+ A F+ P+ A+ K
Sbjct: 225 AIDSSEKRLAIADTGHHRILLLSSTGQLLHVIGGPESGRRDGDLSEASFNSPQGVAI--K 282
Query: 115 GN-IYVADKSNHVIRKITNLD 134
G+ +YVAD NH+IRKI L+
Sbjct: 283 GDTVYVADTENHLIRKIDLLE 303
>gi|428166914|gb|EKX35881.1| hypothetical protein GUITHDRAFT_79273, partial [Guillardia theta
CCMP2712]
Length = 318
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
Query: 40 GHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
G + IN P+ V P + +IV D+ + + E LAG + G ++G
Sbjct: 107 GTKARINTPYGVKVSPCGNYVIVSDTGNNMIRKVDI---ESGYTNTLAGQSLAGTANGVG 163
Query: 99 GSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
A+F+ P VD + YV+D+ N+ IRKI
Sbjct: 164 TLAQFNMPVDVTVDWNETVAYVSDQGNNCIRKI 196
>gi|386875467|ref|ZP_10117636.1| NHL repeat protein, partial [Candidatus Nitrosopumilus salaria
BD31]
gi|386806721|gb|EIJ66171.1| NHL repeat protein, partial [Candidatus Nitrosopumilus salaria
BD31]
Length = 65
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G+ FD P AVD GNIYVAD +N+ ++K
Sbjct: 23 GNGNFDAPSGVAVDSAGNIYVADTNNNRVQKF 54
>gi|78047814|ref|YP_363989.1| hypothetical protein XCV2258 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78036244|emb|CAJ23935.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 499
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G + DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 225 VLRQFGHGNADFIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 278
>gi|449670570|ref|XP_002158620.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
magnipapillata]
Length = 629
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
LAG+GV G +G+ SA F++P + + IY+AD +NH+IR
Sbjct: 496 LAGNGVVGIKNGDFLSAEFNEPGGI-ICLDDTIYIADTNNHLIR 538
Score = 38.5 bits (88), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
VVKR+ G G G+ DGE ++F P+ + ++ I+VAD NH IR+I
Sbjct: 244 VVKRI-GTGFPGFKDGEAHVSQFSSPQGLSW-IENVIFVADTENHSIREIN 292
>gi|219852087|ref|YP_002466519.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546346|gb|ACL16796.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 668
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 26/82 (31%), Positives = 38/82 (46%), Gaps = 5/82 (6%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
+D+PGS V SR Y + + V G +Q S G G +FD P A
Sbjct: 139 LDQPGS----VAVDSRGQIYVADWGNNRVQVFNSTGGYLMQWGSSGS-GDGQFDGPNGIA 193
Query: 111 VDMKGNIYVADKSNHVIRKITN 132
+D GN+YV D N+ I++ +
Sbjct: 194 IDSTGNVYVTDAYNNRIQEFNS 215
Score = 36.2 bits (82), Expect = 6.8, Method: Composition-based stats.
Identities = 15/47 (31%), Positives = 26/47 (55%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
AG + + PG+ +F P A+D GN+Y+ D N+ ++K T+
Sbjct: 450 AGGYLSQWGSHGPGAGQFSGPNGIALDSTGNVYITDWGNNRVQKFTS 496
>gi|383115566|ref|ZP_09936322.1| hypothetical protein BSGG_2564 [Bacteroides sp. D2]
gi|313695029|gb|EFS31864.1| hypothetical protein BSGG_2564 [Bacteroides sp. D2]
Length = 459
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 25/40 (62%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G+ +G+ AR ++P +D +GNIYVA + H I KIT
Sbjct: 364 GFVEGQGNDARLEEPCQMDLDEEGNIYVAVRKKHRIAKIT 403
>gi|291227073|ref|XP_002733512.1| PREDICTED: tripartite motif protein-like, partial [Saccoglossus
kowalevskii]
Length = 594
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSG 159
+A F++P VD KGN+ VAD N ++I LD + I QI L E + + + +G
Sbjct: 429 NAEFNQPVGVTVDKKGNVLVADFGN---KRIQILDSNGKYIDQILLDSERPTGVAMAPNG 485
Query: 160 MFWVTVF 166
V VF
Sbjct: 486 NIVVVVF 492
>gi|336399828|ref|ZP_08580628.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
17128]
gi|336069564|gb|EGN58198.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
17128]
Length = 494
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/50 (32%), Positives = 25/50 (50%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
+G+ DG +A+F+ P+ G +Y+AD NH IR I N +
Sbjct: 392 KGWEDGSLKNAKFNYPRQICFTKDGRLYIADSQNHCIRSIDTTQGANARV 441
>gi|383123946|ref|ZP_09944616.1| hypothetical protein BSIG_4035 [Bacteroides sp. 1_1_6]
gi|251838822|gb|EES66907.1| hypothetical protein BSIG_4035 [Bacteroides sp. 1_1_6]
Length = 451
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/108 (22%), Positives = 51/108 (47%), Gaps = 8/108 (7%)
Query: 30 DGYTVTTVIDGHQLEI-----NPHSVI-DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
D + T VI+G++ + N ++I D G ++ + D+ + ++ V
Sbjct: 344 DNTSTTQVINGYRTDCLFNWNNQMTIIVDESGQKNMYIADAGTQTIRKID--MNTGMVTT 401
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ V+G G P A F+ PK + +G++Y++D + +RK++
Sbjct: 402 VVGRQNVKGNQSGTPLEATFNWPKGVGLTAEGDLYISDCGSGCVRKLS 449
>gi|348175195|ref|ZP_08882089.1| NHL repeat-containing protein [Saccharopolyspora spinosa NRRL
18395]
Length = 630
Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats.
Identities = 30/95 (31%), Positives = 50/95 (52%), Gaps = 13/95 (13%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P VI PG +L+V DS+ + + F E+VV+R+ G G +G +DG +A F +P
Sbjct: 194 PGKVIALPGG-NLLVSDSANHSL--VEFAADGETVVRRI-GTGQRGRADGV--AAEFAEP 247
Query: 107 KSFAV-------DMKGNIYVADKSNHVIRKITNLD 134
A+ + ++ VAD NH++R + +D
Sbjct: 248 AGLALLPQRVAEQVGYDVVVADTVNHLLRGVRLVD 282
>gi|219851493|ref|YP_002465925.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219545752|gb|ACL16202.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 390
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 20/33 (60%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G +FD P AVD GN+YVAD N I+K T
Sbjct: 39 GIGQFDHPGGVAVDSAGNVYVADTENDRIQKFT 71
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G FD+P AVD GN+YVAD N+ ++K T
Sbjct: 180 GVGEFDEPFGIAVDGTGNVYVADTYNNRVQKFT 212
>gi|383454552|ref|YP_005368541.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
gi|380728713|gb|AFE04715.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
Length = 861
Score = 38.5 bits (88), Expect = 1.2, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 9/70 (12%)
Query: 77 SEESVVKRLAGDG--------VQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVI 127
+E +V++R++ +G GY DG SA F++P + G + VAD +N VI
Sbjct: 642 TENAVIRRISPEGKAEVFAGTTWGYQDGPALSAAFNQPAGLSFTPDGTALLVADLNNSVI 701
Query: 128 RKITNLDHGN 137
R+I + GN
Sbjct: 702 RRIDMVTPGN 711
Score = 36.2 bits (82), Expect = 6.9, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 32/73 (43%), Gaps = 1/73 (1%)
Query: 59 LIVLDSSRSAFYTLSFPLS-EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
L + D L+F S E + LAG G G DG A+ P A+ G +
Sbjct: 787 LAISDPGNYRVRRLTFNASGEPEKLTTLAGSGRYGAEDGTGREAQLVLPAGLALGPDGTL 846
Query: 118 YVADKSNHVIRKI 130
YVAD N ++R I
Sbjct: 847 YVADAGNSLVRAI 859
>gi|290995104|ref|XP_002680171.1| predicted protein [Naegleria gruberi]
gi|284093791|gb|EFC47427.1| predicted protein [Naegleria gruberi]
Length = 928
Score = 38.5 bits (88), Expect = 1.2, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+ + P S V +Y+ D +NH IRKI L +
Sbjct: 95 ENGKIVTIAGNGTGGFSGDNGPATNAQLNNPYSVFVSSNNEVYIVDYNNHRIRKI--LKN 152
Query: 136 GN 137
GN
Sbjct: 153 GN 154
Score = 38.5 bits (88), Expect = 1.3, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G +A+ + P V +Y+AD NH IRKI L++
Sbjct: 478 ENGNIVTIAGNGTGGFSGDNGPATNAQLNNPMGVFVSSNNEVYIADYYNHRIRKI--LEN 535
Query: 136 GN 137
GN
Sbjct: 536 GN 537
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 21/55 (38%), Positives = 32/55 (58%), Gaps = 4/55 (7%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+AG+G G+S +G +A+ + P V +Y+AD NH IRKI L++GN
Sbjct: 158 IAGNGTGGFSGDNGPATNAQLNNPMGVFVSSNNEVYIADYYNHRIRKI--LENGN 210
Score = 37.4 bits (85), Expect = 3.0, Method: Composition-based stats.
Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
F L L++ + +AG+G G+ +G +A+ P S V +Y+ D +NH
Sbjct: 340 FIDLKRSLTKNGNIVTIAGNGTAGFRGDNGPATNAQLYNPYSVFVSSNNEVYIVDYNNHR 399
Query: 127 IRKITNLDHGN 137
IRKI L +GN
Sbjct: 400 IRKI--LKNGN 408
Score = 36.6 bits (83), Expect = 5.6, Method: Composition-based stats.
Identities = 19/48 (39%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG+G G+ +G +A+ P S V +Y+AD SNH IRKI
Sbjct: 46 IAGNGTAGFRGDNGPATNAQLYNPYSVFVSSNNEVYIADFSNHRIRKI 93
>gi|290995072|ref|XP_002680155.1| predicted protein [Naegleria gruberi]
gi|284093775|gb|EFC47411.1| predicted protein [Naegleria gruberi]
Length = 1951
Score = 38.5 bits (88), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 2/67 (2%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ ++ +AG G GYS DG SA+ P A+ IY+ D +N+ IRK++
Sbjct: 184 ITTSGIISTIAGTGTSGYSGDGSSAISAQLYNPYGVAISSSDEIYITDTNNNRIRKLSPW 243
Query: 134 DHGNQLI 140
GN ++
Sbjct: 244 CTGNAIL 250
>gi|195164496|ref|XP_002023083.1| GL21149 [Drosophila persimilis]
gi|194105168|gb|EDW27211.1| GL21149 [Drosophila persimilis]
Length = 595
Score = 38.5 bits (88), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 31/56 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + VV+ G G+ DG+ +RF+ P+ A + + VAD NH +RKI+
Sbjct: 252 VSADGVVEHKIGGLQAGFVDGKLALSRFNSPQGVAFLNESTLIVADTENHALRKIS 307
>gi|147818279|emb|CAN64724.1| hypothetical protein VITISV_026725 [Vitis vinifera]
Length = 423
Score = 38.5 bits (88), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
++ G +G DG A F++P+ A + K N+ YVAD NH +R+I
Sbjct: 54 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREI 101
>gi|197120448|ref|YP_002132399.1| fibronectin type III domain-containing protein [Anaeromyxobacter
sp. K]
gi|196170297|gb|ACG71270.1| Fibronectin type III domain protein [Anaeromyxobacter sp. K]
Length = 460
Score = 38.5 bits (88), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 35/77 (45%), Gaps = 6/77 (7%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGN 116
L V D+ +A L VV LAGD GY+DG +ARF P A+ G
Sbjct: 281 LYVADTMNNAIRALDL---ATGVVSTLAGDPASYAGYADGRGTAARFSAPYGLAM-HGGE 336
Query: 117 IYVADKSNHVIRKITNL 133
+ VAD N IR + L
Sbjct: 337 LLVADSGNSAIRAVDPL 353
>gi|290995833|ref|XP_002680487.1| serine/threonine kinase [Naegleria gruberi]
gi|284094108|gb|EFC47743.1| serine/threonine kinase [Naegleria gruberi]
Length = 1078
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 8/94 (8%)
Query: 39 DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
D + +IN H + R SSD ++ + + TL+ P+S + + +G DG
Sbjct: 133 DVYICDINNHRI--RKVSSDGVISTIAGTGRTTLN-PISPNTYRQEGGYNG-----DGLA 184
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ + PK VD KG++Y AD N +IRK++N
Sbjct: 185 TATDLNYPKYAQVDSKGDVYFADTYNGLIRKVSN 218
>gi|295700390|ref|YP_003608283.1| NHL repeat containing protein [Burkholderia sp. CCGE1002]
gi|295439603|gb|ADG18772.1| NHL repeat containing protein [Burkholderia sp. CCGE1002]
Length = 383
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
G+ ++ + D G +F P S AVD ++YVAD+SN R+I D +RQ +
Sbjct: 220 GNWLKSWGDRGTGPGQFHTPHSIAVDAHDHVYVADRSN---RRIQVFDTEGTFLRQFTI 275
>gi|391330742|ref|XP_003739813.1| PREDICTED: NHL repeat-containing protein 2-like [Metaseiulus
occidentalis]
Length = 685
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ + G G G+ DG G RF P+ G I+VAD NH IR++
Sbjct: 258 VQLIVGTGEAGFLDGAYGICRFSSPQGVCF-YDGGIFVADAGNHAIRRV 305
>gi|383124876|ref|ZP_09945537.1| hypothetical protein BSIG_1376 [Bacteroides sp. 1_1_6]
gi|382983584|gb|EES69051.2| hypothetical protein BSIG_1376 [Bacteroides sp. 1_1_6]
Length = 479
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 9/65 (13%)
Query: 76 LSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGN-IYVADKSNHV 126
++ + ++ AG G V GY DGE ARFD P A D + Y+AD+ NH
Sbjct: 413 ITPDGIISTYAGRGSVSSDGRVNGYIDGELRTEARFDSPCGIAYDEETQTFYIADRENHR 472
Query: 127 IRKIT 131
IR I+
Sbjct: 473 IRTIS 477
>gi|307106270|gb|EFN54516.1| hypothetical protein CHLNCDRAFT_31604 [Chlorella variabilis]
Length = 1072
Score = 38.5 bits (88), Expect = 1.3, Method: Composition-based stats.
Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 4/83 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
+P +V+ P S + V DS L + ++ LAG G GY DG +A+ +
Sbjct: 813 HPLAVLAAP-SGKVYVADSYNHRLKELD---PATNTIRTLAGSGAAGYRDGVGTTAQLSE 868
Query: 106 PKSFAVDMKGNIYVADKSNHVIR 128
P A G + + D +N +IR
Sbjct: 869 PAGLAAGPDGTVIICDTNNSLIR 891
>gi|397779898|ref|YP_006544371.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
gi|396938400|emb|CCJ35655.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
Length = 483
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKIT 131
S + + + G G G +DG A F P+ A D + G +YVAD NH IR+++
Sbjct: 196 SRDGAILGVIGTGAPGNADGSSSEAAFYLPEGLAFDEEAGVLYVADAGNHTIRQVS 251
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
G G +GY +G G AR +P V++ G Y+AD NH +R + H
Sbjct: 427 GSGDRGYQNGVSGDARLSEPGGL-VNLGGLWYIADTGNHAVRVYDPVSH 474
>gi|253748633|gb|EET02664.1| Haloacid dehalogenase-like hydrolase family [Giardia intestinalis
ATCC 50581]
Length = 963
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 6/86 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
PHSV+ P + + + D+ TLS + V+ LAG GV GY DG + F++
Sbjct: 143 PHSVVSDPVNGIIYLTDTKNHCIRTLSLATRK---VRTLAGTPGVFGYRDGL--NPLFNE 197
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
P + NI V D N +R I+
Sbjct: 198 PLGLVLTEDSNIIVCDSKNGALRHIS 223
>gi|254417234|ref|ZP_05030979.1| Redoxin family [Coleofasciculus chthonoplastes PCC 7420]
gi|196176040|gb|EDX71059.1| Redoxin family [Coleofasciculus chthonoplastes PCC 7420]
Length = 496
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 34/63 (53%), Gaps = 4/63 (6%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQ 142
+ L G+G G+ DG+ A F +P + + +Y+AD +NHVIR + D G +
Sbjct: 435 QTLLGNGKAGHRDGQSHEAEFAEPSGVSA-IAARLYIADTNNHVIRCV---DLGTLEVTT 490
Query: 143 INL 145
+NL
Sbjct: 491 LNL 493
>gi|383111215|ref|ZP_09932032.1| hypothetical protein BSGG_2315 [Bacteroides sp. D2]
gi|313694780|gb|EFS31615.1| hypothetical protein BSGG_2315 [Bacteroides sp. D2]
Length = 593
Score = 38.5 bits (88), Expect = 1.3, Method: Composition-based stats.
Identities = 35/109 (32%), Positives = 54/109 (49%), Gaps = 13/109 (11%)
Query: 33 TVTTVIDGHQLEINPHSVIDRPG-----SSDLIVLDSSRSAFYTLSFPLSE----ESVVK 83
TV T DG Q +++ ++ G L + D + TL+ P +E + +K
Sbjct: 485 TVATYSDGEQNDLSKATMGATFGIAFGPDEILYITDYTMHTIRTLT-PDAEGDYTKGTLK 543
Query: 84 RLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG G +G DG +A F+ P S V K +Y+AD+ NH+IR IT
Sbjct: 544 TIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRSIT 590
Score = 38.5 bits (88), Expect = 1.3, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 36/60 (60%), Gaps = 1/60 (1%)
Query: 72 LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LSF + S V+ +AG+ +DG+ +A F P+ A+ G+I++A++ N+ IRK+
Sbjct: 281 LSFTYEDLSYTVETVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNTIRKM 340
>gi|73667964|ref|YP_303979.1| hypothetical protein Mbar_A0416 [Methanosarcina barkeri str.
Fusaro]
gi|72395126|gb|AAZ69399.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 346
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
G+ +F+ P S AVD GN+YVADK N I+K ++
Sbjct: 234 GNGQFEFPLSIAVDSSGNVYVADKYNQRIQKFNSI 268
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G+ +F+ P+ AVD GN+YVAD N+ I+K
Sbjct: 187 GNGQFNDPEGVAVDSSGNVYVADSGNNRIQKFN 219
Score = 37.0 bits (84), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 19/33 (57%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
GS +F P AVD GN+YV D NH I+K
Sbjct: 46 GSGQFACPTGVAVDSSGNVYVTDTGNHRIQKFN 78
>gi|423214448|ref|ZP_17200976.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692863|gb|EIY86099.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
CL03T12C04]
Length = 464
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 26/40 (65%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G+++G+ A+ ++P +D +GNIYVA + H I KIT
Sbjct: 369 GFTEGQGNEAKLEEPCQMDLDEEGNIYVAVRRKHRIAKIT 408
>gi|428175994|gb|EKX44881.1| hypothetical protein GUITHDRAFT_60229, partial [Guillardia theta
CCMP2712]
Length = 181
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
LAG GY DG+ RF+ P A G+ + VAD +NH IR+I
Sbjct: 1 LAGPSSAGYQDGQAKIVRFNSPLDIAFSKDGSWLAVADTNNHAIRRI 47
Score = 36.2 bits (82), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 26/40 (65%), Gaps = 1/40 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G DG +A F+ P S A+D + I+VAD SN++IR+I
Sbjct: 68 GSQDGPASAASFNSPTSIALDPQDKFIFVADTSNNMIRQI 107
>gi|29348707|ref|NP_812210.1| hypothetical protein BT_3298 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298387071|ref|ZP_06996625.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
gi|29340612|gb|AAO78404.1| conserved hypothetical protein, with a conserved domain
[Bacteroides thetaiotaomicron VPI-5482]
gi|298260221|gb|EFI03091.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
Length = 470
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 9/65 (13%)
Query: 76 LSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGN-IYVADKSNHV 126
++ + ++ AG G V GY DGE ARFD P A D + Y+AD+ NH
Sbjct: 404 ITPDGIISTYAGRGSVSSDGRVNGYIDGELRTEARFDSPCGIAYDEETQTFYIADRENHR 463
Query: 127 IRKIT 131
IR I+
Sbjct: 464 IRTIS 468
>gi|218190788|gb|EEC73215.1| hypothetical protein OsI_07299 [Oryza sativa Indica Group]
Length = 1103
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 41/85 (48%), Gaps = 2/85 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQINL 145
G +G DG +A F++P+ A + K NI YVAD NH +R+I + + +
Sbjct: 622 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 681
Query: 146 KPEDCSKSSQ-SGSGMFWVTVFSPY 169
K D Q + F++ + SP+
Sbjct: 682 KGSDYRGGGQGTNQACFFMVLNSPW 706
Score = 36.2 bits (82), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V +AG G GY DG SA+ +P G + VAD +N IR I L+ +R
Sbjct: 857 VTTIAGTGRAGYKDGPALSAQLSEPAGLVEVGDGRLLVADTNNSTIRYIV-LNEKGAEVR 915
Query: 142 QINL 145
++L
Sbjct: 916 TLDL 919
>gi|108707787|gb|ABF95582.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
protein, expressed [Oryza sativa Japonica Group]
Length = 772
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
G +G DG +A F++P+ A + K NI YVAD NH +R+I
Sbjct: 610 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREIN 655
>gi|290971766|ref|XP_002668650.1| predicted protein [Naegleria gruberi]
gi|284082136|gb|EFC35906.1| predicted protein [Naegleria gruberi]
Length = 728
Score = 38.5 bits (88), Expect = 1.4, Method: Composition-based stats.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 4/65 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G A+ + P V +Y+AD NH IRKI L++
Sbjct: 39 ENGNIVTIAGNGTAGFSGDNGIATKAQLNGPVGVFVSSNNEVYIADYDNHRIRKI--LEN 96
Query: 136 GNQLI 140
GN +I
Sbjct: 97 GNIVI 101
Score = 37.0 bits (84), Expect = 3.9, Method: Composition-based stats.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 6/63 (9%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLD 134
E + +AG G G+S G+ G A +K P+ V +Y+AD+ NH IRKI L+
Sbjct: 207 ENGNIVTIAGKGTAGFS-GDNGLATKEKLNFPRCVFVSSNNEVYIADQINHRIRKI--LE 263
Query: 135 HGN 137
+GN
Sbjct: 264 NGN 266
Score = 37.0 bits (84), Expect = 3.9, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 33/62 (53%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+ +G +A+ + P V IY+AD NH IRKI L++
Sbjct: 151 ENGNIVTIAGNGPYGFCGDNGLATNAQLNSPAGVFVSSNNEIYIADYDNHRIRKI--LEN 208
Query: 136 GN 137
GN
Sbjct: 209 GN 210
Score = 36.6 bits (83), Expect = 4.6, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 6/56 (10%)
Query: 85 LAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+AG G G+S G+ G A +K P+ V +Y+AD+ NH IRKI L++GN
Sbjct: 102 IAGKGTAGFS-GDNGLATKEKLNFPRCVFVSSNNEVYIADQINHRIRKI--LENGN 154
>gi|147920395|ref|YP_685830.1| hypothetical protein RCIX1195 [Methanocella arvoryzae MRE50]
gi|110621226|emb|CAJ36504.1| hypothetical protein RCIX1195 [Methanocella arvoryzae MRE50]
Length = 673
Score = 38.5 bits (88), Expect = 1.4, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 24/37 (64%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
G G G +FD P S +D +GNIYVAD++N+ IR
Sbjct: 360 GTGIGGAGDDQFDDPMSVDLDSEGNIYVADRNNNRIR 396
Score = 36.2 bits (82), Expect = 6.7, Method: Composition-based stats.
Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 3/45 (6%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQI 143
G ++FD+P S AVD G+IYVAD N+ K+ D +R I
Sbjct: 169 GDSQFDRPMSVAVDSAGSIYVADYMNN---KVKIFDGAGTYLRSI 210
>gi|336180276|ref|YP_004585651.1| NHL repeat containing protein [Frankia symbiont of Datisca
glomerata]
gi|334861256|gb|AEH11730.1| NHL repeat containing protein [Frankia symbiont of Datisca
glomerata]
Length = 398
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
A + P + AVD GN+Y++D NH IRKI L
Sbjct: 133 AELNSPATTAVDSDGNVYISDSHNHRIRKIDPL 165
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ +AG G G+S DG P +A + ++P AV +IY++D N IRKI L
Sbjct: 167 IITTIAGTGTAGFSGDGGPATAAQLNEPYGLAVATDDSIYISDYENQRIRKIDPL 221
>gi|290988149|ref|XP_002676784.1| predicted protein [Naegleria gruberi]
gi|284090388|gb|EFC44040.1| predicted protein [Naegleria gruberi]
Length = 2167
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 21/38 (55%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
Y DG A+F+ P A IY+ADK NHVIR I
Sbjct: 231 YIDGNTTIAKFNSPTGVAYGPSKEIYIADKMNHVIRMI 268
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 1/57 (1%)
Query: 77 SEESVVKRLAGDGVQGY-SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S + +AG G G ++ SA+ P AV + G + +AD SNH IR+++N
Sbjct: 69 SNSGAITIVAGTGAFGSDTNTNATSAKLYSPSGLAVTIDGKLLIADTSNHAIREVSN 125
>gi|219853189|ref|YP_002467621.1| PKD domain-containing protein [Methanosphaerula palustris E1-9c]
gi|219547448|gb|ACL17898.1| PKD domain containing protein [Methanosphaerula palustris E1-9c]
Length = 930
Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats.
Identities = 17/46 (36%), Positives = 26/46 (56%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
G+ + + G+ +F+ P AVD GNIYV D NH ++K T+
Sbjct: 76 GEFITQWGSEGSGNGQFENPGGIAVDSAGNIYVTDTWNHRVQKFTS 121
Score = 37.0 bits (84), Expect = 4.0, Method: Composition-based stats.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 11/76 (14%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ ++ V+D S + F F + E + K G +G DGE F P+ AVD G
Sbjct: 526 AGNVYVVDESNNRFQ--KFTSTGEFITKW----GSEGLGDGE-----FTYPRDVAVDSGG 574
Query: 116 NIYVADKSNHVIRKIT 131
N+Y+ D+SN I+K +
Sbjct: 575 NVYIVDESNSRIQKFS 590
Score = 35.8 bits (81), Expect = 9.6, Method: Composition-based stats.
Identities = 30/91 (32%), Positives = 46/91 (50%), Gaps = 15/91 (16%)
Query: 44 EIN-PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
E+N PH V +D G+ +I ++R +T S + +A G +G DG+
Sbjct: 326 ELNHPHGVALDDAGNVYIIDTWNNRVQIFT--------STGEFIAKWGSKGSEDGQ---- 373
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
F P S AVD GN+YV D N+ ++K T+
Sbjct: 374 -FIYPYSIAVDSAGNVYVVDTGNNRVQKFTS 403
Score = 35.8 bits (81), Expect = 9.8, Method: Composition-based stats.
Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 13/82 (15%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
+DR G+ ++ ++R +T S + + + G G + DG+ F+ P A
Sbjct: 476 VDRAGNVYVVDTGNNRVQIFT-----STGTFIAQWGGSGSR---DGQ-----FNYPGGIA 522
Query: 111 VDMKGNIYVADKSNHVIRKITN 132
VD GN+YV D+SN+ +K T+
Sbjct: 523 VDSAGNVYVVDESNNRFQKFTS 544
>gi|108707786|gb|ABF95581.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
protein, expressed [Oryza sativa Japonica Group]
Length = 1064
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
G +G DG +A F++P+ A + K NI YVAD NH +R +T
Sbjct: 610 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALRTLT 655
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 32/64 (50%), Gaps = 1/64 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V +AG G GY DG SA+ +P G + VAD +N IR I + G + +R
Sbjct: 842 VTTIAGTGRAGYKDGPALSAQLSEPAGLVEVGDGRLLVADTNNSTIRYIVLNEKGAE-VR 900
Query: 142 QINL 145
++L
Sbjct: 901 TLDL 904
>gi|348508723|ref|XP_003441903.1| PREDICTED: NHL repeat-containing protein 2-like [Oreochromis
niloticus]
Length = 719
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 62/142 (43%), Gaps = 29/142 (20%)
Query: 16 SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPG----- 55
+LLFSLV G +L D Y ++ H + I P S++ PG
Sbjct: 168 NLLFSLVGEGHRDRLMLFTDCALRYYGGRGLLKAHTVGIKLYKDALPPSILSFPGKVAID 227
Query: 56 --SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
+ L++ D+ +S + + G G DG+ A F+ P+ A+
Sbjct: 228 NSTKRLVIADTGHHRILVVS----STGDLLHVIGGPESGRKDGDVSEASFNSPQGVAI-- 281
Query: 114 KGN-IYVADKSNHVIRKITNLD 134
KG+ +YVAD NH+IRKI L+
Sbjct: 282 KGDTVYVADTENHLIRKIDLLE 303
>gi|251771416|gb|EES51995.1| conserved protein of unknown function [Leptospirillum
ferrodiazotrophum]
Length = 373
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV--QGYSDGEPGSARFD 104
P ++ P L + DS + L V +AG G G DG G+A F
Sbjct: 173 PRALALTPDGKTLYISDSGNNMIRKLDV---ATGTVTTVAGKGALAPGMDDGIGGAASFR 229
Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
P+ A+ G++ YVAD N++IRKI
Sbjct: 230 DPRGIALSSDGSVLYVADTRNNLIRKI 256
Score = 36.2 bits (82), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
V LAG G +DG G+A F P++ A+ G +Y++D N++IRK+
Sbjct: 148 AVTTLAGRAFPGSNDGTGGAAGFYGPRALALTPDGKTLYISDSGNNMIRKL 198
>gi|222624800|gb|EEE58932.1| hypothetical protein OsJ_10596 [Oryza sativa Japonica Group]
Length = 1017
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 41/85 (48%), Gaps = 2/85 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQINL 145
G +G DG +A F++P+ A + K NI YVAD NH +R+I + + +
Sbjct: 536 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 595
Query: 146 KPEDCSKSSQ-SGSGMFWVTVFSPY 169
K D Q + F++ + SP+
Sbjct: 596 KGSDYRGGGQGTNQACFFMVLNSPW 620
Score = 36.2 bits (82), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V +AG G GY DG SA+ +P G + VAD +N IR I L+ +R
Sbjct: 771 VTTIAGTGRAGYKDGPALSAQLSEPAGLVEVGDGRLLVADTNNSTIRYIV-LNEKGAEVR 829
Query: 142 QINL 145
++L
Sbjct: 830 TLDL 833
>gi|307727531|ref|YP_003910744.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
gi|307588056|gb|ADN61453.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
Length = 396
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V +AG G GYS DG P + A P A+D GN+++A+ HV+RK+
Sbjct: 80 DNGTVTIIAGTGQAGYSGDGGPATQAMLHYPTGVALDRAGNVFIAEPFAHVVRKV 134
>gi|290971688|ref|XP_002668616.1| predicted protein [Naegleria gruberi]
gi|284082092|gb|EFC35872.1| predicted protein [Naegleria gruberi]
Length = 348
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 3/49 (6%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG+G +G+S +G +A+ + P V +Y+AD SNHVIRKI+
Sbjct: 143 IAGNGTKGFSGDNGPATNAQLNGPAGVFVS-NNEVYIADYSNHVIRKIS 190
Score = 35.8 bits (81), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S+ + +AG+G G+S +G +A+ P V +Y++D NHVIRKI+
Sbjct: 189 ISQNGTIVTIAGNGKPGFSGDNGLATNAQLYNPSGTFVSSNNEVYISDCFNHVIRKIS 246
>gi|326781271|ref|ZP_08240536.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
gi|326661604|gb|EGE46450.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
Length = 508
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + V+ +AG GV G + DG P + A+ +KP AVD +G +YVA+ H +R++
Sbjct: 124 VGADGVIVTVAGTGVAGSAGDGGPAAKAQLNKPFGVAVDGEGVLYVAEFGGHRVRRV 180
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG G G+ DG SA+ P AV G +YV+D SNH +R+IT
Sbjct: 21 IAGSGTAGFGGDDGAAVSAQLKHPYEMAVSGTGILYVSDYSNHRVRRIT 69
>gi|428304139|ref|YP_007140964.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
gi|428245674|gb|AFZ11454.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
Length = 537
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 5/85 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P VI S+ L + D+ S ++V+ G G + DG+ A+F +P
Sbjct: 206 PSKVIACQQSNSLFIADTGHHRLVIASLNGETQAVI----GTGSPAWVDGDLEIAQFCEP 261
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
D + IYVAD NH++RKI
Sbjct: 262 MGMVFDYEQQVIYVADTVNHLLRKI 286
>gi|290972172|ref|XP_002668832.1| predicted protein [Naegleria gruberi]
gi|284082360|gb|EFC36088.1| predicted protein [Naegleria gruberi]
Length = 2254
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 35/56 (62%), Gaps = 5/56 (8%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKS-FAVDMKGNIYVADKSNHVIRKITNL 133
+ +AG +G SDG+ G +++ + P+ F D+ G+++VAD +NH IRKI N
Sbjct: 86 ISTIAGQMTKG-SDGDGGLATASQMNSPRGVFFDDVTGSLFVADTNNHKIRKIDNF 140
>gi|116751477|ref|YP_848164.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
gi|116700541|gb|ABK19729.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
Length = 1763
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 20/53 (37%), Positives = 33/53 (62%), Gaps = 4/53 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG+G GYS G+ G+A + P AVD + N+Y++D +H IRK+
Sbjct: 504 IITTVAGNGSCGYS-GDGGAAVEAQLGYPAFVAVDSRENLYISDTESHRIRKV 555
Score = 37.0 bits (84), Expect = 3.6, Method: Composition-based stats.
Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 4/43 (9%)
Query: 88 DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
DG QG +D A+ + P A D GN+Y+AD N +RK+
Sbjct: 590 DGDQGPAD----QAKLNNPSGVATDASGNLYIADTGNRRLRKV 628
>gi|260798200|ref|XP_002594088.1| hypothetical protein BRAFLDRAFT_118791 [Branchiostoma floridae]
gi|229279321|gb|EEN50099.1| hypothetical protein BRAFLDRAFT_118791 [Branchiostoma floridae]
Length = 603
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 4/48 (8%)
Query: 96 GEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
G GS +FD+P AVD +G I VADK NH I+ T HG+ +++
Sbjct: 379 GSAGSRNGQFDRPAGVAVDNQGRIIVADKDNHRIQIFTF--HGDFMLK 424
>gi|160942444|ref|ZP_02089750.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
BAA-613]
gi|158434643|gb|EDP12410.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
BAA-613]
Length = 329
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 24/45 (53%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AG G G++DG A F P+ G +YV+D N V+RKI
Sbjct: 237 AGTGEDGFADGPVSQAMFSGPQRITAAEDGALYVSDTGNSVVRKI 281
>gi|158293254|ref|XP_314572.4| AGAP010610-PA [Anopheles gambiae str. PEST]
gi|157016870|gb|EAA09865.4| AGAP010610-PA [Anopheles gambiae str. PEST]
Length = 730
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 50/119 (42%), Gaps = 18/119 (15%)
Query: 16 SLLFSLVSSGLLLED----GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYT 71
SL +LVSSG + + G +V G + G+ ++++DS+ + +
Sbjct: 215 SLPINLVSSGAIASNMKYPGKVACSVPTGAGGSEEQLFAVSDSGNHRVLIVDSAGTVLHK 274
Query: 72 LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ G G+ DG ARF+ P+ A ++VAD NH +R+I
Sbjct: 275 V--------------GGKQSGFVDGNFTKARFNAPQGVAFQGTDVVFVADNENHAVRRI 319
>gi|242041143|ref|XP_002467966.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
gi|241921820|gb|EER94964.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
Length = 360
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 2/68 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL-- 139
V +AG G GY DG +A+ +P G + VAD +N+ IR IT D G ++
Sbjct: 137 VTTIAGTGRAGYKDGPGLAAQLSEPAGLVEVGDGRLLVADTNNNTIRYITLSDKGAEVKT 196
Query: 140 IRQINLKP 147
+ I ++P
Sbjct: 197 LELIGVQP 204
>gi|195387367|ref|XP_002052367.1| GJ17513 [Drosophila virilis]
gi|194148824|gb|EDW64522.1| GJ17513 [Drosophila virilis]
Length = 736
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 28/51 (54%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
VV+ G G+ DG +ARF+ P+ A K + VAD NH +R+I+
Sbjct: 255 VVEYKVGGLEAGFVDGNLNTARFNNPQGVAFLDKDTLIVADTDNHALRQIS 305
>gi|288921011|ref|ZP_06415303.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288347586|gb|EFC81871.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 801
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 25/73 (34%), Positives = 33/73 (45%), Gaps = 12/73 (16%)
Query: 71 TLSFPLSEESVVKRLAGDGVQGYSDGEPG------------SARFDKPKSFAVDMKGNIY 118
TL S V+R++ DG G PG SA+ P S AV +G +Y
Sbjct: 658 TLYVADSSAHTVRRISTDGKITLVAGTPGAYGYSGDGGAATSAKLYNPHSLAVGPEGELY 717
Query: 119 VADKSNHVIRKIT 131
+AD N VIR +T
Sbjct: 718 IADTYNDVIRVVT 730
>gi|312372811|gb|EFR20689.1| hypothetical protein AND_19679 [Anopheles darlingi]
Length = 745
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 25/50 (50%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ G G+ DG ARF+ P+ A +YVAD NH IR+I
Sbjct: 258 TVRYTIGGKSSGFVDGGFREARFNAPQGVAFRGGDELYVADNENHAIRRI 307
>gi|300771890|ref|ZP_07081761.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300761276|gb|EFK58101.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 658
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 6/85 (7%)
Query: 50 VIDRPGSSDLIVLDSSRSA---FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
V+D G DL D++ + ++F S V +AG G +DG +A F++P
Sbjct: 406 VVDNKG--DLYFTDNNATGGNRIRKITFDASGNGTVSTIAGQASTGNADGIGAAASFNRP 463
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI 130
A+D NI YV++ + IR+I
Sbjct: 464 SGLAIDKNNNILYVSEWIGNRIRRI 488
>gi|428304361|ref|YP_007141186.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
gi|428245896|gb|AFZ11676.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
Length = 491
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P VI S+ L + D+ S ++V+ G G G+ DG+ A+F +P
Sbjct: 184 PSKVIACQQSNSLFIADTGHHRLVIASLDGETKAVI----GTGSAGWVDGDLEIAQFCEP 239
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
+ + IYVAD NH++RKI
Sbjct: 240 MGMVFNHEQQVIYVADTVNHLLRKI 264
>gi|290997205|ref|XP_002681172.1| predicted protein [Naegleria gruberi]
gi|284094795|gb|EFC48428.1| predicted protein [Naegleria gruberi]
Length = 2212
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 27/47 (57%), Gaps = 8/47 (17%)
Query: 85 LAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LAGDG QG YS + +P+ V G+IY+ D N+VIRKI
Sbjct: 480 LAGDGYQGIYS-------QLSQPRGVCVGPTGDIYIVDAGNYVIRKI 519
>gi|290970020|ref|XP_002668012.1| predicted protein [Naegleria gruberi]
gi|284081028|gb|EFC35268.1| predicted protein [Naegleria gruberi]
Length = 243
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
E + +AG+G GYS +G +A+ +P S V +Y+AD NH IRKI
Sbjct: 115 ENGRIVTIAGNGQPGYSGDNGPATTAQLHRPYSVFVSANNEVYIADTFNHSIRKI 169
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG+G QGY +G SA+ + P V +++ADK+N+ IRKI
Sbjct: 11 IAGNGKQGYGGDNGLATSAQLNYPSGVFVSTNNEVFIADKNNNRIRKI 58
>gi|336313754|ref|ZP_08568676.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
gi|335881693|gb|EGM79570.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
Length = 498
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+VKRL G G+ DG S+ F P+ + K +YVAD NH IR+I
Sbjct: 223 LVKRL-GSGIAELKDGHSDSSAFSSPQGLVLTDKA-LYVADTGNHAIRRI 270
>gi|241997422|ref|XP_002433360.1| nhl repeat-containing protein, putative [Ixodes scapularis]
gi|215490783|gb|EEC00424.1| nhl repeat-containing protein, putative [Ixodes scapularis]
Length = 631
Score = 38.1 bits (87), Expect = 1.7, Method: Composition-based stats.
Identities = 24/70 (34%), Positives = 36/70 (51%), Gaps = 10/70 (14%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQI--NLK 146
G++G S G F P+S +D G IYV D NH +I LDH + +I +++
Sbjct: 446 GMKGRSAGH-----FRSPESICIDNTGRIYVCDTCNH---RIQVLDHDGIFLHEIGTHVQ 497
Query: 147 PEDCSKSSQS 156
PE+ S+S
Sbjct: 498 PENAFNRSRS 507
>gi|29349077|ref|NP_812580.1| hypothetical protein BT_3669 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340984|gb|AAO78774.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 196
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 78 EESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
E ++ +AG G GY+ G P +R ++P + + +IYVAD N VI K+
Sbjct: 140 ETKLMSTVAGIGGKSGYAAGNPTVSRMNRPYGICITPENDIYVADAGNKVIMKL 193
>gi|290985545|ref|XP_002675486.1| predicted protein [Naegleria gruberi]
gi|284089082|gb|EFC42742.1| predicted protein [Naegleria gruberi]
Length = 819
Score = 38.1 bits (87), Expect = 1.7, Method: Composition-based stats.
Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG G +G+S DG P SA+ D P V +Y+ D +NH IRKI
Sbjct: 567 IAGTGEKGFSGDGGPATSAQLDSPCGVFVSNNDEVYIVDYNNHRIRKI 614
>gi|281423977|ref|ZP_06254890.1| NHL repeat-containing domain protein [Prevotella oris F0302]
gi|281401902|gb|EFB32733.1| NHL repeat-containing domain protein [Prevotella oris F0302]
Length = 469
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 27/54 (50%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
V++ AG G DG SA F P A D+ GN+YV + IRKI + D
Sbjct: 359 TVEQFAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTD 412
>gi|182440597|ref|YP_001828316.1| hypothetical protein SGR_6804 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178469113|dbj|BAG23633.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC
13350]
Length = 508
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 5/59 (8%)
Query: 78 EESVVKRL---AGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+E V R+ AG G G+ DG SA+ P AV G +YV+D SNH +R+IT
Sbjct: 11 DEGFVPRISAIAGSGTAGFGGDDGAAVSAQLKHPYEMAVSGTGILYVSDYSNHRVRRIT 69
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + V+ +AG GV G + DG P + A+ +KP AVD +G +YVA+ H +R++
Sbjct: 124 VGADGVMVTVAGTGVAGSAGDGGPAAKAQLNKPFGVAVDGEGVLYVAEFGGHRVRRV 180
>gi|220914451|ref|YP_002489760.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
gi|219861329|gb|ACL41671.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
Length = 674
Score = 38.1 bits (87), Expect = 1.8, Method: Composition-based stats.
Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 2/67 (2%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
PL+ V +AG+G++G DG + F + A D GNI+VAD +R + D
Sbjct: 373 PLT--GAVAIVAGNGLEGLLDGPAHESWFAQSSGLAEDADGNIWVADSETSALRTLVIDD 430
Query: 135 HGNQLIR 141
G+ ++
Sbjct: 431 TGSLTVK 437
Score = 37.0 bits (84), Expect = 4.5, Method: Composition-based stats.
Identities = 29/115 (25%), Positives = 48/115 (41%), Gaps = 4/115 (3%)
Query: 19 FSLVSSGLLLEDGYTVTTVIDG--HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPL 76
F ++ + + G + ++DG H+ S + ++ V DS SA TL
Sbjct: 371 FDPLTGAVAIVAGNGLEGLLDGPAHESWFAQSSGLAEDADGNIWVADSETSALRTLVIDD 430
Query: 77 SEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+ VK G G+ G+ DG AR P V G++ +AD N +R+
Sbjct: 431 TGSLTVKTAVGKGLFDFGFRDGTAAEARLQHPLGVTVLPDGSVAIADTYNGAVRR 485
>gi|116754580|ref|YP_843698.1| NHL repeat-containing protein [Methanosaeta thermophila PT]
gi|116666031|gb|ABK15058.1| NHL repeat containing protein [Methanosaeta thermophila PT]
Length = 504
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 49/105 (46%), Gaps = 12/105 (11%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
G ++ + DS+ + +S + + + G G +GYSDG+ AR +P+ A+ +
Sbjct: 202 GGDNIFIADSNNNRILIVS----PDGDLMDVIGSGERGYSDGDFSEARLFRPQGIAI-VG 256
Query: 115 GNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSG 159
+Y+AD NH++R L R+ ++ KS G G
Sbjct: 257 DVVYIADTGNHMVRAA-------DLRRRTLVRMAGTGKSRHPGLG 294
>gi|414866501|tpg|DAA45058.1| TPA: hypothetical protein ZEAMMB73_922990 [Zea mays]
Length = 237
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 30/58 (51%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL 139
V +AG G GY DG +A+ +P G + VAD +N+ IR IT D G ++
Sbjct: 14 VTTIAGTGSAGYRDGPGLTAQLSEPAGLVEVGDGRLLVADTNNNTIRYITLSDKGAEV 71
>gi|299142776|ref|ZP_07035905.1| NHL repeat-containing protein [Prevotella oris C735]
gi|298575805|gb|EFI47682.1| NHL repeat-containing protein [Prevotella oris C735]
Length = 448
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 27/55 (49%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
V++ AG G DG SA F P A D+ GN+YV + IRKI + D
Sbjct: 337 QTVEQFAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTD 391
>gi|91076402|ref|XP_969236.1| PREDICTED: similar to CG12547 CG12547-PA [Tribolium castaneum]
gi|270002449|gb|EEZ98896.1| hypothetical protein TcasGA2_TC004511 [Tribolium castaneum]
Length = 712
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/55 (30%), Positives = 25/55 (45%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
L + + G G G+ DG F+ P+ + ++VAD NH IRKI
Sbjct: 245 LKSDGTILHQVGSGEIGFRDGSFTQCEFNAPQGLVFQNENTLFVADTENHAIRKI 299
>gi|144899324|emb|CAM76188.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense
MSR-1]
Length = 477
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 3/56 (5%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQ 142
G G+ G DG A F+ P+ A+D + IYVAD NH+IR+I G++ R+
Sbjct: 210 GSGLAGRDDGI--DASFNGPEGVAID-RAFIYVADTRNHLIRRIDRQSGGDRHHRR 262
>gi|290985345|ref|XP_002675386.1| predicted protein [Naegleria gruberi]
gi|284088982|gb|EFC42642.1| predicted protein [Naegleria gruberi]
Length = 788
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ +AG G +GYS DG P +A+ + V +Y+AD +NH IRKI
Sbjct: 297 IINTIAGTGEEGYSGDGGPAINAQISNVYNIFVSQNDEVYIADTNNHRIRKI 348
>gi|227540165|ref|ZP_03970214.1| conserved protein of hypothetical function [Sphingobacterium
spiritivorum ATCC 33300]
gi|227240026|gb|EEI90041.1| conserved protein of hypothetical function [Sphingobacterium
spiritivorum ATCC 33300]
Length = 658
Score = 38.1 bits (87), Expect = 1.9, Method: Composition-based stats.
Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 6/85 (7%)
Query: 50 VIDRPGSSDLIVLDSSRSA---FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
V+D G DL D++ + ++F S V +AG G +DG +A F++P
Sbjct: 406 VVDNKG--DLYFTDNNATGGNRIRKITFDASGNGTVSTIAGQASTGNADGIGAAASFNRP 463
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI 130
A+D NI YV++ + IR+I
Sbjct: 464 SGLAIDKNNNILYVSEWIGNRIRRI 488
>gi|167997357|ref|XP_001751385.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697366|gb|EDQ83702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 663
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 4/62 (6%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
+ G+ DG SAR +P S D K +Y+AD NH IR+ ++++ + KP+
Sbjct: 218 IAGFEDGSFESARLRRPSSIVFDKEKKLLYIADCENHAIRRA---HISSRMVDTLYPKPD 274
Query: 149 DC 150
+C
Sbjct: 275 EC 276
>gi|383124309|ref|ZP_09944974.1| hypothetical protein BSIG_5606 [Bacteroides sp. 1_1_6]
gi|382983730|gb|EIC72829.1| hypothetical protein BSIG_5606 [Bacteroides sp. 1_1_6]
Length = 468
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 78 EESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
E ++ +AG G GY+ G P +R ++P + + +IYVAD N VI K+
Sbjct: 412 ETKLMSTVAGIGGKSGYAAGNPTVSRMNRPYGICITPENDIYVADAGNKVIMKL 465
>gi|290978917|ref|XP_002672181.1| predicted protein [Naegleria gruberi]
gi|284085756|gb|EFC39437.1| predicted protein [Naegleria gruberi]
Length = 1457
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 7/48 (14%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKIT 131
AGDG G A F++P A+D G++YV D+ NH IRKIT
Sbjct: 125 FAGDG------GMAIKALFNRPVHIAIDHSNGDLYVTDQGNHAIRKIT 166
>gi|290975594|ref|XP_002670527.1| predicted protein [Naegleria gruberi]
gi|284084087|gb|EFC37783.1| predicted protein [Naegleria gruberi]
Length = 443
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 35/62 (56%), Gaps = 4/62 (6%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
E + +AG+G G+S +G SA+ P V +Y++D +N++IRKI L++
Sbjct: 95 ENGNIITIAGNGTVGFSGDNGPATSAQLYNPFGIFVSANNEVYISDSNNNIIRKI--LEN 152
Query: 136 GN 137
GN
Sbjct: 153 GN 154
>gi|326431736|gb|EGD77306.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
Length = 1384
Score = 38.1 bits (87), Expect = 2.0, Method: Composition-based stats.
Identities = 22/65 (33%), Positives = 31/65 (47%), Gaps = 2/65 (3%)
Query: 81 VVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQ 138
V+ LAG GV G+ G P +AR + P+ A G I +AD N IR + G
Sbjct: 1270 VITALAGTGVAGFRGDGGAPMAARLNTPRGIAAMPAGEIAIADTGNSRIRTLNVGGQGAG 1329
Query: 139 LIRQI 143
+I +
Sbjct: 1330 VIETV 1334
>gi|290978674|ref|XP_002672060.1| predicted protein [Naegleria gruberi]
gi|284085634|gb|EFC39316.1| predicted protein [Naegleria gruberi]
Length = 1219
Score = 38.1 bits (87), Expect = 2.0, Method: Composition-based stats.
Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
+ +AG G+ GY+ D SA+ + P +D G I+++D NH IRKI
Sbjct: 114 ITTIAGTGIAGYNGDDKSADSAQLNTPSGIVIDPNNGEIFISDSKNHRIRKI 165
Score = 37.7 bits (86), Expect = 2.2, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 32/49 (65%), Gaps = 3/49 (6%)
Query: 85 LAGDGVQGYS-DG-EPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKI 130
+AG G +GY+ DG E SA+ + P A+++ G IY+AD N+ IRKI
Sbjct: 397 IAGTGTKGYNGDGIEATSAQLNTPHDVALNLATGEIYIADTENNRIRKI 445
Score = 36.2 bits (82), Expect = 6.7, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 85 LAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
+AG+G GY+ DG + SA+ + P VD +Y+AD +NH IRKI
Sbjct: 60 IAGNGFAGYNGDGLDATSAQLNNPVGIYVDTNSREVYIADSNNHRIRKI 108
>gi|425734502|ref|ZP_18852820.1| NHL repeat-containing protein [Brevibacterium casei S18]
gi|425481116|gb|EKU48277.1| NHL repeat-containing protein [Brevibacterium casei S18]
Length = 666
Score = 37.7 bits (86), Expect = 2.0, Method: Composition-based stats.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 13/95 (13%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---D 112
S +L+V DS + + + ++V++R+ G G +G +DG+ SA+F +P V D
Sbjct: 208 SGNLLVADSGHHSL--VEYTPDGQTVLRRI-GTGERGLTDGDFASAQFSEPGGITVLPED 264
Query: 113 MKG----NIYVADKSNHVIRKITNLDHGNQLIRQI 143
+ + VAD NHV+R I D + +R I
Sbjct: 265 IAARAGYQLVVADTVNHVLRGI---DLDGETVRTI 296
>gi|224539142|ref|ZP_03679681.1| hypothetical protein BACCELL_04044 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519246|gb|EEF88351.1| hypothetical protein BACCELL_04044 [Bacteroides cellulosilyticus
DSM 14838]
Length = 470
Score = 37.7 bits (86), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 78 EESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
E +++ +AG G Q G+S G P ++ + P + G+IY+AD N VI K+
Sbjct: 414 ETNMMTTVAGIGKQKGFSTGSPKESKLNWPAGICLSPNGDIYIADAGNRVIMKL 467
>gi|111219555|ref|YP_710349.1| serine/threonine-protein kinase [Frankia alni ACN14a]
gi|111147087|emb|CAJ58734.1| putative serine/threonine-protein kinase (partial match) [Frankia
alni ACN14a]
Length = 899
Score = 37.7 bits (86), Expect = 2.0, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 29/52 (55%), Gaps = 2/52 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG G GYS DG P +A + + + G++Y +D NH IRKI
Sbjct: 667 VISTVAGTGTAGYSGDGGPATAAQLNSAEKVTTGPDGSVYFSDYDNHRIRKI 718
Score = 37.0 bits (84), Expect = 3.7, Method: Composition-based stats.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+A+ D P S A D GNIY+ D N+ IRK+T
Sbjct: 632 AAQLDGPGSTARDKAGNIYIGDAKNNRIRKVT 663
>gi|196009344|ref|XP_002114537.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
gi|190582599|gb|EDV22671.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
Length = 507
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
+G+ DG ARF+ P+ G I+VAD +NH IRKI
Sbjct: 214 EGFVDGVFQKARFNNPQGITCSRNGKTIFVADTNNHAIRKI 254
>gi|377811375|ref|YP_005043815.1| NHL repeat containing protein [Burkholderia sp. YI23]
gi|357940736|gb|AET94292.1| NHL repeat containing protein [Burkholderia sp. YI23]
Length = 382
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
G+ ++ + + G +F P S AVD G +YVAD+SN R+I D RQ +
Sbjct: 219 GNWLKSWGERGKGPGQFHTPHSIAVDANGMVYVADRSN---RRIQVFDGEGNFQRQFTI 274
>gi|340724286|ref|XP_003400514.1| PREDICTED: tripartite motif-containing protein 71-like [Bombus
terrestris]
Length = 711
Score = 37.7 bits (86), Expect = 2.1, Method: Composition-based stats.
Identities = 27/80 (33%), Positives = 37/80 (46%), Gaps = 12/80 (15%)
Query: 52 DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV 111
DR G +IV D S + + + S V+R G G +FD+P AV
Sbjct: 454 DREGH--MIVADRSNNRIQI--YRQEDGSFVRRFGSYGT--------GPGQFDRPAGVAV 501
Query: 112 DMKGNIYVADKSNHVIRKIT 131
D + I VADK NH I+ +T
Sbjct: 502 DARRRIIVADKDNHRIQILT 521
>gi|390341274|ref|XP_003725419.1| PREDICTED: tripartite motif-containing protein 71-like
[Strongylocentrotus purpuratus]
Length = 778
Score = 37.7 bits (86), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 87 GDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNH 125
DG + G PG+ +FD+P AV+ +GNI +ADK NH
Sbjct: 545 ADGSFSHKFGSPGTRNGQFDRPAGVAVNGEGNIIIADKDNH 585
>gi|406916662|gb|EKD55633.1| NHL repeat containing protein, partial [uncultured bacterium]
Length = 188
Score = 37.7 bits (86), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQS 156
GS +F++P +D K NIYV D N R+IT D ++ +I LK KS+ +
Sbjct: 49 GSGKFNEPVDIGIDSKSNIYVLDAGN---RRITGFDEYGNILGEIPLKKIGGLKSAGA 103
>gi|380300595|ref|ZP_09850288.1| NHL repeat protein [Brachybacterium squillarum M-6-3]
Length = 644
Score = 37.7 bits (86), Expect = 2.2, Method: Composition-based stats.
Identities = 39/115 (33%), Positives = 49/115 (42%), Gaps = 9/115 (7%)
Query: 19 FSLVSSGLLLEDGYTVTTVIDGHQLE---INPHSVIDRPGSSDLIVLDSSRSAFYTLSFP 75
F V+ L++ G T ++DG + P S ID L+V DS SA L P
Sbjct: 342 FDPVTGSLMVLAGTTQEGLVDGPAVTSWWAQP-SGIDELPDGRLVVADSESSAIRVLD-P 399
Query: 76 LSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+ E V L G G+ G+ DG AR P V G I VAD N IR
Sbjct: 400 QTME--VSTLVGQGLFDFGHVDGPAEVARLQHPLGVTVLPDGRIAVADTYNGAIR 452
>gi|325923856|ref|ZP_08185461.1| NHL repeat protein [Xanthomonas gardneri ATCC 19865]
gi|325545661|gb|EGD16910.1| NHL repeat protein [Xanthomonas gardneri ATCC 19865]
Length = 472
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|290973007|ref|XP_002669241.1| predicted protein [Naegleria gruberi]
gi|284082786|gb|EFC36497.1| predicted protein [Naegleria gruberi]
Length = 747
Score = 37.7 bits (86), Expect = 2.3, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG GV G S DG+ + A+ + P+ A+ G IY AD SN IR+I
Sbjct: 244 INTIAGTGVLGLSGDGDLATKAQLNTPRGIAISSTGEIYFADTSNQRIRRI 294
Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats.
Identities = 19/55 (34%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
E ++ +AG G + DG+ + A+ + P+ A+ G IY+AD NH IR+I
Sbjct: 128 ENGIITTIAGTGDSRFGGDGDLATKAQLNSPRGIAISSTGEIYIADTYNHRIRRI 182
>gi|317504749|ref|ZP_07962709.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
gi|315664131|gb|EFV03838.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
Length = 448
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 27/54 (50%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
+++ AG G DG SA F P A D+ GN+Y+ D IRKI + D
Sbjct: 338 TIEQFAGFNGVGSMDGPRMSASFRNPTGMAFDVDGNMYITDSMGFTIRKIGHAD 391
>gi|320103824|ref|YP_004179415.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
gi|319751106|gb|ADV62866.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
Length = 417
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 48/98 (48%), Gaps = 13/98 (13%)
Query: 41 HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG 99
+L+++P ++ ++D+ A + + VV LAG+G +G DG P
Sbjct: 327 KELDVSPEGLV--------AIVDTENQAIRLYN---PKTRVVTTLAGNGQRGREGDGVPA 375
Query: 100 SA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
+A R D+P A G I ++D +NH IR + ++ G
Sbjct: 376 TAARLDRPHGIAFGPGGVIAISDTNNHRIRLVKSVPVG 413
>gi|152964340|ref|YP_001360124.1| fibronectin type III domain-containing protein [Kineococcus
radiotolerans SRS30216]
gi|151358857|gb|ABS01860.1| Fibronectin type III domain protein [Kineococcus radiotolerans
SRS30216]
Length = 841
Score = 37.7 bits (86), Expect = 2.4, Method: Composition-based stats.
Identities = 19/48 (39%), Positives = 25/48 (52%), Gaps = 1/48 (2%)
Query: 85 LAGDGVQGYSD-GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
AG G G G+ + P A+D GN++VAD NHV+ KIT
Sbjct: 192 FAGTGFAGSPQAGDANKSPLASPTGVALDAAGNLHVADADNHVVEKIT 239
>gi|328783461|ref|XP_396242.3| PREDICTED: NHL repeat-containing protein 2 [Apis mellifera]
Length = 699
Score = 37.7 bits (86), Expect = 2.4, Method: Composition-based stats.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ + G + DG+ +ARF+ P+ + + IYVAD NH IRKI
Sbjct: 262 VQHVIGGSNPDFRDGDFENARFNAPQGVCI-LDTFIYVADNENHAIRKI 309
>gi|290976601|ref|XP_002671028.1| predicted protein [Naegleria gruberi]
gi|284084593|gb|EFC38284.1| predicted protein [Naegleria gruberi]
Length = 925
Score = 37.7 bits (86), Expect = 2.4, Method: Composition-based stats.
Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 7/96 (7%)
Query: 38 IDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
++ Q+ I+P V +DR + ++ V D S + + + + AG+G+ GYS D
Sbjct: 50 VNAGQVSISPTKVKVDRK-TGNVYVSDRINSRIRKIDINTGKATTI---AGNGIIGYSGD 105
Query: 96 GEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G+ + A + P + +IY AD NH IRKI
Sbjct: 106 GQLATQANINNPGKVELSSTNDIYFADTENHRIRKI 141
>gi|452822384|gb|EME29404.1| haloacid dehalogenase-like hydrolase family protein [Galdieria
sulphuraria]
Length = 679
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIR 128
+ L GD GY DG A+F P A D + +Y+ D+ NHVIR
Sbjct: 609 RWLCGDSQLGYVDGPKTYAKFHCPCDVAWDPISQRLYIVDRENHVIR 655
>gi|340378860|ref|XP_003387945.1| PREDICTED: hypothetical protein LOC100632455 [Amphimedon
queenslandica]
Length = 1157
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 14/32 (43%), Positives = 22/32 (68%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ RF P+ A+D +G +YVAD+ NH I+K +
Sbjct: 150 NGRFQSPRDIAIDSQGLVYVADQCNHRIQKFS 181
>gi|334335815|ref|YP_004540967.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
allergen [Isoptericola variabilis 225]
gi|334106183|gb|AEG43073.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Isoptericola variabilis 225]
Length = 668
Score = 37.7 bits (86), Expect = 2.4, Method: Composition-based stats.
Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 4/71 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA----VDMKGNIYVADKSNHVIRKI 130
P E V+ LAG +G DG G+A F +P A D G++++AD +R +
Sbjct: 358 PGPESGTVRLLAGTMNEGLEDGPGGTAWFAQPSGLAPVLGSDADGSLWLADAETSALRLV 417
Query: 131 TNLDHGNQLIR 141
T G +R
Sbjct: 418 TPAADGQVQVR 428
>gi|427706390|ref|YP_007048767.1| alkyl hydroperoxide reductase [Nostoc sp. PCC 7107]
gi|427358895|gb|AFY41617.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Nostoc sp. PCC 7107]
Length = 505
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 4/63 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLI 140
V + G G G +DG +A+F P+ D + I Y+AD NH +R++ D Q++
Sbjct: 206 VLHIIGTGKSGLTDGSFSTAQFSAPQGMVFDSENQIFYIADTENHALRRV---DLKQQIV 262
Query: 141 RQI 143
I
Sbjct: 263 ETI 265
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 15/48 (31%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ + GDG G DG+ ++RF +P + + +Y++D +NH IR++
Sbjct: 433 QTILGDGFAGLQDGQGKNSRFFEPSGLSA-INSYLYISDTNNHAIRRV 479
>gi|290978943|ref|XP_002672194.1| predicted protein [Naegleria gruberi]
gi|284085769|gb|EFC39450.1| predicted protein [Naegleria gruberi]
Length = 1483
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 36/77 (46%), Gaps = 6/77 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S+ IV+ S++ L F L EE + LAG G+ D P P +
Sbjct: 733 STKYIVISDSKNN--RLRFYLKEEKTIYTLAG----GFGDNGPSENGIGTPFGVTISPNN 786
Query: 116 NIYVADKSNHVIRKITN 132
IYV+D N++IR+I N
Sbjct: 787 EIYVSDVLNNLIRRIKN 803
>gi|290994583|ref|XP_002679911.1| predicted protein [Naegleria gruberi]
gi|284093530|gb|EFC47167.1| predicted protein [Naegleria gruberi]
Length = 2449
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 44/89 (49%), Gaps = 9/89 (10%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGE-PGSARF 103
N SV P + D + +D R + + L V +AG SDG+ SA+F
Sbjct: 87 NVVSVFKSPATGDCVYMDQKRVRMISKTTNL-----VTTIAGSLTSTNTSDGQFATSAKF 141
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITN 132
PK+F +D GN+YV D N IRK++N
Sbjct: 142 -FPKAFTIDEYGNLYVVD-FNVRIRKVSN 168
>gi|58581916|ref|YP_200932.1| hypothetical protein XOO2293 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58426510|gb|AAW75547.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 472
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|269837841|ref|YP_003320069.1| alkyl hydroperoxide reductase [Sphaerobacter thermophilus DSM
20745]
gi|269787104|gb|ACZ39247.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Sphaerobacter thermophilus DSM 20745]
Length = 494
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 36/84 (42%), Gaps = 5/84 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ + LIV DS S+ V+ G G G DG A F P
Sbjct: 180 PGKVLADAATDRLIVADSGHHRLIVARLDGSDAWVI----GSGEAGLRDGPGAIAAFHDP 235
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ A+ +YVAD NHV+R++
Sbjct: 236 QGLAL-YGDTLYVADTRNHVVRRV 258
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 38/74 (51%), Gaps = 5/74 (6%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGN 116
L V DS SA T++FP +E V+ L G G+ + D G R P + A G
Sbjct: 356 LYVADSETSAVRTVTFPPGDE--VRTLVGTGLFDFGDVDGIGDEVRLQHPLAVACG-DGV 412
Query: 117 IYVADKSNHVIRKI 130
+YVAD NH I+++
Sbjct: 413 LYVADSYNHKIKRL 426
>gi|184201467|ref|YP_001855674.1| hypothetical protein KRH_18210 [Kocuria rhizophila DC2201]
gi|183581697|dbj|BAG30168.1| hypothetical protein [Kocuria rhizophila DC2201]
Length = 246
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 36/84 (42%), Gaps = 5/84 (5%)
Query: 55 GSSDLIVLDSSRSAFYTLSF----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
G L +D S+ L+ P + VV + G GY DG AR D P+
Sbjct: 111 GDGRLWFVDRDSSSLRCLTIDTDRPDGDPHVVTVVGRHGA-GYQDGPATEARLDHPEDLQ 169
Query: 111 VDMKGNIYVADKSNHVIRKITNLD 134
+ G++ VAD NH +R + D
Sbjct: 170 MLYDGSVVVADTGNHALRHVDVTD 193
>gi|188576512|ref|YP_001913441.1| hypothetical protein PXO_00583 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|188520964|gb|ACD58909.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 472
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|160882316|ref|ZP_02063319.1| hypothetical protein BACOVA_00264 [Bacteroides ovatus ATCC 8483]
gi|156112324|gb|EDO14069.1| IPT/TIG domain protein [Bacteroides ovatus ATCC 8483]
Length = 465
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 31/60 (51%), Gaps = 5/60 (8%)
Query: 76 LSEESVVKRLAGDGVQ---GYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
L+ E V AG G GY DGE S A F+ P S A DMK Y+ D NH +RKI
Sbjct: 402 LTPEGRVTTYAGRGNSREWGYVDGELRSQALFNHPTSIAYDMKRKCFYIGDCDNHRVRKI 461
>gi|380016202|ref|XP_003692077.1| PREDICTED: membrane-associated protein Hem-like [Apis florea]
Length = 1439
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + G + DG+ +ARF+ P+ + + +YVAD NH IRKI
Sbjct: 256 IQHVIGGSNPDFRDGDFENARFNAPQGVCI-LDTFVYVADNENHAIRKI 303
>gi|346725095|ref|YP_004851764.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. citrumelo F1]
gi|418518756|ref|ZP_13084891.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. malvacearum str. GSPB1386]
gi|346649842|gb|AEO42466.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. citrumelo F1]
gi|410702450|gb|EKQ60955.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. malvacearum str. GSPB1386]
Length = 472
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|384428019|ref|YP_005637378.1| hypothetical protein XCR_2380 [Xanthomonas campestris pv. raphani
756C]
gi|341937121|gb|AEL07260.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
756C]
Length = 472
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALERE-ELYVADTGNHALRRI-NLRSGQ 251
>gi|289665900|ref|ZP_06487481.1| hypothetical protein XcampvN_23187 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 472
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|118575759|ref|YP_875502.1| hypothetical protein CENSYa_0565 [Cenarchaeum symbiosum A]
gi|118194280|gb|ABK77198.1| hypothetical protein CENSYa_0565 [Cenarchaeum symbiosum A]
Length = 1566
Score = 37.4 bits (85), Expect = 2.6, Method: Composition-based stats.
Identities = 21/47 (44%), Positives = 26/47 (55%), Gaps = 5/47 (10%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
L G G QG + GE F P+ A D G+IYVAD NH I++ T
Sbjct: 81 LFGWGTQGSNSGE-----FLLPRGIAADPGGDIYVADTGNHRIQRFT 122
>gi|449435944|ref|XP_004135754.1| PREDICTED: NHL repeat-containing protein 2-like [Cucumis sativus]
Length = 1086
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
++ G G DG A F++P+ A + K N+ YVAD NH +R++
Sbjct: 639 QIGSTGEDGLRDGNFDDATFNRPQGLAYNAKKNLLYVADTENHALREV 686
>gi|188991459|ref|YP_001903469.1| hypothetical protein xccb100_2064 [Xanthomonas campestris pv.
campestris str. B100]
gi|167733219|emb|CAP51417.1| Conserved hypothetical protein [Xanthomonas campestris pv.
campestris]
Length = 481
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 207 VLRQFGHGNADLIDGGVGEAAFRRPQGLALERE-ELYVADTGNHALRRI-NLRSGQ 260
>gi|418523208|ref|ZP_13089230.1| hypothetical protein WS7_19596 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410700170|gb|EKQ58738.1| hypothetical protein WS7_19596 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 472
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|423303024|ref|ZP_17281045.1| hypothetical protein HMPREF1057_04186 [Bacteroides finegoldii
CL09T03C10]
gi|408470353|gb|EKJ88888.1| hypothetical protein HMPREF1057_04186 [Bacteroides finegoldii
CL09T03C10]
Length = 465
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 31/60 (51%), Gaps = 5/60 (8%)
Query: 76 LSEESVVKRLAGDGVQ---GYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
L+ E V AG G GY DGE S A F+ P S A DMK Y+ D NH +RKI
Sbjct: 402 LTPEGRVTTYAGRGNSREWGYVDGELRSQALFNHPTSIAYDMKRKCFYIGDCDNHRVRKI 461
>gi|381173685|ref|ZP_09882761.1| NHL repeat family protein [Xanthomonas citri pv. mangiferaeindicae
LMG 941]
gi|380685855|emb|CCG39248.1| NHL repeat family protein [Xanthomonas citri pv. mangiferaeindicae
LMG 941]
Length = 472
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|390991569|ref|ZP_10261830.1| NHL repeat family protein [Xanthomonas axonopodis pv. punicae str.
LMG 859]
gi|372553665|emb|CCF68805.1| NHL repeat family protein [Xanthomonas axonopodis pv. punicae str.
LMG 859]
Length = 472
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|299144811|ref|ZP_07037879.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
gi|298515302|gb|EFI39183.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
Length = 459
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 12/59 (20%)
Query: 85 LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNI---YVADKSNHVIRKIT 131
AG+ G GY D SARFD P + + + + +I Y+ D +NH IRKIT
Sbjct: 336 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKIT 394
>gi|219852941|ref|YP_002467373.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219547200|gb|ACL17650.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 579
Score = 37.4 bits (85), Expect = 2.7, Method: Composition-based stats.
Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 82 VKRLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
V+R G+ + G G +F P AVD GN+YVAD N+ ++K ++
Sbjct: 119 VQRFTSTGIFLARWGTGGTGDGQFKSPSGVAVDSAGNVYVADMYNYRVQKFSS 171
Score = 36.6 bits (83), Expect = 5.4, Method: Composition-based stats.
Identities = 19/48 (39%), Positives = 27/48 (56%), Gaps = 5/48 (10%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
LA G +G DG+ FD P AVD + N+YV D N+ ++K T+
Sbjct: 176 LAKWGTEGGGDGQ-----FDYPTGIAVDSENNVYVVDSYNNRVQKFTS 218
>gi|340382512|ref|XP_003389763.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 24-like [Amphimedon
queenslandica]
Length = 1354
Score = 37.4 bits (85), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 28/54 (51%), Gaps = 6/54 (11%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
E VK G+G G + +F P+ A+D +G +YVAD NH I+K +
Sbjct: 1006 EGKKVKSFGGEG------GRSANGQFQYPRDIAIDSQGLVYVADSYNHRIQKFS 1053
>gi|290996889|ref|XP_002681014.1| predicted protein [Naegleria gruberi]
gi|284094637|gb|EFC48270.1| predicted protein [Naegleria gruberi]
Length = 1023
Score = 37.4 bits (85), Expect = 2.8, Method: Composition-based stats.
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
KPK +D GNIY+AD +N+ +RKI+ LD
Sbjct: 356 KPKGAVIDSLGNIYIADTNNNRVRKISYLD 385
Score = 37.4 bits (85), Expect = 3.1, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Query: 85 LAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
+AG G G++ D A+ P S +D N+Y+AD NH IRKI L +GN
Sbjct: 427 VAGVGSSGFNGDILATDAKLSNPVSVTIDSNDNVYIADTYNHRIRKI--LQNGN 478
Score = 35.8 bits (81), Expect = 8.9, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 34/60 (56%), Gaps = 4/60 (6%)
Query: 72 LSFPLSEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S +++ES V +AG GV GY + A P S + G+IY+AD SNH +R++
Sbjct: 11 VSTTVAQESFVSTIAGGGVCDGYLATQ---ASLAYPGSPTIGPDGSIYIADSSNHRVRQV 67
>gi|237718531|ref|ZP_04549012.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|229452238|gb|EEO58029.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 457
Score = 37.4 bits (85), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 31/60 (51%), Gaps = 5/60 (8%)
Query: 76 LSEESVVKRLAGDGVQ---GYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
L+ E V AG G GY DGE S A F+ P S A DMK Y+ D NH +RKI
Sbjct: 394 LTPEGRVTTYAGRGNSREWGYVDGELRSQALFNHPTSIAYDMKRKCFYIGDCDNHRVRKI 453
>gi|449524308|ref|XP_004169165.1| PREDICTED: NHL repeat-containing protein 2-like, partial [Cucumis
sativus]
Length = 892
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 26/107 (24%), Positives = 42/107 (39%), Gaps = 11/107 (10%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQ 142
++ G G DG A F++P+ A + K N+ YVAD NH +R++ + + +
Sbjct: 639 QIGSTGEDGLRDGNFDDATFNRPQGLAYNAKKNLLYVADTENHALREVDFVKERVRTLAG 698
Query: 143 INLKPEDCSKSSQSGSGMF---WVTVFSPY-------LISCKSFWVH 179
K D + S + W F P + WVH
Sbjct: 699 DGSKGSDYQGGKEGTSQLLNSPWDVCFEPINEKVYIAMAGQHQIWVH 745
>gi|397781344|ref|YP_006545817.1| Internalin-A [Methanoculleus bourgensis MS2]
gi|396939846|emb|CCJ37101.1| Internalin-A [Methanoculleus bourgensis MS2]
Length = 2759
Score = 37.4 bits (85), Expect = 2.9, Method: Composition-based stats.
Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
G+ +F+ P+ AV+ GN+YVAD NH I+K + GN L +
Sbjct: 91 GNGQFNGPQGIAVNAAGNVYVADTYNHRIQKFDS--SGNLLTK 131
>gi|325926967|ref|ZP_08188244.1| NHL repeat protein [Xanthomonas perforans 91-118]
gi|325542663|gb|EGD14128.1| NHL repeat protein [Xanthomonas perforans 91-118]
Length = 443
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|156406833|ref|XP_001641249.1| predicted protein [Nematostella vectensis]
gi|156228387|gb|EDO49186.1| predicted protein [Nematostella vectensis]
Length = 708
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 77 SEESVVKRLAG--DGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
S E VV + G DG G+ DG A+F P+ A+ + IYVAD NH IR+I NL
Sbjct: 229 STEGVVHEVIGGEDGFSAGFQDGTFKEAKFHAPQGVAM-LGEVIYVADTENHAIREI-NL 286
Query: 134 D 134
D
Sbjct: 287 D 287
>gi|302821385|ref|XP_002992355.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
gi|300139771|gb|EFJ06505.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
Length = 78
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G GY DG +RF++P+S A+ G ++V D +N IRKI+
Sbjct: 15 GESGYVDGPAAKSRFNRPQSLAICDNGAVFV-DTTNLAIRKIS 56
>gi|84623828|ref|YP_451200.1| hypothetical protein XOO_2171 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84367768|dbj|BAE68926.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 499
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 225 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 278
>gi|422325157|ref|ZP_16406193.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
gi|353343435|gb|EHB87752.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
Length = 656
Score = 37.4 bits (85), Expect = 3.0, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 30/62 (48%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
V AG GV+G +DG A F +P G+++VA +R IT D+G Q+
Sbjct: 363 VSIFAGSGVEGLADGPAADAWFAQPSGIIEARDGSLWVACSETSGLRHITFTDNGVQVTS 422
Query: 142 QI 143
+
Sbjct: 423 AV 424
>gi|293372451|ref|ZP_06618835.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
gi|292632634|gb|EFF51228.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
Length = 446
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 12/59 (20%)
Query: 85 LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNI---YVADKSNHVIRKIT 131
AG+ G GY D SARFD P + + + + +I Y+ D +NH IRKIT
Sbjct: 323 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKIT 381
>gi|195119117|ref|XP_002004078.1| GI18254 [Drosophila mojavensis]
gi|193914653|gb|EDW13520.1| GI18254 [Drosophila mojavensis]
Length = 734
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
VVK G G+ DG +RF+ P+ A K + VAD NH +R+I+ L++G
Sbjct: 255 VVKYKIGGLEAGFVDGSLNMSRFNNPQGVAFLDKNTLIVADTDNHALRQIS-LNNG 309
>gi|290975356|ref|XP_002670409.1| predicted protein [Naegleria gruberi]
gi|284083967|gb|EFC37665.1| predicted protein [Naegleria gruberi]
Length = 2200
Score = 37.4 bits (85), Expect = 3.0, Method: Composition-based stats.
Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 6/56 (10%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+E S +GDG G A+ + P ++ G IY+AD NH IRKI +
Sbjct: 158 NEHSDCDGFSGDG------GLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINS 207
>gi|21242830|ref|NP_642412.1| hypothetical protein XAC2091 [Xanthomonas axonopodis pv. citri str.
306]
gi|21108317|gb|AAM36948.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 499
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 225 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 278
>gi|321469552|gb|EFX80532.1| hypothetical protein DAPPUDRAFT_304106 [Daphnia pulex]
Length = 689
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 6/76 (7%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE-PGSARFDKPKSFAVDMKGN- 116
L V DS T+ PL++ + G+G G +DG +F++P +D +G
Sbjct: 464 LYVADSYNHKIKTVD-PLTK--LCSTYLGNGKAGLADGSSEDEIQFNEPGGLCIDEEGGG 520
Query: 117 -IYVADKSNHVIRKIT 131
+YVAD +NH IRK+
Sbjct: 521 ILYVADTNNHCIRKVN 536
>gi|393912207|gb|EFO20548.2| RING finger protein nhl-1 [Loa loa]
Length = 1144
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 24/48 (50%), Gaps = 8/48 (16%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
VK+ G G P S RF+ P AVD G IYV DK NH ++
Sbjct: 962 CVKKFGGHG--------PSSGRFNNPWGVAVDDMGMIYVVDKDNHRVQ 1001
>gi|289669010|ref|ZP_06490085.1| hypothetical protein XcampmN_11072, partial [Xanthomonas campestris
pv. musacearum NCPPB 4381]
Length = 284
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251
>gi|410896258|ref|XP_003961616.1| PREDICTED: NHL repeat-containing protein 2-like [Takifugu rubripes]
Length = 716
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 3/48 (6%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
++ G G DG+ A F+ P+ A+ KG+ +YVAD NH+IRKI
Sbjct: 254 QVIGGPESGRKDGDLSEASFNSPQGVAI--KGDTVYVADTENHLIRKI 299
>gi|21231560|ref|NP_637477.1| hypothetical protein XCC2116 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66768320|ref|YP_243082.1| hypothetical protein XC_1999 [Xanthomonas campestris pv. campestris
str. 8004]
gi|21113245|gb|AAM41401.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66573652|gb|AAY49062.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 499
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL G
Sbjct: 225 VLRQFGHGNADLIDGGVGEAAFRRPQGLALERE-ELYVADTGNHALRRI-NLRSGQ 278
>gi|326434843|gb|EGD80413.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
Length = 1449
Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats.
Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 80 SVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
S + +AG +G + G +A+ D+P AVD GN+Y+++K NH +
Sbjct: 1158 STITTIAGTKTRGSTGVGGLATAAQLDRPTCVAVDTMGNVYISEKGNHRV 1207
>gi|194767578|ref|XP_001965892.1| GF16360 [Drosophila ananassae]
gi|190619368|gb|EDV34892.1| GF16360 [Drosophila ananassae]
Length = 300
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 30/59 (50%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
+S +V+ + GD G DG+ ARF P+ + + VAD NH +RKI+ D
Sbjct: 72 VSSRGLVEHIIGDHKAGLIDGKFTEARFKHPQGLTFLDEHTLIVADTENHALRKISLAD 130
>gi|260822773|ref|XP_002606776.1| hypothetical protein BRAFLDRAFT_226091 [Branchiostoma floridae]
gi|229292120|gb|EEN62786.1| hypothetical protein BRAFLDRAFT_226091 [Branchiostoma floridae]
Length = 238
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 11/70 (15%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
++V D + + + ++ ++VK++ G G G +FD+P VD + NI+
Sbjct: 76 VVVADPGKHSIFLFE---ADGTLVKQVGGQG--------QGEEQFDEPCFVTVDKEDNIF 124
Query: 119 VADKSNHVIR 128
VAD++NH I+
Sbjct: 125 VADQNNHRIQ 134
>gi|385204242|ref|ZP_10031112.1| NHL repeat protein [Burkholderia sp. Ch1-1]
gi|385184133|gb|EIF33407.1| NHL repeat protein [Burkholderia sp. Ch1-1]
Length = 321
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIR 128
+ + + DG + S GEPGS++ F+ P + + D G +YVAD+ NH I+
Sbjct: 146 IHKYSPDGKRILSWGEPGSSKGQFNLPHNISCDEDGWVYVADRENHRIQ 194
>gi|91777978|ref|YP_553186.1| hypothetical protein Bxe_B2150 [Burkholderia xenovorans LB400]
gi|91690638|gb|ABE33836.1| Conserved hypothetical protein [Burkholderia xenovorans LB400]
Length = 321
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIR 128
+ + + DG + S GEPGS++ F+ P + + D G +YVAD+ NH I+
Sbjct: 146 IHKYSPDGKRILSWGEPGSSKGQFNLPHNISCDEDGWVYVADRENHRIQ 194
>gi|389844275|ref|YP_006346355.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
gi|387859021|gb|AFK07112.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
Length = 496
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 63 DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
++ R LS P +++ + G G+ G DG +AR +KP+ V G +YVAD
Sbjct: 191 NNDRILLTELSTPFVAKTIDQ--IGSGLSGLEDGPFENARLNKPQGI-VYSNGRLYVADT 247
Query: 123 SNHVIR 128
NH +R
Sbjct: 248 ENHALR 253
Score = 35.8 bits (81), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 61/130 (46%), Gaps = 20/130 (15%)
Query: 31 GYTVTTVIDGHQLEIN---PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
G V + DGH + N P + +DR + L V DS SA + F E V+ +A
Sbjct: 324 GSGVENIGDGHLKDANFAQPSGLFLDR---NSLYVADSEVSA---IRFVDLEAGTVQTVA 377
Query: 87 GDGVQ--GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLIRQI 143
G G+ GY DG + F P + +G +YVAD NH IRKI D G + + +
Sbjct: 378 GSGLFSFGYVDGILKRSLFQHP--IGIHGEGRFLYVADTYNHAIRKI---DLGIRRVETV 432
Query: 144 --NLKPEDCS 151
NL+ C+
Sbjct: 433 IKNLREGTCT 442
>gi|402881549|ref|XP_003904331.1| PREDICTED: NHL repeat-containing protein 2 [Papio anubis]
Length = 688
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
G DG + F+ P+ A+ M IYVAD NH+IRK+T D G+++ R
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKLTKGD-GSEVQR 314
>gi|158521169|ref|YP_001529039.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
gi|158509995|gb|ABW66962.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
Length = 2961
Score = 37.0 bits (84), Expect = 3.5, Method: Composition-based stats.
Identities = 18/54 (33%), Positives = 35/54 (64%), Gaps = 2/54 (3%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ ++ +AG+G++G DG A + P+S VD+KGN+Y +++ ++ +RKI
Sbjct: 1769 DGIISTVAGNGLEGNGPDGISALATPLNDPESIVVDVKGNLYFSERDHYKVRKI 1822
>gi|408794504|ref|ZP_11206109.1| NHL repeat protein [Leptospira meyeri serovar Hardjo str. Went 5]
gi|408461739|gb|EKJ85469.1| NHL repeat protein [Leptospira meyeri serovar Hardjo str. Went 5]
Length = 677
Score = 37.0 bits (84), Expect = 3.5, Method: Composition-based stats.
Identities = 23/82 (28%), Positives = 36/82 (43%), Gaps = 6/82 (7%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
+DRP + L Y F +E ++ L+G + + G +F P S
Sbjct: 182 LDRP-----VSLAYHNQKIYVADFA-RDEVLLFDLSGSFISSFGGSGKGQGQFRGPSSIY 235
Query: 111 VDMKGNIYVADKSNHVIRKITN 132
+D GNI+VAD N I+K +
Sbjct: 236 IDSNGNIFVADSGNARIQKFNS 257
>gi|290975266|ref|XP_002670364.1| predicted protein [Naegleria gruberi]
gi|284083922|gb|EFC37620.1| predicted protein [Naegleria gruberi]
Length = 1584
Score = 37.0 bits (84), Expect = 3.6, Method: Composition-based stats.
Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 6/56 (10%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+E S +GDG G A+ + P ++ G IY+AD NH IRKI +
Sbjct: 158 NEHSDCDGFSGDG------GLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINS 207
>gi|72392883|ref|XP_847242.1| flagellum-adhesion glycoprotein [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62358460|gb|AAX78922.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei]
gi|70803272|gb|AAZ13176.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 590
Score = 37.0 bits (84), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
+G DG G A F+ P S V + +IYVAD+ N+ IR+I + + Q KP+D
Sbjct: 108 RGNKDGPKGDALFNNPTS-VVSVNDDIYVADRDNNCIRRIDAEGNVTRYGPQDLNKPKD 165
>gi|383852866|ref|XP_003701946.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
71-like [Megachile rotundata]
Length = 722
Score = 37.0 bits (84), Expect = 3.6, Method: Composition-based stats.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G +FD+P AVD + I VADK NH I+ +T
Sbjct: 500 GPGQFDRPAGVAVDARRRIVVADKDNHRIQVLT 532
>gi|327277470|ref|XP_003223487.1| PREDICTED: NHL repeat-containing protein 2-like [Anolis
carolinensis]
Length = 724
Score = 37.0 bits (84), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
VI G ++V+ + YT+ P S G DG A F+ P+
Sbjct: 235 VIADTGHHRILVVKKTGQILYTVGGPDS--------------GRKDGTYSEATFNSPQGV 280
Query: 110 AVDMKGNI-YVADKSNHVIRKI 130
A+ K NI YVAD NH+IRK+
Sbjct: 281 AI--KNNIIYVADTENHLIRKV 300
>gi|261330448|emb|CBH13432.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei
gambiense DAL972]
Length = 588
Score = 37.0 bits (84), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
+G DG G A F+ P S V + +IYVAD+ N+ IR+I + + Q KP+D
Sbjct: 108 RGNKDGPKGDALFNNPTS-VVSVNDDIYVADRDNNCIRRIDAEGNVTRYGPQDLNKPKD 165
>gi|148654423|ref|YP_001274628.1| NHL repeat-containing protein [Roseiflexus sp. RS-1]
gi|148566533|gb|ABQ88678.1| NHL repeat containing protein [Roseiflexus sp. RS-1]
Length = 1146
Score = 37.0 bits (84), Expect = 3.7, Method: Composition-based stats.
Identities = 32/122 (26%), Positives = 51/122 (41%), Gaps = 23/122 (18%)
Query: 11 AITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHS--VIDRPGSSDLIVLDSSRSA 68
A+ + LV++ + E G ++ + I+P ++ + LIV D S +
Sbjct: 818 AVAATGSIIRLVATDIFGEFGNAPGQLVQPRGIAIDPQGNIIVSDSANHRLIVFDPSGTP 877
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
T+ G G DG+ F +P+ AVD +GNIYVAD N I
Sbjct: 878 IRTI----------------GSFGSGDGQ-----FYEPRGVAVDAQGNIYVADTWNARIV 916
Query: 129 KI 130
K+
Sbjct: 917 KL 918
>gi|374312921|ref|YP_005059351.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
gi|358754931|gb|AEU38321.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
Length = 654
Score = 37.0 bits (84), Expect = 3.7, Method: Composition-based stats.
Identities = 26/80 (32%), Positives = 45/80 (56%), Gaps = 7/80 (8%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVD 112
P ++ I +D+S +A+ T S V +L+G G V +G G A D+P+S A+D
Sbjct: 374 PTAAGNIAIDTSGNAWVT-----SGNFSVTKLSGTGTVLSVVNGSSGGA-LDEPQSLAID 427
Query: 113 MKGNIYVADKSNHVIRKITN 132
GN++V D N+ + +I++
Sbjct: 428 GSGNVWVNDFENNSVTEISS 447
>gi|72392873|ref|XP_847237.1| flagellum-adhesion glycoprotein [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62358465|gb|AAX78927.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei]
gi|70803267|gb|AAZ13171.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 590
Score = 37.0 bits (84), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
+G DG G A F+ P S V + +IYVAD+ N+ IR+I + + Q KP+D
Sbjct: 108 RGNKDGPKGDALFNNPTS-VVSVNDDIYVADRDNNCIRRIDAEGNVTRYGPQDLNKPKD 165
>gi|290993799|ref|XP_002679520.1| predicted protein [Naegleria gruberi]
gi|284093137|gb|EFC46776.1| predicted protein [Naegleria gruberi]
Length = 1414
Score = 37.0 bits (84), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 17/57 (29%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S+ ++ +AG G GY D + ++ + PK V G +++AD +N+ +RKI+
Sbjct: 850 VSKSGIISTIAGLGAAGYIDNVLATESQLNAPKGVVVAPSGEVFIADSNNNKVRKIS 906
>gi|290983130|ref|XP_002674282.1| predicted protein [Naegleria gruberi]
gi|284087871|gb|EFC41538.1| predicted protein [Naegleria gruberi]
Length = 1572
Score = 37.0 bits (84), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 6/74 (8%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGN-I 117
++L SRS+ L + +AG GV GY DG+ F + S AVD N I
Sbjct: 590 LILSHSRSSIRMLK---KSSGFITNIAGTFGVSGYKDGKTSLFSF-RLNSLAVDRNDNSI 645
Query: 118 YVADKSNHVIRKIT 131
Y++D N IRK+T
Sbjct: 646 YISDNGNLAIRKLT 659
>gi|390562751|ref|ZP_10244924.1| putative Peptidylamidoglycolate lyase [Nitrolancetus hollandicus
Lb]
gi|390172671|emb|CCF84237.1| putative Peptidylamidoglycolate lyase [Nitrolancetus hollandicus
Lb]
Length = 527
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 45/105 (42%), Gaps = 26/105 (24%)
Query: 27 LLEDGYTVTTVIDG--HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS---FPLSEESV 81
L DG +VT G H ++ +D G ++ V DS + LS F L+E S
Sbjct: 437 LTSDGTSVTNEWTGFDHPTDV----AVDAEG--NIFVADSGNNRVVKLSSAGFQLAEWSN 490
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
V G+ R D+P+ AVD KG +YV D H+
Sbjct: 491 VS---------------GNGRLDRPRGIAVDAKGMVYVTDLGGHI 520
Score = 36.6 bits (83), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
F P+ AVD GNIYVAD NH + K++
Sbjct: 280 FVSPQGLAVDSIGNIYVADSGNHTVAKLS 308
>gi|326435919|gb|EGD81489.1| hypothetical protein PTSG_02206 [Salpingoeca sp. ATCC 50818]
Length = 6977
Score = 37.0 bits (84), Expect = 3.9, Method: Composition-based stats.
Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 5/87 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
+P V+ P + D++V D + ++ E V G G +G+ DG R
Sbjct: 912 SPQDVLSLP-NGDIVVCDRDNARVRVIT----PEGKVSTALGVGKRGHQDGVGAGVRLAG 966
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITN 132
P+ G+ V D NH IR++++
Sbjct: 967 PRGMTQLASGSFVVTDAENHCIREVSS 993
>gi|295837701|ref|ZP_06824634.1| NHL repeat protein [Streptomyces sp. SPB74]
gi|197695928|gb|EDY42861.1| NHL repeat protein [Streptomyces sp. SPB74]
Length = 610
Score = 37.0 bits (84), Expect = 3.9, Method: Composition-based stats.
Identities = 26/84 (30%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R A L ++ V+R G G +G DG+ +ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGGGERGLLDGDAATARFNEP 238
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ A+ + VAD NH +R++
Sbjct: 239 QGLALLPDRAVVVADTVNHALRRV 262
>gi|373253064|ref|ZP_09541182.1| thiol-disulfide isomerase-like thioredoxin [Nesterenkonia sp. F]
Length = 659
Score = 37.0 bits (84), Expect = 3.9, Method: Composition-based stats.
Identities = 23/78 (29%), Positives = 38/78 (48%), Gaps = 3/78 (3%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESV-VKRLAGDGVQ--GYSDGEPGSARFDKPKSFAV 111
+ D+ + DS S+ L ++ V+ + G+G+ G+ DG+P AR P A
Sbjct: 391 AAGDVWIADSETSSLRVLRTAEDPKAPRVETIVGEGMFDFGFRDGDPAQARLQHPLGVAA 450
Query: 112 DMKGNIYVADKSNHVIRK 129
G++ VAD N IR+
Sbjct: 451 LPDGSVLVADTYNGAIRR 468
>gi|340380228|ref|XP_003388625.1| PREDICTED: hypothetical protein LOC100634990 [Amphimedon
queenslandica]
Length = 616
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 35/59 (59%), Gaps = 3/59 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKIT 131
+S++ +++++ DG S G+ A D P+ A+D +G +YVAD+ NH I+K +
Sbjct: 286 VSDDHRIQKISIDGYLKASVGQVYIADRDNHRYPRDIAIDSQGLVYVADRGNHCIQKFS 344
>gi|405967973|gb|EKC33082.1| hypothetical protein CGI_10023982 [Crassostrea gigas]
Length = 617
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 5/62 (8%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
+QG+ SA F+ P VD +GN+YV K++ + ++T + +R+I L P +
Sbjct: 530 LQGHVTFAFRSAEFENPAGLCVDYQGNVYVCGKTSRNVFQVTKENE----VREI-LSPSE 584
Query: 150 CS 151
CS
Sbjct: 585 CS 586
>gi|290995663|ref|XP_002680402.1| predicted protein [Naegleria gruberi]
gi|284094023|gb|EFC47658.1| predicted protein [Naegleria gruberi]
Length = 974
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 8/76 (10%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKG 115
+LI+ D+ R L+ ++ LAG V G+S DG P + A F +D
Sbjct: 646 ELIISDAKRIRKIDLN------GIIVTLAGSNVAGFSGDGGPAADATFGTLGGIYLDSND 699
Query: 116 NIYVADKSNHVIRKIT 131
+IYV+D NH IRKI+
Sbjct: 700 DIYVSDPDNHRIRKIS 715
Score = 35.8 bits (81), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 57/115 (49%), Gaps = 18/115 (15%)
Query: 30 DGYTVTTVIDGHQLEINP------HSVIDRPGSSDLIVLDS------SRSAFYTLSFPLS 77
+G T V G + + +P +++I PG+ I+ DS S +F+ + L+
Sbjct: 210 EGRIATVVGSGQKGQESPDGVLAVNAIIGAPGN---IIFDSIGNMYLSDRSFHKVRKVLT 266
Query: 78 EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++V +AG+G+ Y+ DG SA +P A+ G +Y+A+ H IRK+
Sbjct: 267 NGTIVT-IAGNGMSAYNGDGILAVSASLFRPSGLALSSTGELYIAESYGHRIRKV 320
>gi|340053072|emb|CCC47357.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 434
Score = 37.0 bits (84), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 11/122 (9%)
Query: 10 LAITLLSLLFSLVSSG-LLLED----GYTVTTVIDGHQLEINPHSVID-RPGSSDLIVLD 63
LA++ + + LVS+G L+D T +V G + + P V PG+ IV
Sbjct: 51 LAVSRVQAIIELVSNGDAWLQDCGSTNGTFLSVNAGLGIRLKPQRVYQLTPGNH--IVFG 108
Query: 64 SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVA 120
R+ F SFPL E V K G E G A D P VD++G+++ +
Sbjct: 109 DVRTLFVDESFPLCREYVRKLEVTGGTASLQSTECGLSSGANDDAPTDTHVDLQGSLFSS 168
Query: 121 DK 122
DK
Sbjct: 169 DK 170
>gi|290976432|ref|XP_002670944.1| predicted protein [Naegleria gruberi]
gi|284084508|gb|EFC38200.1| predicted protein [Naegleria gruberi]
Length = 1514
Score = 37.0 bits (84), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 24/87 (27%), Positives = 47/87 (54%), Gaps = 6/87 (6%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DG-EPGSARF 103
NP ++ + +L + D++ + + L+ +++ +AG G GY+ DG + SA+
Sbjct: 156 NPSGIVVDSKTGELFIADTANNRIRKI---LTNGTIIT-IAGTGETGYNGDGIDATSAQI 211
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
A+D+ +Y+AD +N+ IRKI
Sbjct: 212 RGVYGIALDVNSELYIADANNYRIRKI 238
>gi|87311714|ref|ZP_01093830.1| NHL repeat protein [Blastopirellula marina DSM 3645]
gi|87285608|gb|EAQ77526.1| NHL repeat protein [Blastopirellula marina DSM 3645]
Length = 652
Score = 37.0 bits (84), Expect = 4.1, Method: Composition-based stats.
Identities = 17/44 (38%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G+G G +DG+ +A F + A+D +YVAD NH++RK+
Sbjct: 199 GNGAIGTTDGDYATASFHHLQGMALD-GDTLYVADTENHMLRKV 241
>gi|156401400|ref|XP_001639279.1| predicted protein [Nematostella vectensis]
gi|156226406|gb|EDO47216.1| predicted protein [Nematostella vectensis]
Length = 820
Score = 37.0 bits (84), Expect = 4.2, Method: Composition-based stats.
Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 5/54 (9%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ VV R A G +G +GE F P+ A D GN+ V D++NH + K T
Sbjct: 729 QDDVVTRRAVIGREGNKEGE-----FSYPRGIAFDSVGNLIVCDRNNHRLLKFT 777
>gi|290995474|ref|XP_002680320.1| predicted protein [Naegleria gruberi]
gi|284093940|gb|EFC47576.1| predicted protein [Naegleria gruberi]
Length = 992
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 21/90 (23%), Positives = 42/90 (46%), Gaps = 5/90 (5%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR 102
++P+ V+ P + D+ + D + + + +V +AG G G +G A+
Sbjct: 444 VDPYGVVVDPSNGDVFISDGYLNCVRKID---GKSGIVTTVAGTGEAGDVGDNGPSNKAQ 500
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
P ++ G++ +AD N IRK++N
Sbjct: 501 LFSPSGLSLTSSGDLLIADNGNQAIRKVSN 530
>gi|431798072|ref|YP_007224976.1| NHL repeat protein,IPT/TIG domain-containing protein [Echinicola
vietnamensis DSM 17526]
gi|430788837|gb|AGA78966.1| NHL repeat protein,IPT/TIG domain-containing protein [Echinicola
vietnamensis DSM 17526]
Length = 492
Score = 37.0 bits (84), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 35/87 (40%), Gaps = 13/87 (14%)
Query: 53 RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGE-PGSARFD 104
+P D D A LS E V AG G GY DG+ ARFD
Sbjct: 392 KPDEYDFYFTDQYNHAIRKLS----PEGSVTTFAGRGSSSINPDPYGYVDGDLREEARFD 447
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
+P A G Y+ D+ NH IRKI
Sbjct: 448 RPSGIAYS-DGAFYIGDQMNHRIRKIA 473
>gi|195456364|ref|XP_002075106.1| GK23381 [Drosophila willistoni]
gi|194171191|gb|EDW86092.1| GK23381 [Drosophila willistoni]
Length = 721
Score = 37.0 bits (84), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 30/53 (56%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ VV+ G G+ DG+ ++RF+ P+ A + + VAD NH +R+I+
Sbjct: 259 DGVVEHKIGGFESGFIDGDLTTSRFNNPQGIAFLDEDTLIVADTKNHALRQIS 311
>gi|430743901|ref|YP_007203030.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
acidiphila DSM 18658]
gi|430015621|gb|AGA27335.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
acidiphila DSM 18658]
Length = 696
Score = 37.0 bits (84), Expect = 4.4, Method: Composition-based stats.
Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 5/84 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ S L + D++ + + VV G G G DG+ A F++P
Sbjct: 229 PGKVLADEKSKQLFIADTAHNRIVLTDLDGRKSVVV----GSGGIGMVDGDYAKAEFNRP 284
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ + + +YVAD NH IR I
Sbjct: 285 QGLCL-VDDTLYVADTENHAIRAI 307
>gi|380693132|ref|ZP_09857991.1| hypothetical protein BfaeM_04015 [Bacteroides faecis MAJ27]
Length = 472
Score = 36.6 bits (83), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 32/67 (47%), Gaps = 9/67 (13%)
Query: 74 FPLSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGN-IYVADKSN 124
F ++ + V AG G V G DGE ARFD P A D + Y+ADK N
Sbjct: 404 FKITPDGNVSLFAGRGSVSSDNKVHGNIDGELKTEARFDNPCGIAYDEETQTFYIADKEN 463
Query: 125 HVIRKIT 131
H IR I+
Sbjct: 464 HSIRTIS 470
>gi|290973162|ref|XP_002669318.1| predicted protein [Naegleria gruberi]
gi|284082864|gb|EFC36574.1| predicted protein [Naegleria gruberi]
Length = 832
Score = 36.6 bits (83), Expect = 4.6, Method: Composition-based stats.
Identities = 33/112 (29%), Positives = 57/112 (50%), Gaps = 14/112 (12%)
Query: 23 SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
S G ++G +IDG P+S+ P S DL +D + Y + +S + ++
Sbjct: 268 SKGYSGDEGNAADAMIDG------PYSLAFHPVSGDLTFVDINN---YRIR-KISNKGII 317
Query: 83 KRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+AG+G +G S G+ GSA + S + G +Y+A++ N+ IRKI+
Sbjct: 318 STIAGNGEKG-SIGDGGSALNAQIYYSVSISFSPNGELYIANEWNNRIRKIS 368
>gi|326431538|gb|EGD77108.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
Length = 1455
Score = 36.6 bits (83), Expect = 4.6, Method: Composition-based stats.
Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 80 SVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
S + +AG +G + G +A+ D+P AVD GN+Y+++K NH +
Sbjct: 1164 STITTIAGTKTRGSAGVGGLATAAQLDRPTCVAVDTIGNVYISEKGNHRV 1213
>gi|290976072|ref|XP_002670765.1| predicted protein [Naegleria gruberi]
gi|284084327|gb|EFC38021.1| predicted protein [Naegleria gruberi]
Length = 854
Score = 36.6 bits (83), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 30/52 (57%), Gaps = 4/52 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +AG G GY +G+ G A P+ AV G +Y++D +N++IRK+
Sbjct: 73 ISTIAGTGSAGY-NGDGGLAIETMLSSPQGVAVSESGEVYLSDSTNNIIRKV 123
>gi|156740315|ref|YP_001430444.1| NHL repeat-containing protein [Roseiflexus castenholzii DSM 13941]
gi|156231643|gb|ABU56426.1| NHL repeat containing protein [Roseiflexus castenholzii DSM 13941]
Length = 1163
Score = 36.6 bits (83), Expect = 4.7, Method: Composition-based stats.
Identities = 33/141 (23%), Positives = 58/141 (41%), Gaps = 26/141 (18%)
Query: 11 AITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHS--VIDRPGSSDLIVLDSSRSA 68
A+ + LV++ + E G +I + ++P ++ G+ LIV D S +
Sbjct: 835 AVAATGSVIKLVATDIFGEFGSEPGQLIQPRGVAVDPQGNVIVSDSGNHRLIVFDPSGTP 894
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
T+ G+ +G+ +F +P+ AVD GN+YVAD N
Sbjct: 895 IRTIG------------------GFGNGD---GQFYEPRGVAVDAAGNMYVADTWN---A 930
Query: 129 KITNLDHGNQLIRQINLKPED 149
+I LD + + + ED
Sbjct: 931 RIVKLDPQGRFLSSWGVGRED 951
>gi|153806006|ref|ZP_01958674.1| hypothetical protein BACCAC_00251 [Bacteroides caccae ATCC 43185]
gi|149130683|gb|EDM21889.1| IPT/TIG domain protein [Bacteroides caccae ATCC 43185]
Length = 530
Score = 36.6 bits (83), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 14/86 (16%)
Query: 64 SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG--------------EPGSARFDKPKSF 109
S+ SA ++ + + +V AG G G +D E A F P S
Sbjct: 445 STVSADQSIRYIDQKTGMVTTFAGSGTSGNTDSSFEVLSYAGVNSALESLPAAFGAPSSV 504
Query: 110 AVDMKGNIYVADKSNHVIRKITNLDH 135
VD G +YV D+ N+ ++KIT ++
Sbjct: 505 CVDKDGTVYVLDRRNNCVKKITTVEK 530
>gi|163788357|ref|ZP_02182803.1| hypothetical protein FBALC1_08248 [Flavobacteriales bacterium
ALC-1]
gi|159876677|gb|EDP70735.1| hypothetical protein FBALC1_08248 [Flavobacteriales bacterium
ALC-1]
Length = 500
Score = 36.6 bits (83), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 13/66 (19%)
Query: 70 YTLSFPLSEESVVKRLAGD--------GVQG----YSDGEPGSARFDKPKSFAVDMKGNI 117
Y+L FPL++ ++ K+L G +Q YS G+ R D+P++FA D+ GN
Sbjct: 342 YSLHFPLTQNALQKKLNGGMDIFISKLNIQNSSIVYSTYLGGNKR-DRPRNFATDLNGNA 400
Query: 118 YVADKS 123
Y+ K+
Sbjct: 401 YIIGKT 406
>gi|374855491|dbj|BAL58347.1| NHL repeat containing protein [uncultured candidate division OP1
bacterium]
Length = 325
Score = 36.6 bits (83), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 14/32 (43%), Positives = 23/32 (71%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G+ +F+ ++ AVD +GN+YVAD NH ++K
Sbjct: 39 GNGQFNALEAVAVDAQGNVYVADTENHRVQKF 70
>gi|290990712|ref|XP_002677980.1| predicted protein [Naegleria gruberi]
gi|284091590|gb|EFC45236.1| predicted protein [Naegleria gruberi]
Length = 1555
Score = 36.6 bits (83), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+++ +AG GV+G++ DG S D P F+++ IY AD N+ IR +
Sbjct: 605 IIQTIAGSGVKGFNGDGPALSTHLDTPMGFSLNSDWIIYFADSGNNRIRYV 655
>gi|149177401|ref|ZP_01856005.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
gi|148843734|gb|EDL58093.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
Length = 683
Score = 36.6 bits (83), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 5/84 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P ++ P L + DS+ + S + + + G G G DG+ +A FD P
Sbjct: 223 PGKLLADPAQQRLFISDSNHNRIVIASL----DGKLIDVIGSGQIGNKDGDYKTASFDHP 278
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ A+ + +YVAD NH IR +
Sbjct: 279 QGMAL-VGNTLYVADTENHSIRTV 301
>gi|148657095|ref|YP_001277300.1| PA14 domain-containing protein [Roseiflexus sp. RS-1]
gi|148569205|gb|ABQ91350.1| PA14 domain protein [Roseiflexus sp. RS-1]
Length = 1293
Score = 36.6 bits (83), Expect = 4.9, Method: Composition-based stats.
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
S+ ++R + G + G PG+ +F P+S AV G +YVAD NH I++ +
Sbjct: 76 SDNHRIQRFSAAGELLGAWGSPGTGDGQFSSPRSVAVAPDGTVYVADTGNHRIQRFS 132
>gi|37519956|ref|NP_923333.1| hypothetical protein gll0387 [Gloeobacter violaceus PCC 7421]
gi|35210948|dbj|BAC88328.1| gll0387 [Gloeobacter violaceus PCC 7421]
Length = 544
Score = 36.6 bits (83), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G G+ G+ DG A F P+ A+ G ++V D NH++RKI
Sbjct: 251 GSGIPGWRDGNFEEAEFWAPQGLALSADGRTLFVCDTENHLLRKI 295
>gi|290996598|ref|XP_002680869.1| predicted protein [Naegleria gruberi]
gi|284094491|gb|EFC48125.1| predicted protein [Naegleria gruberi]
Length = 1407
Score = 36.6 bits (83), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 56/131 (42%), Gaps = 18/131 (13%)
Query: 14 LLSLLFSLVSSGLLLED---GYTVTTV---------IDGHQLEINPHSVIDRPGSSDLIV 61
L L+ + SGL++ Y +TT+ ID I+ +S S DL +
Sbjct: 11 LCLLILTYHGSGLVIGAISVKYNITTIAGYNSLSSGIDAKMANIDVNSCFKDLSSGDLFI 70
Query: 62 LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYV 119
D+S + + V +AG GV GY+ + + +A+ + P V IY+
Sbjct: 71 SDNSNHRIVRV---FAANGTVLTIAGTGVSGYNGDNIQATTAQLNTPTGVFV-FNSEIYI 126
Query: 120 ADKSNHVIRKI 130
AD N +RKI
Sbjct: 127 ADSQNSRVRKI 137
>gi|290988662|ref|XP_002677016.1| predicted protein [Naegleria gruberi]
gi|284090621|gb|EFC44272.1| predicted protein [Naegleria gruberi]
Length = 450
Score = 36.6 bits (83), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 24/37 (64%)
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+DG P S P A+D +GNI+++D N++IRK+
Sbjct: 188 TDGPPRSIPLYYPTGIAMDEEGNIFISDTRNNLIRKV 224
>gi|206603181|gb|EDZ39661.1| NHL repeat domain protein [Leptospirillum sp. Group II '5-way CG']
Length = 494
Score = 36.6 bits (83), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/55 (30%), Positives = 30/55 (54%), Gaps = 2/55 (3%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI 130
E++ ++ + G G+ G +DG A+F P+ F D VAD NH++R++
Sbjct: 210 EKAALREVIGQGIPGSADGSFDQAQFRDPQGVRFCPDDPDTAIVADTGNHLLRRV 264
>gi|241584846|ref|XP_002403871.1| NHL repeat-containing protein [Ixodes scapularis]
gi|215500274|gb|EEC09768.1| NHL repeat-containing protein [Ixodes scapularis]
Length = 621
Score = 36.6 bits (83), Expect = 5.4, Method: Composition-based stats.
Identities = 33/122 (27%), Positives = 56/122 (45%), Gaps = 5/122 (4%)
Query: 39 DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE- 97
DG L++ + SSD +L + S + + ++ + LAG GV G + G
Sbjct: 469 DGSALDVRLQHPLGVCWSSDKQLLYVADSYNHKIRKVDVQKRLCTTLAGTGVAGDATGSF 528
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSG 157
+FD+P V + +YVAD +NH + K+ +LD G + ++ L D + S
Sbjct: 529 SDEVQFDEPGGLCV-VGSRLYVADTNNHCV-KLVHLDLG--FVEKVTLTLPDATDSRDED 584
Query: 158 SG 159
S
Sbjct: 585 SA 586
>gi|147918860|ref|YP_687414.1| hypothetical protein RRC373 [Methanocella arvoryzae MRE50]
gi|110622810|emb|CAJ38088.1| hypothetical protein RRC373 [Methanocella arvoryzae MRE50]
Length = 539
Score = 36.6 bits (83), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 39/87 (44%), Gaps = 8/87 (9%)
Query: 52 DRPGSSDL---IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
D PG D I +D + + + T +F + K G EPG +FD+
Sbjct: 82 DEPGQFDQTSGIAVDDTGNIYVTDTFNHRIQVWNKATNTWTTMGSHGDEPG--QFDQTSG 139
Query: 109 FAVDMKGNIYVADKSNHVIR---KITN 132
AVD GNIYV D NH I+ K TN
Sbjct: 140 IAVDDTGNIYVTDTINHRIQVWNKATN 166
>gi|170573447|ref|XP_001892474.1| Hypothetical RING finger protein F54G8.4 in chromosome III,
putative [Brugia malayi]
gi|158601961|gb|EDP38697.1| Hypothetical RING finger protein F54G8.4 in chromosome III,
putative [Brugia malayi]
Length = 1024
Score = 36.6 bits (83), Expect = 5.5, Method: Composition-based stats.
Identities = 17/42 (40%), Positives = 23/42 (54%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
G V+ + P + RF+ P AVD G IYV DK NH ++
Sbjct: 840 GHCVRKFGGHGPSNGRFNNPWGVAVDDMGMIYVVDKDNHRVQ 881
>gi|290976289|ref|XP_002670873.1| predicted protein [Naegleria gruberi]
gi|284084436|gb|EFC38129.1| predicted protein [Naegleria gruberi]
Length = 1395
Score = 36.6 bits (83), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 49/100 (49%), Gaps = 9/100 (9%)
Query: 33 TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
T+TT+ + L P S+ P + DL + +S + LS + + +AG G +G
Sbjct: 558 TITTLSTSNLLNY-PTSLFYTP-NGDLYIANSGGNQILKLS-----KGTISVIAGTGTRG 610
Query: 93 -YSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
DG+ SA+ P + V G I++AD N+ IRKI
Sbjct: 611 NQGDGKQATSAQLSYPLAVTVTSNGVIFIADSGNNAIRKI 650
>gi|198421727|ref|XP_002130106.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 454
Score = 36.6 bits (83), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 1/86 (1%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + + L + D+ A ++ + V G ++G+ DG A +
Sbjct: 176 NPQQLEYDDTNDRLFIADTDNHAIRIINVDSATPKVTTLTGGPRMEGFKDGSFAEAMWRH 235
Query: 106 PKSFAVDMKGNI-YVADKSNHVIRKI 130
P A D++ ++ YV+D NH +R +
Sbjct: 236 PTGMAYDVRKDVLYVSDHYNHAVRML 261
>gi|326923951|ref|XP_003208196.1| PREDICTED: NHL repeat-containing protein 2-like [Meleagris
gallopavo]
Length = 727
Score = 36.6 bits (83), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 3/40 (7%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G DG A F+ P+ A+ K N IYVAD NH+IRKI
Sbjct: 262 GRKDGRFSEAAFNSPQGVAI--KNNVIYVADTENHLIRKI 299
>gi|255513298|gb|EET89564.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Candidatus Micrarchaeum acidiphilum ARMAN-2]
Length = 579
Score = 36.6 bits (83), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 33/50 (66%), Gaps = 4/50 (8%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLD 134
AG+G +G SDG ++ F +P+ ++ GN IY+AD N+ +R+I NLD
Sbjct: 286 AGNGAKGLSDGSFETSEFYRPQ--GLEWVGNRIYIADTENNAVREI-NLD 332
>gi|57529823|ref|NP_001006504.1| NHL repeat-containing protein 2 [Gallus gallus]
gi|82081010|sp|Q5ZI67.1|NHLC2_CHICK RecName: Full=NHL repeat-containing protein 2
gi|53136494|emb|CAG32576.1| hypothetical protein RCJMB04_29n5 [Gallus gallus]
Length = 727
Score = 36.2 bits (82), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 3/40 (7%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G DG A F+ P+ A+ K N IYVAD NH+IRKI
Sbjct: 262 GRKDGRFSEAAFNSPQGVAI--KNNVIYVADTENHLIRKI 299
>gi|428162946|gb|EKX32046.1| hypothetical protein GUITHDRAFT_61646, partial [Guillardia theta
CCMP2712]
Length = 254
Score = 36.2 bits (82), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
LAG G GY+D A+F+KP +D + YV+D +N+ IR+I
Sbjct: 154 LAGSGTAGYNDAVGVLAQFNKPVDVTLDWTESYAYVSDYNNNCIRRI 200
>gi|124003658|ref|ZP_01688507.1| NHL repeat [Microscilla marina ATCC 23134]
gi|123991227|gb|EAY30679.1| NHL repeat [Microscilla marina ATCC 23134]
Length = 508
Score = 36.2 bits (82), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G G QG DG+ A F +P A +KG+ +YVAD N+VIR++
Sbjct: 245 GSGTQGLKDGDFDKATFYEPHGLA--LKGDFLYVADTKNNVIRRV 287
>gi|229029660|ref|ZP_04185736.1| Cell surface protein [Bacillus cereus AH1271]
gi|228731664|gb|EEL82570.1| Cell surface protein [Bacillus cereus AH1271]
Length = 617
Score = 36.2 bits (82), Expect = 6.0, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
G A F +P AVD K NIYV D+ N+ ++K N GN L +
Sbjct: 476 GEAEFFQPMQLAVDSKDNIYVVDRINNRVQKFDN--EGNFLTK 516
Score = 36.2 bits (82), Expect = 6.9, Method: Composition-based stats.
Identities = 16/34 (47%), Positives = 22/34 (64%), Gaps = 2/34 (5%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
PG +F P A+D KGNI++AD NH I+K +
Sbjct: 430 PGFFQF--PSGIAIDSKGNIFIADSENHRIQKFS 461
>gi|423219561|ref|ZP_17206057.1| hypothetical protein HMPREF1061_02830 [Bacteroides caccae
CL03T12C61]
gi|392624766|gb|EIY18844.1| hypothetical protein HMPREF1061_02830 [Bacteroides caccae
CL03T12C61]
Length = 543
Score = 36.2 bits (82), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 14/86 (16%)
Query: 64 SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG--------------EPGSARFDKPKSF 109
S+ SA ++ + + +V AG G G +D E A F P S
Sbjct: 458 STVSADQSIRYIDQKTGMVTTFAGSGTSGNTDSSFEVLSYAGVNSALESLLAAFGAPSSV 517
Query: 110 AVDMKGNIYVADKSNHVIRKITNLDH 135
VD G +YV D+ N+ ++KIT ++
Sbjct: 518 CVDKDGTVYVLDRRNNCVKKITTVEK 543
>gi|384419507|ref|YP_005628867.1| hypothetical protein XOC_2585 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353462420|gb|AEQ96699.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 472
Score = 36.2 bits (82), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
V R G G DG G A F +P+ A++ +YVAD NH +R+I NL G
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALESD-QLYVADTGNHALRRI-NLRSGQ 251
>gi|290977893|ref|XP_002671671.1| predicted protein [Naegleria gruberi]
gi|284085242|gb|EFC38927.1| predicted protein [Naegleria gruberi]
Length = 3330
Score = 36.2 bits (82), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 37/61 (60%), Gaps = 5/61 (8%)
Query: 74 FPLSEES-VVKRLAGD-GVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+ SEES V+R+AG GY+ DG+P + +P+S V G +Y+ D +NH+IR+
Sbjct: 2025 YSASEESGTVQRIAGSLSSSGYNGDGKPAIDSLLSQPQSVFVK-DGEVYICDTNNHLIRR 2083
Query: 130 I 130
I
Sbjct: 2084 I 2084
>gi|374994067|ref|YP_004969566.1| gluconolactonase [Desulfosporosinus orientis DSM 765]
gi|357212433|gb|AET67051.1| gluconolactonase [Desulfosporosinus orientis DSM 765]
Length = 1526
Score = 36.2 bits (82), Expect = 6.2, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 46/102 (45%), Gaps = 23/102 (22%)
Query: 30 DGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD 88
DG + T+ G NP+ V +D G + V DSS +A +KR+ D
Sbjct: 198 DGTNIVTLGSGFS---NPNGVAVDSSGK--IYVADSSNNA-------------IKRMDAD 239
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G + G F P AVD G IYVAD +N+ I+++
Sbjct: 240 GTNIVTLG----TGFSTPNGVAVDSSGKIYVADTNNNAIKRM 277
Score = 36.2 bits (82), Expect = 6.7, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 4/51 (7%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ +KR+ DG + G + F P AVD G IYVAD SN+ I+++
Sbjct: 190 NAIKRMDADGTNIVTLG----SGFSNPNGVAVDSSGKIYVADSSNNAIKRM 236
>gi|86738767|ref|YP_479167.1| serine/threonine protein kinase [Frankia sp. CcI3]
gi|86565629|gb|ABD09438.1| serine/threonine protein kinase [Frankia sp. CcI3]
Length = 863
Score = 36.2 bits (82), Expect = 6.2, Method: Composition-based stats.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+A+ D P S A D GNIY+ D N+ IRKI+
Sbjct: 596 AAQLDGPGSTARDKAGNIYIGDAKNNRIRKIS 627
>gi|340725457|ref|XP_003401086.1| PREDICTED: NHL repeat-containing protein 2-like [Bombus terrestris]
Length = 692
Score = 36.2 bits (82), Expect = 6.3, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 93 YSDGEP-GSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
Y G P S FD+P A++ KG IY+AD +NH + K+ N + GN IN+
Sbjct: 509 YGVGIPHASFLFDEPSGIAINAEKGLIYIADTNNHAV-KMINTERGNITTLPINI 562
>gi|228985056|ref|ZP_04145224.1| Cell surface protein [Bacillus thuringiensis serovar tochigiensis
BGSC 4Y1]
gi|228774744|gb|EEM23142.1| Cell surface protein [Bacillus thuringiensis serovar tochigiensis
BGSC 4Y1]
Length = 598
Score = 36.2 bits (82), Expect = 6.4, Method: Composition-based stats.
Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 2/33 (6%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
PG +F P A+D KGNI++AD NH I+K+
Sbjct: 411 PGFFQF--PSGIAIDSKGNIFIADSENHRIQKL 441
>gi|229155542|ref|ZP_04283650.1| Cell surface protein [Bacillus cereus ATCC 4342]
gi|228627860|gb|EEK84579.1| Cell surface protein [Bacillus cereus ATCC 4342]
Length = 598
Score = 36.2 bits (82), Expect = 6.4, Method: Composition-based stats.
Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 2/33 (6%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
PG +F P A+D KGNI++AD NH I+K+
Sbjct: 411 PGFFQF--PSGIAIDSKGNIFIADSENHRIQKL 441
>gi|47568330|ref|ZP_00239032.1| cell surface protein [Bacillus cereus G9241]
gi|47555023|gb|EAL13372.1| cell surface protein [Bacillus cereus G9241]
Length = 617
Score = 36.2 bits (82), Expect = 6.4, Method: Composition-based stats.
Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 2/33 (6%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
PG +F P A+D KGNI++AD NH I+K+
Sbjct: 430 PGFFQF--PSGIAIDSKGNIFIADSENHRIQKL 460
>gi|402587633|gb|EJW81568.1| RING finger protein nhl-1 [Wuchereria bancrofti]
Length = 794
Score = 36.2 bits (82), Expect = 6.5, Method: Composition-based stats.
Identities = 17/42 (40%), Positives = 23/42 (54%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
G V+ + P + RF+ P AVD G IYV DK NH ++
Sbjct: 610 GHCVRKFGGHGPSNGRFNNPWGVAVDDMGMIYVVDKDNHRVQ 651
>gi|435849842|ref|YP_007301783.1| PDK repeat-containing protein [Methanomethylovorans hollandica DSM
15978]
gi|433663330|gb|AGB50755.1| PDK repeat-containing protein [Methanomethylovorans hollandica DSM
15978]
Length = 1881
Score = 36.2 bits (82), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 14/33 (42%), Positives = 22/33 (66%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ +F+ P + VD GN+YVAD +N+ I+K N
Sbjct: 47 TGQFNNPSAVTVDSYGNVYVADANNYRIQKFDN 79
>gi|290972811|ref|XP_002669144.1| predicted protein [Naegleria gruberi]
gi|284082687|gb|EFC36400.1| predicted protein [Naegleria gruberi]
Length = 1362
Score = 36.2 bits (82), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 86 AGDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AG G+ GYS S F+ P A + + +++AD +NH IRK+
Sbjct: 429 AGTGLVGYSGNYTLRNSTLFNSPNKLAFNSRNELFIADSNNHRIRKV 475
>gi|288801999|ref|ZP_06407440.1| conserved hypothetical protein [Prevotella melaninogenica D18]
gi|288335434|gb|EFC73868.1| conserved hypothetical protein [Prevotella melaninogenica D18]
Length = 486
Score = 36.2 bits (82), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 15/40 (37%), Positives = 22/40 (55%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+G+ DG +A+F P+ G +Y+AD NH IR I
Sbjct: 384 KGWEDGALKNAKFCYPRQMTFTKDGKLYIADSGNHCIRMI 423
>gi|260798148|ref|XP_002594062.1| hypothetical protein BRAFLDRAFT_68498 [Branchiostoma floridae]
gi|229279295|gb|EEN50073.1| hypothetical protein BRAFLDRAFT_68498 [Branchiostoma floridae]
Length = 515
Score = 36.2 bits (82), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 36/77 (46%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
G + LI++ + + F + P+ V + + +G + +FDKP AV +
Sbjct: 246 GKAPLIIVQTEPAVFLPIDTPVPVLGHVMLPSLPNTISFGEGGSRNGQFDKPSGVAVSDE 305
Query: 115 GNIYVADKSNHVIRKIT 131
G I+V D NH I+ T
Sbjct: 306 GEIFVTDLRNHRIQVFT 322
>gi|323320709|gb|ADX36386.1| putative RTX-toxin [Vibrio vulnificus]
Length = 1838
Score = 36.2 bits (82), Expect = 7.1, Method: Composition-based stats.
Identities = 28/103 (27%), Positives = 48/103 (46%), Gaps = 13/103 (12%)
Query: 70 YTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR- 128
Y+ P +E+S+V +G + Y D + K + +D+KGN + D + H++R
Sbjct: 161 YSFEAPHAEKSIVSFWSGKNFKQYRDILDNAQTDGKKVVYDIDVKGNAFAIDLNKHLMRW 220
Query: 129 --KITNLDHG--NQLIRQINLKPEDCSKSSQSGSGMFWVTVFS 167
N D+ NQL I D + S +G FW +V++
Sbjct: 221 GGLFPNPDNAEQNQLKSSI-----DAATFSNTG---FWSSVYA 255
>gi|300743712|ref|ZP_07072732.1| putative NHL repeat protein [Rothia dentocariosa M567]
gi|300380073|gb|EFJ76636.1| putative NHL repeat protein [Rothia dentocariosa M567]
Length = 667
Score = 36.2 bits (82), Expect = 7.2, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 4/69 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TN 132
P+SE + AG GV+G DG A F +P GN++VA +R + T
Sbjct: 364 PVSE--TISIFAGAGVEGLQDGTAEDAWFAQPSGIIEARDGNLWVACSETSGLRHVTFTR 421
Query: 133 LDHGNQLIR 141
+HG+Q ++
Sbjct: 422 DEHGHQSVQ 430
>gi|290974456|ref|XP_002669961.1| predicted protein [Naegleria gruberi]
gi|284083515|gb|EFC37217.1| predicted protein [Naegleria gruberi]
Length = 1106
Score = 36.2 bits (82), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 4/88 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYS-DGEPGSARFD 104
P S+ PG L+ D S + + E +V++R+AG + V GY+ DGE +
Sbjct: 893 PQSLAVSPGGDALVFSDLSTIGIFHYNLN-DENAVIRRIAGFENVIGYNGDGEATTTLIS 951
Query: 105 KPKSFAVDMKG-NIYVADKSNHVIRKIT 131
S A G ++Y D SN++IRK++
Sbjct: 952 HVPSLAFSNDGLSVYFVDSSNNLIRKLS 979
>gi|390960114|ref|YP_006423871.1| gluconolactonase [Terriglobus roseus DSM 18391]
gi|390415032|gb|AFL90536.1| gluconolactonase [Terriglobus roseus DSM 18391]
Length = 1836
Score = 36.2 bits (82), Expect = 7.5, Method: Composition-based stats.
Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 1/76 (1%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
G+ +L V DS + ++ + + + Y+ G+ ++ P A+D
Sbjct: 190 GAGELFVSDSGNNRVRKVTPAPATAPAKSDITKGTIVTYAGGDVATS-LSSPAGLAIDGA 248
Query: 115 GNIYVADKSNHVIRKI 130
GN+YVAD N+VIR+I
Sbjct: 249 GNLYVADSGNNVIRRI 264
>gi|269796827|ref|YP_003316282.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
DSM 10542]
gi|269099012|gb|ACZ23448.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
DSM 10542]
Length = 634
Score = 35.8 bits (81), Expect = 7.7, Method: Composition-based stats.
Identities = 17/63 (26%), Positives = 32/63 (50%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
VV+++ G +G DG A F +P +V G++++AD +R++ D G+ I
Sbjct: 345 VVEQVGGTQNEGLLDGPLAQAWFAQPSGLSVGPDGSVWLADAETSALRRVDVADDGSATI 404
Query: 141 RQI 143
+
Sbjct: 405 TSL 407
>gi|428164545|gb|EKX33567.1| hypothetical protein GUITHDRAFT_120259 [Guillardia theta CCMP2712]
Length = 396
Score = 35.8 bits (81), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR 128
GY DG+ SARF++P+ F + G I +AD +N+ +R
Sbjct: 100 GYVDGDGLSARFNQPRDFVMSTDGTKILIADTNNYCLR 137
>gi|432922359|ref|XP_004080313.1| PREDICTED: NHL repeat-containing protein 2-like [Oryzias latipes]
Length = 720
Score = 35.8 bits (81), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 3/45 (6%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
G G DG A F+ P+ A KG+ +YVAD NH+IRKI
Sbjct: 257 GGPASGRKDGSLSEASFNSPQGVA--FKGDAVYVADTENHLIRKI 299
>gi|219852010|ref|YP_002466442.1| periplasmic copper-binding protein [Methanosphaerula palustris
E1-9c]
gi|219546269|gb|ACL16719.1| periplasmic copper-binding [Methanosphaerula palustris E1-9c]
Length = 709
Score = 35.8 bits (81), Expect = 8.1, Method: Composition-based stats.
Identities = 20/75 (26%), Positives = 40/75 (53%), Gaps = 6/75 (8%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNI 117
+ +D + + F T ++ S +++ + G + + G G +F P + A+D N+
Sbjct: 232 VTVDDAGNVFVTDTY----NSRIQKFSSVGTFITKWGTGGTGEGQFGLPSATAIDTSENV 287
Query: 118 YVADKSNHVIRKITN 132
YVAD +N+ I+K T+
Sbjct: 288 YVADTANNRIQKFTS 302
>gi|290996648|ref|XP_002680894.1| predicted protein [Naegleria gruberi]
gi|284094516|gb|EFC48150.1| predicted protein [Naegleria gruberi]
Length = 966
Score = 35.8 bits (81), Expect = 8.2, Method: Composition-based stats.
Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 4/50 (8%)
Query: 85 LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
AG+G G+S G+ G +A+ + P+ AV G +Y+AD N +RK++
Sbjct: 436 FAGNGTSGFS-GDGGLAINAQLNSPRCVAVSGSGEVYIADSMNSRVRKVS 484
>gi|427739164|ref|YP_007058708.1| thiol-disulfide isomerase-like thioredoxin [Rivularia sp. PCC 7116]
gi|427374205|gb|AFY58161.1| thiol-disulfide isomerase-like thioredoxin [Rivularia sp. PCC 7116]
Length = 487
Score = 35.8 bits (81), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 24/50 (48%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
V L G G G DG F P+ A D NI YVAD N+ IRK+
Sbjct: 217 VLHLIGSGKAGLKDGSFEETEFFAPQGMAFDEVDNILYVADTENNAIRKV 266
>gi|428172619|gb|EKX41527.1| hypothetical protein GUITHDRAFT_153825 [Guillardia theta CCMP2712]
Length = 285
Score = 35.8 bits (81), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 2/74 (2%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMK 114
++ + D+ A ++ V +AG G G+ DGE SARF+ P AV +
Sbjct: 133 ENIFIADTYNHAIRHITRDEDRNWTVTLIAGSRSGQSGFEDGEGESARFNCPTGLAVVNE 192
Query: 115 GNIYVADKSNHVIR 128
I ++D SN IR
Sbjct: 193 QEILISDFSNSAIR 206
>gi|224003203|ref|XP_002291273.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973049|gb|EED91380.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 264
Score = 35.8 bits (81), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 61/129 (47%), Gaps = 21/129 (16%)
Query: 41 HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEE----SVVKRLAGDGVQGYSDG 96
H+ H ++RP +SD I T +F +SE+ S R+A D V+ +
Sbjct: 38 HRQPAQVHHALERPDASDAI----------TDAFQMSEKYGTTSKEARIAWDVVEEIYET 87
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKS---NHVIRK-ITNLDHGNQLIRQI-NLKPEDCS 151
P + FD+ S AVD + Y D+ NH++ + +NL +L+ QI L+ ED S
Sbjct: 88 SPPLSSFDEQTSTAVDAPQDYY--DRIHFLNHLLMESQSNLGQVKELVAQIKELELEDPS 145
Query: 152 KSSQSGSGM 160
+ + GM
Sbjct: 146 LARLADDGM 154
>gi|386811933|ref|ZP_10099158.1| conserved hypothetical protein [planctomycete KSU-1]
gi|386404203|dbj|GAB62039.1| conserved hypothetical protein [planctomycete KSU-1]
Length = 669
Score = 35.8 bits (81), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 40/84 (47%), Gaps = 5/84 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ S+ L + DS+ + + + V + G+G G DG A F+ P
Sbjct: 213 PGKVLADEISNRLFIADSNHNRIVIATL----DGKVLDVVGNGKIGKDDGTFTEASFNHP 268
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
+ A + +YVAD NH+IRK+
Sbjct: 269 QGMAFH-RDYLYVADTENHLIRKL 291
>gi|391334887|ref|XP_003741830.1| PREDICTED: tripartite motif-containing protein 71-like [Metaseiulus
occidentalis]
Length = 800
Score = 35.8 bits (81), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
G + +PG +FD+P A+D G I VADK NH I+
Sbjct: 576 GSAGSQPG--QFDRPAGVAMDSTGRIIVADKDNHRIQ 610
>gi|290996232|ref|XP_002680686.1| predicted protein [Naegleria gruberi]
gi|284094308|gb|EFC47942.1| predicted protein [Naegleria gruberi]
Length = 655
Score = 35.8 bits (81), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 39/75 (52%), Gaps = 4/75 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S D+ DSS + ++ + ++V +AG G G+S+G S++ +KP V G
Sbjct: 113 SGDIYFADSSNNLIRVMN---AATNIVSTIAGTGSFGFSEGNATSSKLNKPTE-PVYWNG 168
Query: 116 NIYVADKSNHVIRKI 130
+ AD N+ IRK+
Sbjct: 169 QLLFADSLNYRIRKV 183
>gi|194365180|ref|YP_002027790.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
gi|194347984|gb|ACF51107.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
Length = 470
Score = 35.8 bits (81), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 17/52 (32%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ R G G + DG A F +P++ ++ + ++YVAD NH +R+I L
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDSLYVADTGNHAVRRINLL 247
>gi|290979581|ref|XP_002672512.1| predicted protein [Naegleria gruberi]
gi|284086089|gb|EFC39768.1| predicted protein [Naegleria gruberi]
Length = 1118
Score = 35.8 bits (81), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+K + G G GYS D P S++ + KSF V G+I + D N++IRK
Sbjct: 584 LKSIVGTGTDGYSGDNLPAVSSKLSQIKSFYVKENGDIIILDSGNNIIRK 633
>gi|156399563|ref|XP_001638571.1| predicted protein [Nematostella vectensis]
gi|156225692|gb|EDO46508.1| predicted protein [Nematostella vectensis]
Length = 1122
Score = 35.8 bits (81), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 15/46 (32%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+AG G QG+++G A+F+KP F V+ + +++ D+ N +R +
Sbjct: 427 IAGSGEQGHANGGGRKAKFNKPSGF-VEYESKLFICDQGNSRLRVV 471
>gi|428162955|gb|EKX32055.1| hypothetical protein GUITHDRAFT_58597, partial [Guillardia theta
CCMP2712]
Length = 186
Score = 35.8 bits (81), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 85 LAGDGVQGYSDGEPG-SARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
LAGDG G D G +A+F P AVD G ++ VAD +H IR I
Sbjct: 83 LAGDGTNGLVDNVNGAAAKFYNPTGVAVDFAGQSLLVADSRDHTIRII 130
>gi|73671050|ref|YP_307065.1| surface layer protein [Methanosarcina barkeri str. Fusaro]
gi|72398212|gb|AAZ72485.1| putative surface layer protein [Methanosarcina barkeri str. Fusaro]
Length = 752
Score = 35.8 bits (81), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
G+ +F + +D GNIYVAD N+ I+K+ N +GN L++
Sbjct: 162 GNVQFKNARDICLDSSGNIYVADTGNNRIQKLDN--NGNYLVQ 202
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.136 0.400
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,968,844,411
Number of Sequences: 23463169
Number of extensions: 115418245
Number of successful extensions: 312288
Number of sequences better than 100.0: 936
Number of HSP's better than 100.0 without gapping: 530
Number of HSP's successfully gapped in prelim test: 406
Number of HSP's that attempted gapping in prelim test: 308829
Number of HSP's gapped (non-prelim): 3547
length of query: 191
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 57
effective length of database: 9,215,130,721
effective search space: 525262451097
effective search space used: 525262451097
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)