BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 024317
(269 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224079468|ref|XP_002305875.1| predicted protein [Populus trichocarpa]
gi|222848839|gb|EEE86386.1| predicted protein [Populus trichocarpa]
Length = 241
Score = 333 bits (855), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 164/245 (66%), Positives = 194/245 (79%), Gaps = 6/245 (2%)
Query: 27 LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKRL 85
+LEDGY VTTV+DGH+L INPH+V R SS+++VLDSSRS FYTL FP+S+ SV VKRL
Sbjct: 1 MLEDGYMVTTVLDGHKLNINPHAVQLR--SSEIVVLDSSRSVFYTLPFPISQASVMVKRL 58
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
+G+G GY DGEPG ARF+KPKSFAVD++GN+YVAD+ NH +RKI+N GVT+ G +
Sbjct: 59 SGEGKTGYIDGEPGLARFNKPKSFAVDLRGNVYVADQQNHAVRKISNSGVTSTIVGNYSQ 118
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
GR DGP + A+FS+DFE+ FVP ICALLISDHGNQL+RQI+LKPEDC SQ SALGA
Sbjct: 119 TGRQDGPGKTATFSSDFEVLFVPQICALLISDHGNQLLRQIDLKPEDCVIGSQ--SALGA 176
Query: 206 VSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPP-LQHDMEALPNQSGETSTDLLLRH 264
V WVL LSCL+ +VIG RPY+I HTGRL P Q DMEALPNQSGE+STD LLR
Sbjct: 177 VKFWVLGLALSCLLGIVIGIATRPYVIPHTGRLQTPSFQQDMEALPNQSGESSTDELLRR 236
Query: 265 QKRNC 269
QKRNC
Sbjct: 237 QKRNC 241
>gi|224065156|ref|XP_002301692.1| predicted protein [Populus trichocarpa]
gi|222843418|gb|EEE80965.1| predicted protein [Populus trichocarpa]
Length = 264
Score = 320 bits (821), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 157/246 (63%), Positives = 190/246 (77%), Gaps = 6/246 (2%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKR 84
++LEDGY VTTV+DGH+L +NPH+V R SSDL+VLDSS+S FYTL FP+S++ V VKR
Sbjct: 23 IMLEDGYMVTTVMDGHKLNVNPHAVQLR--SSDLVVLDSSKSVFYTLPFPISQDGVMVKR 80
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
L+G G +GY DGEPG ARF+KPKSF VD++GN+YVAD+ NH +RKI++ G+TT G
Sbjct: 81 LSGSGDKGYIDGEPGLARFNKPKSFTVDLRGNVYVADQLNHAVRKISSSGMTTTIAGNYS 140
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
+ GR DGP + A+FS DFE+ FVP ICALLISDHGNQL+RQ++LK EDC SQ SALG
Sbjct: 141 QIGRQDGPGETATFSTDFEVLFVPQICALLISDHGNQLLRQVDLKQEDCIIGSQ--SALG 198
Query: 205 AVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPP-LQHDMEALPNQSGETSTDLLLR 263
AV WVL VLSCL L GF RPY+I HTGR P Q DMEALPNQSGE++T+ LLR
Sbjct: 199 AVKFWVLGLVLSCLFGLATGFAIRPYVIPHTGRRQTPSFQQDMEALPNQSGESNTEELLR 258
Query: 264 HQKRNC 269
QKRNC
Sbjct: 259 RQKRNC 264
>gi|18395326|ref|NP_564209.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|15724248|gb|AAL06517.1|AF412064_1 At1g23890/T23E23_13 [Arabidopsis thaliana]
gi|16974539|gb|AAL31179.1| At1g23890/T23E23_13 [Arabidopsis thaliana]
gi|110742589|dbj|BAE99208.1| hypothetical protein [Arabidopsis thaliana]
gi|332192326|gb|AEE30447.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 261
Score = 317 bits (813), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 159/260 (61%), Positives = 202/260 (77%), Gaps = 5/260 (1%)
Query: 7 YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
+SS+++ L ++F+LVS ++LE+GY VTTV+DGH+ +NP+++ PGSS+LIVLDSS
Sbjct: 5 FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S FYT SFPLS +SV+ R AGDG G+ DG+ G++RF KP+ FAVD KGN+YVADKSN
Sbjct: 64 STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123
Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
IRKI++ G VTTIAGG SK G DGPAQNA+FS+DFE+TFVP C LL+SDHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183
Query: 186 INLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPL-QH 244
INLK EDC ++S S LG S+W + VLSC++ + IGF RPY+IRHTGR +PPL H
Sbjct: 184 INLKEEDCLENSHSN--LGTYSLWSIGIVLSCILGVAIGFAVRPYVIRHTGRSEPPLVHH 241
Query: 245 DMEALPNQSGETSTDLLLRH 264
DMEA PNQ G TS+DLLL H
Sbjct: 242 DMEASPNQIGGTSSDLLLLH 261
>gi|147798774|emb|CAN69795.1| hypothetical protein VITISV_022545 [Vitis vinifera]
Length = 459
Score = 316 bits (809), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 163/275 (59%), Positives = 211/275 (76%), Gaps = 12/275 (4%)
Query: 1 MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
MA++ SL++ ++ +F++ S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1 MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
SD I+LDSS+S FYT+ PLS+ES + RL+G G+SDG+ SA F KP+SFAVD+KGN
Sbjct: 59 SDFIILDSSKSVFYTVXSPLSQESEINRLSGSSA-GFSDGDSASAMFSKPRSFAVDLKGN 117
Query: 117 IYVADKSNHVIRKITNLGVTT-IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
+YVAD+SN VIRKITN GVTT IAGG ++K G+ DGPAQNASFS DFEL FVP CA+L+
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177
Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSV-LSCLVSLVIGFVARPYIIRH 234
SD G+QL+RQI+LK EDC +S Q S LG +WVL+ + +SCLV ++G ++RPY+I H
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQ--SVLGGAFLWVLLGLGVSCLVGFIVGIISRPYVIPH 235
Query: 235 TGRLDPPL-QHDMEALPNQSGETSTDLLLRHQKRN 268
TG L P L Q D+EALPN SGETSTD LLRHQK++
Sbjct: 236 TGSLLPTLFQRDVEALPNPSGETSTDALLRHQKQS 270
>gi|255560013|ref|XP_002521025.1| conserved hypothetical protein [Ricinus communis]
gi|223539862|gb|EEF41442.1| conserved hypothetical protein [Ricinus communis]
Length = 408
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 154/225 (68%), Positives = 184/225 (81%), Gaps = 4/225 (1%)
Query: 14 LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
LL L + V GL+LEDGYTVTT+IDGH+LEINPH+V+ RP SSDLI+LDSS S YT+S
Sbjct: 18 LLHLPSNYVLGGLILEDGYTVTTIIDGHKLEINPHAVLSRPQSSDLILLDSSHSTIYTIS 77
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
FP+S+ESVVKRL+GDGV G SDGEPGSARF+KP+SFAVD KGNIYVAD+ N IRKITN
Sbjct: 78 FPISQESVVKRLSGDGVAGLSDGEPGSARFNKPRSFAVDNKGNIYVADRLNGTIRKITNS 137
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
GV+TIAGG SK GR DGPAQNA+FS+DFE+ FV CALLISDHGNQL+R++ LKP+DC
Sbjct: 138 GVSTIAGGYSKGFGREDGPAQNATFSSDFEVAFVAEECALLISDHGNQLVRRLPLKPDDC 197
Query: 194 SKSSQSGSALGAVSVWV--LVSVLSCLVSLVIGFVARPYIIRHTG 236
+ +S SALGAVS WV L V+SCL+ + IGFV RP+I+ + G
Sbjct: 198 ATASH--SALGAVSFWVLGLGLVMSCLIGIAIGFVIRPHIVPYEG 240
>gi|297850862|ref|XP_002893312.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297339154|gb|EFH69571.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 398
Score = 281 bits (720), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 138/229 (60%), Positives = 179/229 (78%), Gaps = 4/229 (1%)
Query: 7 YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
+SS+++ L ++F+LVS ++LEDGY VTTV+DGH+ +NP+++ PGSS+LIVLDSS
Sbjct: 5 FSSVSLFLF-VVFNLVSGKIVLEDGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S FYT SFPLS +SV+ R AGDG GY DG+ G++RF KP+ FA+D KGN+YVAD+SN
Sbjct: 64 STFYTTSFPLSFDSVIHRFAGDGTSGYVDGKAGNSRFKKPRGFAIDAKGNVYVADRSNKA 123
Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
IRKI++ G VTTIAGG SK+ G DGPAQNA+FS+DFE+TFVP C LL+SDHGN+++RQ
Sbjct: 124 IRKISSSGYVTTIAGGISKEFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMVRQ 183
Query: 186 INLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
INLK EDC +SS S LGA S+W + LSC++ + IGF RPYIIRH
Sbjct: 184 INLKEEDCLESSHSN--LGAYSLWSIGIFLSCILGVAIGFAVRPYIIRH 230
>gi|388505270|gb|AFK40701.1| unknown [Medicago truncatula]
Length = 384
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 133/206 (64%), Positives = 162/206 (78%), Gaps = 2/206 (0%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD 88
E+GYT+TT++DGH+L INP S++ RP SSDLIVLDS+ S FYT+ P+S+ESV KR +G+
Sbjct: 31 EEGYTITTILDGHKLHINPFSILQRPISSDLIVLDSTNSTFYTVQLPISQESVFKRFSGN 90
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G GY DG+ G ARFDKP+SFAVD +GN+YVAD+ N VIRKI+ GVTTIAGG S+K
Sbjct: 91 GSPGYEDGDVGLARFDKPRSFAVDFRGNVYVADRVNKVIRKISTNGVTTIAGGSSEKSSI 150
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV 208
DGP QNASFSNDFELTF+P +CALL+SDH +QL+ QINLK EDC+ S+ SALGAV
Sbjct: 151 KDGPVQNASFSNDFELTFIPALCALLVSDHMHQLVHQINLKGEDCTLGSK--SALGAVMT 208
Query: 209 WVLVSVLSCLVSLVIGFVARPYIIRH 234
W L LSC++ LVIG V RPYII H
Sbjct: 209 WTLGLGLSCILGLVIGIVIRPYIIPH 234
>gi|42571623|ref|NP_973902.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|332192325|gb|AEE30446.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 400
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 137/229 (59%), Positives = 178/229 (77%), Gaps = 4/229 (1%)
Query: 7 YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
+SS+++ L ++F+LVS ++LE+GY VTTV+DGH+ +NP+++ PGSS+LIVLDSS
Sbjct: 5 FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S FYT SFPLS +SV+ R AGDG G+ DG+ G++RF KP+ FAVD KGN+YVADKSN
Sbjct: 64 STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123
Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
IRKI++ G VTTIAGG SK G DGPAQNA+FS+DFE+TFVP C LL+SDHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183
Query: 186 INLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
INLK EDC ++S S LG S+W + VLSC++ + IGF RPY+IRH
Sbjct: 184 INLKEEDCLENSHSN--LGTYSLWSIGIVLSCILGVAIGFAVRPYVIRH 230
>gi|225424715|ref|XP_002265272.1| PREDICTED: uncharacterized protein LOC100243227 [Vitis vinifera]
Length = 438
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 139/238 (58%), Positives = 184/238 (77%), Gaps = 11/238 (4%)
Query: 1 MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
MA++ SL++ ++ +F++ S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1 MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
SD I+LDSS+S FYT+S PLS+ES +KRL+G G+SDG+ SA F KP+SFAVD+KGN
Sbjct: 59 SDFIILDSSKSVFYTVSSPLSQESEIKRLSGSSA-GFSDGDSASATFSKPRSFAVDLKGN 117
Query: 117 IYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
+YVAD+SN VIRKITN GV TTIAGG ++K G+ DGPAQNASFS DFEL FVP CA+L+
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177
Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSV-LSCLVSLVIGFVARPYII 232
SD G+QL+RQI+LK EDC +S Q S LG +WVL+ + +SCLV ++G ++RPY+I
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQ--SVLGGAFLWVLLGLGVSCLVGFIVGIISRPYVI 233
>gi|296086532|emb|CBI32121.3| unnamed protein product [Vitis vinifera]
Length = 369
Score = 274 bits (700), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 139/238 (58%), Positives = 184/238 (77%), Gaps = 11/238 (4%)
Query: 1 MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
MA++ SL++ ++ +F++ S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1 MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
SD I+LDSS+S FYT+S PLS+ES +KRL+G G+SDG+ SA F KP+SFAVD+KGN
Sbjct: 59 SDFIILDSSKSVFYTVSSPLSQESEIKRLSGSSA-GFSDGDSASATFSKPRSFAVDLKGN 117
Query: 117 IYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
+YVAD+SN VIRKITN GV TTIAGG ++K G+ DGPAQNASFS DFEL FVP CA+L+
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177
Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSV-LSCLVSLVIGFVARPYII 232
SD G+QL+RQI+LK EDC +S Q S LG +WVL+ + +SCLV ++G ++RPY+I
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQ--SVLGGAFLWVLLGLGVSCLVGFIVGIISRPYVI 233
>gi|356526828|ref|XP_003532018.1| PREDICTED: uncharacterized protein LOC100816542 [Glycine max]
Length = 400
Score = 253 bits (647), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 134/212 (63%), Positives = 161/212 (75%), Gaps = 3/212 (1%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
++ E+GYTVTTV DGH+ I P +V+ RP SSDLI+LDS S FYT FP++EESV RL
Sbjct: 24 VITEEGYTVTTVFDGHKPHIFPFTVLQRPFSSDLILLDSVNSTFYTAQFPITEESVFTRL 83
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG-SK 144
+GDG GYSDG+ GSARF KP+SFA DM+GN+YVADKSN IRKI+ GVTTIAGG S+
Sbjct: 84 SGDGSVGYSDGDVGSARFAKPRSFAFDMRGNVYVADKSNRAIRKISAKGVTTIAGGEFSE 143
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
K DGPA NASFSNDF+LTF+P +CALL+SDH ++L+RQINL EDC+ S+ G LG
Sbjct: 144 KSRTKDGPALNASFSNDFDLTFIPGLCALLVSDHMHRLVRQINLMEEDCTLGSKPG--LG 201
Query: 205 AVSVWVLVSVLSCLVSLVIGFVARPYIIRHTG 236
AV W L LSCL+ LVIG V RPYII + G
Sbjct: 202 AVMTWTLGLGLSCLLGLVIGIVVRPYIIPNKG 233
>gi|9369410|gb|AAF87158.1|AC002423_23 T23E23.6 [Arabidopsis thaliana]
Length = 370
Score = 243 bits (621), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 123/212 (58%), Positives = 161/212 (75%), Gaps = 7/212 (3%)
Query: 7 YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
+SS+++ L ++F+LVS ++LE+GY VTTV+DGH+ +NP+++ PGSS+LIVLDSS
Sbjct: 5 FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S FYT SFPLS +SV+ R AGDG G+ DG+ G++RF KP+ FAVD KGN+YVADKSN
Sbjct: 64 STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123
Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
IRKI++ G VTTIAGG SK G DGPAQNA+FS+DFE+TFVP C LL+SDHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183
Query: 186 INLKPEDCSKSSQSG-----SALGAVSVWVLV 212
INLK EDC ++S S + L + W L+
Sbjct: 184 INLKEEDCLENSHSSIQEEVNHLSFIMTWKLL 215
>gi|148909539|gb|ABR17864.1| unknown [Picea sitchensis]
Length = 264
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 126/267 (47%), Positives = 176/267 (65%), Gaps = 21/267 (7%)
Query: 14 LLSLLFSLVS---SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFY 70
LL +LFSL++ L+ E+GY V TV+ G+++++ PHS++ P + D+++LDS S +
Sbjct: 8 LLIILFSLITFTEPSLIPEEGYRVETVLYGNKMDVYPHSIL--PFNGDILLLDSVNSTLF 65
Query: 71 TLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+ PLS++S +K AG + V G++DG A+F+ PKSF +D KGNIYVAD++NH IR
Sbjct: 66 RIGLPLSQDSTIKVFAGSRNTVPGFADGGFLDAQFNHPKSFTLDSKGNIYVADRANHAIR 125
Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
KI+ GV+TIAGG + K G ADGP+Q A FSND++LTF+P ICALL+SD GN++IRQI L
Sbjct: 126 KISKSGVSTIAGGTAGKTGHADGPSQEAMFSNDYDLTFIPSICALLVSDRGNRMIRQIKL 185
Query: 189 KPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSL-----VIGFVARPYIIRHTGRL-DPPL 242
P DC + S SG+ WV V++ VS ++ F+ +PYI +TG L
Sbjct: 186 PPGDCVQHSGSGTH------WV-TPVIASGVSFLLGLLLMAFL-QPYIAAYTGGLWSSSS 237
Query: 243 QHDMEALPNQSGETSTDLLLRHQKRNC 269
ME +PN GETS D LL H KRNC
Sbjct: 238 NASMERIPNNHGETSADELLCHVKRNC 264
>gi|115476416|ref|NP_001061804.1| Os08g0414600 [Oryza sativa Japonica Group]
gi|37806208|dbj|BAC99711.1| unknown protein [Oryza sativa Japonica Group]
gi|113623773|dbj|BAF23718.1| Os08g0414600 [Oryza sativa Japonica Group]
gi|215715379|dbj|BAG95130.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215715380|dbj|BAG95131.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 275
Score = 223 bits (569), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 166/253 (65%), Gaps = 11/253 (4%)
Query: 25 GLLLEDGYTVTTVID----GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
GL+L+DGYTVTT D G +P++++ RP + DL++LDS+ SA YTL+ PLS +
Sbjct: 26 GLVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGA 85
Query: 81 VVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ LAG G EP A FD+P+S AVD N+YVAD+ N +RKI G TT
Sbjct: 86 AARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTT 145
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
IAGG SK GR DGPAQNA+FS+DFEL +VP +CALL++D GN+LIRQINLK EDC++ +
Sbjct: 146 IAGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARET 205
Query: 198 QSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPL-QHDMEALPNQSGET 256
Q G LG SV + ++VL L+ VIGF R + H G L PL Q D +A+ + +
Sbjct: 206 QPG--LGTTSVSI-IAVLCALLGSVIGFSIRHFYPAHAGSLHQPLFQEDADAVQDNPEDG 262
Query: 257 STDLLLRHQKRNC 269
+D LLRHQKR+C
Sbjct: 263 RSDQLLRHQKRSC 275
>gi|226495023|ref|NP_001144050.1| uncharacterized protein LOC100276874 precursor [Zea mays]
gi|195636116|gb|ACG37526.1| hypothetical protein [Zea mays]
gi|414870559|tpg|DAA49116.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 272
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 161/248 (64%), Gaps = 8/248 (3%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L +P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
G G++DG PG A FD+P+S AVD N+YVAD+ + +RK+ G TTIAGG
Sbjct: 88 LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGPAQNA+FS DF L +VP ICALL++D GN+++RQINLKPEDC+ QSG
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSG-- 205
Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPL-QHDMEALPNQSGETSTDLL 261
LG V ++VL L+ +IGF+AR + + G L PL Q D EA+P + E ++D L
Sbjct: 206 LGNTLVS-FIAVLCTLLGSLIGFLARHFYPVNAGSLRQPLFQQDTEAIPEDTEEGNSDQL 264
Query: 262 LRHQKRNC 269
LRH+KR+C
Sbjct: 265 LRHKKRSC 272
>gi|222640551|gb|EEE68683.1| hypothetical protein OsJ_27311 [Oryza sativa Japonica Group]
Length = 420
Score = 195 bits (496), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 110/217 (50%), Positives = 144/217 (66%), Gaps = 10/217 (4%)
Query: 25 GLLLEDGYTVTTVID----GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
GL+L+DGYTVTT D G +P++++ RP + DL++LDS+ SA YTL+ PLS +
Sbjct: 26 GLVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGA 85
Query: 81 VVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ LAG G EP A FD+P+S AVD N+YVAD+ N +RKI G TT
Sbjct: 86 AARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTT 145
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
IAGG SK GR DGPAQNA+FS+DFEL +VP +CALL++D GN+LIRQINLK EDC++ +
Sbjct: 146 IAGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARET 205
Query: 198 QSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
Q G LG SV + ++VL L+ VIGF R + H
Sbjct: 206 QPG--LGTTSVSI-IAVLCALLGSVIGFSIRHFYPAH 239
>gi|414870561|tpg|DAA49118.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 280
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 100/206 (48%), Positives = 134/206 (65%), Gaps = 7/206 (3%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L +P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
G G++DG PG A FD+P+S AVD N+YVAD+ + +RK+ G TTIAGG
Sbjct: 88 LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGPAQNA+FS DF L +VP ICALL++D GN+++RQINLKPEDC+ QSG
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207
Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVAR 228
VS ++VL L+ +IGF+AR
Sbjct: 208 NTLVS---FIAVLCTLLGSLIGFLAR 230
>gi|224028415|gb|ACN33283.1| unknown [Zea mays]
gi|414870560|tpg|DAA49117.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 418
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 100/206 (48%), Positives = 134/206 (65%), Gaps = 7/206 (3%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L +P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
G G++DG PG A FD+P+S AVD N+YVAD+ + +RK+ G TTIAGG
Sbjct: 88 LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGPAQNA+FS DF L +VP ICALL++D GN+++RQINLKPEDC+ QSG
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207
Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVAR 228
VS ++VL L+ +IGF+AR
Sbjct: 208 NTLVS---FIAVLCTLLGSLIGFLAR 230
>gi|242079119|ref|XP_002444328.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
gi|241940678|gb|EES13823.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
Length = 416
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/209 (48%), Positives = 141/209 (67%), Gaps = 12/209 (5%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT + + L +P++++ RP + DL++LDS+ SA YTLS S +
Sbjct: 29 VLEDGYTVTTFANFNPLPASGPHPYAILPRPRAGDLLLLDSAGSALYTLSLS-SSPGEPR 87
Query: 84 RLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
RLAG + G+ DG+ A FD+P+S AVD N+YVAD+ + +RK+ G TT+AGG
Sbjct: 88 RLAGGKRRSGFDDGD---AAFDRPRSVAVDAADNVYVADQRHGAVRKVAPSGYTTTVAGG 144
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
S G DG AQNA+FS DFEL +VP ICALL++D GN+++RQINLKPEDC+ QSG
Sbjct: 145 LSSGPGHRDGLAQNATFSADFELVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSG- 203
Query: 202 ALGAVSVWVLVSVLSCLVSLVIGFVARPY 230
LG SV V +++L L+ L+IGF+ R +
Sbjct: 204 -LGTTSVSV-IAILCALLGLIIGFLVRHF 230
>gi|414870562|tpg|DAA49119.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 207
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 90/178 (50%), Positives = 119/178 (66%), Gaps = 4/178 (2%)
Query: 27 LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L +P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
G G++DG PG A FD+P+S AVD N+YVAD+ + +RK+ G TTIAGG
Sbjct: 88 LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
S G DGPAQNA+FS DF L +VP ICALL++D GN+++RQINLKPEDC+ QSG
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSG 205
>gi|326503362|dbj|BAJ99306.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 170 bits (430), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 97/212 (45%), Positives = 135/212 (63%), Gaps = 13/212 (6%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+GL+LEDGYTV+TV D + +P++++ RP + DL++LDS+ S YTL P+S ++ +
Sbjct: 54 AGLVLEDGYTVSTVSDLNPSGTHPYALLPRPRAGDLVLLDSAGSTLYTLPLPVSADAGPR 113
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
RLAG + P+S AVD N+YVAD++N IRK+ G TTIAG
Sbjct: 114 RLAGGAG---------ALGDGHPRSIAVDGADNVYVADRANGSIRKVAPSGYTTTIAGAY 164
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G D PAQNA+FS DFEL +VP ICALL++D GN+LIRQI LKPEDC+ +Q G
Sbjct: 165 SAGTGHRDEPAQNATFSADFELIYVPQICALLVADRGNRLIRQIKLKPEDCAHENQKG-- 222
Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
LG S + +++L+ L +IGF+ R + H
Sbjct: 223 LGTTSTSI-IAILAALFGSIIGFLVRHFYPFH 253
>gi|115448467|ref|NP_001048013.1| Os02g0730400 [Oryza sativa Japonica Group]
gi|46390465|dbj|BAD15926.1| putative NHL repeat-containing protein [Oryza sativa Japonica
Group]
gi|46390861|dbj|BAD16365.1| putative NHL repeat-containing protein [Oryza sativa Japonica
Group]
gi|113537544|dbj|BAF09927.1| Os02g0730400 [Oryza sativa Japonica Group]
gi|125541002|gb|EAY87397.1| hypothetical protein OsI_08804 [Oryza sativa Indica Group]
gi|125583566|gb|EAZ24497.1| hypothetical protein OsJ_08258 [Oryza sativa Japonica Group]
gi|215715228|dbj|BAG94979.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 508
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 99/235 (42%), Positives = 140/235 (59%), Gaps = 17/235 (7%)
Query: 8 SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A +++ L+SL S S + E GYTV TV DG +L I P++V G +
Sbjct: 43 SNAASSVMKRLWSLKSTTKTGSGGKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 101
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G G+ DG P AR + PK F VD +GN
Sbjct: 102 LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGFPGHVDGRPREARMNHPKGFTVDGRGN 161
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
IYVAD N IRKI++ GVTTIAGG S + G DGP+ +A FS DFE+ ++ C+LL+
Sbjct: 162 IYVADAMNMAIRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIGSSCSLLVI 221
Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSL----VIGFVA 227
D GNQ IR+I L +DC ++G LG V+V + + +++L V+G V+
Sbjct: 222 DRGNQAIREIQLNFDDCVYQYEAGFPLG-VAVLLAAAFFGYMLALLQRRVLGMVS 275
>gi|115467514|ref|NP_001057356.1| Os06g0269300 [Oryza sativa Japonica Group]
gi|53793193|dbj|BAD54399.1| NHL repeat-containing protein-like [Oryza sativa Japonica Group]
gi|113595396|dbj|BAF19270.1| Os06g0269300 [Oryza sativa Japonica Group]
gi|215768552|dbj|BAH00781.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222635365|gb|EEE65497.1| hypothetical protein OsJ_20925 [Oryza sativa Japonica Group]
Length = 517
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 98/237 (41%), Positives = 134/237 (56%), Gaps = 18/237 (7%)
Query: 8 SSLAITLLSLLFSLVSSG------------LLLEDGYTVTTVIDGHQLEINPHSVIDRPG 55
SS A ++ L+SL S+ + E GY V TV DG +L I PH V P
Sbjct: 45 SSTASAVVKQLWSLKSTATRTGSAAAGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTP- 103
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
S +L+VLDS S Y + PLS S K +AG +G+ G+ DG A+ + PK F VD
Sbjct: 104 SGELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDD 163
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
+GNIYVAD N IRKI++ GVTTIAGG S + G DGP+ +A FS DFE+ ++ C+L
Sbjct: 164 RGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSL 223
Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWV---LVSVLSCLVSLVIGFVA 227
L+ D GNQ IR+I L +DC ++G LG ++ +L+ L V+G V+
Sbjct: 224 LVIDRGNQAIREIALHNDDCEYQYEAGFPLGIALLFAAGFFGYMLALLQRQVLGMVS 280
>gi|218197942|gb|EEC80369.1| hypothetical protein OsI_22479 [Oryza sativa Indica Group]
Length = 517
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 98/237 (41%), Positives = 134/237 (56%), Gaps = 18/237 (7%)
Query: 8 SSLAITLLSLLFSLVSSG------------LLLEDGYTVTTVIDGHQLEINPHSVIDRPG 55
SS A ++ L+SL S+ + E GY V TV DG +L I PH V P
Sbjct: 45 SSTASAVVKQLWSLKSTATRTGSAAAGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTP- 103
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
S +L+VLDS S Y + PLS S K +AG +G+ G+ DG A+ + PK F VD
Sbjct: 104 SGELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDD 163
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
+GNIYVAD N IRKI++ GVTTIAGG S + G DGP+ +A FS DFE+ ++ C+L
Sbjct: 164 RGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSL 223
Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWV---LVSVLSCLVSLVIGFVA 227
L+ D GNQ IR+I L +DC ++G LG ++ +L+ L V+G V+
Sbjct: 224 LVIDRGNQAIREIALHNDDCEYQYEAGFPLGIALLFAAGFFGYMLALLQRQVLGMVS 280
>gi|242095458|ref|XP_002438219.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
gi|241916442|gb|EER89586.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
Length = 507
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 88/178 (49%), Positives = 112/178 (62%), Gaps = 3/178 (1%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY V TV DG +L I PH+V P + DL+VLDS S Y + PLS S K LAG
Sbjct: 76 EGGYAVDTVFDGSKLGIEPHAVEITP-AGDLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 134
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
+G+ G+ DG AR + P F VD KGNIYVAD N IRKI++ GVTTIAGG S +
Sbjct: 135 PEGLSGHVDGRLREARMNHPNGFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSIRG 194
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
G DGP+ +A FS DFE+ ++ C+LL+ D GNQ IR+I L +DC+ ++G LG
Sbjct: 195 GHIDGPSDDAKFSTDFEIQYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYETGFPLG 252
>gi|326512916|dbj|BAK03365.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529495|dbj|BAK04694.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529987|dbj|BAK08273.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 503
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 95/234 (40%), Positives = 136/234 (58%), Gaps = 15/234 (6%)
Query: 8 SSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A +L L+SL S+ + E GYTV TV DG +L I P++V G +
Sbjct: 40 SNAASAVLKRLWSLKSTAKTASGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 98
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G+ G+ DG A+ + PK F VD +GN
Sbjct: 99 LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGIPGHIDGRLREAKMNHPKGFTVDGRGN 158
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
IYVAD N +RKI++ GVTTIAGG S + G DGP+ +A FS DFE+ ++ C+LL+
Sbjct: 159 IYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSLLVI 218
Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALGA---VSVWVLVSVLSCLVSLVIGFVA 227
D GNQ IR+I L +DC ++G LG ++ +L+ L V+G V+
Sbjct: 219 DRGNQAIREIQLNFDDCVYQYEAGFPLGVALLLAAGFFGYMLALLQRRVLGMVS 272
>gi|413923863|gb|AFW63795.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
Length = 515
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 125/208 (60%), Gaps = 12/208 (5%)
Query: 8 SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A ++ L+SL S S + E GYTV TV DG +L I P+SV + S +
Sbjct: 44 SNAASAVVKRLWSLKSTTKTASGGKSMVKYEGGYTVETVFDGSKLGIEPYSV-EVTQSGE 102
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G G+ DG+ AR + PK F VD KGN
Sbjct: 103 LLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDKGN 162
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
IYVAD N IRKI++ GVTTIAGG S + G DGP+ A FS DFE+ ++ C+LL+
Sbjct: 163 IYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLVI 222
Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALG 204
D GNQ IR+I L +DC ++G LG
Sbjct: 223 DRGNQAIREIQLHFDDCVYQYEAGFPLG 250
>gi|226530993|ref|NP_001143690.1| uncharacterized protein LOC100276420 precursor [Zea mays]
gi|195624742|gb|ACG34201.1| hypothetical protein [Zea mays]
Length = 515
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/209 (44%), Positives = 125/209 (59%), Gaps = 13/209 (6%)
Query: 8 SSLAITLLSLLFSLVS----------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
S+ A ++ L+SL S S + E GYTV TV DG +L I P+SV + S
Sbjct: 44 SNAASAVVKRLWSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSV-EVTQSG 102
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+L+V+DS S Y ++ PLS S K +AG +G G+ DG+ AR + PK F VD KG
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDKG 162
Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
NIYVAD N IRKI++ GVTTIAGG S + G DGP+ A FS DFE+ ++ C+LL+
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLV 222
Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALG 204
D GNQ IR+I L +DC ++G LG
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAGFPLG 251
>gi|413923861|gb|AFW63793.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
Length = 516
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 94/209 (44%), Positives = 125/209 (59%), Gaps = 13/209 (6%)
Query: 8 SSLAITLLSLLFSLVS----------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
S+ A ++ L+SL S S + E GYTV TV DG +L I P+SV + S
Sbjct: 44 SNAASAVVKRLWSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSV-EVTQSG 102
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+L+V+DS S Y ++ PLS S K +AG +G G+ DG+ AR + PK F VD KG
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDKG 162
Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
NIYVAD N IRKI++ GVTTIAGG S + G DGP+ A FS DFE+ ++ C+LL+
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLV 222
Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALG 204
D GNQ IR+I L +DC ++G LG
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAGFPLG 251
>gi|255552640|ref|XP_002517363.1| conserved hypothetical protein [Ricinus communis]
gi|223543374|gb|EEF44905.1| conserved hypothetical protein [Ricinus communis]
Length = 494
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 131/221 (59%), Gaps = 14/221 (6%)
Query: 14 LLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
L+ L+SL S S + E GYTV TV DG +L I PH+V P S +L+VLDS
Sbjct: 42 LVKWLWSLKSNTKTAASSRSMMKFEGGYTVETVFDGSKLGIEPHTVDVSP-SGELLVLDS 100
Query: 65 SRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
S Y +S PLS S K +AG +G G+ DG AR + PK VD +GNIY+AD
Sbjct: 101 ENSNIYKISTPLSRYSRPKLIAGSPEGYSGHVDGRLREARMNHPKGLTVDNRGNIYIADT 160
Query: 123 SNHVIRKITNLGVTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
N IRKI++ GVTTIAGG ++ G DGP+++A FSNDF++ ++ C+LL+ D GNQ
Sbjct: 161 MNMAIRKISDGGVTTIAGGKWTRAGGHVDGPSEDAKFSNDFDIAYIGSSCSLLVIDRGNQ 220
Query: 182 LIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV 222
IR+I L +DC+ LG ++V V + +++L+
Sbjct: 221 AIREIQLNDDDCNNQYDGTFHLG-IAVLVAAAFFGYMLALL 260
>gi|195621162|gb|ACG32411.1| hypothetical protein [Zea mays]
gi|219884599|gb|ACL52674.1| unknown [Zea mays]
gi|413944257|gb|AFW76906.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
Length = 508
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/178 (48%), Positives = 112/178 (62%), Gaps = 3/178 (1%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY V TV DG +L I PH+V P + +L+VLDS S Y + PLS S K LAG
Sbjct: 73 EGGYAVDTVFDGSKLGIEPHAVEITP-AGNLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 131
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++ GVTTIAGG S +
Sbjct: 132 PKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISDTGVTTIAGGKSIRG 191
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
G DGP+ +A FS DFE+ ++ C+LL+ D GNQ IR+I L +DC+ ++G LG
Sbjct: 192 GHIDGPSDDAKFSTDFEIKYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYEAGFPLG 249
>gi|293331401|ref|NP_001168100.1| uncharacterized protein LOC100381837 precursor [Zea mays]
gi|223946007|gb|ACN27087.1| unknown [Zea mays]
gi|413938717|gb|AFW73268.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
Length = 510
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 126/208 (60%), Gaps = 12/208 (5%)
Query: 8 SSLAITLLSLLFSL-----VSSG----LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A ++ L+SL ++SG + E GYTV TV DG +L I P+SV G +
Sbjct: 44 SNAASAVVKRLWSLKSTTKIASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG-E 102
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G G+ DG+ AR + PK F VD +GN
Sbjct: 103 LLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRGN 162
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
IYVAD N IRKI++ GVTTIAGG S + G DGP+ A FS DFE+ ++ C+LL+
Sbjct: 163 IYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLVI 222
Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALG 204
D GNQ IR+I L +DC ++G LG
Sbjct: 223 DRGNQAIREIQLHFDDCVYQYEAGFPLG 250
>gi|357137830|ref|XP_003570502.1| PREDICTED: uncharacterized protein LOC100839308 [Brachypodium
distachyon]
Length = 495
Score = 157 bits (397), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 96/234 (41%), Positives = 135/234 (57%), Gaps = 15/234 (6%)
Query: 8 SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
S+ A +L L+SL S S + E GYTV TV DG +L I P++V G +
Sbjct: 39 SNAASAVLKRLWSLKSTTKTATGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 97
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L+V+DS S Y ++ PLS S K +AG +G G+ DG A+ + PK F VD +GN
Sbjct: 98 LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGYPGHVDGRLREAKMNHPKGFTVDGRGN 157
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
IYVAD N +RKI++ GVTTIAGG S + G DGP+ +A FS DFE+ ++ C+LL+
Sbjct: 158 IYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSLLVI 217
Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALG---AVSVWVLVSVLSCLVSLVIGFVA 227
D GNQ IR+I L +DC ++G LG ++ +L+ L V+G V+
Sbjct: 218 DRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGMVS 271
>gi|242066098|ref|XP_002454338.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
gi|241934169|gb|EES07314.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
Length = 518
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 124/209 (59%), Gaps = 13/209 (6%)
Query: 8 SSLAITLLSLLFSLVS----------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
S+ A ++ ++SL S S + E GYTV TV DG +L I P+SV G
Sbjct: 44 SNAASAVVKRMWSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG- 102
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+L+V+DS S Y ++ PLS S K +AG +G G+ DG+ AR + PK F VD +G
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRG 162
Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
NIYVAD N IRKI++ GVTTIAGG S + G DGP+ A FS DFE+ +V C+LL+
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYVGSSCSLLV 222
Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALG 204
D GNQ IR+I L +DC ++G LG
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAGFPLG 251
>gi|413938718|gb|AFW73269.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
Length = 511
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 124/209 (59%), Gaps = 13/209 (6%)
Query: 8 SSLAITLLSLLFSLVS----------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
S+ A ++ L+SL S S + E GYTV TV DG +L I P+SV G
Sbjct: 44 SNAASAVVKRLWSLKSTTKIAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG- 102
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+L+V+DS S Y ++ PLS S K +AG +G G+ DG+ AR + PK F VD +G
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRG 162
Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
NIYVAD N IRKI++ GVTTIAGG S + G DGP+ A FS DFE+ ++ C+LL+
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLV 222
Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALG 204
D GNQ IR+I L +DC ++G LG
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAGFPLG 251
>gi|357118152|ref|XP_003560822.1| PREDICTED: uncharacterized protein LOC100834977 [Brachypodium
distachyon]
Length = 544
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/204 (43%), Positives = 122/204 (59%), Gaps = 6/204 (2%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E Y V TV DG +L I P++V P + +L+VLDS S Y + PLS S K ++G
Sbjct: 79 EGWYEVETVFDGSKLGIEPYAVEVTP-AGELLVLDSMNSNIYRVQLPLSRYSRPKLVSGS 137
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
+G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++ GVTTIAGG S +
Sbjct: 138 PEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSVRG 197
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAV 206
G DGP+ +A FS DFE+ ++ C+LL+ D GNQ IR+I L+ +DC ++G LG
Sbjct: 198 GHTDGPSDDAKFSTDFEIRYISSSCSLLVIDRGNQAIREIPLQLDDCEYQHEAGFPLGVA 257
Query: 207 SVWV---LVSVLSCLVSLVIGFVA 227
++ +L+ L V G V+
Sbjct: 258 LLFAAGFFGYMLALLQRRVFGMVS 281
>gi|449446849|ref|XP_004141183.1| PREDICTED: uncharacterized protein LOC101216407 [Cucumis sativus]
gi|449489545|ref|XP_004158343.1| PREDICTED: uncharacterized protein LOC101227198 [Cucumis sativus]
Length = 417
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 129/216 (59%), Gaps = 5/216 (2%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ E GY V T++DG ++ I P+SV P S +L++LD+ S + +S P+S+ K
Sbjct: 2 IKFERGYVVETLLDGSKMGIEPYSVGVSP-SGELLILDAENSNVHKISMPVSQFCRPKLF 60
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
AG +G G+ DG+ AR P+ VD +GNIY+AD N IRKI++ GVTTIAGG
Sbjct: 61 AGSSEGYSGHVDGKLRDARMSHPRGLTVDQRGNIYIADTKNKAIRKISDAGVTTIAGGKW 120
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
+K G DGP++ + FSNDF+L +V C+LL+ D GNQ IR+I L+ EDC++ GS L
Sbjct: 121 RKSGHIDGPSEESKFSNDFDLVYVGSSCSLLVVDRGNQAIREIQLRAEDCTE--YDGSFL 178
Query: 204 GAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLD 239
+++ +L +++ V + ++ R+D
Sbjct: 179 LGIALLTAAMLLGYMLARFQFRVLATFSSKNDSRVD 214
>gi|225461726|ref|XP_002285515.1| PREDICTED: uncharacterized protein LOC100245328 [Vitis vinifera]
gi|302142876|emb|CBI20171.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/204 (43%), Positives = 125/204 (61%), Gaps = 7/204 (3%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L EDGY V TV++G++L + PHS+ + G +L +D+ ++ ++ PLS+ S +
Sbjct: 57 LQFEDGYLVETVVEGNELGVVPHSIRVSEDG--ELFAVDAVKNNIVRITPPLSQYSRARL 114
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG G G+ DG+P ARF+ PK +D KGN+YVAD SN IRKI + GVTTIAGG
Sbjct: 115 VAGSFQGHTGHVDGKPSDARFNGPKGVTMDDKGNVYVADTSNLAIRKIGDSGVTTIAGGK 174
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGP+++A FS+DF++ +V C+LL+ D GN +RQI+L EDC + S SA
Sbjct: 175 SNVAGYRDGPSEDAKFSSDFDVVYVRPTCSLLVVDRGNAALRQISLNQEDCDYQNSSISA 234
Query: 203 LGAVSV--WVLVSVLSCLVSLVIG 224
V V+V SCL+ G
Sbjct: 235 TDIFMVIGAVMVGYASCLLQKGFG 258
>gi|224102017|ref|XP_002312513.1| predicted protein [Populus trichocarpa]
gi|222852333|gb|EEE89880.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 153 bits (386), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 141/238 (59%), Gaps = 16/238 (6%)
Query: 6 VYSSLAITLLSLLFSL------VSSG---LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
++S++ L+ L+SL V SG + E GYTV TV DG +L I P+SV P S
Sbjct: 38 LFSNVVSALMKWLWSLKATTKTVISGRPMMKFESGYTVETVFDGSKLGIEPYSVQVLP-S 96
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L++LDS+ S Y +S LS S K +AG +G G+ DG+ A+ + PK VD +
Sbjct: 97 GELLILDSANSNIYRISASLSLYSRPKLVAGSHEGYSGHVDGKLREAKMNHPKGLTVDDR 156
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
GNIYVAD N IRKI++ GVTTIAGG + DG +++A+FSNDF++ ++ C+LL
Sbjct: 157 GNIYVADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDANFSNDFDVVYIGSSCSLL 216
Query: 175 ISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV---IGFVARP 229
+ D GN+ IR+I L +DC+ SG LG ++V V +++L+ +G + P
Sbjct: 217 VIDRGNRAIREIQLHFDDCAYQYGSGFPLG-IAVLVAAGFFGYMLALLQRRVGMIVSP 273
>gi|224117004|ref|XP_002317450.1| predicted protein [Populus trichocarpa]
gi|222860515|gb|EEE98062.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 153 bits (386), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 92/235 (39%), Positives = 130/235 (55%), Gaps = 14/235 (5%)
Query: 3 SHFVYS-SLAITLLSLLFSLVSSG-------LLLEDGYTVTTVIDGHQLEINPHSVIDRP 54
SHF Y+ LA+ L+ F + L EDGY V TV+ G+ + + P+ I
Sbjct: 4 SHFSYTLVLAVLSLACTFQFQAHAAPPDENVLQFEDGYLVETVVKGNAMGVVPYK-IRLS 62
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVD 112
+L +D S ++ PLS+ S + +AG G G+ DG+P RF+ P+ +D
Sbjct: 63 EDGELYAVDEVNSNVVKITPPLSQYSRARLVAGSFQGYTGHIDGKPNEVRFNHPRGLTMD 122
Query: 113 MKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
KGNIYVAD NH IRKI + GVTTIAGG S G DGP+++A FSNDF++ +V C+
Sbjct: 123 DKGNIYVADSLNHAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVHSTCS 182
Query: 173 LLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV---WVLVSVLSCLVSLVIG 224
LL+ D GN +RQI+L EDC S S + V + VL+ +C++ L G
Sbjct: 183 LLVVDRGNAALRQISLNQEDCDYQSSSFTMTEDVLMVVGAVLIGYATCMLQLGFG 237
>gi|356549639|ref|XP_003543199.1| PREDICTED: uncharacterized protein LOC100797841 [Glycine max]
Length = 487
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 122/203 (60%), Gaps = 4/203 (1%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+++ + P+ + +L +D+ S ++ PLS+ S + +
Sbjct: 57 LQFENGYVVETVVEGNEIGVVPYRIRVSEEDGELFAVDAINSNIVRITPPLSQYSRGRLV 116
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
AG G G+ DG+P ARF+ PK VD KGN+YVAD N IRKI + GVTTIAGG S
Sbjct: 117 AGSFQGYTGHVDGKPSDARFNHPKGITVDDKGNVYVADTQNLAIRKIGDAGVTTIAGGKS 176
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
G DGP+++A FSNDF++ +V C+LL+ D GN +RQI+L EDC S S S+
Sbjct: 177 NVAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLDQEDCDYQSNSISST 236
Query: 204 GAVSV--WVLVSVLSCLVSLVIG 224
++V V+V +C++ G
Sbjct: 237 DILTVVGAVIVGYATCMLQQGFG 259
>gi|293332279|ref|NP_001170567.1| uncharacterized protein LOC100384592 precursor [Zea mays]
gi|238006082|gb|ACR34076.1| unknown [Zea mays]
Length = 505
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/186 (47%), Positives = 115/186 (61%), Gaps = 7/186 (3%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
S + E GY V TV DG +L I PH+ I+R G DL++LDS S Y + PLS S
Sbjct: 71 SMVRYEGGYAVDTVFDGSKLGIEPHAAQINRAG--DLLLLDSINSNIYRVQLPLSPYSRP 128
Query: 83 KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
K LAG +G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++ GVTTIAG
Sbjct: 129 KLLAGSPEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAG 188
Query: 141 GGSKKEGRADGP--AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
G S + G D P + +A FS DFE+ ++ C+LL+ D GNQ IR+I L +DC+ +
Sbjct: 189 GKSIRGGYIDEPSVSDDAKFSTDFEVQYISSTCSLLVIDRGNQAIREIPLNDDDCAYQYE 248
Query: 199 SGSALG 204
+G LG
Sbjct: 249 AGFPLG 254
>gi|413953662|gb|AFW86311.1| hypothetical protein ZEAMMB73_666053 [Zea mays]
Length = 506
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/186 (47%), Positives = 115/186 (61%), Gaps = 7/186 (3%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
S + E GY V TV DG +L I PH+ I+R G DL++LDS S Y + PLS S
Sbjct: 71 SMVRYEGGYAVDTVFDGSKLGIEPHAAQINRAG--DLLLLDSINSNIYRVQLPLSPYSRP 128
Query: 83 KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
K LAG +G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++ GVTTIAG
Sbjct: 129 KLLAGSPEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAG 188
Query: 141 GGSKKEGRADGP--AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
G S + G D P + +A FS DFE+ ++ C+LL+ D GNQ IR+I L +DC+ +
Sbjct: 189 GKSIRGGYIDEPSVSDDAKFSTDFEVQYISSTCSLLVIDRGNQAIREIPLNDDDCAYQYE 248
Query: 199 SGSALG 204
+G LG
Sbjct: 249 AGFPLG 254
>gi|9369390|gb|AAF87138.1|AC002423_3 T23E23.5 [Arabidopsis thaliana]
Length = 493
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 94/242 (38%), Positives = 136/242 (56%), Gaps = 20/242 (8%)
Query: 1 MASHFVYSSLAITLLSLLFSLVSSG-----------LLLEDGYTVTTVIDGHQLEINPHS 49
M+ HF++ + I L S + S + E+GY+V TV+DG +L I P+S
Sbjct: 1 MSRHFLFLGIIILLFSAFVASAPSSTSPATVPTKSMVKFENGYSVETVLDGSKLGIEPYS 60
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPK 107
I + +L++LDS S Y +S LS S + + G +G G+ DG AR + PK
Sbjct: 61 -IQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDARLNNPK 119
Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS-KKEGRADGPAQNASFSNDFELTF 166
VD +GNIYVAD N+ IRKI+ GVTTIAGG + G DGP+++A FSNDF++ +
Sbjct: 120 GLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVRGGGHVDGPSEDAKFSNDFDVVY 179
Query: 167 VPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAV-----SVWVLVSVLSCLVSL 221
+ C+LL+ D GNQ IR+I L +DC+ SG LG + SVWV +L +S
Sbjct: 180 LGSSCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLGNIYIPKFSVWVPSCILQSSLSQ 239
Query: 222 VI 223
++
Sbjct: 240 LL 241
>gi|296087695|emb|CBI34951.3| unnamed protein product [Vitis vinifera]
Length = 811
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 96/234 (41%), Positives = 136/234 (58%), Gaps = 14/234 (5%)
Query: 6 VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
+ S++ L+ L+SL S S + EDGYTV TV DG +L I P+S ++ S
Sbjct: 34 IVSNVVSALVKWLWSLKSTTNTAISSRSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 92
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L+VLDS S Y +S PLS S K +AG DG G+ DG+ AR + PK +D +
Sbjct: 93 GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 152
Query: 115 GNIYVADKSNHVIRKITNLGVTTIA-GGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
GNIY+AD N IRKI++ GVTTIA G + G DGP+++A FSNDF++ ++ C+L
Sbjct: 153 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 212
Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVA 227
L+ D GNQ IR+I L EDC+ LG ++V V +++L+ VA
Sbjct: 213 LVIDRGNQAIREIQLHYEDCAYQYNGSFHLG-IAVLVAAGFFGYMLALLQRRVA 265
>gi|224108107|ref|XP_002314724.1| predicted protein [Populus trichocarpa]
gi|222863764|gb|EEF00895.1| predicted protein [Populus trichocarpa]
Length = 487
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/238 (38%), Positives = 137/238 (57%), Gaps = 16/238 (6%)
Query: 6 VYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
V+S++ + L+S+ S+ + E GYTV TV DG +L I P+SV P S
Sbjct: 41 VFSNVVPAFMKWLWSMKSTTKTVISGRPMMKFESGYTVETVFDGSKLGIEPYSVQVLP-S 99
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L++LDS S Y +S LS S K +AG +G G+ DG+ A+ + PK VD +
Sbjct: 100 GELLILDSVNSNIYRMSSSLSLYSRPKLVAGSHEGYSGHVDGKLREAKMNHPKGLTVDDR 159
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
GNIY+AD N IRKI++ GVTTIAGG + DG +++A FSNDF++ ++ C+LL
Sbjct: 160 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDAKFSNDFDVLYIGSSCSLL 219
Query: 175 ISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV---IGFVARP 229
+ D GNQ IR+I L +DC+ SG LG ++V + +++L+ +G + P
Sbjct: 220 VIDRGNQAIREIQLHFDDCAYQYGSGFPLG-IAVLLAAGFFGYMLALLQRRVGMIVSP 276
>gi|449492692|ref|XP_004159073.1| PREDICTED: uncharacterized protein LOC101226879 [Cucumis sativus]
Length = 516
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 136/229 (59%), Gaps = 14/229 (6%)
Query: 6 VYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
V S++ +L+ L+S+ SS + E GY+V TV DG +L I+P+SV P S
Sbjct: 34 VLSNVVSSLVKKLWSMKSSAKTAVSSRSMMKFESGYSVETVFDGSKLGIDPYSVEMSP-S 92
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L++LD+ S + +S PLS S K ++G +G G+ DG AR + PK +D +
Sbjct: 93 GELLILDAENSNIHKISMPLSRFSRPKLVSGSAEGYSGHVDGHSREARMNHPKGLTLDER 152
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
GNIY+AD N IRKI++ GVTTIAGG ++ G DGP+++A FSNDF++ +V C+L
Sbjct: 153 GNIYIADTMNMAIRKISDTGVTTIAGGRWNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSL 212
Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV 222
L+ D GN+ IR+I L +DC+ LG V + V + L++L+
Sbjct: 213 LVIDRGNKAIREIELNYDDCNTQYADSLNLGVV-LLVAAGLFGYLLALL 260
>gi|147860395|emb|CAN80443.1| hypothetical protein VITISV_043282 [Vitis vinifera]
Length = 527
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 97/245 (39%), Positives = 136/245 (55%), Gaps = 24/245 (9%)
Query: 6 VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
+ S++ L+ L+SL S S + EDGYTV TV DG +L I P+S ++ S
Sbjct: 17 IVSNVVSALVKWLWSLKSTTNTAISSXSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 75
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L+VLDS S Y +S PLS S K +AG DG G+ DG+ AR + PK +D +
Sbjct: 76 GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 135
Query: 115 GNIYVADKSNHVIRKITNLGVTTIA-GGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
GNIY+AD N IRKI++ GVTTIA G + G DGP+++A FSNDF++ ++ C+L
Sbjct: 136 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 195
Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIR 233
L+ D GNQ IR+I L EDC+ LG + L + VI + P++ R
Sbjct: 196 LVIDRGNQAIREIQLHYEDCAYQYNGSFHLGKLR----------LANFVIA-ITXPWLWR 244
Query: 234 HTGRL 238
R
Sbjct: 245 FMRRF 249
>gi|225452518|ref|XP_002274991.1| PREDICTED: uncharacterized protein LOC100266244 [Vitis vinifera]
Length = 677
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 96/234 (41%), Positives = 136/234 (58%), Gaps = 14/234 (5%)
Query: 6 VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
+ S++ L+ L+SL S S + EDGYTV TV DG +L I P+S ++ S
Sbjct: 34 IVSNVVSALVKWLWSLKSTTNTAISSRSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 92
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
+L+VLDS S Y +S PLS S K +AG DG G+ DG+ AR + PK +D +
Sbjct: 93 GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 152
Query: 115 GNIYVADKSNHVIRKITNLGVTTIA-GGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
GNIY+AD N IRKI++ GVTTIA G + G DGP+++A FSNDF++ ++ C+L
Sbjct: 153 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 212
Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVA 227
L+ D GNQ IR+I L EDC+ LG ++V V +++L+ VA
Sbjct: 213 LVIDRGNQAIREIQLHYEDCAYQYNGSFHLG-IAVLVAAGFFGYMLALLQRRVA 265
>gi|356497860|ref|XP_003517774.1| PREDICTED: uncharacterized protein LOC100799483 [Glycine max]
Length = 508
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 125/202 (61%), Gaps = 5/202 (2%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GYTV TV DG +L I P++V P + +L++LDS+ S Y +S LS S K
Sbjct: 62 SMMKFESGYTVETVFDGSKLGIEPYAVEVLP-NGELLILDSANSNIYRISSSLSLNSRPK 120
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
+AG +G G+ DG AR + PK VD +GNIYVAD N IRKI++ GVTTIAGG
Sbjct: 121 LVAGSAEGYSGHVDGRFREARMNHPKGITVDNRGNIYVADIMNMAIRKISDSGVTTIAGG 180
Query: 142 -GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
S+ G DGP++ A FSNDF++ +V C+LL+ D GNQ IR+I L +DC+ ++G
Sbjct: 181 KWSRGGGHVDGPSEEAKFSNDFDVVYVGSSCSLLVIDRGNQAIREIQLHFDDCAYQYENG 240
Query: 201 SALGAVSVWVLVSVLSCLVSLV 222
LG ++V + +++L+
Sbjct: 241 FPLG-IAVLIGAGFFGYMLALL 261
>gi|449456567|ref|XP_004146020.1| PREDICTED: uncharacterized protein LOC101206392 [Cucumis sativus]
Length = 477
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 85/218 (38%), Positives = 128/218 (58%), Gaps = 16/218 (7%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
+ + E GY V T+++G+++ + P+ + + G +L +DS S +S PLS S
Sbjct: 55 NAIQFESGYLVETIVEGNEIGMVPYKIRVSEDG--ELFAVDSVNSNVVKVSPPLSRYSRA 112
Query: 83 KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
+ +AG G +G+ DG+P ARF++PK +D KGN+YVAD N IRKI + GVTTIAG
Sbjct: 113 RLVAGSFQGYKGHVDGKPSDARFNQPKGITIDDKGNVYVADTLNLAIRKIVDAGVTTIAG 172
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
G + G +DGP + A FSNDF++ +V C+LL+ D GN +RQI+L EDC
Sbjct: 173 GKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVVDRGNAALRQISLNKEDCD------ 226
Query: 201 SALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRL 238
G+VS V + +L+IG+ Y+++H RL
Sbjct: 227 YQYGSVST---SDVAMFIGALLIGYFT--YMLQHGFRL 259
>gi|449518065|ref|XP_004166064.1| PREDICTED: uncharacterized protein LOC101228723 [Cucumis sativus]
Length = 449
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 85/218 (38%), Positives = 128/218 (58%), Gaps = 16/218 (7%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
+ + E GY V T+++G+++ + P+ + + G +L +DS S +S PLS S
Sbjct: 27 NAIQFESGYLVETIVEGNEIGMVPYKIRVSEDG--ELFAVDSVNSNVVKVSPPLSRYSRA 84
Query: 83 KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
+ +AG G +G+ DG+P ARF++PK +D KGN+YVAD N IRKI + GVTTIAG
Sbjct: 85 RLVAGSFQGYKGHVDGKPSDARFNQPKGITIDDKGNVYVADTLNLAIRKIVDAGVTTIAG 144
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
G + G +DGP + A FSNDF++ +V C+LL+ D GN +RQI+L EDC
Sbjct: 145 GKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVVDRGNAALRQISLNKEDCD------ 198
Query: 201 SALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRL 238
G+VS V + +L+IG+ Y+++H RL
Sbjct: 199 YQYGSVST---SDVAMFIGALLIGYFT--YMLQHGFRL 231
>gi|449530424|ref|XP_004172195.1| PREDICTED: uncharacterized LOC101210502, partial [Cucumis sativus]
Length = 437
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 88/235 (37%), Positives = 133/235 (56%), Gaps = 10/235 (4%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E+GY V TV++G+++ + PH + + + G +L V+DS S ++ PLS+ + +
Sbjct: 16 LQFENGYLVGTVVEGNEIGVLPHKIHVSKDG--ELFVVDSVNSNIVKITPPLSKYTRARL 73
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG G+ DG+P ARF+ P+ VD KGN+YVAD N IRKI + GVTTIAGG
Sbjct: 74 VAGSFQSHTGHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 133
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGP ++A FSNDF++ +V C+LL+ D GN IRQI+L EDC S S
Sbjct: 134 SNVVGYRDGPGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDCEYQDSSIS- 192
Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEALPNQSGETS 257
+ VL+ + + L + R + + + +PPL+ + P + +S
Sbjct: 193 ----NSDVLMIIGAVLAGYATYMIQRGFGTSNVSQTNPPLETEYREKPYKPESSS 243
>gi|449449204|ref|XP_004142355.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101209861 [Cucumis sativus]
Length = 454
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/202 (41%), Positives = 124/202 (61%), Gaps = 5/202 (2%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GY+V TV DG +L I+P+SV P S +L++LD+ S + +S PLS S K
Sbjct: 13 SMMKFESGYSVETVFDGSKLGIDPYSVEMSP-SGELLILDAENSNIHKISMPLSRFSRPK 71
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
++G +G G+ DG AR + PK +D +GNIY+AD N IRKI++ GVTTIAGG
Sbjct: 72 LVSGSAEGYSGHVDGHSREARMNHPKGLTLDERGNIYIADTMNMAIRKISDTGVTTIAGG 131
Query: 142 -GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
++ G DGP+++A FSNDF++ +V C+LL+ D GN+ IR+I L +DC+
Sbjct: 132 RWNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIELNYDDCNTQYADS 191
Query: 201 SALGAVSVWVLVSVLSCLVSLV 222
LG V + V + L++L+
Sbjct: 192 LNLGVV-LLVAAGLFGYLLALL 212
>gi|255560011|ref|XP_002521024.1| conserved hypothetical protein [Ricinus communis]
gi|223539861|gb|EEF41441.1| conserved hypothetical protein [Ricinus communis]
Length = 500
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 138/234 (58%), Gaps = 15/234 (6%)
Query: 1 MASHFVYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVI 51
+ S F ++L++ + L+SL ++ + E GY V TV DG +L I P+SV
Sbjct: 39 IVSGFFSNALSV-FMKWLWSLKATSKTAISGRPMMKFEGGYNVETVFDGSKLGIEPYSVE 97
Query: 52 DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSF 109
P + +L++LDS+ S Y +S LS S K +AG DG G+ DG+ AR + PK
Sbjct: 98 VLP-NGELLILDSANSNIYRISSSLSLYSRPKLVAGSPDGYSGHVDGKHREARMNHPKGL 156
Query: 110 AVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG-RADGPAQNASFSNDFELTFVP 168
VD +GNIY+AD N IRKI++ GVTTIAGG + G DG +++A FSNDF++ ++
Sbjct: 157 TVDDRGNIYIADAMNMAIRKISDAGVTTIAGGKWGRGGSHVDGASEDAKFSNDFDVVYIG 216
Query: 169 HICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV 222
C+LL+ D GN+ IR+I L +DC+ +SG LG V+V V +++L+
Sbjct: 217 SSCSLLVIDRGNRAIREIQLHFDDCAYQYESGFPLG-VAVLVAAGFFGYMLALL 269
>gi|449438911|ref|XP_004137231.1| PREDICTED: uncharacterized protein LOC101210502 [Cucumis sativus]
Length = 480
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 88/235 (37%), Positives = 133/235 (56%), Gaps = 10/235 (4%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E+GY V TV++G+++ + PH + + + G +L V+DS S ++ PLS+ + +
Sbjct: 59 LQFENGYLVGTVVEGNEIGVLPHKIHVSKDG--ELFVVDSVNSNIVKITPPLSKYTRARL 116
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG G+ DG+P ARF+ P+ VD KGN+YVAD N IRKI + GVTTIAGG
Sbjct: 117 VAGSFQSHTGHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 176
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGP ++A FSNDF++ +V C+LL+ D GN IRQI+L EDC S S
Sbjct: 177 SNVVGYRDGPGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDCEYQDSSIS- 235
Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEALPNQSGETS 257
+ VL+ + + L + R + + + +PPL+ + P + +S
Sbjct: 236 ----NSDVLMIIGAVLAGYATYMIQRGFGTSNVSQTNPPLETEYREKPYKPESSS 286
>gi|356501873|ref|XP_003519748.1| PREDICTED: uncharacterized protein LOC100797570 [Glycine max]
Length = 507
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/184 (45%), Positives = 117/184 (63%), Gaps = 4/184 (2%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GYTV TV DG +L I P++V + + +L++LDS+ S Y +S LS + K
Sbjct: 62 SMMKFESGYTVETVFDGSKLGIEPYAV-EVLANGELLILDSANSNIYRISSSLSLNTRPK 120
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
+AG +G G+ DG AR + PK AVD +GNIY+AD N IRKI++ GVTTIAGG
Sbjct: 121 LVAGSAEGYSGHVDGRLREARMNHPKGIAVDNRGNIYIADIMNMAIRKISDSGVTTIAGG 180
Query: 142 -GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
S+ G DGP++ A FSNDF++ ++ C+LL+ D GNQ IR+I L +DC+ ++G
Sbjct: 181 KWSRGGGHIDGPSEEAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDCAYQYENG 240
Query: 201 SALG 204
LG
Sbjct: 241 LTLG 244
>gi|357140648|ref|XP_003571876.1| PREDICTED: uncharacterized protein LOC100824092 [Brachypodium
distachyon]
Length = 491
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/240 (37%), Positives = 135/240 (56%), Gaps = 22/240 (9%)
Query: 25 GLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
L E GY V T+++G +L + PH++ P +L+ +DS+ S ++ PLSE S +
Sbjct: 65 ALQFESGYFVETLVEGDKLGVTPHTIRVSPLEGGELLAVDSAHSNIVRITPPLSEYSRAR 124
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
+AG G G+ DG+P ARF +P AVD GN+Y+AD +N IRKI + GVTTIAGG
Sbjct: 125 LVAGSFQGHAGHIDGKPIDARFKRPTGVAVDDTGNVYIADTANLAIRKIGDSGVTTIAGG 184
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
S G DGP+++A FS DF++ +V +C+LL+ D GN +R+I L EDC+
Sbjct: 185 KSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCT------- 237
Query: 202 ALGAVSVWVLVSVLSCLVSLVIGFVARPY---IIRHTGRLDPPLQHDMEALPNQSGETST 258
+ ++LS + LVIG V Y +++H P +EA ++ E+ST
Sbjct: 238 -------YQDAALLSSDIILVIGAVVAGYLFSVVQHG--FGPSSSEKIEAPEDEKQESST 288
>gi|356573394|ref|XP_003554846.1| PREDICTED: uncharacterized protein LOC100793916 [Glycine max]
Length = 528
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 118/199 (59%), Gaps = 5/199 (2%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E GY+V T+ DG QL I PHSV P + + +VLDS S Y +S +S S K LAG
Sbjct: 70 FESGYSVETIFDGSQLGIEPHSVKISP-NGEFLVLDSENSNIYKVSGSMSRYSRPKLLAG 128
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
+G G+ DG P AR + PK VD +GNIY+AD N IRKI++ GVTTIAGG
Sbjct: 129 SAEGNIGHIDGRPREARMNHPKGLTVDDRGNIYIADTLNMAIRKISDEGVTTIAGGKRGY 188
Query: 146 E-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
G DGP+++A FSNDF++ +V C+LL+ D GN IR+I L +DC+ + ++
Sbjct: 189 AGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVVDRGNHAIREIQLHQDDCTSYDEDDNSFN 248
Query: 205 -AVSVWVLVSVLSCLVSLV 222
+ V V + +++L+
Sbjct: 249 LGIVVLVAAAFFGYMLALL 267
>gi|413936635|gb|AFW71186.1| hypothetical protein ZEAMMB73_395034 [Zea mays]
Length = 501
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 123/214 (57%), Gaps = 17/214 (7%)
Query: 22 VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEES 80
V L E GY V T+++G +L + PH+V P +L+ +DS+ S ++ PLSE S
Sbjct: 67 VGVALQFESGYFVETLVEGDKLGVTPHTVRVSPVEGGELLAVDSAHSNIVRITPPLSEYS 126
Query: 81 VVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ +AG G G+ DG+P ARF +P AVD GN+YVAD +N IRKI GVTTI
Sbjct: 127 RGRLVAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTI 186
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
AGG S G DGP+++A FS DF++ +V +C+LL+ D GN +R+I+L EDC+
Sbjct: 187 AGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT---- 242
Query: 199 SGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
+ ++LS + LVIG V YI
Sbjct: 243 ----------YQDSALLSSDLILVIGAVVAGYIF 266
>gi|3176691|gb|AAC18814.1| Contains homology to serine/threonine protein kinase gb|X99618 from
Mycobacterium tuberculosis. ESTs gb|F14403, gb|F14404,
and gb|N96730 come from this gene [Arabidopsis thaliana]
Length = 493
Score = 147 bits (370), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 10/214 (4%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E+GY+V TV DG +L I P+S+ P + +L++LDS S Y +S LS S +
Sbjct: 35 SMVKFENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPR 93
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
+ G +G G+ DG A+ + PK VD +GNIYVAD N+ IRKI+ GVTTIAGG
Sbjct: 94 LVTGSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGG 153
Query: 142 GS-KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
+ + G DGP+++A FSNDF++ +V C+LL+ D GN+ IR+I L +DC+ SG
Sbjct: 154 KTVRNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDCAYQYGSG 213
Query: 201 SALGAVS------VWVLVSVLSCLVSLVIGFVAR 228
LG ++ +W+ V V + ++ + R
Sbjct: 214 FPLGKLTKVPNFFLWIAVLVAAGFFGYMLALLQR 247
>gi|115445197|ref|NP_001046378.1| Os02g0234200 [Oryza sativa Japonica Group]
gi|50251216|dbj|BAD27660.1| putative NHL repeat-containing protein [Oryza sativa Japonica
Group]
gi|113535909|dbj|BAF08292.1| Os02g0234200 [Oryza sativa Japonica Group]
gi|125538730|gb|EAY85125.1| hypothetical protein OsI_06475 [Oryza sativa Indica Group]
gi|222622487|gb|EEE56619.1| hypothetical protein OsJ_06002 [Oryza sativa Japonica Group]
Length = 493
Score = 147 bits (370), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 85/210 (40%), Positives = 121/210 (57%), Gaps = 17/210 (8%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E GY V T+++G +L + PH++ P +L+ +DS+ S ++ PLSE S +
Sbjct: 69 LQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRL 128
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG G G+ DG+P ARF +P AVD GN+YVAD +N IRKI GVTTIAGG
Sbjct: 129 VAGSFQGHSGHIDGKPSDARFKRPTGVAVDDTGNVYVADTANLAIRKIGESGVTTIAGGK 188
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGP+++A FS DF++ +V +C+LL+ D GN +R+I L EDC+
Sbjct: 189 SNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCT-------- 240
Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
+ ++LS + LVIG V YI
Sbjct: 241 ------YQDATLLSSDIILVIGAVVAGYIF 264
>gi|226504928|ref|NP_001143593.1| uncharacterized protein LOC100276297 precursor [Zea mays]
gi|195623016|gb|ACG33338.1| hypothetical protein [Zea mays]
Length = 501
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 123/214 (57%), Gaps = 17/214 (7%)
Query: 22 VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEES 80
V L E GY V T+++G +L + PH++ P +L+ +DS+ S ++ PLSE S
Sbjct: 67 VGVALQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYS 126
Query: 81 VVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ +AG G G+ DG+P ARF +P AVD GN+YVAD +N IRKI GVTTI
Sbjct: 127 RGRLVAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTI 186
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
AGG S G DGP+++A FS DF++ +V +C+LL+ D GN +R+I+L EDC+
Sbjct: 187 AGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT---- 242
Query: 199 SGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
+ ++LS + LVIG V YI
Sbjct: 243 ----------YQDSALLSSDLILVIGAVVAGYIF 266
>gi|30698020|ref|NP_177185.3| NHL domain-containing protein [Arabidopsis thaliana]
gi|332196920|gb|AEE35041.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 509
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 94/255 (36%), Positives = 144/255 (56%), Gaps = 20/255 (7%)
Query: 13 TLLSLLFSL---------VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLD 63
+L+ L+SL S + E+GY+V TV DG +L I P+S+ P + +L++LD
Sbjct: 42 SLMKWLWSLKTTTKTTIATRSMVKFENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILD 100
Query: 64 SSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
S S Y +S LS S + + G +G G+ DG A+ + PK VD +GNIYVAD
Sbjct: 101 SENSNIYKISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVAD 160
Query: 122 KSNHVIRKITNLGVTTIAGGGS-KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
N+ IRKI+ GVTTIAGG + + G DGP+++A FSNDF++ +V C+LL+ D GN
Sbjct: 161 TVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGN 220
Query: 181 QLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV---IGFVARPYIIRHTGR 237
+ IR+I L +DC+ SG LG ++V V +++L+ +G + + +
Sbjct: 221 KAIREIQLHFDDCAYQYGSGFPLG-IAVLVAAGFFGYMLALLQRRVGSIVSSHNDQEMFE 279
Query: 238 LDP---PLQHDMEAL 249
DP P++H +L
Sbjct: 280 ADPDQKPMKHSRPSL 294
>gi|242061092|ref|XP_002451835.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
gi|241931666|gb|EES04811.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
Length = 495
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/210 (40%), Positives = 122/210 (58%), Gaps = 17/210 (8%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E GY V T+++G +L + PH++ P +L+ +DS+ S ++ PLSE S +
Sbjct: 70 LQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRL 129
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG G G+ DG+P ARF +P AVD GN+YVAD +N IRKI GVTTIAGG
Sbjct: 130 VAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTIAGGK 189
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGP+++A FS DF++ +V +C+LL+ D GN +R+I+L EDC+
Sbjct: 190 SNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT-------- 241
Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
+ ++LS + LVIG V YI
Sbjct: 242 ------YQDSALLSSDLILVIGAVVAGYIF 265
>gi|357452081|ref|XP_003596317.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
gi|355485365|gb|AES66568.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
Length = 493
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 83/204 (40%), Positives = 120/204 (58%), Gaps = 6/204 (2%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+++ + P+ + +L +D S ++ PLS+ S + +
Sbjct: 63 LQFENGYVVETVVEGNEIGVIPYRIRVSEEDGELFAVDEINSNIVRITPPLSQYSRGRLV 122
Query: 86 AGDGVQGYSD---GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
AG QGY+D G+P ARF+ PK +D KGN+YVAD N IRKI + GVTTIAGG
Sbjct: 123 AG-SFQGYTDHVDGKPSDARFNHPKGITMDDKGNVYVADTQNLAIRKIGDAGVTTIAGGK 181
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGP+++A FSNDF++ +V C+LL+ D GN +R+I L EDC S S S+
Sbjct: 182 SNVAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRKIILDQEDCDYQSSSISS 241
Query: 203 LGAVSV--WVLVSVLSCLVSLVIG 224
+ V VLV +C++ G
Sbjct: 242 TDILIVVGAVLVGYATCMLQQGFG 265
>gi|22329761|ref|NP_173800.2| NHL domain-containing protein [Arabidopsis thaliana]
gi|18700091|gb|AAL77657.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
gi|20856013|gb|AAM26643.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
gi|332192324|gb|AEE30445.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 545
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 84/198 (42%), Positives = 121/198 (61%), Gaps = 5/198 (2%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E+GY+V TV+DG +L I P+S I + +L++LDS S Y +S LS S + + G
Sbjct: 128 FENGYSVETVLDGSKLGIEPYS-IQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTG 186
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS-K 144
+G G+ DG AR + PK VD +GNIYVAD N+ IRKI+ GVTTIAGG +
Sbjct: 187 SPEGYPGHVDGRLRDARLNNPKGLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVR 246
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
G DGP+++A FSNDF++ ++ C+LL+ D GNQ IR+I L +DC+ SG LG
Sbjct: 247 GGGHVDGPSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLG 306
Query: 205 AVSVWVLVSVLSCLVSLV 222
++V V +++L+
Sbjct: 307 -IAVLVAAVFFGYMLALL 323
>gi|297845436|ref|XP_002890599.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297336441|gb|EFH66858.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 140/258 (54%), Gaps = 25/258 (9%)
Query: 8 SSLAITLLSLLFSLVS-------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLI 60
+SL L SL F + S + E+GY+V TV+DG +L I P+S+ P + +L+
Sbjct: 41 TSLMKWLWSLSFKTTTITAVSTKSMVKFENGYSVETVLDGSKLGIEPYSLQVLP-NGELL 99
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+LDS S Y +S LS S + + G +G G+ DG AR + PK VD +GNIY
Sbjct: 100 ILDSQNSNIYKISSSLSLYSRPRLITGSPEGYPGHVDGRLRDARLNNPKGLTVDDRGNIY 159
Query: 119 VADKSNHVIRKITNLGVTTIAGGGS-KKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
VAD N+ IRKI+ GVTTIAGG G DGP+++A FSNDF++ ++ C+LL+ D
Sbjct: 160 VADTVNNAIRKISEAGVTTIAGGKMVHGGGHVDGPSEDAKFSNDFDVVYLGSSCSLLVID 219
Query: 178 HGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVAR---PYIIRH 234
GNQ IR+I L +DC+ SG LG + V V + ++ + R I H
Sbjct: 220 RGNQAIREIQLHFDDCADQYGSGFPLG-----IAVLVAAIFFGYMLALLQRRLSSIISYH 274
Query: 235 TGRLDPPLQHDMEALPNQ 252
T Q EA+P+Q
Sbjct: 275 TD------QEVFEAVPDQ 286
>gi|30698018|ref|NP_850974.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|17065224|gb|AAL32766.1| Unknown protein [Arabidopsis thaliana]
gi|21387163|gb|AAM47985.1| unknown protein [Arabidopsis thaliana]
gi|332196919|gb|AEE35040.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 447
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/231 (38%), Positives = 135/231 (58%), Gaps = 11/231 (4%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E+GY+V TV DG +L I P+S+ P + +L++LDS S Y +S LS S + + G
Sbjct: 4 FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 62
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS-K 144
+G G+ DG A+ + PK VD +GNIYVAD N+ IRKI+ GVTTIAGG + +
Sbjct: 63 SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 122
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
G DGP+++A FSNDF++ +V C+LL+ D GN+ IR+I L +DC+ SG LG
Sbjct: 123 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDCAYQYGSGFPLG 182
Query: 205 AVSVWVLVSVLSCLVSLV---IGFVARPYIIRHTGRLDP---PLQHDMEAL 249
++V V +++L+ +G + + + DP P++H +L
Sbjct: 183 -IAVLVAAGFFGYMLALLQRRVGSIVSSHNDQEMFEADPDQKPMKHSRPSL 232
>gi|297841763|ref|XP_002888763.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297334604|gb|EFH65022.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 503
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 88/222 (39%), Positives = 131/222 (59%), Gaps = 14/222 (6%)
Query: 13 TLLSLLFSL---------VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLD 63
+L+ L+SL S + E+GY+V TV DG +L I P+S+ P + +L++LD
Sbjct: 42 SLMKWLWSLKTTTKTTIATRSMVKFENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILD 100
Query: 64 SSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
S S Y +S LS S + + G +G G+ DG A+ + PK VD +GNIYVAD
Sbjct: 101 SENSNIYKISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVAD 160
Query: 122 KSNHVIRKITNLGVTTIAGGGS-KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
N+ IRKI+ GVTTIAGG + + G DGP+++A FSNDF++ +V C+LL+ D GN
Sbjct: 161 TVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGN 220
Query: 181 QLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV 222
+ IR+I L +DC+ SG LG ++V V +++L+
Sbjct: 221 KAIREIQLHFDDCAYQYGSGFPLG-IAVLVAAGFFGYMLALL 261
>gi|225424717|ref|XP_002265467.1| PREDICTED: uncharacterized protein LOC100263777 [Vitis vinifera]
gi|296086531|emb|CBI32120.3| unnamed protein product [Vitis vinifera]
Length = 491
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 125/202 (61%), Gaps = 5/202 (2%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GYTV TV DG +L I P+S I+ GS +L++LDS+ S Y +S LS+ + K
Sbjct: 62 SMMKFESGYTVETVFDGSKLGIEPYS-IEVLGSGELLILDSANSNLYKISSSLSQYTRPK 120
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
+ G +G G+ DG AR + PK VD +GNIYVAD N IRKI++ GVTTIAGG
Sbjct: 121 LVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIRKISDAGVTTIAGG 180
Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
+ G DGP+++A FSNDF++ ++ C+LL+ D GNQ IR+I L +DC+ SG
Sbjct: 181 KLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDCAYQYGSG 240
Query: 201 SALGAVSVWVLVSVLSCLVSLV 222
LG ++V + +++L+
Sbjct: 241 FPLG-IAVLIAAGFFGYMLALL 261
>gi|11994597|dbj|BAB02652.1| unnamed protein product [Arabidopsis thaliana]
Length = 511
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 120/208 (57%), Gaps = 8/208 (3%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+ + + P+ I +L +D S ++ PLS+ S + +
Sbjct: 71 LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
AG G G++DG+P ARF+ P+ +D KGN+YVAD N IRKI + GVTTIAGG S
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
G DGP+++A FSNDF++ +V C+LL+ D GN +RQI+L EDC S +L
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDCDYQDDSSISL 249
Query: 204 GAVSVWVL-----VSVLSCLVSLVIGFV 226
V + VL V + LVIG V
Sbjct: 250 TGVDLRVLTEDHCVLTFGADILLVIGAV 277
>gi|147798773|emb|CAN69794.1| hypothetical protein VITISV_022544 [Vitis vinifera]
Length = 491
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 125/202 (61%), Gaps = 5/202 (2%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GYTV TV DG +L I P+S I+ GS +L++LDS+ S Y +S LS+ + K
Sbjct: 62 SMMKFESGYTVETVFDGSKLGIEPYS-IEVLGSGELLILDSANSNLYKISSSLSQYTRPK 120
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
+ G +G G+ DG AR + PK VD +GNIYVAD N IRKI++ GVTTIAGG
Sbjct: 121 LVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIRKISDAGVTTIAGG 180
Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
+ G DGP+++A FSNDF++ ++ C+LL+ D GNQ IR+I L +DC+ SG
Sbjct: 181 KLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDCAYQYGSG 240
Query: 201 SALGAVSVWVLVSVLSCLVSLV 222
LG ++V + +++L+
Sbjct: 241 FPLG-IAVLIAAGFFGYMLALL 261
>gi|224117390|ref|XP_002331700.1| predicted protein [Populus trichocarpa]
gi|222874306|gb|EEF11437.1| predicted protein [Populus trichocarpa]
Length = 495
Score = 143 bits (360), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 78/177 (44%), Positives = 109/177 (61%), Gaps = 7/177 (3%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L EDGY V TV++G+ + + P+ + + G +L +D S ++ PLS+ S R
Sbjct: 61 LQFEDGYLVETVVEGNAMGVVPYKIRVSEDG--ELYAVDEVNSNVVKITPPLSQYSRA-R 117
Query: 85 LAGDGVQGYS---DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
LA QGY+ DG+P ARF+ P+ +D KGNIYVAD N IRKI + GVTTIAGG
Sbjct: 118 LAAGSFQGYTGHIDGKPNEARFNHPRGLTMDDKGNIYVADTLNLAIRKIGDAGVTTIAGG 177
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
S G DGP+++A FSNDF++ ++ C+LL+ D GN +RQI+L EDC S+
Sbjct: 178 KSNVAGFRDGPSEDAKFSNDFDVVYLHSTCSLLVVDRGNAALRQISLNQEDCDYQSK 234
>gi|255567088|ref|XP_002524526.1| conserved hypothetical protein [Ricinus communis]
gi|223536200|gb|EEF37853.1| conserved hypothetical protein [Ricinus communis]
Length = 484
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 80/204 (39%), Positives = 119/204 (58%), Gaps = 7/204 (3%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L EDGY V TV++G+++ + P+ + + G +L +D S ++ PLS+ S +
Sbjct: 60 LQFEDGYLVETVVEGNEIGVVPYKIRVSEDG--ELYAVDEVNSNIVKITPPLSQYSRARL 117
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG G +G+ DG+ ARF+ PK +D KGN+YVAD N IRKI + GVTTIAGG
Sbjct: 118 VAGSFQGYKGHVDGKSNEARFNHPKGITMDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 177
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGP+++A FS DF++ +V C+LL+ D GN +RQI+L EDC S S +
Sbjct: 178 SNTAGYRDGPSEDAKFSTDFDVVYVHSTCSLLVVDRGNVALRQISLNQEDCDYQSSSITV 237
Query: 203 LGAVSV--WVLVSVLSCLVSLVIG 224
+ V V ++C++ G
Sbjct: 238 TDLLMVVGAVFTGYVTCMLQQGFG 261
>gi|356565931|ref|XP_003551189.1| PREDICTED: uncharacterized protein LOC100781836 [Glycine max]
Length = 493
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 82/184 (44%), Positives = 115/184 (62%), Gaps = 4/184 (2%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GY V TV DG +L I P++V P + +L++LDS+ S Y +S LS S K
Sbjct: 66 SMMKFESGYNVETVFDGSKLGIEPYAVEVLP-NGELLILDSANSNIYRISSSLSLYSRPK 124
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
+AG +G G+ DG+ AR + PK VD +GNIYVAD +N IRKI++ GVTTIAGG
Sbjct: 125 LVAGSAEGYSGHVDGKLREARMNHPKGITVDDRGNIYVADTTNMAIRKISDSGVTTIAGG 184
Query: 142 -GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
++ G DGP++ A FS+D ++ +V C+LL+ D GN+ IR+I L +DC+ SG
Sbjct: 185 KWNRGGGHVDGPSEEAKFSDDLDVVYVGSSCSLLVIDRGNRAIREIQLHFDDCAYQYGSG 244
Query: 201 SALG 204
LG
Sbjct: 245 FPLG 248
>gi|297834352|ref|XP_002885058.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297330898|gb|EFH61317.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 121/205 (59%), Gaps = 8/205 (3%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
L E+GY V TV++G+++ + P+ + + G +L +D S ++ PLS+ S +
Sbjct: 67 LQFENGYLVETVVEGNEIGVVPYKIRVSHDG--ELYAVDELNSNIMKITPPLSQYSRGRL 124
Query: 85 LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG G G++DG+P ARF+ P+ +D KGN+YV D N IRKI + GVTTIAGG
Sbjct: 125 VAGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVGDTLNLAIRKIGDSGVTTIAGGK 184
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S G DGP+++A FSNDF++ +V C+LL+ D GN +RQI+L EDC S +
Sbjct: 185 SNIAGYRDGPSEDAKFSNDFDVVYVRSTCSLLVIDRGNAALRQISLSDEDCDYQDDSSIS 244
Query: 203 LGAVSV---WVLVSVLSCLVSLVIG 224
L + + VL+ +CL+ G
Sbjct: 245 LTDILLVIGAVLIGYATCLLQQGFG 269
>gi|42572439|ref|NP_974315.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|332642057|gb|AEE75578.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 493
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 80/204 (39%), Positives = 119/204 (58%), Gaps = 6/204 (2%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+ + + P+ I +L +D S ++ PLS+ S + +
Sbjct: 71 LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
AG G G++DG+P ARF+ P+ +D KGN+YVAD N IRKI + GVTTIAGG S
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
G DGP+++A FSNDF++ +V C+LL+ D GN +RQI+L EDC S +L
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDCDYQDDSSISL 249
Query: 204 GAVSV---WVLVSVLSCLVSLVIG 224
+ + VL+ +C++ G
Sbjct: 250 TDILLVIGAVLIGYATCMLQQGFG 273
>gi|22331093|ref|NP_188104.2| NHL domain-containing protein [Arabidopsis thaliana]
gi|20260314|gb|AAM13055.1| unknown protein [Arabidopsis thaliana]
gi|25083516|gb|AAN72090.1| unknown protein [Arabidopsis thaliana]
gi|332642056|gb|AEE75577.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 492
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 80/204 (39%), Positives = 119/204 (58%), Gaps = 6/204 (2%)
Query: 26 LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
L E+GY V TV++G+ + + P+ I +L +D S ++ PLS+ S + +
Sbjct: 71 LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129
Query: 86 AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
AG G G++DG+P ARF+ P+ +D KGN+YVAD N IRKI + GVTTIAGG S
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
G DGP+++A FSNDF++ +V C+LL+ D GN +RQI+L EDC S +L
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDCDYQDDSSISL 249
Query: 204 GAVSV---WVLVSVLSCLVSLVIG 224
+ + VL+ +C++ G
Sbjct: 250 TDILLVIGAVLIGYATCMLQQGFG 273
>gi|357485963|ref|XP_003613269.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
gi|355514604|gb|AES96227.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
Length = 589
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 85/202 (42%), Positives = 116/202 (57%), Gaps = 22/202 (10%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS------ 77
S + E GY V TV DG +L I P++V + + +L++LDS S Y +S LS
Sbjct: 76 SMMKFESGYNVETVFDGSKLGIEPYAV-EVLSNGELLILDSENSNIYKISSSLSLCKCAN 134
Query: 78 ------------EESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
S K +AG +G G+ DG+ AR + PK VD +GNIYVAD
Sbjct: 135 VITSPMMSLVAPTNSRPKLVAGSAEGYSGHVDGKLREARMNHPKGITVDDRGNIYVADIM 194
Query: 124 NHVIRKITNLGVTTIAGGG-SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
N IRKI++ GVTTIAGG S+ G DGP++ A FSNDF++ +V C+LL+ D GNQ
Sbjct: 195 NMAIRKISDSGVTTIAGGKLSRGGGHVDGPSEEAKFSNDFDVVYVGSSCSLLVIDRGNQA 254
Query: 183 IRQINLKPEDCSKSSQSGSALG 204
IR+I L+ +DC+ +SG LG
Sbjct: 255 IREIQLRFDDCAYQYESGFPLG 276
>gi|9755656|emb|CAC01808.1| putative protein [Arabidopsis thaliana]
Length = 733
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/169 (43%), Positives = 105/169 (62%), Gaps = 5/169 (2%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY + TV DG +L I P+++ P +LIVLDS S + +S PLS K L+G
Sbjct: 54 ESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPKLLSGS 113
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
+G G+ DG+ AR ++P+ A+D +GNIYVAD N IRKI++ GV+TIA GG
Sbjct: 114 QEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSG 173
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
G + ++ FS+DF+L +V C+LL+ D GNQLI++I L DCS+
Sbjct: 174 GSKE---ESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQ 219
>gi|30685100|ref|NP_196993.2| NHL domain-containing protein [Arabidopsis thaliana]
gi|332004703|gb|AED92086.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 754
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/169 (43%), Positives = 105/169 (62%), Gaps = 5/169 (2%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY + TV DG +L I P+++ P +LIVLDS S + +S PLS K L+G
Sbjct: 75 ESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPKLLSGS 134
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
+G G+ DG+ AR ++P+ A+D +GNIYVAD N IRKI++ GV+TIA GG
Sbjct: 135 QEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSG 194
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
G + ++ FS+DF+L +V C+LL+ D GNQLI++I L DCS+
Sbjct: 195 GSKE---ESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQ 240
>gi|224140207|ref|XP_002323476.1| predicted protein [Populus trichocarpa]
gi|222868106|gb|EEF05237.1| predicted protein [Populus trichocarpa]
Length = 453
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 85/201 (42%), Positives = 119/201 (59%), Gaps = 5/201 (2%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
S + E GYTV TV DG +L I P+SV P S +L++LDS S Y +S S+ K
Sbjct: 7 SMMKFEGGYTVETVFDGSKLGIEPYSVEVSP-SGELLLLDSENSNIYKISTQFSKYGRPK 65
Query: 84 RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
+ G +G G+ DG+ AR + PK VD GNIYVAD N IRKI T+ GVTTIAG
Sbjct: 66 LITGSFEGYPGHVDGKLRDARMNHPKGLTVDEGGNIYVADTMNMAIRKISTDGGVTTIAG 125
Query: 141 G-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQS 199
G ++ G DGP+++A FSNDF++ + C+LLI D G+Q IR+I L +DC+
Sbjct: 126 GKWARGGGHVDGPSEDAKFSNDFDVVYSASSCSLLIIDRGSQAIREIQLHDDDCNYPHDD 185
Query: 200 GSALGAVSVWVLVSVLSCLVS 220
L ++ + ++ L+ LV+
Sbjct: 186 CFHLDLDNILINIAGLAVLVA 206
>gi|297807503|ref|XP_002871635.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317472|gb|EFH47894.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 704
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/168 (41%), Positives = 104/168 (61%), Gaps = 5/168 (2%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY + TV DG +L I P+++ P +LIVLDS S + +S PLS + K ++G
Sbjct: 75 ESGYNIETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYAKPKLVSGS 134
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
+G G+ DG+ A+ ++P+ A+D GNIYVAD +N IRKI++ GV+TI GG
Sbjct: 135 QEGYTGHVDGKLKEAKMNRPRGLAIDDSGNIYVADTNNMAIRKISDDGVSTITAGGRWSG 194
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G + ++ FS+DF+L +V C+LL+ D GNQ+I++I L DCS
Sbjct: 195 GSKE---ESMRFSDDFDLIYVSSSCSLLVIDRGNQMIKEIQLHDHDCS 239
>gi|317015034|gb|ADU85604.1| NHL repeat-containing protein, partial [Wolffia arrhiza]
Length = 239
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/174 (43%), Positives = 109/174 (62%), Gaps = 3/174 (1%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
E GY V T+ DG++L + PH++ P +LI +DS S ++ PLS S + +AG
Sbjct: 60 FESGYLVETIADGNRLGLTPHAIRVSP-DGELIAVDSVNSNIVRITPPLSAFSRGRLVAG 118
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
G G DG+P ARF++P+ AVD +GNIYVAD SN IRKI + GV+TIAGG +
Sbjct: 119 SFQGRSGLIDGKPSEARFNQPRGVAVDRRGNIYVADVSNLAIRKIGDSGVSTIAGGKAGA 178
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQS 199
G DGP++ A FS DF++ +V +C+LL+ D GN +R+I L +DC++ S
Sbjct: 179 AGFRDGPSEEARFSADFDVVYVKSLCSLLVVDRGNAALRKIFLHDDDCTQDFSS 232
>gi|302768697|ref|XP_002967768.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
gi|300164506|gb|EFJ31115.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
Length = 186
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/180 (40%), Positives = 106/180 (58%), Gaps = 6/180 (3%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
L+ + +L+E G+TV T+ D + N ++ P + ++ LDSS + + L PLS+ S
Sbjct: 7 LLKTFVLVEPGFTVKTLFDLGKHSANAFALYPDPRNKFVLALDSSGNRIWKLRLPLSQNS 66
Query: 81 VVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ AG G GY DG + F++P+S A+ G ++VAD N IRKI+ G VTT
Sbjct: 67 SLEAFAGSFVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEVTT 126
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDCSKS 196
IAGG S+K G ADGP A FS++F L +LLI+D GN+LIR+I + P+ C S
Sbjct: 127 IAGGSSRKPGFADGPGDTARFSSEFSLAC--SCGSLLIADRGNRLIREIQIDDPKSCDSS 184
>gi|168018878|ref|XP_001761972.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686689|gb|EDQ73076.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 386
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/167 (43%), Positives = 100/167 (59%), Gaps = 11/167 (6%)
Query: 31 GYTVTTVIDGHQLEINPHSV---IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
GY VTTV++G+ ++ + + DR + IVLDS++S + PLS++SVV+ +AG
Sbjct: 44 GYDVTTVLNGNLRGLSFYCIDEATDR-APAWAIVLDSTKSKVLRVQLPLSQDSVVEHIAG 102
Query: 88 --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGG 141
+G GY DG G A F+ PK +D GNIYVAD N IR IT G VTTIA G
Sbjct: 103 SLEGKAGYQDGRGGDALFNHPKMLTLDSDGNIYVADVRNTAIRMITTQGKHSFVTTIA-G 161
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G + G DG + +FSNDF +T++ C LLI D GN+++R + L
Sbjct: 162 GMNRTGHNDGEGRVVTFSNDFGVTYLRKNCTLLIVDRGNRMVRAMKL 208
>gi|357516309|ref|XP_003628443.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
gi|355522465|gb|AET02919.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
Length = 160
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/122 (48%), Positives = 76/122 (62%), Gaps = 1/122 (0%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G G+ DG+ ARF PK A+D KGN+YVAD N IRKI + GVTTIAGG S G
Sbjct: 24 GRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGY 83
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV 208
DGP ++A SNDF++ ++ C+LL+ D GN +RQI L EDC+ S S S G+
Sbjct: 84 RDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQSSSISLTGS-EF 142
Query: 209 WV 210
W+
Sbjct: 143 WI 144
>gi|125561542|gb|EAZ06990.1| hypothetical protein OsI_29235 [Oryza sativa Indica Group]
Length = 498
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 58/100 (58%), Positives = 72/100 (72%), Gaps = 3/100 (3%)
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
TTIAGG SK GR DGPAQNA+FS DFEL +VP +CALLI+D GN+LIRQINLK EDC+
Sbjct: 222 TTTIAGGRSKGPGRKDGPAQNATFSPDFELVYVPKMCALLITDRGNRLIRQINLKREDCA 281
Query: 195 KSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
+ +Q G LG SV +++VL L+ V+GF R + H
Sbjct: 282 RETQPG--LGTTSV-SIIAVLCALLGSVVGFSVRHFYPAH 318
>gi|357516379|ref|XP_003628478.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
gi|355522500|gb|AET02954.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
Length = 150
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 73/116 (62%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G G+ DG+ ARF PK A+D KGN+YVAD N IRKI + GVTTIAGG S G
Sbjct: 24 GRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGY 83
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
DGP ++A SNDF++ ++ C+LL+ D GN +RQI L EDC+ S S S G
Sbjct: 84 RDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQSSSISLTG 139
>gi|302821461|ref|XP_002992393.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
gi|300139809|gb|EFJ06543.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
Length = 183
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/180 (37%), Positives = 102/180 (56%), Gaps = 9/180 (5%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
L+ + +L+E G+TV T+ D + N ++ P ++ LDSS + + L PLS+ S
Sbjct: 7 LLKTFVLVEPGFTVKTLFDLGKHSANAFALYPDPRKKFVLALDSSGNRIWKLRLPLSQNS 66
Query: 81 VVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ AG G GY DG + F++P+S ++ G ++VAD N IRKI+ G VTT
Sbjct: 67 SLEAFAGSFVGESGYVDGPAAKSLFNRPQSLSICDNGAVFVADTRNLAIRKISKDGEVTT 126
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDCSKS 196
IAGG S+K G A A FS++F L +LLI+D GN+LIR+I + P+ C S
Sbjct: 127 IAGGSSRKPGFA---GDTARFSSEFSL--ACSCGSLLIADCGNRLIREIQIDDPKSCDSS 181
>gi|302791603|ref|XP_002977568.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
gi|300154938|gb|EFJ21572.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
Length = 288
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 107/201 (53%), Gaps = 25/201 (12%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
G+TV T+ D + +N + P + ++ DSS + + L PLS+ S ++ AG
Sbjct: 38 GFTVKTLFDLGKHSVNAFGLYPDPRNKFMLAPDSSGNRIWKLRLPLSQNSSLEAFAG-WE 96
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
GY DG + F++P+S A+ G I+VAD + IRKI+ VTTIAGG S+K G AD
Sbjct: 97 SGYIDGPAAKSLFNRPQSLAMCGNGAIFVAD-TRMAIRKISK--VTTIAGGSSRKPGIAD 153
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC--SKSSQSGSA----- 202
G A+ FS++F L +L I+D GN LIR+I + P+ C S S+ SGS
Sbjct: 154 GTAR---FSSEFSL--ACSCGSLPIADRGNWLIREIQIDDPKSCDSSDSTVSGSQKCVVL 208
Query: 203 --------LGAVSVWVLVSVL 215
LGA +W ++ L
Sbjct: 209 LGVCLGIPLGAFIIWKILDQL 229
>gi|238009310|gb|ACR35690.1| unknown [Zea mays]
Length = 340
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 70/118 (59%), Gaps = 14/118 (11%)
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
GN+YVAD +N IRKI GVTTIAGG S G DGP+++A FS DF++ +V +C+LL
Sbjct: 2 GNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLL 61
Query: 175 ISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
+ D GN +R+I+L EDC+ + ++LS + LVIG V YI
Sbjct: 62 VIDRGNAALRKISLPQEDCT--------------YQDSALLSSDLILVIGAVVAGYIF 105
>gi|413944258|gb|AFW76907.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
Length = 187
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 65/108 (60%), Gaps = 3/108 (2%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GY V TV DG +L I PH+V P + +L+VLDS S Y + PLS S K LAG
Sbjct: 73 EGGYAVDTVFDGSKLGIEPHAVEITP-AGNLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 131
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
G+ G+ DG AR + PK F VD +GNIYVAD N IRKI++ G
Sbjct: 132 PKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISDTG 179
>gi|302768701|ref|XP_002967770.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
gi|300164508|gb|EFJ31117.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
Length = 128
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/123 (43%), Positives = 75/123 (60%), Gaps = 7/123 (5%)
Query: 79 ESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--LG 134
+S ++ LAG G GY DG + F++P+S A+ G ++VAD N IRKI+ G
Sbjct: 3 DSSLEALAGSLVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEG 62
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC 193
+TTIAGG S+K G ADGP A FS++F L +LLI+D GN+LIR+I + P+ C
Sbjct: 63 MTTIAGGSSRKPGFADGPGDTARFSSEFRLAC--SCGSLLIADRGNRLIREIQIDDPKSC 120
Query: 194 SKS 196
S
Sbjct: 121 DSS 123
>gi|195609352|gb|ACG26506.1| hypothetical protein [Zea mays]
gi|414870563|tpg|DAA49120.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 159
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 73/117 (62%), Gaps = 5/117 (4%)
Query: 27 LLEDGYTVTTVIDGHQLEIN---PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
+LEDGYTVTT D + L + P++V+ R + DL++LDS+ SA YT+S S +
Sbjct: 28 VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISL-SSSPGEPR 86
Query: 84 RLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
RLAG G++DG PG A FD+P+S AVD N+YVAD+ + +RK+ G T A
Sbjct: 87 RLAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGERTRA 143
>gi|373488690|ref|ZP_09579354.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
gi|372005635|gb|EHP06271.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
Length = 438
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 73/138 (52%), Gaps = 3/138 (2%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRL-AG-DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
L V D++R L P S+ +V + L AG DG+ G D SA F+ P+ A+D GN
Sbjct: 132 LYVSDATRHTISKL-VPNSDGTVTQSLLAGSDGISGSEDKSGSSASFNSPEGLALDASGN 190
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
+YVAD NH IRKI LG T G ++ G DG +A F++ L F L ++
Sbjct: 191 LYVADYGNHTIRKIDTLGAVTTFAGQAEASGTVDGDRLSARFNHPIGLAFNASYSVLYVA 250
Query: 177 DHGNQLIRQINLKPEDCS 194
D GN IR IN+K + S
Sbjct: 251 DSGNHTIRAINIKSQTVS 268
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 3/107 (2%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
S + AGD G ++G ARF++P + G + VAD NH IRK++ G +
Sbjct: 321 STLAGYAGD--TGTANGTGNGARFNQPVGITLHTSGYLLVADAYNHAIRKVSTSGSVSTL 378
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G S G D A FS + V ++D+ N LIR I
Sbjct: 379 AGESGVSGNEDESGSEAHFSRPSNIC-VDSSGIAYVTDYKNGLIRTI 424
Score = 40.0 bits (92), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 8/111 (7%)
Query: 96 GEPG--SARFDKPKSFAVDMKGNI---YVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
G PG S+ P AV ++ YV D + IRKIT+ G + G + G A+
Sbjct: 275 GSPGTSSSMLSSPNGVAVSTSDDVDTLYVTDYGSSTIRKITSSGGISTLAGYAGDTGTAN 334
Query: 151 GPAQNASFSNDFELTFVPHICA-LLISDHGNQLIRQINLKPEDCSKSSQSG 200
G A F+ +T H LL++D N IR+++ + + +SG
Sbjct: 335 GTGNGARFNQPVGITL--HTSGYLLVADAYNHAIRKVSTSGSVSTLAGESG 383
>gi|255531264|ref|YP_003091636.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255344248|gb|ACU03574.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 439
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 78/142 (54%), Gaps = 7/142 (4%)
Query: 64 SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
S+ +T + E +V AGDG G +G A+F P+ AVD +GN+ VAD+
Sbjct: 101 SAEGPVFTYKAAPASEYIVSTFAGDGTAGLFNGAAADAQFRNPEGLAVDAQGNVIVADRQ 160
Query: 124 NHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
NH IRKI+ G VTTIAG G+ G ADG A FS+ ++L P + ++++D N
Sbjct: 161 NHTIRKISPAGQVTTIAGDGT--AGYADGTGTAAKFSSPWKLAIDP-LGNIIVADRDNFK 217
Query: 183 IRQINLKPEDCSKSSQSGSALG 204
IR+I D + ++ +GS G
Sbjct: 218 IRKI---APDGAVTTLAGSTAG 236
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 62/112 (55%), Gaps = 4/112 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S V LAGDG GY D A+F P VD +GNI VAD+ NH IRKIT G
Sbjct: 274 VSPSGTVTTLAGDGTAGYLDATGVQAKFRNPSGLTVDQQGNIIVADRLNHRIRKITPAGT 333
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V+TIAG G+ G DG A A F++ + + V +L+++ N IR+I
Sbjct: 334 VSTIAGAGT--TGLLDGDALTAKFADPYGVA-VDAGGNILVAELTNARIRKI 382
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 61/112 (54%), Gaps = 5/112 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ + V LAG G++DG +A+F +P D +GNIYVAD + H IRK++ G
Sbjct: 221 IAPDGAVTTLAGS-TAGFADGTGSAAKFMQPLDVVADAQGNIYVADNTAHRIRKVSPSGT 279
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT+AG G+ G D A F N LT V ++++D N IR+I
Sbjct: 280 VTTLAGDGT--AGYLDATGVQAKFRNPSGLT-VDQQGNIIVADRLNHRIRKI 328
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 63/114 (55%), Gaps = 6/114 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ V +AG G G DG+ +A+F P AVD GNI VA+ +N IRKIT +G
Sbjct: 328 ITPAGTVSTIAGAGTTGLLDGDALTAKFADPYGVAVDAGGNILVAELTNARIRKITPVGQ 387
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSN--DFELTFVPHICALLISDHGNQLIRQI 186
V+T+AG G ADG + NA F+ D ++ +I +++H ++IR I
Sbjct: 388 VSTLAG---SSAGFADGLSVNAKFNQPTDLDIDAKGNIYVAEVTNHRIRMIRLI 438
>gi|414870491|tpg|DAA49048.1| TPA: hypothetical protein ZEAMMB73_516870 [Zea mays]
Length = 271
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 100/200 (50%), Gaps = 42/200 (21%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
E GYTV ++D +L I P+S+ GS +++++DS+ Y+++ PLS +S K +AG
Sbjct: 105 ECGYTVEKILDYSKLGIEPYSMEVTQGS-EILIMDSN---IYSMALPLSRDSRPKLVAGS 160
Query: 88 -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
+G G+ DG R PK VD KG +
Sbjct: 161 LEGFPGHVDGNLREGRIYHPKGVTVDDKG-----------------------------RG 191
Query: 147 GRADGPAQNASFSNDFELTF-VPHICALLISDHGNQLIRQINLKPEDCSKSSQS----GS 201
G+ DGP+ A S DFE+ + + C+LL+ D GNQ IR+I L +DC ++ G
Sbjct: 192 GQVDGPSDEAKSSTDFEVCYYIGSSCSLLVIDRGNQTIREIQLHFDDCVYQHEADFPLGV 251
Query: 202 ALGAVSVWV--LVSVLSCLV 219
AL AV+ ++ ++++L C V
Sbjct: 252 ALLAVAAFLGYMLALLQCQV 271
>gi|431798812|ref|YP_007225716.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
gi|430789577|gb|AGA79706.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
Length = 428
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/172 (35%), Positives = 90/172 (52%), Gaps = 15/172 (8%)
Query: 37 VIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+IDG ++ NP V P LIV D + ++ ++ + V + G G G+
Sbjct: 121 LIDGSGIQAAFRNPEGVTMHPDGY-LIVTDRANNSIR----KVTTDGAVSTVLGTGNSGF 175
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGP 152
+G SA D P VDM+GNIYVAD+ NH+IRKI G V+T+AG G + G ADGP
Sbjct: 176 QNGPVASALLDYPWKSCVDMEGNIYVADRDNHMIRKIDPQGMVSTVAGTG--EAGFADGP 233
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
A+ A F ++ L ++D+ N IR+I + D + S+ +GS G
Sbjct: 234 AEEAQFDQPLDIAVTAE-GVLYVTDNRNHRIRKIEV---DGTVSTVAGSEQG 281
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 56/105 (53%), Gaps = 5/105 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
V +AGDG QG DG+ +A+F+ P +V G + VAD SNH IR I V TIAG
Sbjct: 326 VSTVAGDGSQGTRDGQVMTAQFNNPYGISVADNGQLVVADLSNHKIRLIQGENVITIAG- 384
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G DG + F N ++T+ H + ++D GN +R+I
Sbjct: 385 --SVAGFLDGVGVTSQFYNPTDVTY--HDGVIYVADLGNHRVRKI 425
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 55/98 (56%), Gaps = 5/98 (5%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGR 148
QG DG A F P VD GNIYVAD+ NH+IRKI N G V+T+AG GS +G
Sbjct: 280 QGNQDGALEEATFRYPSGLDVDDMGNIYVADRINHLIRKIDLNAGQVSTVAGDGS--QGT 337
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
DG A F+N + ++ V L+++D N IR I
Sbjct: 338 RDGQVMTAQFNNPYGIS-VADNGQLVVADLSNHKIRLI 374
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 55/108 (50%), Gaps = 4/108 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
VV LAG G DG A F P+ + G + V D++N+ IRK+T G V+T+
Sbjct: 109 VVSTLAGSSDYGLIDGSGIQAAFRNPEGVTMHPDGYLIVTDRANNSIRKVTTDGAVSTVL 168
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G+ G +GP +A ++ + V + ++D N +IR+I+
Sbjct: 169 GTGNS--GFQNGPVASALLDYPWK-SCVDMEGNIYVADRDNHMIRKID 213
>gi|302821467|ref|XP_002992396.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
gi|300139812|gb|EFJ06546.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
Length = 161
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 78/143 (54%), Gaps = 12/143 (8%)
Query: 74 FPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
FPL +S ++ AG G GY DG + F++P+S ++ G ++VAD N IRKI+
Sbjct: 14 FPL--DSSLEAFAGSFVGESGYVDGPAAKSLFNRPQSLSICDNGAVFVADTRNLAIRKIS 71
Query: 132 NL--GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL- 188
VTTIAGG S+K G A A FS++F L +LLI+D GN+LIR+I +
Sbjct: 72 KFLPTVTTIAGGSSRKPGFA---GDTARFSSEFSL--ACSCGSLLIADCGNRLIREIQID 126
Query: 189 KPEDCSKSSQSGSALGAVSVWVL 211
P+ C S + S + W+
Sbjct: 127 DPKSCDSSDSAVSGDFSSFTWIF 149
>gi|330824371|ref|YP_004387674.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
gi|329309743|gb|AEB84158.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
Length = 362
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 67/106 (63%), Gaps = 4/106 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V LAG+G G++DG+ +ARF++P + +D +G ++V D+ NH +R++T G VTT+AG
Sbjct: 199 VDTLAGNGAPGFADGQRATARFNQPYNVRIDGQGRLWVPDQLNHAVRRVTPAGEVTTVAG 258
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G K G ADGPA F N + +P+ A++++D N +R +
Sbjct: 259 AG--KAGYADGPAATVRFDNPTGVAPLPN-GAVVVADRNNNRLRLV 301
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 66/114 (57%), Gaps = 3/114 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
L V +AG G G++DG +ARF++P + AV+ G +YVAD++NH IRKI G
Sbjct: 83 LRSGGTVATVAGTGKAGFADGPAHAARFNEPIAVAVERSGAVYVADRNNHRIRKIRPDGT 142
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
V T+AGG S G DG ++A F+ + + L ++D+ N IR+I+L
Sbjct: 143 VITLAGGDS--AGFVDGDLKSARFNQPYGVALDAAQTTLYVADYLNHAIRRIDL 194
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 72/137 (52%), Gaps = 13/137 (9%)
Query: 71 TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
TL FP V+ LAGDG G +DG PG R ++P + G++Y AD+ NH +R +
Sbjct: 31 TLPFPE-----VRTLAGDGQPGAADGSPG--RVNRPHGLSYGNDGSLYFADRGNHQVRVL 83
Query: 131 TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G V T+AG G K G ADGPA A F+ + V A+ ++D N IR+I +
Sbjct: 84 RSGGTVATVAGTG--KAGFADGPAHAARFNEPIAVA-VERSGAVYVADRNNHRIRKI--R 138
Query: 190 PEDCSKSSQSGSALGAV 206
P+ + G + G V
Sbjct: 139 PDGTVITLAGGDSAGFV 155
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 62/115 (53%), Gaps = 6/115 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V +AG G GY+DG + RFD P A G + VAD++N+ +R +T G V T+AG
Sbjct: 253 VTTVAGAGKAGYADGPAATVRFDNPTGVAPLPNGAVVVADRNNNRLRLVTPDGAVATLAG 312
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
G + G ADG A +A F+ ++ F + +L+S+ +R + P+D +
Sbjct: 313 AG--EAGFADGTAASARFNQPLDVEFDDSMSRVLVSEDKGHRLRVL---PKDTRR 362
>gi|223935789|ref|ZP_03627704.1| NHL repeat containing protein [bacterium Ellin514]
gi|223895390|gb|EEF61836.1| NHL repeat containing protein [bacterium Ellin514]
Length = 755
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 77/137 (56%), Gaps = 8/137 (5%)
Query: 69 FYTLSF-PLSEES--VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
F +L+F PLS V LAG+ QG +DG SARF+ P AVD GN+YVAD +NH
Sbjct: 11 FLSLAFVPLSHSQPLTVSTLAGNAGQGSADGNNSSARFNLPGGVAVDKTGNLYVADTANH 70
Query: 126 VIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
IRKI+ V+T A G + G A+G A F N + V + ++D GN +IR+
Sbjct: 71 TIRKISGGVVSTFA-GLAGVSGSANGKGSAARF-NQPQGVAVDTNGIVYVADTGNHIIRK 128
Query: 186 INLKPEDCSKSSQSGSA 202
I L D + S+ +G+A
Sbjct: 129 IAL---DGTVSTLAGAA 142
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 70/132 (53%), Gaps = 7/132 (5%)
Query: 61 VLDSSRSAFYTLSF-----PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
+D+S + F + +F ++ V LAG G G ++G G A+F P+ AVD
Sbjct: 377 AVDASGNVFVSDTFNHTIRKITAAGTVSTLAGTAGSSGTNNGVGGGAQFYAPQGIAVDTG 436
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
GN YVAD +N+VIRK+T+ G T G + EG+ DG NA FS + + +
Sbjct: 437 GNAYVADTANNVIRKVTSGGTVTTLAGTAGVEGQGDGTGSNAQFSGP-QAVALDGAANVY 495
Query: 175 ISDHGNQLIRQI 186
+SD GN IR+I
Sbjct: 496 VSDTGNHTIRKI 507
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 56/117 (47%), Gaps = 13/117 (11%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+S V LAG G +DG GSA+F PK AVD GN++V+D NH IRKIT G
Sbjct: 343 ISSGGAVTTLAGSASIGNADGTGGSAKFYWPKGTAVDASGNVFVSDTFNHTIRKITAAGT 402
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQI 186
+ G + G +G A F + P A+ ++D N +IR++
Sbjct: 403 VSTLAGTAGSSGTNNGVGGGAQF-------YAPQGIAVDTGGNAYVADTANNVIRKV 452
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 8/138 (5%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKS 108
+D+ G +L V D++ +S VV AG GV G ++G+ +ARF++P+
Sbjct: 55 AVDKTG--NLYVADTANHTIRKIS-----GGVVSTFAGLAGVSGSANGKGSAARFNQPQG 107
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
AVD G +YVAD NH+IRKI G + G + G + NA F +
Sbjct: 108 VAVDTNGIVYVADTGNHIIRKIALDGTVSTLAGAAGNPGTLNATGTNAQFYEPEAVAVNG 167
Query: 169 HICALLISDHGNQLIRQI 186
+ + ++D N IRQ+
Sbjct: 168 NGSLIYVADTWNHEIRQV 185
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 53/111 (47%), Gaps = 13/111 (11%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
V LAG G G ++G +A+F +P AV G +YVAD NH IR +T+ GV T
Sbjct: 240 VTTLAGSPGNYGSTNGTGSAAQFYQPMGVAVAANGTVYVADNLNHTIRAVTSGGVVTTLA 299
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
G + G DG NA F + P A + + D GN IRQI
Sbjct: 300 GLAGNYGSKDGTGSNARF-------YAPQGVAVSGSTVFVVDTGNGTIRQI 343
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 48/103 (46%), Gaps = 11/103 (10%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF- 158
+ARF P AVD GN+YVAD NH IRKIT G + G G ADG ++A F
Sbjct: 537 TARFYSPSGLAVDSSGNVYVADTGNHTIRKITADGSVSTLAGLPGVWGNADGTNRDARFF 596
Query: 159 ----------SNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
N F + H +LI+ N ++ I +P+
Sbjct: 597 QPEGISIDSQGNLFVMDSGNHTMRMLIASGTNWIVTTIAGQPD 639
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 58/120 (48%), Gaps = 19/120 (15%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---- 130
++ + V LAG GV G +DG ARF +P+ ++D +GN++V D NH +R +
Sbjct: 567 ITADGSVSTLAGLPGVWGNADGTNRDARFFQPEGISIDSQGNLFVMDSGNHTMRMLIASG 626
Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIR 184
TN VTTIA G G ADG A F + P L ++D GN IR
Sbjct: 627 TNWIVTTIA-GQPDLGGAADGTGNGAQF-------YYPGGLGLNNSGFFAVADSGNNTIR 678
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 56/117 (47%), Gaps = 7/117 (5%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ V LAG GV+G DG +A+F P++ A+D N+YV+D NH IRKI+ G
Sbjct: 452 VTSGGTVTTLAGTAGVEGQGDGTGSNAQFSGPQAVALDGAANVYVSDTGNHTIRKISPGG 511
Query: 135 VTTIAGGGSKKEGRADGPAQN-----ASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T G G D N A F + L V + ++D GN IR+I
Sbjct: 512 AVTTFAGFPGHPGNLDSNMDNNGTNTARFYSPSGLA-VDSSGNVYVADTGNHTIRKI 567
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 44/74 (59%), Gaps = 15/74 (20%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-------ARFDKPKSFAVDMKGN-IYVADKSNHVI 127
++ + V LAG + G PG+ A+F +P++ AV+ G+ IYVAD NH I
Sbjct: 129 IALDGTVSTLAG------AAGNPGTLNATGTNAQFYEPEAVAVNGNGSLIYVADTWNHEI 182
Query: 128 RKITNLG-VTTIAG 140
R++T+ G VTT+AG
Sbjct: 183 RQVTSAGVVTTLAG 196
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 40/95 (42%), Gaps = 5/95 (5%)
Query: 96 GEPG----SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G PG + F +P+ AV GNIYVAD N IR I G T G G +G
Sbjct: 196 GTPGVIGTGSLFYQPQGIAVGSDGNIYVADTGNGTIRVIPPGGSVTTLAGSPGNYGSTNG 255
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F + + + ++D+ N IR +
Sbjct: 256 TGSAAQFYQPMGVAVAAN-GTVYVADNLNHTIRAV 289
>gi|124002516|ref|ZP_01687369.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
gi|123992345|gb|EAY31713.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
Length = 2385
Score = 77.4 bits (189), Expect = 6e-12, Method: Composition-based stats.
Identities = 51/140 (36%), Positives = 71/140 (50%), Gaps = 8/140 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-----GVQGYSDGEPGSARFDKPKSFA 110
+ +L V D + A ++ P VV+ +AG+ V G G A+F++P A
Sbjct: 663 AGNLYVADKNNHAIRMITNPSGGSPVVRTIAGNSSYPTAVSGNVTGALAVAKFNEPSGVA 722
Query: 111 VDMKGNIYVADKSNHVIRKITNLGVTTIAG---GGSKKEGRADGPAQNASFSNDFELTFV 167
VD GNIYVADK+NH I+KI N VTT+AG + GR DG A A F +
Sbjct: 723 VDAAGNIYVADKNNHRIKKIANGMVTTLAGPMNDAASIPGRTDGAADAARFFFPTSVALD 782
Query: 168 PHICALLISDHGNQLIRQIN 187
L ++D N +IRQ+N
Sbjct: 783 ITGAQLYVADKLNNIIRQVN 802
Score = 62.0 bits (149), Expect = 3e-07, Method: Composition-based stats.
Identities = 43/113 (38%), Positives = 56/113 (49%), Gaps = 9/113 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VT 136
V +AG GV G+ D A+F+ P A+DM GN+YVADK+NH IR ITN V
Sbjct: 631 VSIVAGSGVAGFKDDNGSLAQFNYPSGLALDMAGNLYVADKNNHAIRMITNPSGGSPVVR 690
Query: 137 TIAGGGSKK---EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TIAG S G G A F N+ V + ++D N I++I
Sbjct: 691 TIAGNSSYPTAVSGNVTGALAVAKF-NEPSGVAVDAAGNIYVADKNNHRIKKI 742
Score = 56.6 bits (135), Expect = 1e-05, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 43/76 (56%), Gaps = 3/76 (3%)
Query: 86 AGD---GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
AGD G+ G+ DG SA+F P V+ G++Y+AD N VIRKI+ V TIAG
Sbjct: 812 AGDVANGIAGHQDGNAASAKFRSPAGITVNAVGDVYIADTHNQVIRKISQGQVITIAGEV 871
Query: 143 SKKEGRADGPAQNASF 158
+ +D A++A F
Sbjct: 872 EVADDDSDVLARDAKF 887
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 37/102 (36%), Positives = 57/102 (55%), Gaps = 5/102 (4%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
+ G +DG +ARF P S A+D+ G +YVADK N++IR++ T+ G G
Sbjct: 760 IPGRTDGAADAARFFFPTSVALDITGAQLYVADKLNNIIRQVNTADGHTLTYAGDVANGI 819
Query: 149 A---DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A DG A +A F + +T V + + I+D NQ+IR+I+
Sbjct: 820 AGHQDGNAASAKFRSPAGIT-VNAVGDVYIADTHNQVIRKIS 860
>gi|296136309|ref|YP_003643551.1| NHL repeat containing protein [Thiomonas intermedia K12]
gi|295796431|gb|ADG31221.1| NHL repeat containing protein [Thiomonas intermedia K12]
Length = 366
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 66/117 (56%), Gaps = 2/117 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S + VV LAG G G+ DG ARF+ P +D +G IYV++ N+VIRKIT G
Sbjct: 196 ISAKGVVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQGQIYVSEYFNNVIRKITPDG 255
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
T G K G ADG + A F + L+F P +L+++D GN +R+I+ + E
Sbjct: 256 TVTTFAGKPGKGGFADGKVEEAQFLHPQTLSFAPD-GSLIVADTGNNRVRRISPQGE 311
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 57/112 (50%), Gaps = 2/112 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S E V LAG V+G++DG ARF+ P V+ KG +YVAD N +R+I+ G
Sbjct: 141 ISREGRVSTLAGKLEVEGFADGTGQQARFNHPVGLGVNAKGVVYVADAYNSTVRRISAKG 200
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V + G G DG A F+ LT + +S++ N +IR+I
Sbjct: 201 VVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQ-GQIYVSEYFNNVIRKI 251
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
Y+ G+ ARF P+ A+D +GNI+VAD N V+RK+ G +I G ++ +GP
Sbjct: 49 YTGGDFAQARFHDPRGMALDAQGNIFVADYVNSVVRKLGTDGQVSIVAGQVEQRDARNGP 108
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A A F + E V L +SD G+ +R I+
Sbjct: 109 ALQARFYSP-ECVAVATDGTLFVSDSGSNTVRCIS 142
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 58/120 (48%), Gaps = 11/120 (9%)
Query: 80 SVVKRLAGDG----VQGY------SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
SVV++L DG V G +G ARF P+ AV G ++V+D ++ +R
Sbjct: 81 SVVRKLGTDGQVSIVAGQVEQRDARNGPALQARFYSPECVAVATDGTLFVSDSGSNTVRC 140
Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
I+ G + G + EG ADG Q A F++ L V + ++D N +R+I+ K
Sbjct: 141 ISREGRVSTLAGKLEVEGFADGTGQQARFNHPVGLG-VNAKGVVYVADAYNSTVRRISAK 199
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 44/73 (60%), Gaps = 2/73 (2%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ + V AG G G++DG+ A+F P++ + G++ VAD N+ +R+I+ G
Sbjct: 251 ITPDGTVTTFAGKPGKGGFADGKVEEAQFLHPQTLSFAPDGSLIVADTGNNRVRRISPQG 310
Query: 135 -VTTIAGGGSKKE 146
V+T+AG G+ ++
Sbjct: 311 EVSTLAGTGASEK 323
>gi|445496654|ref|ZP_21463509.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
gi|444786649|gb|ELX08197.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
Length = 741
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/132 (40%), Positives = 70/132 (53%), Gaps = 8/132 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ +L V D+ +A ++ E VV LAGDG+ G DG +A+F+ P AVD G
Sbjct: 168 AGNLYVADTGNNAIRKVT----PEGVVSTLAGDGLPGDKDGRGAAAQFNGPVGIAVDAAG 223
Query: 116 NIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
+YVAD N IR+I N VTTIAGG + G+ADG A A F L L
Sbjct: 224 VVYVADTYNDRIRRIAPNGDVTTIAGG--SRAGKADGAAAQALFDTPTGLALSA-AGDLY 280
Query: 175 ISDHGNQLIRQI 186
I+D GN IR++
Sbjct: 281 IADTGNHAIRKL 292
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 69/132 (52%), Gaps = 7/132 (5%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ LAG GV+GY++G +A F+ P A+D GN+YVAD N+ IRK+T G V+T
Sbjct: 134 DGATTTLAG-GVEGYAEGAGKAAAFNTPSGLAIDAAGNLYVADTGNNAIRKVTPEGVVST 192
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI--NLKPEDCSK 195
+AG G G DG A F+ + V + ++D N IR+I N +
Sbjct: 193 LAGDG--LPGDKDGRGAAAQFNGPVGIA-VDAAGVVYVADTYNDRIRRIAPNGDVTTIAG 249
Query: 196 SSQSGSALGAVS 207
S++G A GA +
Sbjct: 250 GSRAGKADGAAA 261
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 59/111 (53%), Gaps = 6/111 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-NHVIRKITNLGVTTIAG 140
V +AGDG+ G ++G S RF P +D GN+YVAD N+ IRKI G TT
Sbjct: 82 VTSIAGDGLPGANNGHGRSTRFADPFGVVIDGAGNLYVADGGDNNSIRKIALDGATTTLA 141
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
GG EG A+G + A+F+ L + L ++D GN IR++ PE
Sbjct: 142 GGV--EGYAEGAGKAAAFNTPSGLA-IDAAGNLYVADTGNNAIRKVT--PE 187
>gi|302821973|ref|XP_002992647.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
gi|300139611|gb|EFJ06349.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
Length = 124
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 69/122 (56%), Gaps = 9/122 (7%)
Query: 79 ESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
+S ++ AG G GY DG + F++P+S A+ G ++VAD N IRKI+ G V
Sbjct: 3 DSSLEAFAGSFVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEV 62
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDCS 194
TTIAGG S+K G A A FS +F L +LLI+D GN+LIR+I + P C
Sbjct: 63 TTIAGGSSRKPGFA---GDTARFSGEFSLAC--SCGSLLIADRGNRLIREIQIDDPNSCD 117
Query: 195 KS 196
S
Sbjct: 118 SS 119
>gi|325104251|ref|YP_004273905.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
gi|324973099|gb|ADY52083.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
Length = 532
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 102/214 (47%), Gaps = 31/214 (14%)
Query: 15 LSLLFSLVSSGLLLEDGYTVTTV--------IDGHQLEI---NPHSV-IDRPGSSDLIVL 62
L+L+ L + + YTVTTV +DG+ L+ NP + +D G D+I+
Sbjct: 6 LTLILMLGTYMVTKSQNYTVTTVAGSTTFGFVDGNGLDARFRNPDGILVDMNG--DIIIT 63
Query: 63 DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
D + + ++ VV LAG GV GY++G+PG +F+ P VD GNI V +K
Sbjct: 64 DRTNHSIR----KMTTAGVVSTLAGTGVSGYANGKPG--QFNTPWQSTVDAAGNIIVVEK 117
Query: 123 SNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
IRKI G V+TIAG GS G +DG A F N + V + I+D N+
Sbjct: 118 DGARIRKIALDGTVSTIAGTGS--AGYSDGAVSVARFDNALD-AVVDSDGNIFIADRNNR 174
Query: 182 LIRQI------NLKPEDCSKSSQSGSALGAVSVW 209
+R+I N S + G+ G V VW
Sbjct: 175 RVRKITPGAGGNWTTATVSTVAGDGTTSGTV-VW 207
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 74/147 (50%), Gaps = 10/147 (6%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
S+ V +AG G+ DG ARF P VDM G+I + D++NH IRK+T G V
Sbjct: 20 SQNYTVTTVAGSTTFGFVDGNGLDARFRNPDGILVDMNGDIIITDRTNHSIRKMTTAGVV 79
Query: 136 TTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP--ED 192
+T+AG G G A+G P Q F+ ++ T V +++ + IR+I L
Sbjct: 80 STLAGTGV--SGYANGKPGQ---FNTPWQST-VDAAGNIIVVEKDGARIRKIALDGTVST 133
Query: 193 CSKSSQSGSALGAVSVWVLVSVLSCLV 219
+ + +G + GAVSV + L +V
Sbjct: 134 IAGTGSAGYSDGAVSVARFDNALDAVV 160
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 28/51 (54%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V +AG G G DG A F++P D GNIYVAD NH IRK+
Sbjct: 299 ATVTTIAGTGSTGRIDGLGNVATFNQPYDVVTDENGNIYVADNVNHSIRKL 349
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 57/121 (47%), Gaps = 14/121 (11%)
Query: 93 YSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--------TNLGVTTIAGG 141
++DGE G +AR D GNI AD S + IRKI T VTTIAG
Sbjct: 248 FTDGESGKPLTARLGDVFGLNFDNDGNIIFADASFNRIRKITPGENGDWTTATVTTIAGT 307
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
GS GR DG A+F+ +++ + ++D+ N IR++ P SS + +
Sbjct: 308 GS--TGRIDGLGNVATFNQPYDV-VTDENGNIYVADNVNHSIRKLVPSPLPVQLSSFTAT 364
Query: 202 A 202
A
Sbjct: 365 A 365
>gi|373955434|ref|ZP_09615394.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
gi|373892034|gb|EHQ27931.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
Length = 459
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 62/112 (55%), Gaps = 4/112 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ V AG G GY DG +A+F PK A D +GNIYVAD N++IRKI+ G
Sbjct: 189 ITAAGVTTTYAGTGTLGYLDGPAATAQFYAPKGVAADAQGNIYVADMGNNMIRKISAAGV 248
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT+AG GS G ADG +A F + L V + ++D G IR++
Sbjct: 249 VTTLAGKGS--AGYADGTGADAVFKSPAGLA-VDASGNIYVADQGTNTIRKV 297
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 62/113 (54%), Gaps = 5/113 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S VV LAG G GY+DG A F P AVD GNIYVAD+ + IRK+T+ G
Sbjct: 243 ISAAGVVTTLAGKGSAGYADGTGADAVFKSPAGLAVDASGNIYVADQGTNTIRKVTSAGV 302
Query: 135 VTTIAGGGSKKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT+A G+ G+ D +A FS+ +T V + ++D N IR++
Sbjct: 303 VTTLA--GAAASGQVDATTNTDARFSSPSGVT-VDASGNVYVADLANHAIRKV 352
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 63/125 (50%), Gaps = 15/125 (12%)
Query: 69 FYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
F T S + + V AG+ G + +G +A+F P+ D +GN+YVAD N+VI
Sbjct: 127 FKTNSATFAITATVSTFAGNANGGGFVNGTGVNAQFHNPQGICTDAQGNMYVADSYNNVI 186
Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQ 181
RKIT GVTT G+ G DGPA A F + P A + ++D GN
Sbjct: 187 RKITAAGVTTTY-AGTGTLGYLDGPAATAQF-------YAPKGVAADAQGNIYVADMGNN 238
Query: 182 LIRQI 186
+IR+I
Sbjct: 239 MIRKI 243
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/63 (46%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV LAG G D + ARF P VD GN+YVAD +NH IRK+T+ G
Sbjct: 297 VTSAGVVTTLAGAAASGQVDATTNTDARFSSPSGVTVDASGNVYVADLANHAIRKVTSAG 356
Query: 135 VTT 137
VTT
Sbjct: 357 VTT 359
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Query: 78 EESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+++ LAG G G+++G +A F+ P++ +D +GNIYV D N++IRKI
Sbjct: 400 TTNIIYSLAGVAGTSGFANGTNINALFNGPQALTLDSQGNIYVVDYYNNMIRKI 453
>gi|255531263|ref|YP_003091635.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255344247|gb|ACU03573.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 646
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/107 (42%), Positives = 63/107 (58%), Gaps = 4/107 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
+V AG G G +G SARFD P+ A+D GN+Y+ADK+N+VIRKIT+ G V+T A
Sbjct: 327 LVSNYAGTGTAGLVNGAKASARFDSPEGLAIDASGNMYIADKNNNVIRKITSTGTVSTFA 386
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G G ADG A F+ +++ V + ++D N IR+I
Sbjct: 387 GTGV--AGFADGAGSIAKFNGPWKVA-VDATGNVYVADRDNFKIRKI 430
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 61/112 (54%), Gaps = 5/112 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ V AG GV G++DG A+F+ P AVD GN+YVAD+ N IRKIT G
Sbjct: 376 ITSTGTVSTFAGTGVAGFADGAGSIAKFNGPWKVAVDATGNVYVADRDNFKIRKITPAGI 435
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V+T+AG G ADG A F ++ P ++++D+ + IR+I
Sbjct: 436 VSTLAG---STAGYADGTGSAAKFMQPLDVAIDPS-GNVIVADNTSHRIRKI 483
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 61/112 (54%), Gaps = 5/112 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ +V LAG GY+DG +A+F +P A+D GN+ VAD ++H IRKIT G
Sbjct: 430 ITPAGIVSTLAGS-TAGYADGTGSAAKFMQPLDVAIDPSGNVIVADNTSHRIRKITAAGV 488
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTTIAG G+ G +G A F N + V + ++D N IR+I
Sbjct: 489 VTTIAGNGT--AGYTNGTGTAAQFKNPSGVD-VDASGNIYVADRLNHRIRKI 537
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 38/60 (63%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ VV +AG+G GY++G +A+F P VD GNIYVAD+ NH IRKIT GV
Sbjct: 483 ITAAGVVTTIAGNGTAGYTNGTGTAAQFKNPSGVDVDASGNIYVADRLNHRIRKITTSGV 542
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 61/113 (53%), Gaps = 5/113 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ VV LAG G G +DG GSA+F P VD+ GN+YVAD + IRKI++ V
Sbjct: 537 ITTSGVVSSLAGTGTSGTTDGAAGSAKFSDPYGVTVDVSGNVYVADLISSRIRKISSGQV 596
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+T+AG G +G + A F+ +L V + I+DH N IR + L
Sbjct: 597 STLAG---TIPGYQNGTSTIAKFNQPTDL--VIQGSNIYIADHSNNSIRLVKL 644
>gi|431798813|ref|YP_007225717.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
gi|430789578|gb|AGA79707.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
Length = 655
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 69/105 (65%), Gaps = 4/105 (3%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+ +AG G+ G+ +G +A+FD P+ A+D +GNI+VAD+ N+VIRKI++ G V+T AG
Sbjct: 336 ETIAGTGIAGFQNGSSSTAKFDNPEGIAIDQQGNIFVADRDNNVIRKISSSGDVSTFAGT 395
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G DG A A F++ +++ V + ++++D GN IR+I
Sbjct: 396 GIA--GFTDGVAGVAKFNSPWKVA-VDNQGNVIVADRGNHSIRKI 437
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 12/144 (8%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP + ID+ G ++ V D + +S V AG G+ G++DG G A+F+
Sbjct: 358 NPEGIAIDQQG--NIFVADRDNNVIR----KISSSGDVSTFAGTGIAGFTDGVAGVAKFN 411
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
P AVD +GN+ VAD+ NH IRKIT N V+T+AG G DG A F +
Sbjct: 412 SPWKVAVDNQGNVIVADRGNHSIRKITPNGTVSTLAG---TTNGYQDGSGNQAKFDQPTD 468
Query: 164 LTFVPHICALLISDHGNQLIRQIN 187
+ +P+ ++I+D+ N IR I+
Sbjct: 469 VAVLPN-GNIVIADNRNHCIRMID 491
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 58/107 (54%), Gaps = 4/107 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
V +AG G GY DG A+F P D GN++VAD+ NH IRKI + V+T+AG
Sbjct: 496 VSTIAGTGNGGYVDGAGSQAQFYYPSGIDTDPNGNLFVADRKNHAIRKIDSYHNVSTVAG 555
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G EG +G A F + + + V +L++D N +IR+IN
Sbjct: 556 G--NGEGIQNGGIAVAKFDDPYGVA-VGQNGKVLVADLDNNVIREIN 599
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 77/153 (50%), Gaps = 9/153 (5%)
Query: 40 GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
G Q + S ID + +L V D A + S +V G+G +G +G
Sbjct: 512 GSQAQFYYPSGIDTDPNGNLFVADRKNHAIRKID---SYHNVSTVAGGNG-EGIQNGGIA 567
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
A+FD P AV G + VAD N+VIR+I V+TI GS EG DGP+ + +
Sbjct: 568 VAKFDDPYGVAVGQNGKVLVADLDNNVIREINGDYVSTII--GSNGEGYIDGPSTASKMN 625
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPED 192
+ ++ + + ++ +D+GN L+R++ +K E+
Sbjct: 626 SPTDV--LVNGDEIIFADYGNHLVRKV-VKDEE 655
>gi|402814801|ref|ZP_10864394.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
gi|402507172|gb|EJW17694.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
Length = 571
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 56/98 (57%), Gaps = 4/98 (4%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRA 149
QG+ DGE +FD+P A D KGN+YVAD NHVIRKI G VTT+AG G G
Sbjct: 129 QGWEDGEQAQVQFDQPTGLAADKKGNLYVADAGNHVIRKIDGAGKVTTVAGDGIP--GWK 186
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
DG A A F + V A+ ++D N +IR+I+
Sbjct: 187 DGAAGTARFHTPRAIA-VADDGAIYVTDSLNHVIRRID 223
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 68/134 (50%), Gaps = 11/134 (8%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V +AGDG+ G+ DG G+ARF P++ AV G IYV D NHVIR+I +G VTT+
Sbjct: 174 VTTVAGDGIPGWKDGAAGTARFHTPRAIAVADDGAIYVTDSLNHVIRRIDAMGMVTTLTA 233
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFV----PHICALL------ISDHGNQLIRQINLKP 190
S+ + G A D LT P AL+ ++D GNQ IR I+LK
Sbjct: 234 RSSRIVEYSPGSVTGAGDFKDGNLTEAMFNEPSGIALMNDGKLAVADTGNQRIRLIDLKQ 293
Query: 191 EDCSKSSQSGSALG 204
S + S S G
Sbjct: 294 GSVSTIAGSSSTAG 307
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 84/195 (43%), Gaps = 27/195 (13%)
Query: 14 LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
L +F+ S L+ DG ++ + ID S + SS +A YTL
Sbjct: 257 LTEAMFNEPSGIALMNDGKLAVADTGNQRIRL-----IDLKQGSVSTIAGSSSTAGYTLP 311
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
V + L G GY DG ++ F+ P A+ + I VAD+ NHV+R I
Sbjct: 312 ------GVQRVLYAPG--GYRDGAASNSIFNSPTGIAITNENGIVVADRWNHVVRYIYKG 363
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC--ALLISDHGNQLIRQIN---- 187
V T++ G + K G DG S++ E T V + ++ +++ N IR I
Sbjct: 364 EVVTLS-GQAGKSGDLDGI---TSYAKLHEPTAVAVLANGSIAVAEGFNNAIRLIRRYEL 419
Query: 188 ----LKPEDCSKSSQ 198
+ P D S++SQ
Sbjct: 420 PECVMGPLDQSRASQ 434
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 36/86 (41%), Gaps = 1/86 (1%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
R P A +I V D NHV+R++ G T G K+G DG F
Sbjct: 85 RLQFPTGIAAGENQSIIVTDTRNHVLRRLDAEGRITATIGMINKQGWEDGEQAQVQFDQP 144
Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
L L ++D GN +IR+I+
Sbjct: 145 TGLA-ADKKGNLYVADAGNHVIRKID 169
>gi|223934698|ref|ZP_03626618.1| NHL repeat containing protein [bacterium Ellin514]
gi|223896653|gb|EEF63094.1| NHL repeat containing protein [bacterium Ellin514]
Length = 847
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 68/127 (53%), Gaps = 4/127 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ VV LAG +G +G SARF P++ AVD + NIYVAD N VIRKIT GV
Sbjct: 358 MTAAGVVTTLAGSPSEGSINGVTSSARFYSPQNVAVDGQNNIYVADTQNSVIRKITPFGV 417
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
++ G + G ADG NA FS + V + ++D GN IR+I S
Sbjct: 418 VSVLAGTTGVFGSADGSGANALFSGPQGIA-VDGGGNIYVADTGNSTIRKIT---PSGST 473
Query: 196 SSQSGSA 202
S+ +GSA
Sbjct: 474 STLAGSA 480
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 60/112 (53%), Gaps = 2/112 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ + + LAG GVQG +DG A F++P A+D GNIYV+D + IRK+T G
Sbjct: 85 VTASGLCRTLAGTAGVQGSADGMGAQASFNQPSGIALDSDGNIYVSDYGSSTIRKVTQSG 144
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T G + G + NA F + L V + L ++D+GN LIR+I
Sbjct: 145 QVTTLAGMTGVTGSVNNTGTNALFFHPMGLA-VDNATNLYVADYGNHLIRKI 195
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 65/133 (48%), Gaps = 6/133 (4%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
G +++ V D+ S ++ P VV LAG GV G +DG +A F P+ AVD
Sbjct: 395 GQNNIYVADTQNSVIRKIT-PFG---VVSVLAGTTGVFGSADGSGANALFSGPQGIAVDG 450
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
GNIYVAD N IRKIT G T+ G + G ADG A F + V +
Sbjct: 451 GGNIYVADTGNSTIRKITPSGSTSTLAGSAGNPGNADGAGITAQFYQP-QGVAVDSANNV 509
Query: 174 LISDHGNQLIRQI 186
++D GN +R +
Sbjct: 510 YVADTGNHTVRMV 522
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 64/126 (50%), Gaps = 13/126 (10%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
++ +AG +G +DG GSA+ P+ AVD GN+YVAD N++IR +T G+
Sbjct: 36 LIGTVAGYAGKGSADGVGGSAQLFGPQGVAVDGAGNVYVADTGNNIIRVVTASGLCRTLA 95
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQINLKPEDCS 194
G + +G ADG ASF+ P AL +SD+G+ IR++ + +
Sbjct: 96 GTAGVQGSADGMGAQASFNQ-------PSGIALDSDGNIYVSDYGSSTIRKVTQSGQVTT 148
Query: 195 KSSQSG 200
+ +G
Sbjct: 149 LAGMTG 154
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 61/126 (48%), Gaps = 2/126 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ + LAG G G DG ARF+ P AVD GN+YV D +N IRK+T+ G
Sbjct: 522 VTPGGISSTLAGLAGTFGTFDGTNAGARFNGPTGIAVDGAGNLYVTDYNNDTIRKVTSAG 581
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
T G + G DG +A F ++ V + L + D GN +R++ L +
Sbjct: 582 AVTTLAGWTGMWGSIDGAGNSALFFGPSGIS-VDALGNLYVIDSGNSTLRKLTLSGGTWT 640
Query: 195 KSSQSG 200
S+ +G
Sbjct: 641 VSTVAG 646
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 61/119 (51%), Gaps = 17/119 (14%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S VV LAG G G +DG SARF P+ AVD G +++AD +N IR +T G
Sbjct: 303 ISPAGVVTTLAGLSGTAGSADGTNSSARFLGPQGVAVDSTGTVFIADTANSTIRVMTAAG 362
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
VTT+A GS EG +G +A F + P A + ++D N +IR+I
Sbjct: 363 VVTTLA--GSPSEGSINGVTSSARF-------YSPQNVAVDGQNNIYVADTQNSVIRKI 412
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 65/132 (49%), Gaps = 4/132 (3%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
G ++ V D+ S ++ P S + AG+ G +DG +A+F +P+ AVD
Sbjct: 450 GGGNIYVADTGNSTIRKIT-PSGSTSTLAGSAGN--PGNADGAGITAQFYQPQGVAVDSA 506
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
N+YVAD NH +R +T G+++ G + G DG A F+ + V L
Sbjct: 507 NNVYVADTGNHTVRMVTPGGISSTLAGLAGTFGTFDGTNAGARFNGPTGIA-VDGAGNLY 565
Query: 175 ISDHGNQLIRQI 186
++D+ N IR++
Sbjct: 566 VTDYNNDTIRKV 577
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 68/148 (45%), Gaps = 8/148 (5%)
Query: 39 DGHQLEIN-PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG 96
DG + N P +V +D+ G ++ V D+ +A + + SV G G +D
Sbjct: 215 DGLGGQFNEPEAVAVDQAG--NVYVADTGNAAIRMI---MPGGSVTTLAGAAGFVGSADA 269
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
+A F +P ++ GN+YVAD N+ IR+I+ GV T G S G ADG +A
Sbjct: 270 SGTNALFHQPAGIGINSAGNLYVADYFNNTIRQISPAGVVTTLAGLSGTAGSADGTNSSA 329
Query: 157 SFSNDFELTFVPHICALLISDHGNQLIR 184
F + V + I+D N IR
Sbjct: 330 RFLGP-QGVAVDSTGTVFIADTANSTIR 356
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 54/112 (48%), Gaps = 4/112 (3%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+++ V LAG GV G + +A F P AVD N+YVAD NH+IRKIT
Sbjct: 140 VTQSGQVTTLAGMTGVTGSVNNTGTNALFFHPMGLAVDNATNLYVADYGNHLIRKITPSN 199
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V + G + G ADG F N+ E V + ++D GN IR I
Sbjct: 200 VVSTLAGVTGVPGSADG--LGGQF-NEPEAVAVDQAGNVYVADTGNAAIRMI 248
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 30/72 (41%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
+L V+DS S L+ +V GV G DG A F P V G +
Sbjct: 618 NLYVIDSGNSTLRKLTLSGGTWTVSTVAGMPGVNGGIDGSGAGAEFYYPAGVTVSAAGYV 677
Query: 118 YVADKSNHVIRK 129
YVAD N+ IR
Sbjct: 678 YVADAGNNTIRS 689
>gi|182412380|ref|YP_001817446.1| immunoglobulin I-set domain-containing protein [Opitutus terrae
PB90-1]
gi|177839594|gb|ACB73846.1| Immunoglobulin I-set domain protein [Opitutus terrae PB90-1]
Length = 1130
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 3/103 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G G +DG ARF P+ A+D GN+YVAD +NH IRKI G T G + + G
Sbjct: 220 GSAGSADGPGDQARFRDPEGLAIDAAGNVYVADINNHTIRKINPAGEVTTLAGAAGESGF 279
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
ADGPA NA F L P A+ ++D N+ IR+I+ PE
Sbjct: 280 ADGPAANARFFCPTSLAIDPA-GAIWVNDAINRAIRKIS--PE 319
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 88/169 (52%), Gaps = 14/169 (8%)
Query: 38 IDG---HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GY 93
IDG + L NP +++ P S DL V DS+ A ++ VV LAG ++ GY
Sbjct: 115 IDGVGSNALFHNPSALVVGP-SGDLYVADSNGHAIRKVT----PAGVVTTLAGGPLRYGY 169
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG-GGSKKEGRADG 151
DG A+F P+ AV+ G I+V+D+S H IR++ LG V+T AG GGS G ADG
Sbjct: 170 MDGPGTEAQFSYPRGIAVNATGVIFVSDRSAHTIRRVDQLGNVSTWAGHGGSA--GSADG 227
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
P A F D E + + ++D N IR+IN E + + +G
Sbjct: 228 PGDQARF-RDPEGLAIDAAGNVYVADINNHTIRKINPAGEVTTLAGAAG 275
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 64/133 (48%), Gaps = 7/133 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMK 114
S ++ V+D S +S + +V AG G G DG +ARF P +D
Sbjct: 499 SGNIFVVDRGASTIRKIS-----QGIVTTFAGMPGETGQDDGAGAAARFRDPMGIVIDGA 553
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
N+YVAD +N IRK+T GV T G + +G DGP ASF N + L P+ AL
Sbjct: 554 DNLYVADTNNWKIRKVTPAGVVTTFAGHTSTQGANDGPIGIASFFNPYGLAIGPN-GALY 612
Query: 175 ISDHGNQLIRQIN 187
+ D +R I+
Sbjct: 613 VVDLAGDTLRMIS 625
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 60/128 (46%), Gaps = 2/128 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV LAG V G DG +A F P + V G++YVAD + H IRK+T G
Sbjct: 96 ITPAGVVTTLAGVANVHGCIDGVGSNALFHNPSALVVGPSGDLYVADSNGHAIRKVTPAG 155
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
V T GG + G DGP A FS + V + +SD IR+++ +
Sbjct: 156 VVTTLAGGPLRYGYMDGPGTEAQFSYPRGIA-VNATGVIFVSDRSAHTIRRVDQLGNVST 214
Query: 195 KSSQSGSA 202
+ GSA
Sbjct: 215 WAGHGGSA 222
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 48/102 (47%), Gaps = 1/102 (0%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
LAG+ +G DG ARF+ A+ G +YVAD +N IR IT GV T G +
Sbjct: 51 LAGEWNKGSRDGAAAVARFNGANGVAIAPNGLVYVADLANSTIRAITPAGVVTTLAGVAN 110
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G DG NA F N L P L ++D IR++
Sbjct: 111 VHGCIDGVGSNALFHNPSALVVGPS-GDLYVADSNGHAIRKV 151
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 3/106 (2%)
Query: 82 VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
V +AG DG G +DG SARF AVD GNI+V D+ IRKI+ VTT A
Sbjct: 466 VTTIAGADGEDGDTDGPAASARFRAVDGLAVDSSGNIFVVDRGASTIRKISQGIVTTFA- 524
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + G+ DG A F + + + L ++D N IR++
Sbjct: 525 GMPGETGQDDGAGAAARFRDPMGI-VIDGADNLYVADTNNWKIRKV 569
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 3/66 (4%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
P E + + AG+ G++DG +ARF P S A+D G I+V D N IRKI+ G
Sbjct: 263 PAGEVTTLAGAAGE--SGFADGPAANARFFCPTSLAIDPAGAIWVNDAINRAIRKISPEG 320
Query: 135 -VTTIA 139
VTT+A
Sbjct: 321 TVTTVA 326
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 46/99 (46%), Gaps = 3/99 (3%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G G+ DG +A+F + D +GN+ V D IRK+T G T G ++G
Sbjct: 421 GYPGFRDGSGYAAQFRDLRGITPDKEGNLLVGD--GRTIRKVTLAGAVTTIAGADGEDGD 478
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
DGPA +A F L V + + D G IR+I+
Sbjct: 479 TDGPAASARFRAVDGLA-VDSSGNIFVVDRGASTIRKIS 516
Score = 40.8 bits (94), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 49/111 (44%), Gaps = 8/111 (7%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKS 108
VID G+ +L V D++ ++ VV AG QG +DG G A F P
Sbjct: 549 VID--GADNLYVADTNNWKIRKVT----PAGVVTTFAGHTSTQGANDGPIGIASFFNPYG 602
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR-ADGPAQNASF 158
A+ G +YV D + +R I+ G T GG S G ADG A F
Sbjct: 603 LAIGPNGALYVVDLAGDTLRMISPDGFVTTLGGSSAHRGETADGIGTAARF 653
Score = 40.0 bits (92), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 16/111 (14%)
Query: 33 TVTTVID--GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
TVTTV D G + I+P+ V+ + D I S S ++ P +S R
Sbjct: 321 TVTTVADTAGEGITIDPNGVL-YIAADDRIKRLESGSVLSVVAGP--TDSYTSR------ 371
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
+G +ARF +P A+ + GN+YV D + IR++T G V+T+AG
Sbjct: 372 ----NGVGANARFVQPIGSALAVDGNLYVTDSGGYAIRRVTRSGEVSTLAG 418
>gi|348676369|gb|EGZ16187.1| hypothetical protein PHYSODRAFT_506757 [Phytophthora sojae]
Length = 1981
Score = 74.7 bits (182), Expect = 4e-11, Method: Composition-based stats.
Identities = 51/126 (40%), Positives = 71/126 (56%), Gaps = 11/126 (8%)
Query: 70 YTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
Y+ +F S + V +AG+G G+ DG +ARF+ P+ AVD G +YVAD +NH I
Sbjct: 190 YSAAFGASGRAQGRVTTVAGEGTAGFLDGPAATARFNHPRGVAVDSNGVVYVADTANHRI 249
Query: 128 RKI--TNLGVTTIAGGGSKKEGRADGPAQNA---SFSNDFEL--TFVPHICALLISDHGN 180
RKI T V+T+AG G EG ADG A +A S+ +D + T + ++D GN
Sbjct: 250 RKINPTTKMVSTLAGDGI--EGFADGAALSAARFSYPSDVAVLETNGGATVTVFVADTGN 307
Query: 181 QLIRQI 186
IRQI
Sbjct: 308 HRIRQI 313
Score = 55.5 bits (132), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/62 (43%), Positives = 33/62 (53%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G +DG+P ARFD P AVD G ++VAD NH+IR+I G T G AD
Sbjct: 344 GLADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRIEPNGTTHTLAGSVVPSEDADT 403
Query: 152 PA 153
P
Sbjct: 404 PG 405
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 41/126 (32%), Positives = 61/126 (48%), Gaps = 21/126 (16%)
Query: 81 VVKRLAGDGVQGYSDGEP-GSARFDKPKSFAV-DMKG----NIYVADKSNHVIRKITNLG 134
+V LAGDG++G++DG +ARF P AV + G ++VAD NH IR+I +
Sbjct: 258 MVSTLAGDGIEGFADGAALSAARFSYPSDVAVLETNGGATVTVFVADTGNHRIRQIKSGV 317
Query: 135 VTTIAG--------------GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
V+ ++G S G ADG A F + + V + ++D GN
Sbjct: 318 VSCLSGLCGAGVETVRLSQQPESPHAGLADGDPLGARFDSPMGVA-VDADGVVFVADTGN 376
Query: 181 QLIRQI 186
LIR+I
Sbjct: 377 HLIRRI 382
Score = 41.2 bits (95), Expect = 0.46, Method: Composition-based stats.
Identities = 29/107 (27%), Positives = 48/107 (44%), Gaps = 5/107 (4%)
Query: 66 RSAFYTLSFPLSEESV-----VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
R Y F + E+V V LAG V G+ DGE A F+ P A G +YVA
Sbjct: 450 RRVTYDGGFASTIETVTSTNRVVTLAGSNVPGHIDGEGNEATFNAPAGVAFAADGRVYVA 509
Query: 121 DKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
++ +R+I+ + + + S + + P+ AS+ + F+
Sbjct: 510 SSTDCTVRQISPASLVSRSVTCSTRATQVLRPSGCASYEQPVDELFL 556
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 27/85 (31%), Positives = 40/85 (47%), Gaps = 18/85 (21%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-------------- 134
G +G+ DG A+FD P++ AV + + VAD H +R++T G
Sbjct: 413 GERGFRDGNLTYAQFDSPRAIAVGAERTVVVAD--GHRVRRVTYDGGFASTIETVTSTNR 470
Query: 135 VTTIAGGGSKKEGRADGPAQNASFS 159
V T+A GS G DG A+F+
Sbjct: 471 VVTLA--GSNVPGHIDGEGNEATFN 493
>gi|110639161|ref|YP_679370.1| hypothetical protein CHU_2779 [Cytophaga hutchinsonii ATCC 33406]
gi|110281842|gb|ABG60028.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 1051
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 59/111 (53%), Gaps = 2/111 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ +V +AG+G G D +ARF+ P + +D NIY+AD NH IRKIT GV
Sbjct: 81 VNPSGIVSTIAGNGTSGLIDATGSAARFNLPAAVVLDAAQNIYIADNGNHCIRKITPAGV 140
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
I GS G DG A F+N + + + L ++D GN LIR+I
Sbjct: 141 -VITFAGSGTAGSNDGTGTAAQFNNPYGMA-IDASGNLYVADSGNNLIRKI 189
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 76/143 (53%), Gaps = 11/143 (7%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP+ + ID G +L V DS + L ++ VV +AG+ GY DG +ARF
Sbjct: 164 NPYGMAIDASG--NLYVADSGNN----LIRKITSSGVVTTIAGNTTPGYVDGTGTAARFY 217
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
P + +D+ GN ++ D NH IRK+T+ G V+T+AG GS G +G A F+ +
Sbjct: 218 LPVNITLDVSGNFFITDNRNHRIRKMTSAGVVSTVAGSGS--AGYMNGTGVTAQFNRPYG 275
Query: 164 LTFVPHICALLISDHGNQLIRQI 186
+ V L ++D N +IR+I
Sbjct: 276 IV-VDAFSNLYVTDTNNGVIRKI 297
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 52/84 (61%), Gaps = 3/84 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ VV +AG G GY +G +A+F++P VD N+YV D +N VIRKIT+ G
Sbjct: 243 MTSAGVVSTVAGSGSAGYMNGTGVTAQFNRPYGIVVDAFSNLYVTDTNNGVIRKITSSGV 302
Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
V+T AG G+ G ADGPA NA F
Sbjct: 303 VSTYAGTGT--PGFADGPAANAQF 324
Score = 67.4 bits (163), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 60/112 (53%), Gaps = 4/112 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ VV AG G G +DG +A+F+ P A+D GN+YVAD N++IRKIT+ G
Sbjct: 135 ITPAGVVITFAGSGTAGSNDGTGTAAQFNNPYGMAIDASGNLYVADSGNNLIRKITSSGV 194
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTTIA G+ G DG A F +T I+D+ N IR++
Sbjct: 195 VTTIA--GNTTPGYVDGTGTAARFYLPVNITLDVS-GNFFITDNRNHRIRKM 243
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 13/91 (14%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ VV AG G G++DG +A+F P ++ G++Y AD H +RK+T G
Sbjct: 297 ITSSGVVSTYAGTGTPGFADGPAANAQFQWPTGLTINASGDLYEADNETHRVRKVTPAGI 356
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
V+T AG G NA F+N L+
Sbjct: 357 VSTFAGNG------------NAGFANTAALS 375
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 4/88 (4%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+A F A D GN+ VA +H IRK+ G V+TIAG G+ A G A +
Sbjct: 51 TAEFYGLSGLAFDNSGNLIVAGYQDHSIRKVNPSGIVSTIAGNGTSGLIDATGSAARFNL 110
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
L +I I+D+GN IR+I
Sbjct: 111 PAAVVLDAAQNI---YIADNGNHCIRKI 135
>gi|194364186|ref|YP_002026796.1| SMP-30/gluconolaconase/LRE domain-containing protein
[Stenotrophomonas maltophilia R551-3]
gi|194346990|gb|ACF50113.1| SMP-30/Gluconolaconase/LRE domain protein [Stenotrophomonas
maltophilia R551-3]
Length = 693
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 76/152 (50%), Gaps = 11/152 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
IDG L+ N S I +L V D+ A + + V LAG G QGY+D
Sbjct: 114 IDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGYAD 168
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
G ARFD P AVD +G +YVAD N IR I T+ V T+AGG ++ G ADG
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG--ERPGLADGLGV 226
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F L F H ALL++D N +R++
Sbjct: 227 AARFDTPVALAFDAH-GALLVADLFNNAVRRV 257
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/106 (40%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
++ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I G VTT+AG
Sbjct: 103 IETVAGQG-EGRIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAG 161
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++G ADGPA A F + V + ++D N IR I
Sbjct: 162 G---EQGYADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 203
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 58/114 (50%), Gaps = 6/114 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITN 132
PL+ + ++ LAGDG G DG ARF P + G+IY D +N + R+ +
Sbjct: 41 PLAWTAQIEMLAGDGHPGDRDGASAQARFADPYALLRSADGSIYFTDAGDNNRIRRRQPD 100
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ T+AG G EGR DGPA ASF+ + L ++D GN IR+I
Sbjct: 101 GRIETVAGQG---EGRIDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRI 150
>gi|409099442|ref|ZP_11219466.1| NHL repeat containing protein [Pedobacter agri PB92]
Length = 754
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 60/112 (53%), Gaps = 4/112 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S V +AG G G DG SA F+ P AVD GNIYVAD N+ IRKIT G
Sbjct: 193 ISSTGFVTTVAGSGATGSLDGNAASATFNNPAGVAVDRAGNIYVADAGNNKIRKITPAGV 252
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V+T+AG G+ G DG NA +T P + L I+D+ N IR+I
Sbjct: 253 VSTLAGNGT--AGSTDGTGTNARLYYPTGITIDP-LDNLYIADYRNNRIRKI 301
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 82/162 (50%), Gaps = 11/162 (6%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP V +DR G ++ V D+ + ++ VV LAG+G G +DG +AR
Sbjct: 222 NPAGVAVDRAG--NIYVADAGNNKIR----KITPAGVVSTLAGNGTAGSTDGTGTNARLY 275
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
P +D N+Y+AD N+ IRKI+ G VTT+AG GS G +G N+SF+
Sbjct: 276 YPTGITIDPLDNLYIADYRNNRIRKISPSGVVTTLAGNGS--AGNLNGVGTNSSFNGPIM 333
Query: 164 LTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
L F L I+D N LIR+I+L K+ +G A A
Sbjct: 334 LAFDVQ-GNLFITDFQNNLIRKISLTGYTIDKNLPAGLAFDA 374
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 52/88 (59%), Gaps = 4/88 (4%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
S F P D+ G++Y+AD+ N+ I+KI++ G VTT+AG G+ G DG A +A+F
Sbjct: 163 SGNFSSPGGLNFDLAGDLYLADQKNNKIKKISSTGFVTTVAGSGA--TGSLDGNAASATF 220
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+N + V + ++D GN IR+I
Sbjct: 221 NNPAGVA-VDRAGNIYVADAGNNKIRKI 247
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 33/79 (41%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
V AG G +G S+ F+ P D + NIYV D + IRKIT G+ + G
Sbjct: 63 VSTFAGSGAASSYNGTGLSSGFNMPSGIGSDAQSNIYVCDYGSGAIRKITPSGLVSTIGY 122
Query: 142 GSKKEGRADGPAQNASFSN 160
G A N SN
Sbjct: 123 VGTPTGLTADAAGNTYISN 141
>gi|426402696|ref|YP_007021667.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
Tiberius]
gi|425859364|gb|AFY00400.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
Tiberius]
Length = 692
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 66/132 (50%), Gaps = 7/132 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
+ D+ ++DS+ S +S VV AG GV ++DG SARFD P +D
Sbjct: 69 AGDIFIVDSNASVIRKIS-----NGVVSTFAGKFGVFDHADGTGDSARFDYPTGITIDGS 123
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
GN++V + +NH IRKIT V T G G ADG A F+N ++T
Sbjct: 124 GNLFVTEGNNHTIRKITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAAD-GNFY 182
Query: 175 ISDHGNQLIRQI 186
I+D N +IR++
Sbjct: 183 ITDKNNNMIRKM 194
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 66/139 (47%), Gaps = 18/139 (12%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
G+ L V DS YT+ ++ VV LAG G+QG DG A F + V
Sbjct: 506 GAGSLYVSDSGN---YTVR-KVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTP 561
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA- 172
GNI+VAD N+VIRKIT GV T G + + G DG NA FS PH A
Sbjct: 562 SGNIFVADTDNNVIRKITVAGVVTTFAGAAGQGGNDDGMGSNARFSQ-------PHFVAT 614
Query: 173 -----LLISDHGNQLIRQI 186
L +++ G IR+I
Sbjct: 615 DSSGNLYVAEWGEATIRKI 633
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 58/114 (50%), Gaps = 2/114 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
L+ VV LAGD G +DG SARF A D G++YV+D N+ +RK+T G
Sbjct: 468 LTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIASDGAGSLYVSDSGNYTVRKVTLAG 527
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
V T G +G DG A+FS +T P + ++D N +IR+I +
Sbjct: 528 VVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTPS-GNIFVADTDNNVIRKITV 580
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 1/111 (0%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ VV AGDG G +DG +A F+ P D GN++V S IRKIT GV
Sbjct: 194 MTPAGVVTTFAGDGTYGCTDGTGAAAHFNYPTGIVGDSAGNLFVVCSSCSTIRKITPAGV 253
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T G + G DG A FS +T + L ++D+GN IR++
Sbjct: 254 VTTFAGLANATGALDGTGTAARFSWPIGIT-IDSSDNLYVADYGNSAIRKV 303
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/111 (42%), Positives = 57/111 (51%), Gaps = 3/111 (2%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S DL V DS ++ ++ V LAGD G +DG +A F P+ A D G
Sbjct: 342 SGDLFVTDSDNASIRKITPARVVTLVAGSLAGD-SDGSADGTGTAASFFSPEGVAADPAG 400
Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
N+YVAD N IRKIT G VTTIA G + G ADG A FS +LT
Sbjct: 401 NLYVADTMNRTIRKITPSGNVTTIA-GSPGQIGSADGTGAAARFSYPTKLT 450
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 62/140 (44%), Gaps = 20/140 (14%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
S +L V D SA ++ +VV AG G G DG +ARF P +D
Sbjct: 287 SDNLYVADYGNSAIR----KVTSSAVVSNFAGSYGDYGAVDGTGTAARFAGPAGIGIDAS 342
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGG--GSKKEGRADGPAQNASFSNDFELTFVPHICA 172
G+++V D N IRKIT V T+ G +G ADG ASF F P A
Sbjct: 343 GDLFVTDSDNASIRKITPARVVTLVAGSLAGDSDGSADGTGTAASF-------FSPEGVA 395
Query: 173 ------LLISDHGNQLIRQI 186
L ++D N+ IR+I
Sbjct: 396 ADPAGNLYVADTMNRTIRKI 415
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 53/108 (49%), Gaps = 4/108 (3%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
V +AG G G +DG +ARF P V GNIY+AD+ + IRK+T GV T
Sbjct: 421 VTTIAGSPGQIGSADGTGAAARFSYPTKLTVAEDGNIYIADE--YRIRKLTPGGVVTSLA 478
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G G ADG +A F + +L +SD GN +R++ L
Sbjct: 479 GDYDNSGSADGTGTSARFGGVAGIA-SDGAGSLYVSDSGNYTVRKVTL 525
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 4/86 (4%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ +VV +AG G G +DG +ARF+ P+ + GN Y+ DK+N++IRK+T G
Sbjct: 139 ITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAADGNFYITDKNNNMIRKMTPAG 198
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFS 159
VTT AG G+ G DG A F+
Sbjct: 199 VVTTFAGDGT--YGCTDGTGAAAHFN 222
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 49/112 (43%), Gaps = 2/112 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV AG G DG +ARF P +D N+YVAD N IRK+T+
Sbjct: 248 ITPAGVVTTFAGLANATGALDGTGTAARFSWPIGITIDSSDNLYVADYGNSAIRKVTSSA 307
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V + G G DG A F+ + + L ++D N IR+I
Sbjct: 308 VVSNFAGSYGDYGAVDGTGTAARFAGPAGIG-IDASGDLFVTDSDNASIRKI 358
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NL 133
++ VV AG QG +D GS ARF +P A D GN+YVA+ IRKIT +
Sbjct: 578 ITVAGVVTTFAGAAGQGGNDDGMGSNARFSQPHFVATDSSGNLYVAEWGEATIRKITPSA 637
Query: 134 GVTTIAG 140
VTTIAG
Sbjct: 638 VVTTIAG 644
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 8/89 (8%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG---RADGPAQNAS 157
ARF +P+ V+ G+I++ D + VIRKI+N V+T AG K G ADG +A
Sbjct: 56 ARFREPRDAVVNAAGDIFIVDSNASVIRKISNGVVSTFAG----KFGVFDHADGTGDSAR 111
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
F +T + L +++ N IR+I
Sbjct: 112 FDYPTGIT-IDGSGNLFVTEGNNHTIRKI 139
>gi|410694120|ref|YP_003624742.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
3As]
gi|294340545|emb|CAZ88930.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
3As]
Length = 366
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 65/117 (55%), Gaps = 2/117 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S + VV LAG G G+ DG ARF+ P +D +G IYV++ N+VIRKIT G
Sbjct: 196 ISAKGVVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQGQIYVSEYFNNVIRKITPDG 255
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
T G K G ADG A F + L+F P +L+++D GN +R+I+ + E
Sbjct: 256 TVTTFAGKPGKGGFADGNVGEALFLHPQTLSFAPD-GSLIVADTGNNRVRRISPQGE 311
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 52/95 (54%), Gaps = 1/95 (1%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
Y+ G+ ARF P+ A+D +GNI+VAD N V+RK+ G +I G ++ +GP
Sbjct: 49 YNGGDFAQARFHDPRGMALDAQGNIFVADYVNSVVRKLGTDGQVSIVAGQVEQRDARNGP 108
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A A F + E V L +SD G+ +R+I+
Sbjct: 109 ALQARFYSP-ECVAVATDGTLFVSDSGSNTVRRIS 142
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 11/121 (9%)
Query: 79 ESVVKRLAGDG----VQGY------SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
SVV++L DG V G +G ARF P+ AV G ++V+D ++ +R
Sbjct: 80 NSVVRKLGTDGQVSIVAGQVEQRDARNGPALQARFYSPECVAVATDGTLFVSDSGSNTVR 139
Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+I+ G + G + EG ADG Q A F++ L V + ++D N +R+I+
Sbjct: 140 RISREGRVSTLAGKLEVEGFADGTGQQARFNHPVGLD-VNAKGVVYVADAYNSTVRRISA 198
Query: 189 K 189
K
Sbjct: 199 K 199
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 2/73 (2%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ + V AG G G++DG G A F P++ + G++ VAD N+ +R+I+ G
Sbjct: 251 ITPDGTVTTFAGKPGKGGFADGNVGEALFLHPQTLSFAPDGSLIVADTGNNRVRRISPQG 310
Query: 135 -VTTIAGGGSKKE 146
V+T+AG G+ ++
Sbjct: 311 EVSTLAGTGAGEK 323
>gi|42522312|ref|NP_967692.1| hypothetical protein Bd0727 [Bdellovibrio bacteriovorus HD100]
gi|39574843|emb|CAE78685.1| hypothetical protein with NHL repeat [Bdellovibrio bacteriovorus
HD100]
Length = 709
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 66/132 (50%), Gaps = 7/132 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
+ D+ ++DS+ S +S VV AG GV ++DG SARFD P +D
Sbjct: 86 AGDIFIVDSNASVIRKIS-----NGVVSTFAGKFGVFDHADGTGDSARFDYPTGITIDGS 140
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
GN++V + +NH IRKIT V T G G ADG A F+N ++T
Sbjct: 141 GNLFVTEGNNHTIRKITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAAD-GNFY 199
Query: 175 ISDHGNQLIRQI 186
I+D N +IR++
Sbjct: 200 ITDKNNNMIRKM 211
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 66/139 (47%), Gaps = 18/139 (12%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
G+ L V DS YT+ ++ VV LAG G+QG DG A F + V
Sbjct: 523 GAGSLYVSDSGN---YTVR-KVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTP 578
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA- 172
GNI+VAD N+VIRKIT GV T G + + G DG NA FS PH A
Sbjct: 579 SGNIFVADTDNNVIRKITVAGVVTTFAGAAGQGGNDDGMGSNARFSQ-------PHFVAT 631
Query: 173 -----LLISDHGNQLIRQI 186
L +++ G IR+I
Sbjct: 632 DSSGNLYVAEWGEATIRKI 650
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 58/114 (50%), Gaps = 2/114 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
L+ VV LAGD G +DG SARF A D G++YV+D N+ +RK+T G
Sbjct: 485 LTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIASDGAGSLYVSDSGNYTVRKVTLAG 544
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
V T G +G DG A+FS +T P + ++D N +IR+I +
Sbjct: 545 VVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTPS-GNIFVADTDNNVIRKITV 597
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 53/111 (47%), Gaps = 1/111 (0%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ VV AGDG G +DG +A F+ P D GN++V S IRKIT GV
Sbjct: 211 MTPAGVVTTFAGDGTYGCTDGTGAAAHFNYPTGIVGDSAGNLFVVCSSCSTIRKITPAGV 270
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T G + G DG A FS +T + L ++D+ N IR++
Sbjct: 271 VTTFAGQAYTTGAVDGTGTAARFSWPVGIT-IDSSDNLYVADYSNSAIRKV 320
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 57/111 (51%), Gaps = 3/111 (2%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +L V DS ++ ++ V LAGD G +DG +A F P+ A D G
Sbjct: 359 SGNLFVTDSDNASIRKVTPARVVTLVAGSLAGD-SDGSADGTGTAASFHSPEGVAADPAG 417
Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
N+YVAD N IRKIT G VTTIA G + G ADG A FS +LT
Sbjct: 418 NLYVADTMNRTIRKITPSGNVTTIA-GSPGQIGSADGTGAAARFSYPTKLT 467
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 63/134 (47%), Gaps = 8/134 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
S +L V D S SA ++ +VV AG G G DG +ARF P +D
Sbjct: 304 SDNLYVADYSNSAIR----KVTSSAVVSNFAGSYGDYGAVDGTGTAARFAGPAGVGIDAS 359
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGG--GSKKEGRADGPAQNASFSNDFELTFVPHICA 172
GN++V D N IRK+T V T+ G +G ADG ASF + + P
Sbjct: 360 GNLFVTDSDNASIRKVTPARVVTLVAGSLAGDSDGSADGTGTAASFHSPEGVAADPA-GN 418
Query: 173 LLISDHGNQLIRQI 186
L ++D N+ IR+I
Sbjct: 419 LYVADTMNRTIRKI 432
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 53/108 (49%), Gaps = 4/108 (3%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
V +AG G G +DG +ARF P V GNIYVAD+ + IRK+T GV T
Sbjct: 438 VTTIAGSPGQIGSADGTGAAARFSYPTKLTVAEDGNIYVADE--YRIRKLTPGGVVTSLA 495
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G G ADG +A F + +L +SD GN +R++ L
Sbjct: 496 GDYDNSGSADGTGTSARFGGVAGIA-SDGAGSLYVSDSGNYTVRKVTL 542
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 4/86 (4%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ +VV +AG G G +DG +ARF+ P+ + GN Y+ DK+N++IRK+T G
Sbjct: 156 ITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAADGNFYITDKNNNMIRKMTPAG 215
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFS 159
VTT AG G+ G DG A F+
Sbjct: 216 VVTTFAGDGT--YGCTDGTGAAAHFN 239
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 50/112 (44%), Gaps = 2/112 (1%)
Query: 76 LSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV AG G DG +ARF P +D N+YVAD SN IRK+T+
Sbjct: 265 ITPAGVVTTFAGQAYTTGAVDGTGTAARFSWPVGITIDSSDNLYVADYSNSAIRKVTSSA 324
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V + G G DG A F+ + + L ++D N IR++
Sbjct: 325 VVSNFAGSYGDYGAVDGTGTAARFAGPAGVG-IDASGNLFVTDSDNASIRKV 375
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV AG QG +D GS ARF +P A D GN+YVA+ IRKIT+
Sbjct: 595 ITVAGVVTTFAGAAGQGGNDDGMGSNARFSQPHFVATDSSGNLYVAEWGEATIRKITSGA 654
Query: 135 -VTTIAG 140
VTTIAG
Sbjct: 655 VVTTIAG 661
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 8/89 (8%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG---RADGPAQNAS 157
ARF +P+ V+ G+I++ D + VIRKI+N V+T AG K G ADG +A
Sbjct: 73 ARFREPRDAVVNAAGDIFIVDSNASVIRKISNGVVSTFAG----KFGVFDHADGTGDSAR 128
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
F +T + L +++ N IR+I
Sbjct: 129 FDYPTGIT-IDGSGNLFVTEGNNHTIRKI 156
>gi|255532869|ref|YP_003093241.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255345853|gb|ACU05179.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 2296
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/107 (42%), Positives = 60/107 (56%), Gaps = 4/107 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
VV AG GV+G+ DG +A+F P A D GN+YVAD+ NH IRKIT G VTT+A
Sbjct: 642 VVSTFAGSGVKGFKDGTGTAAQFAFPTGLAADAAGNVYVADRDNHRIRKITAAGVVTTLA 701
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G+ ADG NA F+ + V + ++D N IR++
Sbjct: 702 GNGTAD--FADGTQANARFNMPVGVA-VDANGNVFVADRNNFRIRKV 745
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 61/106 (57%), Gaps = 3/106 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ VV LAG+G ++DG +ARF+ P AVD GN++VAD++N IRK+T G
Sbjct: 691 ITAAGVVTTLAGNGTADFADGTQANARFNMPVGVAVDANGNVFVADRNNFRIRKVTPAGV 750
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
V+T AG G+ G A+G + +A F F L L ISD GN
Sbjct: 751 VSTYAGKGT--SGFAEGLSGDAQFKLVFGLAMSSISGNLYISDQGN 794
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 40/61 (65%), Gaps = 2/61 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
VVKR G + GY++G SA F+ P A+D GN Y+AD+ NH IRKI+ G VTT+A
Sbjct: 481 VVKRFVGKDL-GYANGTGTSAMFNSPAGIAMDAAGNTYLADRGNHRIRKISPAGVVTTLA 539
Query: 140 G 140
G
Sbjct: 540 G 540
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 52/107 (48%), Gaps = 6/107 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V AG G GY++G +A F A+D GN+YV D+ + IRKIT G V+T AG
Sbjct: 589 VSTFAGTGTYGYANGTTTTATFTYLLGIAIDNSGNLYVTDQDVNSIRKITPGGVVSTFAG 648
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICA-LLISDHGNQLIRQI 186
G K G DG A F+ F + ++D N IR+I
Sbjct: 649 SGVK--GFKDGTGTAAQFA--FPTGLAADAAGNVYVADRDNHRIRKI 691
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 47/77 (61%), Gaps = 3/77 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S VV LAG G G++DG +A+F+ P AVD GN+YV+D N +RKIT G
Sbjct: 529 ISPAGVVTTLAGSGTAGFADGAGAAAQFNDPSGVAVDAAGNVYVSDTRNFRVRKITPAGQ 588
Query: 135 VTTIAGGGSKKEGRADG 151
V+T AG G+ G A+G
Sbjct: 589 VSTFAGTGT--YGYANG 603
Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 4/68 (5%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
S+ V AG GV GY +G+ +A+F D N+YV D N+ IRKI
Sbjct: 33 SQAQTVSPYAGTGVDGYYEGDNNDAITAQFSYIGGITTDANNNLYVTDSRNYCIRKIDQN 92
Query: 134 G-VTTIAG 140
G V+ IAG
Sbjct: 93 GKVSVIAG 100
>gi|373953102|ref|ZP_09613062.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
gi|373889702|gb|EHQ25599.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
Length = 3731
Score = 73.9 bits (180), Expect = 7e-11, Method: Composition-based stats.
Identities = 47/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S VV AG GV G ++G +A F+ P A D++GN+YV+D ++++IRKIT G
Sbjct: 109 ISPVGVVSTFAGSGVAGSANGTGTAASFNNPFGIATDVQGNLYVSDVNSNLIRKITPGGV 168
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT+AG GS G +G ASF+ + LT + ++D+GNQLIR+I
Sbjct: 169 VTTLAGSGSA--GSVNGTGTAASFNTPYSLT-TDMQGNVYVADYGNQLIRKI 217
Score = 70.9 bits (172), Expect = 5e-10, Method: Composition-based stats.
Identities = 45/112 (40%), Positives = 63/112 (56%), Gaps = 4/112 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ +V AG G G +G +A F+ P D +GNIYV+D +N+ IRKIT G
Sbjct: 1086 ITPSGLVTTFAGSGAAGSENGNGAAASFNNPFGLTTDAQGNIYVSDANNNTIRKITPSGV 1145
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT AG GS G ADG ASF++ + L + ++D GNQ+IR+I
Sbjct: 1146 VTTFAGSGS--SGAADGIGMAASFNSPYGLATDAQ-GNIYVADFGNQVIRKI 1194
Score = 69.7 bits (169), Expect = 1e-09, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 60/112 (53%), Gaps = 4/112 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ VV AG GV G +G +A F P +D +GN+YVAD NH IRKIT G
Sbjct: 272 ITPAGVVTTFAGSGVPGALNGTGTAATFYNPTGVTMDAQGNVYVADSQNHSIRKITPAGV 331
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT+AG GS G A+G NASF + L I+D N LIR+I
Sbjct: 332 VTTLAGTGSM--GSANGAGTNASFYYP-NAVVADALGNLYIADTNNHLIRKI 380
Score = 65.5 bits (158), Expect = 2e-08, Method: Composition-based stats.
Identities = 41/111 (36%), Positives = 54/111 (48%), Gaps = 1/111 (0%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ VV LAG G G +DG +A F P D +GN+YVAD N IRKIT GV
Sbjct: 657 ITPAGVVTTLAGSGDSGSADGTGTAASFHTPYDLTTDAQGNVYVADNFNQTIRKITREGV 716
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G S G +G A A F N + + ++D+GN IR+I
Sbjct: 717 VNTFAGTSGSSGFVNGTAAAAKFKNPIGIATDTQ-GNVYVADNGNLAIRKI 766
Score = 63.5 bits (153), Expect = 9e-08, Method: Composition-based stats.
Identities = 40/106 (37%), Positives = 53/106 (50%), Gaps = 1/106 (0%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
VV LAG G G +G +A F+ P S DM+GN+YVAD N +IRKIT GV T
Sbjct: 168 VVTTLAGSGSAGSVNGTGTAASFNTPYSLTTDMQGNVYVADYGNQLIRKITPAGVVTTLA 227
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G +G A F+ + + ++D NQ IR+I
Sbjct: 228 GTVGSSGFVNGTGTAAKFNYPRSVA-TDAAGNVYVADQVNQAIRKI 272
Score = 62.0 bits (149), Expect = 2e-07, Method: Composition-based stats.
Identities = 45/112 (40%), Positives = 60/112 (53%), Gaps = 4/112 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ +V AG G G +G +A F P D +GN+YVAD N+ IRKIT G
Sbjct: 1249 ITPAGLVTTFAGSGGIGALNGTGTAASFHNPTGITTDAQGNVYVADLYNNAIRKITPGGV 1308
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT+AG GS G ADG +ASF N + + + + D NQLIR+I
Sbjct: 1309 VTTLAGTGSI--GSADGVGTSASFYNPNAVA-TDAVGNIYVVDTYNQLIRKI 1357
Score = 61.6 bits (148), Expect = 4e-07, Method: Composition-based stats.
Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 12/113 (10%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ ++VV LAG+G G ++G +A F+ P D GN+YVAD N++IRK+T+ G
Sbjct: 1690 ITPDAVVTTLAGNGAAGSANGTGNAASFNHPWGIVADAAGNLYVADTYNNLIRKVTSAGS 1749
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
VTT AG G+ DG ASF+ + A+ I GN + ++N
Sbjct: 1750 VTTFAGSGAASS--VDGTGTAASFN---------YPSAISIDASGNLYVAELN 1791
Score = 61.6 bits (148), Expect = 4e-07, Method: Composition-based stats.
Identities = 43/111 (38%), Positives = 61/111 (54%), Gaps = 1/111 (0%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ VV AG G G +DG +A F+ P A D +GNIYVAD N VIRKIT GV
Sbjct: 1140 ITPSGVVTTFAGSGSSGAADGIGMAASFNSPYGLATDAQGNIYVADFGNQVIRKITPDGV 1199
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T G + G +G A A F++ +++ V + ++D NQ+IR+I
Sbjct: 1200 VTTFAGTTGVAGNVNGAAAAAKFNSPYDVA-VDVTGNVYVADELNQVIRKI 1249
Score = 60.8 bits (146), Expect = 5e-07, Method: Composition-based stats.
Identities = 44/113 (38%), Positives = 61/113 (53%), Gaps = 5/113 (4%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV LAG G G+ +G +A+F+ P+S A D GN+YVAD+ N IRKIT G
Sbjct: 217 ITPAGVVTTLAGTVGSSGFVNGTGTAAKFNYPRSVATDAAGNVYVADQVNQAIRKITPAG 276
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT AG G G +G A+F N +T + ++D N IR+I
Sbjct: 277 VVTTFAGSGVP--GALNGTGTAATFYNPTGVTMDAQ-GNVYVADSQNHSIRKI 326
Score = 60.8 bits (146), Expect = 6e-07, Method: Composition-based stats.
Identities = 45/113 (39%), Positives = 60/113 (53%), Gaps = 4/113 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ VV LAG G +DG +A F+ P AVD GN+YVAD N ++RKIT G
Sbjct: 2130 ITAAGVVTTLAGKLTAGSADGVGAAASFNLPAGVAVDASGNVYVADLLNSMVRKITPDGT 2189
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
VTT+A GS G ADG A F L V ++++D N IR+I+
Sbjct: 2190 VTTLA--GSTSAGSADGTGAAAGFHYPTNLQ-VDDQGNIIVADQLNNKIRKIS 2239
Score = 59.3 bits (142), Expect = 2e-06, Method: Composition-based stats.
Identities = 43/107 (40%), Positives = 60/107 (56%), Gaps = 5/107 (4%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
V AG+ G+ G ++ +A F P AVD GN+YVAD N++IRKI+ +G V+T A
Sbjct: 60 VSTFAGNAGIAGNTNATGTAATFHSPFGVAVDASGNVYVADAGNNLIRKISPVGVVSTFA 119
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G G A+G ASF+N F + L +SD + LIR+I
Sbjct: 120 GSGVA--GSANGTGTAASFNNPFGIATDVQ-GNLYVSDVNSNLIRKI 163
Score = 59.3 bits (142), Expect = 2e-06, Method: Composition-based stats.
Identities = 43/107 (40%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
V +AG G+ G ++G +A F P D GN+Y+AD +N +IRKIT G VTT A
Sbjct: 1037 VSTVAGSVGIAGKANGIGTAATFSGPSGVTTDASGNLYIADFNNRLIRKITPSGLVTTFA 1096
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G+ G +G ASF+N F LT + +SD N IR+I
Sbjct: 1097 GSGAA--GSENGNGAAASFNNPFGLTTDAQ-GNIYVSDANNNTIRKI 1140
Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats.
Identities = 36/89 (40%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ E VV AG G G+ +G +A+F P A D +GN+YVAD N IRKIT G
Sbjct: 711 ITREGVVNTFAGTSGSSGFVNGTAAAAKFKNPIGIATDTQGNVYVADNGNLAIRKITPAG 770
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDF 162
VTT+AG G K AQ + D+
Sbjct: 771 VVTTLAGSGFKDPFSVATDAQGNVYVMDY 799
Score = 57.0 bits (136), Expect = 9e-06, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 50/84 (59%), Gaps = 3/84 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V AG G DG +A F+ P + ++D GN+YVA+ + +VIRKI+ G VTTIAG
Sbjct: 1750 VTTFAGSGAASSVDGTGTAASFNYPSAISIDASGNLYVAELNGNVIRKISPAGVVTTIAG 1809
Query: 141 GGSKKEGRADGPAQNASFSNDFEL 164
G+ G A+G + A+F N + +
Sbjct: 1810 SGA--SGIANGIGKAATFGNLYSI 1831
Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats.
Identities = 41/121 (33%), Positives = 66/121 (54%), Gaps = 7/121 (5%)
Query: 69 FYTLSFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
FY ++ + +V AG G G +DG +A F P+ A+D GN++V + N++
Sbjct: 1628 FYNQIRKITPDGLVTTFAGSLTGAPGATDGTGAAATFRSPRGMAIDALGNLFVVE-DNYL 1686
Query: 127 IRKIT-NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
IRKIT + VTT+AG G+ G A+G ASF++ + + L ++D N LIR+
Sbjct: 1687 IRKITPDAVVTTLAGNGAA--GSANGTGNAASFNHPWGIV-ADAAGNLYVADTYNNLIRK 1743
Query: 186 I 186
+
Sbjct: 1744 V 1744
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 30/60 (50%), Positives = 36/60 (60%), Gaps = 1/60 (1%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
+A F+ P AVD GNIYVAD+ N++IRKIT GV T G G ADG ASF+
Sbjct: 2100 AATFNHPTGLAVDAAGNIYVADQGNNMIRKITAAGVVTTL-AGKLTAGSADGVGAAASFN 2158
Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats.
Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 3/112 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ +VV G G G ++G+ +A F P A+D GN++VAD+ + IRKIT G
Sbjct: 1581 ITPATVVSTFVGTGSPGSTNGKGTAASFYVPYGMAIDAAGNLFVADQFYNQIRKITPDGL 1640
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT AG + G DG A+F + + ++ D N LIR+I
Sbjct: 1641 VTTFAGSLTGAPGATDGTGAAATFRSPRGMAIDALGNLFVVED--NYLIRKI 1690
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 37/86 (43%), Positives = 48/86 (55%), Gaps = 4/86 (4%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN 160
+F+ P AVD+ GN+YVAD+ N VIRKIT G VTT AG G G +G ASF N
Sbjct: 1221 KFNSPYDVAVDVTGNVYVADELNQVIRKITPAGLVTTFAGSGGI--GALNGTGTAASFHN 1278
Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
+T + ++D N IR+I
Sbjct: 1279 PTGITTDAQ-GNVYVADLYNNAIRKI 1303
Score = 50.8 bits (120), Expect = 5e-04, Method: Composition-based stats.
Identities = 43/123 (34%), Positives = 61/123 (49%), Gaps = 6/123 (4%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
VV AG G +G +A F D + N+YV D ++IRKIT G VTT+A
Sbjct: 610 VVSTFAGSGTAASVNGTGVAASFLHAYRLTTDAQSNVYVID--GNMIRKITPAGVVTTLA 667
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQS 199
G G G ADG ASF ++LT + ++D+ NQ IR+I + + + S
Sbjct: 668 GSGD--SGSADGTGTAASFHTPYDLTTDAQ-GNVYVADNFNQTIRKITREGVVNTFAGTS 724
Query: 200 GSA 202
GS+
Sbjct: 725 GSS 727
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 36/86 (41%), Positives = 47/86 (54%), Gaps = 4/86 (4%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
V LAG G G ++G +A F P A D GNIYV+D +N++IRKI G V+T A
Sbjct: 556 VSTLAGKAGSAGNANGTGTAATFSSPTGVATDPSGNIYVSDYNNNLIRKINLAGVVSTFA 615
Query: 140 GGGSKKEGRADGPAQNASFSNDFELT 165
G G+ G A ASF + + LT
Sbjct: 616 GSGTAASVNGTGVA--ASFLHAYRLT 639
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 41/108 (37%), Positives = 59/108 (54%), Gaps = 4/108 (3%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+VV AG G G + SA F+ P AVD +GN YV D+ N+++RKIT G V+T+
Sbjct: 2026 AVVTTFAGSGAAGSVNSTGTSATFNGPLDVAVDAEGNTYVLDQLNNLVRKITPAGVVSTL 2085
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG GS A A+F++ L V + ++D GN +IR+I
Sbjct: 2086 AGSGSSGSANGAATA--ATFNHPTGLA-VDAAGNIYVADQGNNMIRKI 2130
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ + V LAG G +DG +A F P + VD +GNI VAD+ N+ IRKI+ G
Sbjct: 2184 ITPDGTVTTLAGSTSAGSADGTGAAAGFHYPTNLQVDDQGNIIVADQLNNKIRKISPAGV 2243
Query: 135 VTTIAG 140
VTTIAG
Sbjct: 2244 VTTIAG 2249
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 41/118 (34%), Positives = 57/118 (48%), Gaps = 29/118 (24%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ VV LAG G F P S A D +GN+YV D S ++RKI G
Sbjct: 766 ITPAGVVTTLAGSG-------------FKDPFSVATDAQGNVYVMDYSTPILRKILPTGT 812
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
VT +AG GS G A+G ++F +VP+ A + ++D GN LIR+I
Sbjct: 813 VTILAGDGSA--GSANGAGTVSNF-------YVPNALATDALGNIYVADAGNNLIRKI 861
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 27/51 (52%), Positives = 30/51 (58%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
VV LAG G G +DG SA F P + A D GNIYV D N +IRKIT
Sbjct: 1308 VVTTLAGTGSIGSADGVGTSASFYNPNAVATDAVGNIYVVDTYNQLIRKIT 1358
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 5/89 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIA 139
V LAGDG G ++G + F P + A D GNIYVAD N++IRKIT N +T +
Sbjct: 813 VTILAGDGSAGSANGAGTVSNFYVPNALATDALGNIYVADAGNNLIRKITTGNYSITPML 872
Query: 140 GGG---SKKEGRADGPAQNASFSNDFELT 165
G + G G AS + + +T
Sbjct: 873 PAGLNFDQSTGTISGTPTVASPATTYTIT 901
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 44/139 (31%), Positives = 63/139 (45%), Gaps = 17/139 (12%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ P S+V + G+G G +G +A + D+ GN++VAD NH+IRKIT
Sbjct: 1524 AVPAKTYSLVSTIVGNGSSGAVNGTGTAASLNLCDGLVFDLLGNMFVADFGNHMIRKITP 1583
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN-----QLIRQI 186
V+T G GS G +G ASF +VP+ + I GN Q QI
Sbjct: 1584 ATVVSTFVGTGSP--GSTNGKGTAASF-------YVPY--GMAIDAAGNLFVADQFYNQI 1632
Query: 187 NLKPEDCSKSSQSGSALGA 205
D ++ +GS GA
Sbjct: 1633 RKITPDGLVTTFAGSLTGA 1651
Score = 44.3 bits (103), Expect = 0.056, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 34/61 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+S VV +AG G G ++G +A F S A D G++YVAD+ ++IRKI G
Sbjct: 1798 ISPAGVVTTIAGSGASGIANGIGKAATFGNLYSIATDASGDVYVADQYKYIIRKIVGTGY 1857
Query: 136 T 136
+
Sbjct: 1858 S 1858
>gi|329908045|ref|ZP_08274805.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
IMCC9480]
gi|327546778|gb|EGF31712.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
IMCC9480]
Length = 685
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 86/180 (47%), Gaps = 25/180 (13%)
Query: 25 GLLLEDGYTVTTVIDGHQLE---INPHSVIDRPGSSDLIVLDSSRSAFYTLSFP------ 75
G+ + G T + G +LE ++ + R GS + D+S + +Y + P
Sbjct: 351 GVHVITGGTTLKTLAGKELERGMVDGNGAKARFGSLQGVTSDASGN-WYVVDTPHHMIRK 409
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ VV +LAG+G G DG A F+ P+ D GN++VAD N IRKIT G
Sbjct: 410 ITPAGVVSQLAGNGKPGGVDGTGSGASFNYPRGIVADALGNLFVADTFNSRIRKITPAGV 469
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
VTTIAG GS G DGP A F F P A L ++D GN +R+I +
Sbjct: 470 VTTIAGAGS-SSGSTDGPGNIARF-------FEPEAIAIDASRNLYVADTGNHTVRKITV 521
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 71/142 (50%), Gaps = 9/142 (6%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFD 104
P S+ IDR G+ L V D++ ++ + VV +AG GV G +DG +ARF
Sbjct: 120 PQSIAIDRAGT--LYVADTNNQTIRKIT----PQGVVTTIAGRVGVDGSTDGRGNAARFS 173
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
P+ AVD+ G + V+D NH +R I+ GV G + G DG A FSN L
Sbjct: 174 YPQGIAVDVAGTVLVSDTYNHTVRTISPGGVVGTLAGSAGNFGVLDGVRSAARFSNPQGL 233
Query: 165 TFVPHICALLISDHGNQLIRQI 186
+ ++D GN ++R++
Sbjct: 234 -VTDAARNIYVADAGNGVLRKV 254
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 58/113 (51%), Gaps = 2/113 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S + V LAG G +DG +ARF+ P+S A+D G +YVAD +N IRKIT G
Sbjct: 89 VSADGSVSTLAGAMSTPGAADGTGTAARFNYPQSIAIDRAGTLYVADTNNQTIRKITPQG 148
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
V T G +G DG A FS + V +L+SD N +R I+
Sbjct: 149 VVTTIAGRVGVDGSTDGRGNAARFSYPQGIA-VDVAGTVLVSDTYNHTVRTIS 200
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 53/105 (50%), Gaps = 6/105 (5%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDM 113
IV D+ + F +F ++ VV +AG G G +DG ARF +P++ A+D
Sbjct: 443 IVADALGNLFVADTFNSRIRKITPAGVVTTIAGAGSSSGSTDGPGNIARFFEPEAIAIDA 502
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
N+YVAD NH +RKIT GV + G K G DG A F
Sbjct: 503 SRNLYVADTGNHTVRKITVAGVVSTLAGSPGKYGSDDGTGAAARF 547
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 65/135 (48%), Gaps = 7/135 (5%)
Query: 58 DLIVLDSSRSAFYTLSF-----PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAV 111
+ I +D+SR+ + + ++ VV LAG G G DG +ARF KP + AV
Sbjct: 496 EAIAIDASRNLYVADTGNHTVRKITVAGVVSTLAGSPGKYGSDDGTGAAARFLKPPAIAV 555
Query: 112 DMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
D GN+ ++ + IRKIT GV T G + G DG A F N L +
Sbjct: 556 DPSGNVVLSQPAYGTIRKITPGGVVTTLAGRVLETGALDGAGDAARFFNPQGLA-ADNAG 614
Query: 172 ALLISDHGNQLIRQI 186
+ ++D GN IR+I
Sbjct: 615 NVYVADTGNNTIRRI 629
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 44/86 (51%), Gaps = 1/86 (1%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
ARF+ P AVD GN YVAD NHVIRK++ G + G G ADG A F N
Sbjct: 60 ARFNFPIGIAVDGVGNRYVADWQNHVIRKVSADGSVSTLAGAMSTPGAADGTGTAARF-N 118
Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
+ + L ++D NQ IR+I
Sbjct: 119 YPQSIAIDRAGTLYVADTNNQTIRKI 144
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 40/84 (47%), Gaps = 1/84 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S VV LAG G G DG +ARF P+ D NIYVAD N V+RK+T G
Sbjct: 199 ISPGGVVGTLAGSAGNFGVLDGVRSAARFSNPQGLVTDAARNIYVADAGNGVLRKVTPAG 258
Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
+ T G G DG A F
Sbjct: 259 IVTTLAGSLANYGLKDGTGAAAGF 282
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 3/78 (3%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
S+ A+ T+ ++ VV LAG ++ G DG +ARF P+ A D GN+YVAD
Sbjct: 564 SQPAYGTIR-KITPGGVVTTLAGRVLETGALDGAGDAARFFNPQGLAADNAGNVYVADTG 622
Query: 124 NHVIRKITNLG-VTTIAG 140
N+ IR+IT G V+T+AG
Sbjct: 623 NNTIRRITPSGQVSTVAG 640
>gi|408823744|ref|ZP_11208634.1| NHL repeat-containing protein [Pseudomonas geniculata N1]
Length = 681
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 75/152 (49%), Gaps = 11/152 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A + + V LAG G QGY+D
Sbjct: 102 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGYAD 156
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
G ARFD P AVD +G +YVAD N IR I T+ V T+AGG ++ G ADG
Sbjct: 157 GPAAQARFDAPMGIAVDAQGQVYVADTYNDRIRVIGTDGNVRTLAGG--ERPGMADGVGA 214
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F L F ALL++D N +R+I
Sbjct: 215 AARFDTPVALAFDAQ-GALLVADLFNNAVRRI 245
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I G VTT+AG
Sbjct: 91 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAG 149
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++G ADGPA A F + V + ++D N IR I
Sbjct: 150 G---EQGYADGPAAQARFDAPMGIA-VDAQGQVYVADTYNDRIRVI 191
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 59/114 (51%), Gaps = 6/114 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK--SNHVIRKITN 132
PL+ + ++ LAGDG G DG ARF P + G++Y D +N + R++ +
Sbjct: 29 PLAWTAQIEPLAGDGHPGNRDGAAAQARFADPYALLRSADGSVYFTDAGDNNRIRRRLPD 88
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V T+AG G EGR DGPA ASF+ + L ++D GN IR+I
Sbjct: 89 GRVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRI 138
>gi|225872818|ref|YP_002754275.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
51196]
gi|225794459|gb|ACO34549.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
51196]
Length = 855
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 73/140 (52%), Gaps = 7/140 (5%)
Query: 49 SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKP 106
S + GS +L + D++ + +S VV +AG+G +GYS DG SA D P
Sbjct: 82 SAVAYDGSGNLYIADTNNNVIR----EVSTTGVVTTIAGNGEEGYSGDGAAATSAMLDTP 137
Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
AVD GNIY+AD N+ IR+++N + T+AG G+ DG A ++ +D
Sbjct: 138 TGIAVDSNGNIYIADSHNNRIREVSNGIINTVAGNGTAGY-SGDGAAATSAMLDDPTAVA 196
Query: 167 VPHICALLISDHGNQLIRQI 186
V + I+D GNQ IR +
Sbjct: 197 VDASGNIYIADTGNQRIRNV 216
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 59/107 (55%), Gaps = 6/107 (5%)
Query: 85 LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG G GYS G SA F P + A D GN+Y+AD +N+VIR+++ G VTTIAG
Sbjct: 58 VAGTGSAGYSGNGGAATSAAFASPSAVAYDGSGNLYIADTNNNVIREVSTTGVVTTIAGN 117
Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G +EG DG A ++ + V + I+D N IR+++
Sbjct: 118 G--EEGYSGDGAAATSAMLDTPTGIAVDSNGNIYIADSHNNRIREVS 162
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 2/66 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
++ +AG+G GYS DG SA D P + AVD GNIY+AD N IR + + T+
Sbjct: 165 IINTVAGNGTAGYSGDGAAATSAMLDDPTAVAVDASGNIYIADTGNQRIRNVAAGTIHTV 224
Query: 139 AGGGSK 144
AG G +
Sbjct: 225 AGNGEE 230
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 60/111 (54%), Gaps = 5/111 (4%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ +AG+G +GYS DG ++ D P AVD GNIY+AD N+ IR+++ + T+
Sbjct: 220 TIHTVAGNGEEGYSGDGAAAASAELDTPTGIAVDSSGNIYIADSHNNRIREVSGGVINTV 279
Query: 139 AGGGSKK-EGRADGPAQNASFSNDFELTFVPHICA--LLISDHGNQLIRQI 186
AG G+ G G +A+ + + T V A + I+D N+ +R+I
Sbjct: 280 AGSGAVTFPGSYSGDGGSATAATLAKPTGVALDAAGHVYIADTNNERLREI 330
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 9/124 (7%)
Query: 81 VVKRLAGDGV---QGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
V+ +AG G G G+ GSA KP A+D G++Y+AD +N +R+I N
Sbjct: 275 VINTVAGSGAVTFPGSYSGDGGSATAATLAKPTGVALDAAGHVYIADTNNERLREIANGV 334
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
+ T+AG G + DG A ++ ND V ++ ++D N+ +R L P +
Sbjct: 335 IATVAGNGQQGY-SGDGAAATSAALNDPRNASVNASGSVAVADTLNERVR--GLTPPTLT 391
Query: 195 KSSQ 198
+SQ
Sbjct: 392 YASQ 395
>gi|223939820|ref|ZP_03631690.1| NHL repeat containing protein [bacterium Ellin514]
gi|223891508|gb|EEF57999.1| NHL repeat containing protein [bacterium Ellin514]
Length = 805
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/144 (37%), Positives = 78/144 (54%), Gaps = 16/144 (11%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSF 109
ID+ G +L+V D+ S ++ P+ VV LAG G +DG ARF+ P+
Sbjct: 358 IDKAG--NLLVADTQNSEIRKIA-PVGTNYVVTTLAGFAQNVGGADGTNAVARFNSPRGI 414
Query: 110 AVDMKGNIYVADKSNHVIRKI----TNLGVTTIAGGGSKKEGRA---DGPAQNASFSNDF 162
AVD GN++VAD++N+ IRK+ TN VTTIAG + G A DG NA F+
Sbjct: 415 AVDAAGNVFVADQNNNTIRKLTPVGTNWAVTTIAG----QAGMAYYGDGNGTNAYFNYPA 470
Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
+ V L ++D GN +IR++
Sbjct: 471 GIA-VDASGNLFVTDAGNHVIRKL 493
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 73/145 (50%), Gaps = 26/145 (17%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMK 114
S L V DSS ++ ++ P+ VV +AG V G +DG G A F++P AVD
Sbjct: 187 SGHLFVGDSSNNSIRKMT-PVGTNWVVTTIAGSSPVSGSNDGTNGFAHFNQPCGLAVDAA 245
Query: 115 GNIYVADKSNHVIRKITNLG----VTTIAGGGSKKEGRAD---GPAQNASFSNDFELTFV 167
G+I+VAD N IRKIT+ G VTTIAG K G AD G NA F+
Sbjct: 246 GSIFVADYFNSTIRKITSAGTNWLVTTIAG----KVGVADSAEGTGTNAVFN-------Y 294
Query: 168 PHICA------LLISDHGNQLIRQI 186
PH A + ++D N IR++
Sbjct: 295 PHAVAVDTNGNVFVADSENYTIRKL 319
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 56/112 (50%), Gaps = 18/112 (16%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI----TNLGVTTIAG 140
+AG G +DG ARF+ P++ A D GNIYVAD NHVIRK+ TN VTTIA
Sbjct: 42 IAGVASIGSNDGTNRVARFNNPEATASDSAGNIYVADSGNHVIRKMAAVGTNWVVTTIA- 100
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
G G ADG A F + P A + ++D N IR+I
Sbjct: 101 GLPGNPGSADGTNSAAQF-------YYPAAVAVDNSGNVFVADSANYTIRKI 145
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 69/136 (50%), Gaps = 8/136 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMK 114
S ++ V DS+ ++ PL VV +AG G G SDG A F P+ A D
Sbjct: 129 SGNVFVADSANYTIRKIA-PLGTNWVVTTIAGVPGYHGSSDGTNTDALFFYPEGLAFDSS 187
Query: 115 GNIYVADKSNHVIRKITNLG----VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
G+++V D SN+ IRK+T +G VTTIA G S G DG A F+ L V
Sbjct: 188 GHLFVGDSSNNSIRKMTPVGTNWVVTTIA-GSSPVSGSNDGTNGFAHFNQPCGLA-VDAA 245
Query: 171 CALLISDHGNQLIRQI 186
++ ++D+ N IR+I
Sbjct: 246 GSIFVADYFNSTIRKI 261
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 59/117 (50%), Gaps = 19/117 (16%)
Query: 81 VVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----V 135
VV +AG G G +DG +A+F P + AVD GN++VAD +N+ IRKI LG V
Sbjct: 95 VVTTIAGLPGNPGSADGTNSAAQFYYPAAVAVDNSGNVFVADSANYTIRKIAPLGTNWVV 154
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
TTIA G G +DG +A F F P A L + D N IR++
Sbjct: 155 TTIA-GVPGYHGSSDGTNTDALF-------FYPEGLAFDSSGHLFVGDSSNNSIRKM 203
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 49/98 (50%), Gaps = 6/98 (6%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSKKEG 147
G+ DG ARF P A+D +YV D+ N+ IRK+T LG VTT+A G G
Sbjct: 630 GFMDGTNSDARFAYPTGIAIDTNDTLYVTDQGNNTIRKVTPLGTNWMVTTLA-GIHASTG 688
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
ADG A F+ F + + L ++D N IR+
Sbjct: 689 SADGAGSAALFNGPFGIA-IDKTGNLFVADLQNSSIRK 725
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
Query: 81 VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
+V +AG GV ++G +A F+ P + AVD GN++VAD N+ IRK+T G +
Sbjct: 269 LVTTIAGKVGVADSAEGTGTNAVFNYPHAVAVDTNGNVFVADSENYTIRKLTPSGTNYVV 328
Query: 140 G---GGSKKEGRADGPAQNASFSNDFELTF-VPHICALLISDHGNQLIRQI 186
G ++ DG A F F L + LL++D N IR+I
Sbjct: 329 STPIGQTRASNSTDGTNNAARFW--FLLGISIDKAGNLLVADTQNSEIRKI 377
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 11/162 (6%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFD 104
PH+V +D G ++ V DS L+ P VV G +DG +ARF
Sbjct: 295 PHAVAVDTNG--NVFVADSENYTIRKLT-PSGTNYVVSTPIGQTRASNSTDGTNNAARFW 351
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI----TNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
++D GN+ VAD N IRKI TN VTT+A G ++ G ADG A F++
Sbjct: 352 FLLGISIDKAGNLLVADTQNSEIRKIAPVGTNYVVTTLA-GFAQNVGGADGTNAVARFNS 410
Query: 161 DFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
+ V + ++D N IR++ + + ++ +G A
Sbjct: 411 PRGIA-VDAAGNVFVADQNNNTIRKLTPVGTNWAVTTIAGQA 451
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 8/114 (7%)
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI----TNLGVTTIAGGGSKKEGRA 149
+DG +ARF VD GN++VAD +N +IRKI TN TTIAG + + A
Sbjct: 516 TDGTNANARFFIVDGITVDAAGNLFVADNNNCLIRKIAPVGTNWITTTIAGKLNSYD-FA 574
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
DG N F+ + V + + D GN ++R+ L P + S + +A
Sbjct: 575 DGVGTNILFNQPTGIA-VGKGGVVYVVDMGNNMVRK--LTPNGTNYVSSTVAAF 625
>gi|254522386|ref|ZP_05134441.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
gi|219719977|gb|EED38502.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
Length = 643
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 76/153 (49%), Gaps = 11/153 (7%)
Query: 37 VIDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS 94
+IDG L+ N S I +L V D+ A + + V LAG GVQG+
Sbjct: 63 LIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGI----DGQVTTLAG-GVQGHG 117
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPA 153
DG ARFD P AVD +G +YVAD N IR I T+ V T+AGG ++ G ADG
Sbjct: 118 DGPVAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG--ERPGLADGAG 175
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F L F ALL++D N +R++
Sbjct: 176 DAARFDTPVALAFDAQ-GALLVADLFNNAVRRV 207
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 55/106 (51%), Gaps = 6/106 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I G VTT+AG
Sbjct: 53 VETVAGQG-EGLIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGIDGQVTTLAG 111
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +G DGP A F + V + ++D N IR I
Sbjct: 112 G---VQGHGDGPVAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 153
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 6/106 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITNLGVTTIAGGG 142
+AGDG G DG ARF P + G++Y D +N + R++ + V T+AG G
Sbjct: 1 MAGDGHPGDRDGASAQARFADPYALLRGADGSVYFTDAGDNNRIRRRLPDGRVETVAGQG 60
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
EG DGPA ASF+ + L ++D GN IR+I +
Sbjct: 61 ---EGLIDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRIGI 102
>gi|171913873|ref|ZP_02929343.1| NHL repeat containing protein [Verrucomicrobium spinosum DSM 4136]
Length = 370
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 71/130 (54%), Gaps = 7/130 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
++ + ++ +AG+G +GY+ DG P +A FD P D GN Y+AD SNH IRK+
Sbjct: 84 VTPDGKIETMAGNGKKGYTGDGGPATAASFDLPHEIRFDKAGNYYIADMSNHAIRKVDAK 143
Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
T + +TT+AG G DGP++ A + F P L I D GN +IR+I++K
Sbjct: 144 TGI-ITTLAGTGRGGYTGDDGPSEKAELKQPHSIQFGPD-GDLYICDIGNHVIRKIDMKT 201
Query: 191 EDCSKSSQSG 200
S + +G
Sbjct: 202 GKISTFAGTG 211
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 58/117 (49%), Gaps = 6/117 (5%)
Query: 77 SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+ E + AG GV+G+S DG P + A+ D P G I+ + + IR++T G
Sbjct: 29 AAEWTISTFAGTGVKGFSGDGGPATQAQMDNPFGVVRGPDGAIWYTEYTGQRIRRVTPDG 88
Query: 135 -VTTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ T+AG G KK D GPA ASF E+ F I+D N IR+++ K
Sbjct: 89 KIETMAGNG-KKGYTGDGGPATAASFDLPHEIRF-DKAGNYYIADMSNHAIRKVDAK 143
Score = 37.4 bits (85), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 4/74 (5%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+ ++ +AG G G++ +G P A PK A+D +GN ++AD +H +R I
Sbjct: 257 KAGIIHHIAGTGASGFTGNGGPAKLATLKGPKGIALDAEGNAWLADTESHTVRVIMAKTG 316
Query: 136 T--TIAGGGSKKEG 147
T +AG G K +G
Sbjct: 317 TLELLAGTGEKGDG 330
>gi|374606289|ref|ZP_09679172.1| copper amine oxidase domain-containing protein [Paenibacillus
dendritiformis C454]
gi|374388103|gb|EHQ59542.1| copper amine oxidase domain-containing protein [Paenibacillus
dendritiformis C454]
Length = 564
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 73/138 (52%), Gaps = 13/138 (9%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
L + V +AG G+ G+ DG+ ARF++P+ AV G++YVAD NHVIR+I N
Sbjct: 165 LDKSGRVTTVAGSGLAGWKDGKGAEARFNEPRDVAVAEDGSLYVADALNHVIRRIDANGN 224
Query: 135 VTTI----------AGGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
VTT+ A G G ADG + F+ L F P L++SD GNQ +
Sbjct: 225 VTTLNARSKRIVEYAPGAVTAAGDYADGKLMVSKFNEPSSLAFTPS-GDLVVSDTGNQRL 283
Query: 184 RQINLKPEDCSKSSQSGS 201
R ++LK + + + +GS
Sbjct: 284 RLVDLKQKRVTTLAGAGS 301
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 58/96 (60%), Gaps = 4/96 (4%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG 151
++DG+ ARF++P A D +GN+Y+AD +NHVIRK+ G VTT+AG G G DG
Sbjct: 128 WADGKGTEARFNEPMGMAEDRQGNLYIADAANHVIRKLDKSGRVTTVAGSGLA--GWKDG 185
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A F+ ++ V +L ++D N +IR+I+
Sbjct: 186 KGAEARFNEPRDVA-VAEDGSLYVADALNHVIRRID 220
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 73/142 (51%), Gaps = 14/142 (9%)
Query: 68 AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
A Y+ FP + RL G GY DG+ A F+ P A+ +G I VAD+ NH I
Sbjct: 303 ASYSYKFPDT------RLYAAG--GYRDGKASEALFNGPAGIALTGEGGIVVADRWNHAI 354
Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN---QLIR 184
R + + V T++GGG + G +G A+ A+F ++ + + + ++D N +LIR
Sbjct: 355 RYLYDGKVYTLSGGG--RTGHQNGWAEQATFREPMDVAVLSN-GTIAVADGFNNSIRLIR 411
Query: 185 QINLKPEDCSKSSQSGSALGAV 206
+ L E S ++ S SA AV
Sbjct: 412 RYTLPEEVRSPAATSPSAQDAV 433
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 55/121 (45%), Gaps = 11/121 (9%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
PL E++ AG GV G +DG A+F P G + +AD NH+IR+ G
Sbjct: 49 PLWYETI--DWAGSGVSGSADGRAEQAQFRYPAGLLAGKNGALLIADTYNHLIRRADAAG 106
Query: 135 -VTTIAG----GGSKKE---GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V+T+AG +E ADG A F+ + L I+D N +IR++
Sbjct: 107 QVSTLAGQVRFAAETREPNGSWADGKGTEARFNEPMGMA-EDRQGNLYIADAANHVIRKL 165
Query: 187 N 187
+
Sbjct: 166 D 166
>gi|409204160|ref|ZP_11232357.1| putative outer membrane adhesin-like protein [Pseudoalteromonas
flavipulchra JG1]
Length = 2380
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 68/131 (51%), Gaps = 5/131 (3%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
++DL + D + A ++ SVV AG G G ++G A F+KP A+D
Sbjct: 735 NNDLFIADRNNHAIR----KVTSASVVTTFAGTGSAGSANGTGTQASFNKPYDIALDSAD 790
Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
N+YV++++ H IRKIT+ GV T G + G DG A A FS F + V + +
Sbjct: 791 NLYVSEQTGHTIRKITSAGVVTTFAGSAGVSGSTDGLASVARFSQPFGIA-VDSNGIVYV 849
Query: 176 SDHGNQLIRQI 186
+D GN IR+I
Sbjct: 850 ADTGNHRIRKI 860
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 60/113 (53%), Gaps = 12/113 (10%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ E VV LAG G G +G +A F+ PK+ VD GN+YVAD SN+ IRKIT G
Sbjct: 588 ITPEGVVTTLAGSGTAGSDEGTGSAASFNFPKAVTVDASGNVYVADSSNNKIRKITPAGV 647
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
VTT AG GS G DG A+F+ P + I +GN + + N
Sbjct: 648 VTTFAGSGSP--GSTDGTGTAATFA-------AP--TGITIDSNGNLFVVETN 689
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 63/116 (54%), Gaps = 5/116 (4%)
Query: 72 LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LS S E+ V LAG GY+DG +A F P A D N+YVAD SNHVIRKIT
Sbjct: 531 LSLNTSTEATVSTLAGQS-SGYADGTGTAASFKSPYDVATDSNNNVYVADYSNHVIRKIT 589
Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G VTT+AG G+ G +G ASF+ +T V + ++D N IR+I
Sbjct: 590 PEGVVTTLAGSGTA--GSDEGTGSAASFNFPKAVT-VDASGNVYVADSSNNKIRKI 642
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 60/131 (45%), Gaps = 5/131 (3%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S ++ V DSS + ++ VV AG G G +DG +A F P +D G
Sbjct: 626 SGNVYVADSSNNKIR----KITPAGVVTTFAGSGSPGSTDGTGTAATFAAPTGITIDSNG 681
Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
N++V + + H++RKIT GV T G G D +A F+ + + L I
Sbjct: 682 NLFVVETNPHIVRKITPAGVVTTFAGSKGSSGFTDATGTSAKFNFPYNGGSSSN-NDLFI 740
Query: 176 SDHGNQLIRQI 186
+D N IR++
Sbjct: 741 ADRNNHAIRKV 751
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 70/134 (52%), Gaps = 10/134 (7%)
Query: 60 IVLDSSRSAFYTLSFP-----LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
I +DS+ + F + P ++ VV AG G G++D SA+F+ P +
Sbjct: 675 ITIDSNGNLFVVETNPHIVRKITPAGVVTTFAGSKGSSGFTDATGTSAKFNFPYNGGSSS 734
Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
++++AD++NH IRK+T+ VTT AG GS G A+G ASF+ +++ +
Sbjct: 735 NNDLFIADRNNHAIRKVTSASVVTTFAGTGSA--GSANGTGTQASFNKPYDIA-LDSADN 791
Query: 173 LLISDHGNQLIRQI 186
L +S+ IR+I
Sbjct: 792 LYVSEQTGHTIRKI 805
>gi|329908044|ref|ZP_08274804.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
IMCC9480]
gi|327546777|gb|EGF31711.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
IMCC9480]
Length = 642
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 53/112 (47%), Gaps = 1/112 (0%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ VV +LAG G G DG A F PK D GN++VAD N IRKIT GV
Sbjct: 369 ITPAGVVSQLAGSGKWGSVDGTGAGASFSAPKGIVADPLGNVFVADTYNSTIRKITPAGV 428
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
T G G DGP A FS E + L + D GN IR+I+
Sbjct: 429 VTTIAGAPSSTGSTDGPGNLARFSGP-EAIAIDAQRNLYVGDTGNHTIRKIS 479
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/144 (36%), Positives = 71/144 (49%), Gaps = 11/144 (7%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARF 103
+P S+ IDR G L V DS +S + VV LAG G G +DG +ARF
Sbjct: 78 SPESIAIDRAGM--LYVADSVNHTIRKVS----PQGVVTTLAGRAGEPGSADGRGSAARF 131
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
PK AVD+ GN+ V+D +NH IRKI+ G T G ++ DG A F+ F
Sbjct: 132 FDPKGVAVDVAGNVVVSDNANHTIRKISPDGNVTTLAGSARNASALDGIGSAARFA--FP 189
Query: 164 LTFVPHICA-LLISDHGNQLIRQI 186
V + + D GN L+R+I
Sbjct: 190 QALVTDAARNVYVVDRGNGLLRKI 213
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 62/114 (54%), Gaps = 4/114 (3%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G DG +ARF P+S A+D G +YVAD NH IRK++ GV T G + + G ADG
Sbjct: 65 GAVDGRGAAARFYSPESIAIDRAGMLYVADSVNHTIRKVSPQGVVTTLAGRAGEPGSADG 124
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
A F D + V +++SD+ N IR+I+ D + ++ +GSA A
Sbjct: 125 RGSAARFF-DPKGVAVDVAGNVVVSDNANHTIRKIS---PDGNVTTLAGSARNA 174
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 58/114 (50%), Gaps = 3/114 (2%)
Query: 74 FPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ ++ S KRLAG + +G ARF+ P + D GN YVAD N++IRKIT
Sbjct: 312 YVVTNGSTFKRLAGLAPELALVNGNGAQARFNLPWAVTSDATGNWYVADAGNYMIRKITP 371
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
GV + GS K G DG ASFS + P + + ++D N IR+I
Sbjct: 372 AGVVSQL-AGSGKWGSVDGTGAGASFSAPKGIVADP-LGNVFVADTYNSTIRKI 423
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 53/113 (46%), Gaps = 4/113 (3%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S VV LAG G G DG +AR P+S +VD GN+YV S +RKIT G
Sbjct: 478 ISTSGVVSTLAGSPGRYGSEDGTGAAARLASPRSMSVDQAGNVYVI--SYRAVRKITPAG 535
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
V T G + G D ++A F LT + +SD IR+I+
Sbjct: 536 VVTTWAGQALAYGNVDAVGEDARFGYLLALT-ADAAGNVYVSDTAATTIRKID 587
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 1/77 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV +AG G +DG ARF P++ A+D + N+YV D NH IRKI+ G
Sbjct: 423 ITPAGVVTTIAGAPSSTGSTDGPGNLARFSGPEAIAIDAQRNLYVGDTGNHTIRKISTSG 482
Query: 135 VTTIAGGGSKKEGRADG 151
V + G + G DG
Sbjct: 483 VVSTLAGSPGRYGSEDG 499
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 2/88 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
ARF P S VD GN Y+AD++N+ IRK+ G V+T+AG + G DG A F
Sbjct: 18 ARFAYPGSMVVDSAGNRYIADQNNNQIRKVRPDGSVSTLAGASASTPGAVDGRGAAARFY 77
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
+ E + L ++D N IR+++
Sbjct: 78 SP-ESIAIDRAGMLYVADSVNHTIRKVS 104
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S + V LAG + DG +ARF P++ D N+YV D+ N ++RKIT G
Sbjct: 158 ISPDGNVTTLAGSARNASALDGIGSAARFAFPQALVTDAARNVYVVDRGNGLLRKITPAG 217
Query: 135 -VTTIAGG 141
VTT+A G
Sbjct: 218 IVTTLASG 225
>gi|392540748|ref|ZP_10287885.1| Ig family protein [Pseudoalteromonas piscicida JCM 20779]
Length = 2384
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 61/116 (52%), Gaps = 5/116 (4%)
Query: 72 LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LS S E+ V LAG GY+DG +A F P D GN+YVAD NHVIRKIT
Sbjct: 535 LSLNTSTEATVSTLAGQS-SGYADGTGTAASFKSPYDLVTDSNGNVYVADYGNHVIRKIT 593
Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G VTT+AG GS G DG ASF+ +T + ++D N IR++
Sbjct: 594 PEGVVTTLAGSGSA--GSDDGTGSAASFNFPKAVTL-DSSGNVYVADSSNNNIRKV 646
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 65/131 (49%), Gaps = 5/131 (3%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
++DL + D + A ++ SVV AG G G ++G A F+KP A D
Sbjct: 739 NNDLFIADRNNHAIR----KVTSASVVTTFAGTGSAGSTNGTGTQASFNKPYDVAADSAD 794
Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
N+YV +++ H IRKIT+ GV T G + G DG A FS + + V + +
Sbjct: 795 NLYVTEQAAHTIRKITSTGVVTTYAGSAGASGNTDGLVSVARFSQPYGIA-VDSNDVVYV 853
Query: 176 SDHGNQLIRQI 186
+D GN IR+I
Sbjct: 854 ADTGNHRIRKI 864
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 60/113 (53%), Gaps = 12/113 (10%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ E VV LAG G G DG +A F+ PK+ +D GN+YVAD SN+ IRK+T G
Sbjct: 592 ITPEGVVTTLAGSGSAGSDDGTGSAASFNFPKAVTLDSSGNVYVADSSNNNIRKVTPAGV 651
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
VTT AG G+ G DG A+F+ P + I +GN + + N
Sbjct: 652 VTTFAGSGTY--GSDDGTGTAATFA-------AP--TGITIDSNGNLYVVETN 693
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 1/106 (0%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
VV AG G G DG +A F P +D GN+YV + + H++RKIT GV T
Sbjct: 651 VVTTFAGSGTYGSDDGTGTAATFAAPTGITIDSNGNLYVVETNPHIVRKITPAGVVTTFA 710
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G D +A+F+ + + L I+D N IR++
Sbjct: 711 GSKNSSGFTDATGTSATFNFPYNGGSNSN-NDLFIADRNNHAIRKV 755
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 67/134 (50%), Gaps = 10/134 (7%)
Query: 60 IVLDSSRSAFYTLSFP-----LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
I +DS+ + + + P ++ VV AG G++D SA F+ P + +
Sbjct: 679 ITIDSNGNLYVVETNPHIVRKITPAGVVTTFAGSKNSSGFTDATGTSATFNFPYNGGSNS 738
Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
++++AD++NH IRK+T+ VTT AG GS G +G ASF+ +++
Sbjct: 739 NNDLFIADRNNHAIRKVTSASVVTTFAGTGSA--GSTNGTGTQASFNKPYDVA-ADSADN 795
Query: 173 LLISDHGNQLIRQI 186
L +++ IR+I
Sbjct: 796 LYVTEQAAHTIRKI 809
>gi|428179226|gb|EKX48098.1| hypothetical protein GUITHDRAFT_69001 [Guillardia theta CCMP2712]
Length = 334
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 80/175 (45%), Gaps = 18/175 (10%)
Query: 14 LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
++LL GL EDG G Q N I DL + D
Sbjct: 25 WITLLAGCEEDGL--EDGV-------GDQARFNSPVGIVESKEGDLYLCDQENHCIR--- 72
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + V LAG+G +GY DG+ AR + P +D +GNI AD N IRK++
Sbjct: 73 -KVTRKGEVTTLAGNGEEGYHDGKGKDARLNIPTGLCMDAQGNIIFADSGNQRIRKVSPD 131
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G VTTIAG K+G DGPA A F N V ++ +SD GN IR+I+
Sbjct: 132 GTVTTIAG---SKKGFKDGPAGKALF-NYPAYVAVDSKGSIFVSDFGNHCIRKID 182
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 67/117 (57%), Gaps = 5/117 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S + V +AG +G+ DG G A F+ P AVD KG+I+V+D NH IRKI G
Sbjct: 128 VSPDGTVTTIAGS-KKGFKDGPAGKALFNYPAYVAVDSKGSIFVSDFGNHCIRKIDGEGM 186
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
VTT+AG G K G ADG A F N + + + I+D+GNQ IR+++ + E
Sbjct: 187 VTTVAGNG--KMGWADGKGAKARF-NSPQGMCIDKEDTVYIADYGNQRIRKMSKEGE 240
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 61/118 (51%), Gaps = 14/118 (11%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+ +E + LAG G DG ARF+ P +G++Y+ D+ NH IRK+T G
Sbjct: 20 IDKEGWITLLAGCEEDGLEDGVGDQARFNSPVGIVESKEGDLYLCDQENHCIRKVTRKGE 79
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQIN 187
VTT+AG G +EG DG ++A L +C ++ +D GNQ IR+++
Sbjct: 80 VTTLAGNG--EEGYHDGKGKDA------RLNIPTGLCMDAQGNIIFADSGNQRIRKVS 129
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 4/110 (3%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
E +V +AG+G G++DG+ ARF+ P+ +D + +Y+AD N IRK++ G V T
Sbjct: 184 EGMVTTVAGNGKMGWADGKGAKARFNSPQGMCIDKEDTVYIADYGNQRIRKMSKEGEVVT 243
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AG G + G A G Q A F ++ V + + D N +R+I+
Sbjct: 244 VAGSG--EPGFAHGHGQLARFRGPRSVS-VSQDGIVYVGDRENFRVRKIS 290
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S+E V +AG G G++ G ARF P+S +V G +YV D+ N +RKI+ G
Sbjct: 235 MSKEGEVVTVAGSGEPGFAHGHGQLARFRGPRSVSVSQDGIVYVGDRENFRVRKISEDGY 294
Query: 135 VTTIAGGG 142
V T AG G
Sbjct: 295 VWTFAGMG 302
>gi|223937730|ref|ZP_03629631.1| NHL repeat containing protein [bacterium Ellin514]
gi|223893523|gb|EEF59983.1| NHL repeat containing protein [bacterium Ellin514]
Length = 1474
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 89/181 (49%), Gaps = 34/181 (18%)
Query: 43 LEINPHSVIDRPGSSDLIVL----DSSRSAFYTLSFPL--SEESVVKRLAGDGVQGYSDG 96
L NP + G+S L+V + + SA T ++ L E +VV+ AG GV G+++G
Sbjct: 1002 LYTNPFPI---SGNSTLVVKAWLSNYAPSALATATYSLLDYEPTVVQTYAGTGVPGFTNG 1058
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQN 155
A+F+ P+ VD G ++V+D N+VIRKI TN VTT AG G G DG N
Sbjct: 1059 PNLKAQFNAPQGICVDGLGTLFVSDTGNNVIRKISTNGVVTTFAGSG--VAGTHDGVGTN 1116
Query: 156 ASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDC-------SKSSQSGSA 202
ASF P A L ++D GN LIR++ P+ S SQ G A
Sbjct: 1117 ASF-------LAPTGIALDSSNNLYVADSGNSLIRKV--TPDGLVTTLTTFSDPSQQGPA 1167
Query: 203 L 203
L
Sbjct: 1168 L 1168
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 54/104 (51%), Gaps = 7/104 (6%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
AGD + GY DG ARF +PK+ A G+I V+D + IRKI + G VTT+AG G
Sbjct: 1252 AGD-LSGYMDGPRLLARFQRPKALATASDGSIIVSDWTR--IRKIHSDGRVTTLAGAG-- 1306
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G +G A+F N V + +D N IR+I++
Sbjct: 1307 DLGLRNGSGLFAAF-NQLGAVTVDSAGNIYAADAANHSIRKISV 1349
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 29/50 (58%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG G G +G A F++ + VD GNIY AD +NH IRKI+
Sbjct: 1299 VTTLAGAGDLGLRNGSGLFAAFNQLGAVTVDSAGNIYAADAANHSIRKIS 1348
>gi|344205849|ref|YP_004790990.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
gi|343777211|gb|AEM49764.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
Length = 693
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 76/152 (50%), Gaps = 11/152 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A +S + V LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADVQGNLYVADTGNHAIRRIS----TDGQVTTLAG-GEQGHAD 168
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
G ARFD P AVD +G +YVAD N IR I T+ V T+AGG + G ADG
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG--DRPGFADGVGT 226
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F L F H ALL++D N +R++
Sbjct: 227 AARFDTPVALAFDAH-GALLVADLFNNAVRRV 257
Score = 66.6 bits (161), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 60/106 (56%), Gaps = 6/106 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V+ +AG G +G DG A F+ P A D++GN+YVAD NH IR+I+ G VTT+AG
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADVQGNLYVADTGNHAIRRISTDGQVTTLAG 161
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++G ADGPA A F + V + ++D N IR I
Sbjct: 162 G---EQGHADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 203
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 60/115 (52%), Gaps = 6/115 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK--SNHVIRKITN 132
PL+ + ++ LAGDG G DG ARF P + G+IY D +N + R++ +
Sbjct: 41 PLAWTAQIEPLAGDGHPGDRDGGSAQARFADPYALLRGADGSIYFTDAGDNNRIRRRLPD 100
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
V T+AG G EGR DGPA ASF+ + L ++D GN IR+I+
Sbjct: 101 GRVETVAGQG---EGRVDGPALQASFNTPSGIAADVQ-GNLYVADTGNHAIRRIS 151
>gi|255589882|ref|XP_002535117.1| nhl repeat-containing protein, putative [Ricinus communis]
gi|223523999|gb|EEF27264.1| nhl repeat-containing protein, putative [Ricinus communis]
Length = 435
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 67/133 (50%), Gaps = 8/133 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ +L V D+ +A + + V LAGDG+ G DG+ A+F+ P AVD G
Sbjct: 130 AGNLYVADTGNNAIRKIG----PDGTVSTLAGDGLAGDKDGKGAGAQFNGPIGVAVDAAG 185
Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
+YV D N IR+I G VTT+AGG K+ G ADG A F + AL
Sbjct: 186 VVYVTDTYNDRIRRIAPDGTVTTVAGG--KRAGMADGAGAQALFDTPTGIALGAS-GALY 242
Query: 175 ISDHGNQLIRQIN 187
I+D GN IR+I
Sbjct: 243 IADTGNSAIRKIG 255
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 68/130 (52%), Gaps = 7/130 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-NHVIRKITNL 133
P + + +AGDG+ G S+G RF P A+D +GN+YVAD N+ IRKI
Sbjct: 37 PAGWHARITTMAGDGLPGSSNGSGKRTRFADPFGVALDRQGNLYVADGGDNNSIRKIDLD 96
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
GVTT GG+ EG A+G A+F+ L + L ++D GN IR+I D
Sbjct: 97 GVTTTLAGGT--EGYAEGAPTAAAFNTPSGLA-IDAAGNLYVADTGNNAIRKIG---PDG 150
Query: 194 SKSSQSGSAL 203
+ S+ +G L
Sbjct: 151 TVSTLAGDGL 160
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 59/109 (54%), Gaps = 5/109 (4%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ V LAG G +GY++G P +A F+ P A+D GN+YVAD N+ IRKI G V+T
Sbjct: 96 DGVTTTLAG-GTEGYAEGAPTAAAFNTPSGLAIDAAGNLYVADTGNNAIRKIGPDGTVST 154
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+AG G G DG A F+ + V + ++D N IR+I
Sbjct: 155 LAGDG--LAGDKDGKGAGAQFNGPIGVA-VDAAGVVYVTDTYNDRIRRI 200
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ + V +AG G +DG A FD P A+ G +Y+AD N IRKI G
Sbjct: 200 IAPDGTVTTVAGGKRAGMADGAGAQALFDTPTGIALGASGALYIADTGNSAIRKIGKDGT 259
Query: 135 VTTIA 139
V+T+A
Sbjct: 260 VSTVA 264
>gi|299473428|emb|CBN77825.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 2120
Score = 70.9 bits (172), Expect = 5e-10, Method: Composition-based stats.
Identities = 38/75 (50%), Positives = 47/75 (62%), Gaps = 3/75 (4%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
G G GY+DG +A F+ P+ AVD N+YVAD NH IR+I+ G VTT+AG G
Sbjct: 205 GGSGEAGYADGSGTAALFNDPQDVAVDAHANVYVADTGNHRIRRISPEGVVTTVAGDG-- 262
Query: 145 KEGRADGPAQNASFS 159
+EG DG A ASFS
Sbjct: 263 EEGSDDGDAMEASFS 277
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 39/135 (28%), Positives = 55/135 (40%), Gaps = 25/135 (18%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN----IYVADKSNHVIRKIT 131
+S E VV +AGDG +G DG+ A F P A+ + +YVAD +NH +RKI+
Sbjct: 249 ISPEGVVTTVAGDGEEGSDDGDAMEASFSFPGGIALYYDSSEGLVLYVADTNNHRLRKIS 308
Query: 132 --------------------NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
T +A + + G ADG A F L
Sbjct: 309 GDIANGAGTVTCHAGRCGNGTESATRMAAEATPEAGFADGDGSYARFDGPSGLA-AAEDG 367
Query: 172 ALLISDHGNQLIRQI 186
L ++D N LIR +
Sbjct: 368 TLFVADTNNHLIRMV 382
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 84 RLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
R+A + G++DG+ ARFD P A G ++VAD +NH+IR + N V T+ G
Sbjct: 334 RMAAEATPEAGFADGDGSYARFDGPSGLAAAEDGTLFVADTNNHLIRMVLANSTVFTLTG 393
Query: 141 GGSKKEGRADG 151
G E A G
Sbjct: 394 GLEGAEVEAGG 404
Score = 44.3 bits (103), Expect = 0.052, Method: Composition-based stats.
Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 17/84 (20%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------LG------V 135
GV G++DG +ARF+ P ++ G ++VAD H +R+I+ LG V
Sbjct: 415 GVAGHTDGNLTAARFNYPADVSLGTNGTLFVADL--HSLRRISMPENPTIVLGVGFDGRV 472
Query: 136 TTIAGGGSKKEGRADGPAQNASFS 159
TT AGG + G ADG A FS
Sbjct: 473 TTAAGGA--EPGEADGTGPEARFS 494
>gi|380692925|ref|ZP_09857784.1| hypothetical protein BfaeM_02978 [Bacteroides faecis MAJ27]
Length = 454
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 69/131 (52%), Gaps = 5/131 (3%)
Query: 59 LIVLDSSRSAFYTLSFPLSEES--VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKG 115
L ++ + Y +++ + + + + AGD G GY+ G+ ARF++P + +D +G
Sbjct: 323 LYIIARKKHCIYKVAYNAATHTFGIPELFAGDYGESGYASGKGTGARFNQPSTPCLDPEG 382
Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
N+ + DK NH IRKIT G T+ G + G DG A F +TF + AL++
Sbjct: 383 NLLIPDKMNHCIRKITPEGEVTLYAGQPQTSGHTDGLPDKAKFYEPEAVTFSGN--ALIV 440
Query: 176 SDHGNQLIRQI 186
+D GN +R +
Sbjct: 441 ADRGNHCVRNV 451
>gi|347754243|ref|YP_004861807.1| gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
gi|347586761|gb|AEP11291.1| Gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
Length = 1222
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
L+ + V LAGDGV+G DG ARF P++ AVD G +YVAD + +R+I+ G
Sbjct: 333 LTLDGAVTTLAGDGVEGTRDGVGKQARFKAPRAVAVDASGTVYVADDAR--VRRISPGGM 390
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
V TIAGG + G DGPA+ A F L + L ++D GN+ +R+++
Sbjct: 391 VVTIAGG---EPGCVDGPAEAARFDTLSGLAL-DRVGNLYLADAGNRRLRKLS 439
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 58/110 (52%), Gaps = 4/110 (3%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
E VV AGD G+ DG SAR + P A+D +G +Y+AD NH IRK+ G + T
Sbjct: 616 ELVVTVTAGDSTPGFLDGVGTSARLNHPVGLALDAQGVLYIADHFNHAIRKLLPDGRLVT 675
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AGGG + G DG A F+ L V L ++DH N IR++
Sbjct: 676 LAGGGQR--GFQDGYGPEAQFNGPLGLA-VGRDGELYVADHLNMRIRKVT 722
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 44/76 (57%), Gaps = 5/76 (6%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG- 142
LAG G +G+ DG A+F+ P AV G +YVAD N IRK+T G V+T+AG G
Sbjct: 676 LAGGGQRGFQDGYGPEAQFNGPLGLAVGRDGELYVADHLNMRIRKVTPDGYVSTLAGTGI 735
Query: 143 SKKEGRADGPAQNASF 158
SK E DG ASF
Sbjct: 736 SKIE---DGSVATASF 748
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 5/98 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
GY+DG A+F +P A D +GN+YVAD H IR+I+ G V T+AG G + G D
Sbjct: 242 GYADGPLTVAKFLRPNGIACDPQGNLYVADFGGHRIRQISVDGLVRTLAGSG--QPGNRD 299
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
A F+ + + L ++D N +R++ L
Sbjct: 300 DLGLLAEFNGPRGIAYAAGY--LYVADLNNASVRRLTL 335
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 9/113 (7%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ + V LAG G+ DG +A F+ PK AVDM G +YV D +R IT G
Sbjct: 721 VTPDGYVSTLAGTGISKIEDGSVATASFEGPKGVAVDMHGVVYVTDGVT--VRTITPDGE 778
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTF-VPHICALLISDHGNQLIRQI 186
V T+AG + G DG A F + + V +C ++D N +R I
Sbjct: 779 VRTLAG---QVRGFRDGIGTRAMFGWAYAIAVDVSGLC--FVTDAANHAVRCI 826
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 14/151 (9%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ + V+ LAG V+G+ DG A F + AVD+ G +V D +NH +R I G
Sbjct: 773 ITPDGEVRTLAGQ-VRGFRDGIGTRAMFGWAYAIAVDVSGLCFVTDAANHAVRCIFPDGT 831
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
V T+ GGG A+ +F N + H L ++D N I ++ +
Sbjct: 832 VKTVFGGGE---------ARQLNFPNGLAVDVFGH---LYVADTNNHRILRLTPNGNGYT 879
Query: 195 KSSQSGSALGAVSVWVLVSVLSCLVSLVIGF 225
S G G + L + L +V+GF
Sbjct: 880 ASLVCGVRRGRQTGSALEAELDSPRGIVVGF 910
Score = 40.4 bits (93), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 2/71 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S +V +AG G G DG +ARFD A+D GN+Y+AD N +RK++ G
Sbjct: 385 ISPGGMVVTIAG-GEPGCVDGPAEAARFDTLSGLALDRVGNLYLADAGNRRLRKLSRDGQ 443
Query: 135 VTTIAGGGSKK 145
V+T+ G K
Sbjct: 444 VSTLPVGPDDK 454
>gi|386716891|ref|YP_006183217.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
gi|384076453|emb|CCH11034.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
Length = 652
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 57/152 (37%), Positives = 75/152 (49%), Gaps = 11/152 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A +S + V LAG G QG++D
Sbjct: 73 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIS----TDGQVTTLAG-GEQGHAD 127
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
G ARFD P AVD +G +YVAD N IR I T+ V T+AGG + G ADG
Sbjct: 128 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG--DRPGLADGVGA 185
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F L F ALL++D N +R++
Sbjct: 186 AARFDTPVALAFDAQ-GALLVADLFNNAVRRV 216
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 59/106 (55%), Gaps = 6/106 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I+ G VTT+AG
Sbjct: 62 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRISTDGQVTTLAG 120
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++G ADGPA A F + V + ++D N IR I
Sbjct: 121 G---EQGHADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 162
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 59/114 (51%), Gaps = 6/114 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITNL 133
++ + ++ LAGDG G DG ARF P + G++Y D +N + R++ +
Sbjct: 1 MAWTAQIEPLAGDGHPGDRDGASAQARFADPYALLRSADGSVYFTDAGDNNRIRRRLPDG 60
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
V T+AG G EGR DGPA ASF+ + L ++D GN IR+I+
Sbjct: 61 RVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRIS 110
>gi|456737758|gb|EMF62435.1| Hypothetical protein EPM1_0109 [Stenotrophomonas maltophilia EPM1]
Length = 693
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 74/152 (48%), Gaps = 11/152 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A + + V LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGHAD 168
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
G ARFD P AVD +G +YVAD N IR I T+ V T+AGG + G ADG
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG--DRPGLADGVGA 226
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F L F ALL++D N +R++
Sbjct: 227 TARFDTPVALAFDAQ-GALLVADLFNNAVRRV 257
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I G VTT+AG
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAG 161
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++G ADGPA A F + V + ++D N IR I
Sbjct: 162 G---EQGHADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 203
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 59/114 (51%), Gaps = 6/114 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK--SNHVIRKITN 132
PL+ + ++ LAGDG G DG ARF P + G++Y D +N + R++ +
Sbjct: 41 PLAWTAQIEPLAGDGHPGDRDGASAQARFADPYALLRSADGSVYFTDAGDNNRIRRRLPD 100
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V T+AG G EGR DGPA ASF+ + L ++D GN IR+I
Sbjct: 101 GRVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRI 150
>gi|424666862|ref|ZP_18103887.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
Ab55555]
gi|401069531|gb|EJP78052.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
Ab55555]
Length = 693
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 74/152 (48%), Gaps = 11/152 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A + + V LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGI----DGQVTTLAG-GEQGHAD 168
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
G ARFD P AVD +G +YVAD N IR I T+ V T+AGG + G ADG
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG--DRPGLADGVGA 226
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F L F ALL++D N +R++
Sbjct: 227 AARFDTPVALAFDAQ-GALLVADLFNNAVRRV 257
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I G VTT+AG
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGIDGQVTTLAG 161
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++G ADGPA A F + V + ++D N IR I
Sbjct: 162 G---EQGHADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 203
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 60/116 (51%), Gaps = 6/116 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK--SNHVIRKITN 132
PL+ + ++ LAGDG G DG ARF P + G++Y D +N + R++ +
Sbjct: 41 PLAWTAQIEPLAGDGHPGDRDGASAQARFADPYALLRSADGSVYFTDAGDNNRIRRRLPD 100
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
V T+AG G EGR DGPA ASF+ + L ++D GN IR+I +
Sbjct: 101 GRVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRIGI 152
>gi|421861496|ref|ZP_16293498.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
gi|410828922|dbj|GAC43935.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
Length = 575
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 71/142 (50%), Gaps = 21/142 (14%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
L + V +AG G+ G+ DG ARF++P+ AV G++YVAD NHV+R+I N
Sbjct: 176 LDKSGRVTTVAGSGLAGWRDGTGSEARFNEPRDVAVAEDGSLYVADALNHVLRRIDANGN 235
Query: 135 VTT---------------IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHG 179
VTT +A GG ADG + F+ L F L++SD G
Sbjct: 236 VTTLNARSKRIVEYAPGAVAAGGDY----ADGKLGESKFNEPSSLAFTSS-GDLVVSDTG 290
Query: 180 NQLIRQINLKPEDCSKSSQSGS 201
NQ +R +NLK + + + +GS
Sbjct: 291 NQRLRLVNLKQKYVTTLAGAGS 312
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 57/96 (59%), Gaps = 4/96 (4%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG 151
++DG+ ARF++P A D +GN+Y+AD NHVIRK+ G VTT+AG G G DG
Sbjct: 139 WTDGKGTEARFNQPMGMAEDRQGNLYIADAGNHVIRKLDKSGRVTTVAGSG--LAGWRDG 196
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A F+ ++ V +L ++D N ++R+I+
Sbjct: 197 TGSEARFNEPRDVA-VAEDGSLYVADALNHVLRRID 231
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 49/108 (45%), Gaps = 6/108 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
AG GV G DG A F P G + +AD NH+IR+ G V+T+AG +
Sbjct: 71 WAGSGVSGSVDGRAERAEFRYPSGLLAGKNGELLIADTYNHLIRRADAAGQVSTLAGQVA 130
Query: 144 KKEGR----ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
K + DG A F+ + L I+D GN +IR+++
Sbjct: 131 KMRQQYGSWTDGKGTEARFNQPMGMA-EDRQGNLYIADAGNHVIRKLD 177
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 6/118 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY D E A F+ P A+ +G I VAD+ NH IR + N V T+ G G +G
Sbjct: 330 GYRDSEASEALFNGPAGIAITAEGGIIVADRWNHAIRYLFNGKVYTLG--GGGGTGHQNG 387
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGN---QLIRQINLKPEDCSKSSQSGSALGAV 206
A+ A+F + + + + ++D N +LIR+ L E S ++ S A AV
Sbjct: 388 WAEQATFREPVNVAVLSN-GTIAVADGFNNSIRLIRRYTLPEEARSSTAASPYARDAV 444
>gi|301122083|ref|XP_002908768.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099530|gb|EEY57582.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1981
Score = 69.3 bits (168), Expect = 1e-09, Method: Composition-based stats.
Identities = 44/108 (40%), Positives = 61/108 (56%), Gaps = 8/108 (7%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
+AGDG G+ DG +++F+ P+ AVD G +YVAD +NH IR I T V+T+AG G
Sbjct: 208 VAGDGSAGFQDGLAATSQFNHPRGVAVDSNGVVYVADTANHRIRIIDPTTKRVSTLAGDG 267
Query: 143 SKKEGRADGPAQNA---SFSNDFELTFV-PHICALLISDHGNQLIRQI 186
EG DGPA A S+ +D + + ++D GN IRQI
Sbjct: 268 V--EGFVDGPALTAARFSYPSDVAVRESNSGTITVFVADTGNHRIRQI 313
Score = 57.4 bits (137), Expect = 7e-06, Method: Composition-based stats.
Identities = 28/62 (45%), Positives = 34/62 (54%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G +DG+P ARFD P AVD G ++VAD NH+IR+I G T GG AD
Sbjct: 344 GMADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRIDLDGTTHTVAGGVVPSEDADT 403
Query: 152 PA 153
P
Sbjct: 404 PG 405
Score = 55.8 bits (133), Expect = 2e-05, Method: Composition-based stats.
Identities = 45/126 (35%), Positives = 62/126 (49%), Gaps = 20/126 (15%)
Query: 82 VKRLAGDGVQGYSDGEP-GSARFDKPKSFAVDMKGN----IYVADKSNHVIRKITNLGVT 136
V LAGDGV+G+ DG +ARF P AV + ++VAD NH IR+I N VT
Sbjct: 260 VSTLAGDGVEGFVDGPALTAARFSYPSDVAVRESNSGTITVFVADTGNHRIRQIRNGVVT 319
Query: 137 TIAG-GGSKKE-------------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
++G G+ E G ADG A F + + V + ++D GN L
Sbjct: 320 CLSGLCGAGVETVRLSQQPEKPHAGMADGDPLGARFDSPMGVA-VDADGVVFVADTGNHL 378
Query: 183 IRQINL 188
IR+I+L
Sbjct: 379 IRRIDL 384
Score = 39.7 bits (91), Expect = 1.4, Method: Composition-based stats.
Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 14/90 (15%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-------NLGVTTIAG- 140
G +G+ DG ARF+ P++ A+ + + V+++ H IR++T +G T A
Sbjct: 413 GERGFRDGNLTFARFESPRAIAIGAQRTLIVSER--HRIRRVTYDGGSASTIGTVTAANR 470
Query: 141 ----GGSKKEGRADGPAQNASFSNDFELTF 166
GS G DG A+F+ +TF
Sbjct: 471 VVTLAGSNVPGHIDGEGNEATFNAPAGVTF 500
>gi|451980854|ref|ZP_21929239.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
gracilis 3/211]
gi|451761976|emb|CCQ90480.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
gracilis 3/211]
Length = 729
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 63/115 (54%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
+ + ++ + G GV GYS DG P + A P D +GN+YVAD+ NHV+RK+ TN
Sbjct: 68 IDQNGMLTNVVGTGVSGYSGDGGPATEATLKVPAGLTFDKEGNLYVADRENHVVRKVDTN 127
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+TT AG G GPA A + ++T V H L ISD N +IR+++
Sbjct: 128 GIITTFAGTGKAGYSGDKGPATQAKLNLPSDMT-VDHKGNLFISDRSNNVIRKVD 181
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 64/114 (56%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ E + +AG+G Y+ DG P A F P+S +D +GN+Y+ D +N+VIRKI
Sbjct: 528 VTPEGKIITIAGNGSSDYTGDGGPAKDASFRDPQSLTMDKEGNLYIGDTANNVIRKIDKN 587
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G VTT AG G+ + GPA A F + ++ F P L + + GN +R+I
Sbjct: 588 GIVTTYAGNGNHEHSGDGGPALKAGFMSIGDIVFSPS-GELHVVEPGNHTVRKI 640
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ ++ AG G GYS G A+ + P VD KGN++++D+SN+VIRK+
Sbjct: 124 VDTNGIITTFAGTGKAGYSGDKGPATQAKLNLPSDMTVDHKGNLFISDRSNNVIRKVDPS 183
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G +TT AG G++ + PA + F L H L I+D GN IR+++
Sbjct: 184 GTITTYAGTGNEGYNGDNMPALRTNLDKPFGLAVDKH-GNLYIADRGNNRIRKVD 237
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 47/80 (58%), Gaps = 2/80 (2%)
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
A D KGN+YV+D+ NH IRK+T G + TIAG GS GPA++ASF + LT
Sbjct: 507 LAFDKKGNLYVSDRINHQIRKVTPEGKIITIAGNGSSDYTGDGGPAKDASFRDPQSLTMD 566
Query: 168 PHICALLISDHGNQLIRQIN 187
L I D N +IR+I+
Sbjct: 567 KE-GNLYIGDTANNVIRKID 585
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 61/122 (50%), Gaps = 7/122 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDM 113
+ DL+V+D S Y + + S V+ +AG+GV+ ++ +G A + P VD
Sbjct: 333 NGDLLVVDRSH---YRIRKMHQKGSRVETIAGNGVKNFAGDNGPATGANLEFPHGIVVDK 389
Query: 114 KGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNAS-FSNDFELTFVPHIC 171
+ N+ ADK ++ IR+I G+ TT+ G G + +GPA AS F EL I
Sbjct: 390 QDNVIFADKGHYRIRQIDPEGIITTVVGNGIRGNIGDNGPALEASVFPTILELNHKDEIF 449
Query: 172 AL 173
L
Sbjct: 450 FL 451
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 50/89 (56%), Gaps = 5/89 (5%)
Query: 76 LSEESVVKRLAGD-GVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-T 131
++ + V+ +AG GVQG + DG + A +P A D KGN+Y+ D N+ IRK+ T
Sbjct: 640 ITRDGKVELVAGRPGVQGLFGDGGKATEAMLKQPACIAFDSKGNMYITDMGNNRIRKVDT 699
Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSN 160
N +TT+AG GS G +G N F N
Sbjct: 700 NGIITTLAGRGSFGWGY-EGEEVNIYFQN 727
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 51/88 (57%), Gaps = 8/88 (9%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN---D 161
P AVD +GN+YVAD++N+ IRKI LG + T+ G G + +G A+ A +N
Sbjct: 269 PTDVAVDDEGNVYVADRNNNRIRKINTLGMIRTVMGTGQQD---YNGDAEVARETNLHLP 325
Query: 162 FELTFVPHICALLISDHGNQLIRQINLK 189
F +T + + LL+ D + IR+++ K
Sbjct: 326 FAIT-LDNNGDLLVVDRSHYRIRKMHQK 352
Score = 40.0 bits (92), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 59/122 (48%), Gaps = 21/122 (17%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLG--V 135
+++ + G G Q Y +G+ AR P + +D G++ V D+S++ IRK+ G V
Sbjct: 298 MIRTVMGTGQQDY-NGDAEVARETNLHLPFAITLDNNGDLLVVDRSHYRIRKMHQKGSRV 356
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
TIAG G K +GPA A+ PH ++ +D G+ IRQI+
Sbjct: 357 ETIAGNGVKNFAGDNGPATGANLE-------FPHGIVVDKQDNVIFADKGHYRIRQID-- 407
Query: 190 PE 191
PE
Sbjct: 408 PE 409
>gi|290974765|ref|XP_002670115.1| predicted protein [Naegleria gruberi]
gi|284083670|gb|EFC37371.1| predicted protein [Naegleria gruberi]
Length = 784
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 77/148 (52%), Gaps = 7/148 (4%)
Query: 43 LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGS 100
LE P V P S+DL++ D L+ +++V + GD GYS +G
Sbjct: 95 LEFQPGGVALYPNSNDLLISDPVGGVIVRLN-SKGIQTIVAGMKGD--LGYSGDNGLATR 151
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQNASFS 159
AR + P AV G +Y+ADKSNHVIRKI+ N ++TIAG G + G A+ A +
Sbjct: 152 ARLNTPTGVAVASNGEVYIADKSNHVIRKISLNGNISTIAGNGEEGFSGDGGNAKTAQLN 211
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
+ L+ + L I+D N +IR+I+
Sbjct: 212 SPIGLS-ISSTGELYIADSKNHVIRKID 238
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 58/109 (53%), Gaps = 4/109 (3%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG+G +G+S G +A+ + P ++ G +Y+AD NHVIRKI G ++T
Sbjct: 187 ISTIAGNGEEGFSGDGGNAKTAQLNSPIGLSISSTGELYIADSKNHVIRKIDANGIISTF 246
Query: 139 AGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G+ DG A+ A + + + F + I+D N LIR++
Sbjct: 247 AGNGTINGYGGDGSQAKQALLNTPYGVFFYESTGEVYIADTLNSLIRKV 295
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 70/166 (42%), Gaps = 46/166 (27%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYV--- 119
+ SA++ + +S ++ + AG G GYS DG A+F++P A G+I++
Sbjct: 405 AESAYHKIR-KISPNGIISQYAGTGDHGYSGDGSDAKLAKFNEPTRIACSKNGDIFIADL 463
Query: 120 --------------------------------------ADKSNHVIRKITNLGVTTIAGG 141
AD+S+HVIRKI+ GV +I G
Sbjct: 464 FNGAIRRIFASNGTVDTVVSGLGSPQAVILTESGELLVADRSSHVIRKISTNGVMSIIAG 523
Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ G DG A FS ++ P+ L I+D+ N LIR++
Sbjct: 524 VLEDGGFNGDGLATKTKFSGPQDIALAPN-GDLYIADYDNYLIRKL 568
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 7/122 (5%)
Query: 33 TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
TV TV+ G +P +VI S +L+V D S +S + L G G
Sbjct: 477 TVDTVVSGLG---SPQAVILTE-SGELLVADRSSHVIRKISTNGVMSIIAGVLEDGGFNG 532
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG 151
DG +F P+ A+ G++Y+AD N++IRK++ G +TT+AG G G G
Sbjct: 533 --DGLATKTKFSGPQDIALAPNGDLYIADYDNYLIRKLSKNGNITTVAGNGDSTVGEDFG 590
Query: 152 PA 153
A
Sbjct: 591 IA 592
Score = 40.4 bits (93), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 59/122 (48%), Gaps = 9/122 (7%)
Query: 76 LSEESVVKRLAGDG-VQGYS-DG-EPGSARFDKPKS-FAVDMKGNIYVADKSNHVIRKIT 131
+ ++ AG+G + GY DG + A + P F + G +Y+AD N +IRK++
Sbjct: 237 IDANGIISTFAGNGTINGYGGDGSQAKQALLNTPYGVFFYESTGEVYIADTLNSLIRKVS 296
Query: 132 NLGVTTIAGGGSKKEG--RADG--PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+ + G G R D PA +A S + + + + I+D+GN IR+++
Sbjct: 297 KSGIISTVAGVPNSSGYSREDENVPATSALLSAPTSVA-LSSLGEMFIADNGNLYIRKVD 355
Query: 188 LK 189
K
Sbjct: 356 TK 357
>gi|323454387|gb|EGB10257.1| hypothetical protein AURANDRAFT_62887 [Aureococcus anophagefferens]
Length = 434
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 71/138 (51%), Gaps = 17/138 (12%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRL----------AGDGVQGYSDGEPGSARFDKPKSF 109
+ LD++R+ Y S V+RL AGDG +GY+DG+ G ARFD+P +
Sbjct: 247 VCLDAARNCLYVAD---SGNHCVRRLDLASGAVTTVAGDGTRGYADGDAGGARFDEPTAV 303
Query: 110 AVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
A+D G +YVAD+ N +R + V+T+AGGG+ G A ASFS L P
Sbjct: 304 ALDADGALYVADQENRRVRCLRGRTVSTLAGGGAGDSADGVGAA--ASFSAPNSLHLCPE 361
Query: 170 ICALLISDHGNQLIRQIN 187
I L + D Q +R +
Sbjct: 362 IRRLYVGD--KQCVRVVT 377
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 65/134 (48%), Gaps = 4/134 (2%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV-DMK 114
SS L+V DS R + + V++ LAG G+ G DG A F P A
Sbjct: 141 SSRLVVTDSRRCRVVEVCG--ASGRVLRVLAGCGMSGGRDGRADRATFAGPWGVAAHPAT 198
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
G++YV++ IR I VTT+AG G + GRADG ASF+ + L
Sbjct: 199 GDVYVSEYFGCKIRVIRRGDVTTLAGSGPRG-GRADGVGPAASFAGPRGVCLDAARNCLY 257
Query: 175 ISDHGNQLIRQINL 188
++D GN +R+++L
Sbjct: 258 VADSGNHCVRRLDL 271
>gi|455647406|gb|EMF26379.1| hypothetical protein H114_24527 [Streptomyces gancidicus BKS 13-15]
Length = 627
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 70/123 (56%), Gaps = 7/123 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ P S +V D++R L+ E+V++R G G +G DG PG ARF +P
Sbjct: 186 PGKVLRLPSSGTFLVTDTTRHQVVELAE--DGETVLRRF-GSGGRGLEDGPPGRARFQEP 242
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDF 162
+ A+ G + VAD NH +R++ + VTT+AG GS + G A DGPA+ + S+ +
Sbjct: 243 QGLALLPDGAVVVADTVNHALRRLDPESGEVTTLAGTGSPWRPGEATDGPARAVNLSSPW 302
Query: 163 ELT 165
++T
Sbjct: 303 DVT 305
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 3/107 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V R AG G + DG A F +P AV +G ++VAD + +R++ G V T G
Sbjct: 331 VAREAGTGGEQLVDGSAKEALFAQPSGLAVTAEG-LWVADSESSALRRLGPDGTVRTAVG 389
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G DGPA A F + +T +P +L +SD N +R+ +
Sbjct: 390 AGLFAFGYQDGPADGALFQHPLGVTALPD-GSLAVSDTYNHALRRYD 435
>gi|325102830|ref|YP_004272484.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
DSM 12145]
gi|324971678|gb|ADY50662.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
DSM 12145]
Length = 491
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/173 (32%), Positives = 82/173 (47%), Gaps = 13/173 (7%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P+ V P + DLIV D + A +S + +V ++AG G G DG+ A F+ P
Sbjct: 170 PNGVSFDPTTGDLIVADRTAQAIKRIS----KAGIVTKIAGTGTAGRVDGDISIASFNNP 225
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
AVD GNIYVAD NH IRKI TN T G G+ G DG N +
Sbjct: 226 YKTAVDKYGNIYVADNGNHRIRKIDLSTNTVSTIAGGAGAATSGYTDGIGANGLLNTPTG 285
Query: 164 LTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS--VWVLVSV 214
L V + ++D N +R+I D S+ +G+ + ++ +W V+V
Sbjct: 286 LA-VDDDLNVYVADAANHAVRKIT---PDGRVSTLAGNGIAGIADGIWPNVTV 334
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 30/43 (69%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
GY +G PG +RF P AVD +GN+++AD +N+VIRKI +
Sbjct: 443 NGYVNGLPGISRFYNPTDVAVDEEGNVFIADMNNYVIRKIVKI 485
Score = 40.0 bits (92), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
++ + V LAG+G+ G +DG + ++P + + G +Y AD+ IRKI G
Sbjct: 307 ITPDGRVSTLAGNGIAGIADGIWPNVTVNRPTAVCMGKDGFLYSADRYGQRIRKIDVRTG 366
Query: 135 VT-TIAGGGSKKEGRA---DGPAQNASFSN 160
T TIAG G G DG A F+N
Sbjct: 367 KTVTIAGSGGNAAGTGGHVDGEVLKARFNN 396
Score = 40.0 bits (92), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 52/112 (46%), Gaps = 7/112 (6%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD-----KSNHVIRKITNLGVTTIAGGGS 143
G G+ DGE ARF+ +D G IYV++ +H IR I N V+TI G +
Sbjct: 380 GTGGHVDGEVLKARFNNIWGMDIDKDGTIYVSELEGTAGKSHTIRMIKNGQVSTIGGPDA 439
Query: 144 -KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G +G + F N ++ V + I+D N +IR+I +D S
Sbjct: 440 FDNNGYVNGLPGISRFYNPTDVA-VDEEGNVFIADMNNYVIRKIVKIKKDTS 490
>gi|428179990|gb|EKX48859.1| hypothetical protein GUITHDRAFT_105483 [Guillardia theta CCMP2712]
Length = 388
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 74/164 (45%), Gaps = 17/164 (10%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR---------LAGDGVQGYSD 95
+ P+ I + + VLD L F +S + VK LAG G GY D
Sbjct: 17 VGPNIYISDEKGNKIYVLDRFTGTVMIL-FTMSSNTTVKGNDVVSQVTLLAGTGEAGYKD 75
Query: 96 GEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG------VTTIAGGGSKKEGR 148
G SA+F P AV ++ + V+D+ NH IR IT V T+AG G
Sbjct: 76 GIGSSAKFKSPSGLAVQSLRTGLCVSDRGNHAIRIITRESPDKPAVVDTVAGSDDSGAGF 135
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
ADG NA F++ L P +L++D GN IR++ L ED
Sbjct: 136 ADGEGPNALFNHPEGLAMSPDGSFILVADSGNHRIRKVLLSKED 179
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 68/124 (54%), Gaps = 16/124 (12%)
Query: 80 SVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI------ 130
+VV +AG D G++DGE +A F+ P+ A+ G+ I VAD NH IRK+
Sbjct: 120 AVVDTVAGSDDSGAGFADGEGPNALFNHPEGLAMSPDGSFILVADSGNHRIRKVLLSKED 179
Query: 131 TNLGVTTIAGGGSKK-----EGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIR 184
NL V+TIAGG + +G D N+SF+ ++ F+P+ +L+ D GN IR
Sbjct: 180 KNL-VSTIAGGNQNRNVTYSQGFNDADNGVNSSFNRPTDVAFLPNGEEVLVVDAGNHAIR 238
Query: 185 QINL 188
++ L
Sbjct: 239 KLTL 242
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 74/167 (44%), Gaps = 13/167 (7%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-------VQGYSDGEPG 99
P + P S ++V DS + ++++V +AG QG++D + G
Sbjct: 148 PEGLAMSPDGSFILVADSGNHRIRKVLLSKEDKNLVSTIAGGNQNRNVTYSQGFNDADNG 207
Query: 100 -SARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADGPAQN 155
++ F++P A G + V D NH IRK+T VTTIAG G+ G +G N
Sbjct: 208 VNSSFNRPTDVAFLPNGEEVLVVDAGNHAIRKLTLSTGAVTTIAGNGNS--GSDNGFGTN 265
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
A+F + I++ Q IRQ++L + + +G A
Sbjct: 266 ATFEFPMSIVISAGGKYAFIAEMEGQRIRQLHLSTGQVTTLAGTGDA 312
>gi|116749588|ref|YP_846275.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
gi|116698652|gb|ABK17840.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
Length = 1750
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 85/185 (45%), Gaps = 17/185 (9%)
Query: 12 ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPG------SSDLIVLDSS 65
I L LLF SS + D DG LE N +D P S ++ + D++
Sbjct: 2 IFLAILLFPGPSSAERVVDTVAGLQSDDGPALEAN----LDSPSGVAVDTSGNVYLSDTN 57
Query: 66 RSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKS 123
L + ++ LAG GYS G A ++P+ AVD GN+Y AD +
Sbjct: 58 NHRIRKLDVAAGQ---IQTLAGGQSPGYSGDGGTAAKAGLNRPRGIAVDAAGNVYFADSN 114
Query: 124 NHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
NH IRKI G +TT+AG GS GPA +A + F + P + ++D GN
Sbjct: 115 NHCIRKIDTSGIITTVAGTGSAGSNGDGGPAASARLAYPFGIAVDPS-GNIYVADLGNHK 173
Query: 183 IRQIN 187
+R+I+
Sbjct: 174 VRRID 178
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 56/106 (52%), Gaps = 4/106 (3%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG GV GYS DG AR + P + AVD +IY+AD +NH IRK+ G +TT+AG
Sbjct: 465 VAGKGVAGYSGDDGPAAEARLNNPSAIAVDGSESIYIADTNNHRIRKVDGGGTITTVAGN 524
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+ DG + A+ N V + I+D N +R ++
Sbjct: 525 GTPGY-SGDGASATAASLNFPNGVAVDADGNVFIADTSNHRVRMVD 569
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 60/116 (51%), Gaps = 4/116 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ ++ +AG G G + DG P SAR P AVD GNIYVAD NH +R+I
Sbjct: 121 IDTSGIITTVAGTGSAGSNGDGGPAASARLAYPFGIAVDPSGNIYVADLGNHKVRRIDAA 180
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G ++T+AG G GPA A ++ + V L ISD G +IR++++
Sbjct: 181 GNISTVAGTGLLSRLGDGGPATEAGLTSPTGVA-VDGSGNLFISDSGRHVIRRVDV 235
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 58/108 (53%), Gaps = 3/108 (2%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ +AG+G GYS DG +A + P AVD GN+++AD SNH +R + + +TT+
Sbjct: 517 TITTVAGNGTPGYSGDGASATAASLNFPNGVAVDADGNVFIADTSNHRVRMVDSGVITTV 576
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G+ G A +AS + +V AL I+D N +R++
Sbjct: 577 AGNGTPGYSGDGGAAVSASLKAPHGV-WVDATGALYIADAHNYRVRKV 623
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 63/134 (47%), Gaps = 9/134 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDM 113
++DL + D+ Y + V+ +AG G G DG P + AR P + AVD
Sbjct: 273 NNDLYIADT-----YNQRIRKVTDGVINTVAGTGYGGSLEDGIPATGARLKSPVALAVDS 327
Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
+ NIY+AD +H IR++ G + T+AG G G A A + L P +
Sbjct: 328 QNNIYIADTYSHRIRRVDAAGNIVTVAGKGVPGNAGDGGQAVAAILKSPHGLALGPD-NS 386
Query: 173 LLISDHGNQLIRQI 186
L I+D + +R++
Sbjct: 387 LYIADRTDHRVRKV 400
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 55/114 (48%), Gaps = 5/114 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ + V+ +AG+G GYS G+ G SA P VD G +Y+AD N+ +RK+
Sbjct: 567 MVDSGVITTVAGNGTPGYS-GDGGAAVSASLKAPHGVWVDATGALYIADAHNYRVRKVAG 625
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ T+AG G+ G A A F + L V L ++D N +R++
Sbjct: 626 GNIVTVAGTGTPGYSGDGGLAAAADFRSVHGL-VVDGSGNLFVADMENSRVRKV 678
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 95 DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD-GP 152
DG P + A P AVD GN++++D HVIR++ G G ++ D GP
Sbjct: 197 DGGPATEAGLTSPTGVAVDGSGNLFISDSGRHVIRRVDVGGTIDRVAGDYEQRYHGDGGP 256
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A +A N + + P+ L I+D NQ IR++
Sbjct: 257 ALSAGLMNAYGVALGPN-NDLYIADTYNQRIRKV 289
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 58/115 (50%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ V+ LAG G +G S +A D P + AV G++Y +D ++ +RKI
Sbjct: 400 VTAAGVISTLAGTGEEGLSADGAAAAFANLDGPCAVAVGPSGSVYFSDSGSNRVRKIGLD 459
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G ++T+AG G DGPA A +N + V ++ I+D N IR+++
Sbjct: 460 GNLSTVAGKGVAGYSGDDGPAAEARLNNPSAIA-VDGSESIYIADTNNHRIRKVD 513
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 64/138 (46%), Gaps = 13/138 (9%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVD 112
GS +L + DS R + + R+AGD Q Y DG P SA A+
Sbjct: 216 GSGNLFISDSGRHVIRRVDV----GGTIDRVAGDYEQRYHGDGGPALSAGLMNAYGVALG 271
Query: 113 MKGNIYVADKSNHVIRKITNLGVTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPH 169
++Y+AD N IRK+T+ + T+AG GGS ++G PA A + L V
Sbjct: 272 PNNDLYIADTYNQRIRKVTDGVINTVAGTGYGGSLEDGI---PATGARLKSPVALA-VDS 327
Query: 170 ICALLISDHGNQLIRQIN 187
+ I+D + IR+++
Sbjct: 328 QNNIYIADTYSHRIRRVD 345
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 54/107 (50%), Gaps = 4/107 (3%)
Query: 85 LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGG 141
+AG GV G + G+ +A P A+ ++Y+AD+++H +RK+T GV +T+AG
Sbjct: 353 VAGKGVPGNAGDGGQAVAAILKSPHGLALGPDNSLYIADRTDHRVRKVTAAGVISTLAGT 412
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G + A A+ + P ++ SD G+ +R+I L
Sbjct: 413 GEEGLSADGAAAAFANLDGPCAVAVGPS-GSVYFSDSGSNRVRKIGL 458
>gi|338536093|ref|YP_004669427.1| putative lipoprotein [Myxococcus fulvus HW-1]
gi|337262189|gb|AEI68349.1| putative lipoprotein [Myxococcus fulvus HW-1]
Length = 855
Score = 68.6 bits (166), Expect = 3e-09, Method: Composition-based stats.
Identities = 41/112 (36%), Positives = 62/112 (55%), Gaps = 7/112 (6%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
+ VKR+ +G + E + + P AVD GN+YVAD ++VIR+I G +
Sbjct: 598 NAVKRVTREG-----EVETVATGLNGPMGIAVDAAGNVYVADTDHYVIRRIDVEGKVEVF 652
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
GG+ G DGPA+ A+F+ +T P ALL++D N +IR+I+L E
Sbjct: 653 AGGTP--GLQDGPAKQAAFNQPTGMTVTPDGTALLVADMNNGVIRRIDLVAE 702
Score = 57.0 bits (136), Expect = 7e-06, Method: Composition-based stats.
Identities = 59/166 (35%), Positives = 79/166 (47%), Gaps = 17/166 (10%)
Query: 27 LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
L+ +G+ VTT + G L P V S L V++S S + L+ SVV A
Sbjct: 699 LVAEGHPVTT-LQGDWL-YRPSGVAVSADGSTLYVVESGMSRVVRIRDGLT--SVV---A 751
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI------TNLGVTTIAG 140
G GY DG P SA+F AV G++ VAD N+ +R++ T VTT+AG
Sbjct: 752 GT-TPGYRDGAPTSAQFLPYLGIAVLKDGSLAVADPGNYRVRRVVLEADGTARKVTTLAG 810
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + G ADGP A LT P L ++D GN L+R I
Sbjct: 811 NG--RYGHADGPGDKAELVLPAGLTVGPD-GRLYVADAGNGLVRAI 853
Score = 39.3 bits (90), Expect = 1.9, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 26/50 (52%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG+G G++DG A P V G +YVAD N ++R IT
Sbjct: 805 VTTLAGNGRYGHADGPGDKAELVLPAGLTVGPDGRLYVADAGNGLVRAIT 854
>gi|333383538|ref|ZP_08475196.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
BAA-286]
gi|332827477|gb|EGK00223.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
BAA-286]
Length = 434
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 3/131 (2%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
L + + R YT + +E + G G+SDG A F+ P+ +D++GN+Y
Sbjct: 307 LYIAYTQRHCIYTYNLATNEHLLFA--GAPGQPGWSDGRVSDAEFNLPRQMTLDLEGNLY 364
Query: 119 VADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDH 178
+AD+ NH IR I G+ T G +EG DG + A F + V + I+D+
Sbjct: 365 IADEGNHCIRMIDKDGIVTTPIGQPGQEGYMDGSPEIALFKKPRGVA-VDKNGDVYIADY 423
Query: 179 GNQLIRQINLK 189
GN+ +R++ L+
Sbjct: 424 GNRCLRKLTLQ 434
>gi|312198348|ref|YP_004018409.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311229684|gb|ADP82539.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 782
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 55/102 (53%), Gaps = 3/102 (2%)
Query: 88 DGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKK 145
DG+ DG P + A P AVD +GN+Y+AD N +R++ G +TT+AG G +
Sbjct: 505 DGLTYAGDGVPATKAMLRSPNDLAVDGRGNLYIADGENERVRRVDTHGIITTVAGSGVEA 564
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G DGPA +A F LT P +L + D GN IR+I
Sbjct: 565 GGDGDGPATDAFFKYTEGLTIGPD-GSLYVGDQGNSRIRRIT 605
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 8/86 (9%)
Query: 60 IVLDSSRSAFYTLSF------PLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAV 111
I LD R FY + + ++ +AG+G+ G++ G+ SA+ D P AV
Sbjct: 400 ITLDLKRGYFYIADYYNERVRRVDHNGIITTVAGNGIAGFAGDGGQATSAQLDSPTDVAV 459
Query: 112 DMKGNIYVADKSNHVIRKITNLGVTT 137
G+IY+AD +N+ +R + G+ T
Sbjct: 460 AADGSIYIADSANNRVRHVDTHGIIT 485
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 3/81 (3%)
Query: 81 VVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ +AG GV+ G DG A F + + G++YV D+ N IR+IT G + T
Sbjct: 553 IITTVAGSGVEAGGDGDGPATDAFFKYTEGLTIGPDGSLYVGDQGNSRIRRITPQGRIIT 612
Query: 138 IAGGGSKKEGRADGPAQNASF 158
IAG G++ GPA A
Sbjct: 613 IAGTGTRGYSGDGGPATKAQI 633
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 48/95 (50%), Gaps = 6/95 (6%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKP-KSFAVDMKGNIYVADKSNHVIRKITN 132
++ + + +AG G +GYS DG P + A+ D + AVD GN+Y +D+S +R++
Sbjct: 604 ITPQGRIITIAGTGTRGYSGDGGPATKAQIDSTGTAIAVDSAGNVYFSDQSR--VRRVDT 661
Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
GV TT+ G G GPA A + L
Sbjct: 662 HGVITTVVGTGESGLAGDGGPAVKAKIGLAYGLVL 696
>gi|325916326|ref|ZP_08178604.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
gi|325537497|gb|EGD09215.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
Length = 699
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 57/112 (50%), Gaps = 4/112 (3%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
+ + VV LAGDG G+ DG ARF+ P AVD +G +YVAD N IR I G V
Sbjct: 153 TPQGVVTTLAGDGTAGFGDGAASQARFNGPMGVAVDAQGRVYVADTYNDRIRVIERDGQV 212
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
T+A GS G ADG + A +L H + ++D N IR+I
Sbjct: 213 RTLA--GSALPGMADGVGEQARLDTPTDLKVDAH-GVVWVADMRNDAIRRIT 261
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 66/123 (53%), Gaps = 8/123 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V+ LAG G +G++DG +A F+ P A+D GN+Y+AD NH IRK T G VTT+AG
Sbjct: 105 VQTLAG-GREGFADGIGAAAAFNTPSGIALDTAGNLYIADTGNHAIRKRTPQGVVTTLAG 163
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
G+ G DG A A F+ + V + ++D N IR I D + +G
Sbjct: 164 DGTA--GFGDGAASQARFNGPMGVA-VDAQGRVYVADTYNDRIRVIE---RDGQVRTLAG 217
Query: 201 SAL 203
SAL
Sbjct: 218 SAL 220
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 62/116 (53%), Gaps = 6/116 (5%)
Query: 72 LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-NHVIR-K 129
++ PL + ++ +AG+GV+G D A+F +P A+D G +Y+AD N+ IR
Sbjct: 40 VATPLGWAAQLQWVAGNGVRGAQDARAADAQFAEPYGLAIDTHGALYIADAGDNNRIRVM 99
Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
+ N V T+AGG +EG ADG A+F+ + L I+D GN IR+
Sbjct: 100 LPNGSVQTLAGG---REGFADGIGAAAAFNTPSGIAL-DTAGNLYIADTGNHAIRK 151
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+ + V+ LAG + G +DG AR D P VD G ++VAD N IR+IT G
Sbjct: 206 IERDGQVRTLAGSALPGMADGVGEQARLDTPTDLKVDAHGVVWVADMRNDAIRRITPDGS 265
Query: 135 VTTIAGG 141
V T+ GG
Sbjct: 266 VATLVGG 272
>gi|190572588|ref|YP_001970433.1| NHL repeat-containing protein [Stenotrophomonas maltophilia K279a]
gi|190010510|emb|CAQ44119.1| putative NHL repeat protein [Stenotrophomonas maltophilia K279a]
Length = 1267
Score = 68.2 bits (165), Expect = 4e-09, Method: Composition-based stats.
Identities = 56/152 (36%), Positives = 73/152 (48%), Gaps = 11/152 (7%)
Query: 38 IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
+DG L+ N S I +L V D+ A + + V LAG G QG++D
Sbjct: 688 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGHAD 742
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQ 154
G ARFD P AVD +G +YVAD N IR I G V T+AGG + G ADG
Sbjct: 743 GPAVQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGMVRTLAGG--DRPGLADGVGA 800
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F L F ALL++D N +R++
Sbjct: 801 AARFDTPVALAFDAQ-GALLVADLFNNAVRRV 831
Score = 66.6 bits (161), Expect = 9e-09, Method: Composition-based stats.
Identities = 38/78 (48%), Positives = 49/78 (62%), Gaps = 5/78 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
V+ +AG G +G DG A F+ P A D +GN+YVAD NH IR+I T+ VTT+AG
Sbjct: 677 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAG 735
Query: 141 GGSKKEGRADGPAQNASF 158
G ++G ADGPA A F
Sbjct: 736 G---EQGHADGPAVQARF 750
Score = 62.4 bits (150), Expect = 2e-07, Method: Composition-based stats.
Identities = 41/114 (35%), Positives = 59/114 (51%), Gaps = 6/114 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITN 132
PL+ + ++ LAGDG G DG ARF P + G++Y D +N + R++ +
Sbjct: 615 PLAWTAQIEPLAGDGHPGDRDGASAQARFADPYALLRGADGSVYFTDAGDNNRIRRRLPD 674
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V T+AG G EGR DGPA ASF+ + L ++D GN IR+I
Sbjct: 675 GRVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRI 724
>gi|182414082|ref|YP_001819148.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
gi|177841296|gb|ACB75548.1| NHL repeat containing protein [Opitutus terrae PB90-1]
Length = 963
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 61/118 (51%), Gaps = 8/118 (6%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--- 131
++ VV LAGD + G +DG G ARF P AVD GNI+VAD N+ IR++T
Sbjct: 253 ITPTGVVLTLAGDPDLVGSADGTGGDARFSSPAGLAVDRDGNIFVADSLNNTIRRVTPLN 312
Query: 132 ---NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
LGV T G + G ADG A F+ + + V + ++D GN IR+I
Sbjct: 313 GPAPLGVVTTVAGQAGVTGSADGVGSQARFNLPYGIA-VDAAGNIFVADLGNTTIRKI 369
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 2/111 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ V LAG+ G +DG ARF+ P AVD+ GN YVAD N IRKIT GV
Sbjct: 369 IAPSGAVTTLAGEASVGTADGPGPMARFNYPNGVAVDLAGNTYVADTFNATIRKITPAGV 428
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ G + + G ADG A F +F L + + + +R+I
Sbjct: 429 VSTLAGAAGQIGSADGTGSAARF--EFPLGIAVDRAGNVYTTANSATVRKI 477
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 56/109 (51%), Gaps = 15/109 (13%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
LAG G DG+ +A F P S AVD GN++VAD +NH IRKIT G V+T AG G
Sbjct: 44 LAGRPFFGGLDGQGRAAGFTTPSSAAVDQAGNLFVADTTNHTIRKITPSGTVSTFAGMGG 103
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
+ G DG A F PH A L ++D GN IR+I
Sbjct: 104 QP-GSVDGTGNAARF-------LSPHGVALDEAGNLYVADSGNNTIRKI 144
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 55/112 (49%), Gaps = 3/112 (2%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV LAG Q G +DG +ARF+ P AVD GN+Y S +RKIT G
Sbjct: 423 ITPAGVVSTLAGAAGQIGSADGTGSAARFEFPLGIAVDRAGNVYTTANS-ATVRKITPAG 481
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V T G S G ADGP A F+ L V L ++D N IRQI
Sbjct: 482 VVTTIAGVSGNFGSADGPGLAARFAFPNGLA-VATDGTLYVADEENSTIRQI 532
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 63/131 (48%), Gaps = 2/131 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV +AG G G +DG +ARF P AV G +YVAD+ N IR+IT G
Sbjct: 477 ITPAGVVTTIAGVSGNFGSADGPGLAARFAFPNGLAVATDGTLYVADEENSTIRQITPDG 536
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
+ + G + G DG A F LT + L +SD G+ +R+I E +
Sbjct: 537 MVSTLAGSPAQRGGIDGTGTAARFVQPAGLT-IDAAGNLYVSDRGDFTVRKITPAGEVTT 595
Query: 195 KSSQSGSALGA 205
+ Q G A GA
Sbjct: 596 VAGQHGIAGGA 606
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 73/143 (51%), Gaps = 15/143 (10%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEES---VVKRLAGD-GVQGYSDGEPGSARFDKP 106
+DR G ++ V DS + ++ PL+ + VV +AG GV G +DG ARF+ P
Sbjct: 289 VDRDG--NIFVADSLNNTIRRVT-PLNGPAPLGVVTTVAGQAGVTGSADGVGSQARFNLP 345
Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS--NDFE 163
AVD GNI+VAD N IRKI G VTT+A G G ADGP A F+ N
Sbjct: 346 YGIAVDAAGNIFVADLGNTTIRKIAPSGAVTTLA--GEASVGTADGPGPMARFNYPNGVA 403
Query: 164 LTFVPHICALLISDHGNQLIRQI 186
+ + ++D N IR+I
Sbjct: 404 VDLAGNT---YVADTFNATIRKI 423
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 56/112 (50%), Gaps = 3/112 (2%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ + +V LAG Q G DG +ARF +P +D GN+YV+D+ + +RKIT G
Sbjct: 532 ITPDGMVSTLAGSPAQRGGIDGTGTAARFVQPAGLTIDAAGNLYVSDRGDFTVRKITPAG 591
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T G G ADG A F+ + + L ++D N+ IRQI
Sbjct: 592 EVTTVAGQHGIAGGADGTGSAAQFAYAGGIA-IDRRGTLYVADSNNR-IRQI 641
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 57/112 (50%), Gaps = 2/112 (1%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ V AG G Q G DG +ARF P A+D GN+YVAD N+ IRKIT G
Sbjct: 89 ITPSGTVSTFAGMGGQPGSVDGTGNAARFLSPHGVALDEAGNLYVADSGNNTIRKITPTG 148
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V + G + G ADG A F++ +T P L ++D N +IR I
Sbjct: 149 VVSTLAGQAGAAGSADGDGSAARFNHPTGVTAYPD-GTLFVADTQNHVIRTI 199
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 3/106 (2%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
V AG G++G ++G +A F P++ AV +GN+YV ++ + IR IT GV
Sbjct: 205 VSTFAGKTGIRGNTNGTVDTALFALPRNIAV-FRGNLYVTEQESAAIRWITPTGVVLTLA 263
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G ADG +A FS+ L V + ++D N IR++
Sbjct: 264 GDPDLVGSADGTGGDARFSSPAGLA-VDRDGNIFVADSLNNTIRRV 308
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 27/58 (46%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
RF+ P G ++VAD NHVIR IT G + G + G +G A F+
Sbjct: 171 RFNHPTGVTAYPDGTLFVADTQNHVIRTITPAGRVSTFAGKTGIRGNTNGTVDTALFA 228
>gi|444912221|ref|ZP_21232386.1| putative hemolysin [Cystobacter fuscus DSM 2262]
gi|444717129|gb|ELW57964.1| putative hemolysin [Cystobacter fuscus DSM 2262]
Length = 844
Score = 67.8 bits (164), Expect = 4e-09, Method: Composition-based stats.
Identities = 42/107 (39%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
VKR++ DG S R + P A D +GN+YV+D N+ IR+IT G TT+ G
Sbjct: 591 VKRVSPDG----SVSTLSHVRLNGPLGIAADAQGNVYVSDSDNYCIRRITPDGTTTVFAG 646
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ G DG A A F+ L FV LL++D GN +IR+I+L
Sbjct: 647 AEMQPGGMDGSALQARFNQPAGL-FVTPAQELLVADLGNGVIRRIDL 692
Score = 40.8 bits (94), Expect = 0.66, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 27/56 (48%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S V LAG G G DG A F P AV G +YVAD N ++R IT
Sbjct: 788 ISPAGEVTTLAGSGRFGARDGRGADADFVVPGGLAVGTDGTLYVADSGNALLRAIT 843
>gi|29348037|ref|NP_811540.1| cell surface protein [Bacteroides thetaiotaomicron VPI-5482]
gi|29339939|gb|AAO77734.1| putative cell surface protein, have conserved domain [Bacteroides
thetaiotaomicron VPI-5482]
Length = 434
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 68/132 (51%), Gaps = 5/132 (3%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
L + +++ YT + E K AG G G++DG A F+ P+ ++DM+GNI
Sbjct: 307 LYIAYAAKHVIYTYNIETGEH---KLFAGAFGQSGWNDGIATDAEFNSPRQMSLDMEGNI 363
Query: 118 YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
Y+AD NH IR I G+ T G + G ADG + + + ND V + I+D
Sbjct: 364 YIADSGNHCIRMIDKNGIVTTPIGQPGEAGYADG-SPDMALLNDPRGVAVNSEGDVYIAD 422
Query: 178 HGNQLIRQINLK 189
GN+ IR++ L+
Sbjct: 423 LGNRCIRKLTLQ 434
>gi|430809992|ref|ZP_19437107.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
gi|429497580|gb|EKZ96110.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
Length = 536
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/106 (40%), Positives = 59/106 (55%), Gaps = 8/106 (7%)
Query: 60 IVLDSSRSAFYTLSFP-----LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
IV+D + + F ++ ++ VV LAG G G++DG +A F++P AVD
Sbjct: 350 IVVDVAGNVFVADTYNHLVRMVTPTGVVTTLAGTGTAGFADGPAQAAAFNRPAGIAVDAA 409
Query: 115 GNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
GNIYV+D N IRKI+ VTT+A GS G ADG ASF+
Sbjct: 410 GNIYVSDLGNARIRKISPANVVTTLA--GSTTRGSADGTGAAASFT 453
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 53/88 (60%), Gaps = 4/88 (4%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+A F P VD+ GN++VAD NH++R +T G VTT+AG G+ G ADGPAQ A+F
Sbjct: 341 AASFYYPSDIVVDVAGNVFVADTYNHLVRMVTPTGVVTTLAGTGT--AGFADGPAQAAAF 398
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+ + V + +SD GN IR+I
Sbjct: 399 NRPAGIA-VDAAGNIYVSDLGNARIRKI 425
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 38/60 (63%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+S V LAG GV G +DG +A F+ P AVD GN+YVAD NH+IRKIT+ GV
Sbjct: 263 ISPYGAVTTLAGSGVAGSTDGTGTAASFNLPSGVAVDGAGNVYVADTYNHLIRKITSAGV 322
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
P AVD GN+YVAD NH IRKI+ G VTT+AG G G DG ASF N
Sbjct: 239 PIDVAVDGTGNVYVADYGNHRIRKISPYGAVTTLAGSG--VAGSTDGTGTAASF-NLPSG 295
Query: 165 TFVPHICALLISDHGNQLIRQI 186
V + ++D N LIR+I
Sbjct: 296 VAVDGAGNVYVADTYNHLIRKI 317
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 33/62 (53%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+S +VV LAG +G +DG +A F A D GN+Y D ++ +RK+T GV
Sbjct: 425 ISPANVVTTLAGSTTRGSADGTGAAASFTSLLRIATDAAGNVYAVDAGSNAVRKVTPAGV 484
Query: 136 TT 137
T
Sbjct: 485 VT 486
>gi|325104746|ref|YP_004274400.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
gi|324973594|gb|ADY52578.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
Length = 1769
Score = 67.8 bits (164), Expect = 5e-09, Method: Composition-based stats.
Identities = 36/84 (42%), Positives = 50/84 (59%), Gaps = 3/84 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ V AG+G GY+DG SA F+ P++ +D NIYV++ SNH IRKIT G
Sbjct: 1079 ITPNGTVSTFAGNGTAGYADGPGTSAMFNSPQAMVMDASDNIYVSESSNHRIRKITPAGE 1138
Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
V+T+AG G +G DG + A F
Sbjct: 1139 VSTVAGKGV--QGNRDGTKEEAQF 1160
Score = 66.2 bits (160), Expect = 2e-08, Method: Composition-based stats.
Identities = 43/112 (38%), Positives = 62/112 (55%), Gaps = 5/112 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
++ V AG+G GY+DG ARF++P D GN+YV++ SN+ IRKIT N
Sbjct: 1026 ITPNGTVSTFAGNGTAGYADGVGVLARFNRPTGIITDALGNVYVSESSNY-IRKITPNGT 1084
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V+T AG G+ G ADGP +A F N + + + +S+ N IR+I
Sbjct: 1085 VSTFAGNGTA--GYADGPGTSAMF-NSPQAMVMDASDNIYVSESSNHRIRKI 1133
Score = 64.7 bits (156), Expect = 4e-08, Method: Composition-based stats.
Identities = 38/85 (44%), Positives = 51/85 (60%), Gaps = 3/85 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
++ + +V +AG G G S+G +A F+KP A+D GNIY+AD NH IRKIT N
Sbjct: 972 ITPQGLVSTIAGSGTNGNSNGVGTAASFNKPTGIAIDPWGNIYIADDLNHSIRKITPNGT 1031
Query: 135 VTTIAGGGSKKEGRADGPAQNASFS 159
V+T AG G+ G ADG A F+
Sbjct: 1032 VSTFAGNGTA--GYADGVGVLARFN 1054
Score = 61.6 bits (148), Expect = 4e-07, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 64/116 (55%), Gaps = 12/116 (10%)
Query: 80 SVVKRLAGDG--------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
S+++++ DG GY+DG +A+F+ P+ A+D GNIY+A++ IRKIT
Sbjct: 914 SLIRKITADGQVSTFAGSTFGYADGIGTAAKFNGPQGMAIDASGNIYIAERGASRIRKIT 973
Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G V+TIAG G+ G ++G ASF+ + P + I+D N IR+I
Sbjct: 974 PQGLVSTIAGSGTN--GNSNGVGTAASFNKPTGIAIDPW-GNIYIADDLNHSIRKI 1026
Score = 58.9 bits (141), Expect = 2e-06, Method: Composition-based stats.
Identities = 40/102 (39%), Positives = 55/102 (53%), Gaps = 12/102 (11%)
Query: 52 DRPGSSDL------IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGS 100
D PG+S + +V+D+S + + + S ++ V +AG GVQG DG
Sbjct: 1098 DGPGTSAMFNSPQAMVMDASDNIYVSESSNHRIRKITPAGEVSTVAGKGVQGNRDGTKEE 1157
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
A+F P A+DM GNIYVA+ SNH IRKIT G+T G
Sbjct: 1158 AQFWGPVGIALDMSGNIYVAEWSNHRIRKITQ-GITLTGTPG 1198
>gi|322434821|ref|YP_004217033.1| hypothetical protein AciX9_1190 [Granulicella tundricola MP5ACTX9]
gi|321162548|gb|ADW68253.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
Length = 1631
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 59/110 (53%), Gaps = 4/110 (3%)
Query: 81 VVKRLAGDGVQ--GYSDGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
V+ AG+GV GY DG P +A + P AV KG IY+AD+ N +IRK+ N ++T
Sbjct: 313 VITTFAGNGVTTGGYGDGGPAAAAMLNAPWGIAVSSKGEIYIADQGNSLIRKVVNGTIST 372
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AG + ++ PA + N V L ISD GN LIR++N
Sbjct: 373 VAGTHDAADTGSNPPAIHTQL-NSPAGVVVDVAGNLYISDSGNNLIRKVN 421
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 4/91 (4%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRAD-GPAQNA 156
SA P S A+D GN++++D N+VIRK+ + ++TIAG K D GPA +A
Sbjct: 219 SATLSSPSSVALDGAGNVFISDTGNNVIRKVNVFDGTISTIAGQMGKNGYVGDGGPASSA 278
Query: 157 SFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ L F L D N ++R+I+
Sbjct: 279 KLNGPNGLVFDAQ-GNLYFCDTNNNVVRRID 308
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 58/110 (52%), Gaps = 4/110 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVT-TI 138
+ +AG V Y+ +G + F P +D GN+Y+ D SN IRK+ + G+T TI
Sbjct: 143 ITTVAGSSVWIYNGEGVATQSPFFLPFGVTLDAAGNLYITDTSNTRIRKVDAVTGMTSTI 202
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
AG G+ GPA +A+ S+ + + ISD GN +IR++N+
Sbjct: 203 AGNGTIGGTGDGGPATSATLSSPSSVAL-DGAGNVFISDTGNNVIRKVNV 251
>gi|290972390|ref|XP_002668936.1| predicted protein [Naegleria gruberi]
gi|284082474|gb|EFC36192.1| predicted protein [Naegleria gruberi]
Length = 727
Score = 67.0 bits (162), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 82/176 (46%), Gaps = 23/176 (13%)
Query: 27 LLEDGYTVTTVIDGHQ-------LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF----- 74
+LE+G VT +G Q L +N V S V SS++ Y +
Sbjct: 35 ILENGNIVTIAGNGQQGYSGDNGLAVNAQLV------SPSYVFVSSKNEVYIADYQNNRI 88
Query: 75 -PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI- 130
+ E + +AG+G QGY+ +G +A+ P S V +K +Y+AD NH IRKI
Sbjct: 89 RKILENGNIITIAGNGQQGYNGDNGLAVNAQLASPSSVCVSLKNEVYIADSGNHSIRKIL 148
Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
N + TIAG G + +GPA NA D FV + I+D GN IR+I
Sbjct: 149 ENGNIVTIAGNGQQGYNGDNGPAVNAPLF-DPSSVFVTPNDEVYIADTGNHRIRKI 203
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 61/114 (53%), Gaps = 8/114 (7%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSA----RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
E + +AG+G QGY +G+ G A FD P S V +Y+AD NH IRKI N
Sbjct: 149 ENGNIVTIAGNGQQGY-NGDNGPAVNAPLFD-PSSVFVTPNDEVYIADTGNHRIRKILRN 206
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G + +GPA NA S+ ++ FV + I+D+ N IR+I
Sbjct: 207 GNIITIAGNGQQGYSGDNGPAVNAKLSSPVDV-FVSSKNEVYIADYQNTRIRKI 259
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 54/105 (51%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
+AG+G QGYS +G +A+ P V K +Y+AD N IRKI N + TIAG
Sbjct: 212 IAGNGQQGYSGDNGPAVNAKLSSPVDVFVSSKNEVYIADYQNTRIRKILENGNIITIAGN 271
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G +G A NA + + FV + I+D+ N IR+I
Sbjct: 272 GKLGYGGDNGLAVNAKLYSPVGV-FVSQDNEVYIADYQNNRIRKI 315
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 3/89 (3%)
Query: 100 SARFDKPKS-FAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNAS 157
+A+ P+S F +Y+AD NH +RKI N + TIAG G + +G A NA
Sbjct: 4 NAQLFNPRSIFVSSSNDEVYIADTGNHTVRKILENGNIVTIAGNGQQGYSGDNGLAVNAQ 63
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
+ FV + I+D+ N IR+I
Sbjct: 64 LVSP-SYVFVSSKNEVYIADYQNNRIRKI 91
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 74/169 (43%), Gaps = 20/169 (11%)
Query: 27 LLEDGYTVTTVIDGHQ-LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF------PLSEE 79
+L +G +T +G Q + ++ SS + V SS++ Y + + E
Sbjct: 203 ILRNGNIITIAGNGQQGYSGDNGPAVNAKLSSPVDVFVSSKNEVYIADYQNTRIRKILEN 262
Query: 80 SVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
+ +AG+G GY +G +A+ P V +Y+AD N+ IRKI N +
Sbjct: 263 GNIITIAGNGKLGYGGDNGLAVNAKLYSPVGVFVSQDNEVYIADYQNNRIRKILENGNII 322
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
TIAG G E DG + F+ + PHI + S +G +++
Sbjct: 323 TIAGNG---ELGYDGDSV-------FDFSKYPHIGPIYSSTYGKAKLKK 361
>gi|86605024|ref|YP_473787.1| NHL repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86553566|gb|ABC98524.1| NHL repeat domain protein [Synechococcus sp. JA-3-3Ab]
Length = 637
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 50/84 (59%), Gaps = 3/84 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S + V AG V GY DGE A+F +P A+ + G++YVAD++N IR+IT G
Sbjct: 408 ISRQRQVTTFAGTCVAGYRDGERDEAQFREPFGLALGLDGSLYVADRANRRIRRITPTGK 467
Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
VTT AG G + G ADGPA A
Sbjct: 468 VTTAAGTG--QPGSADGPADQAQL 489
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 65/134 (48%), Gaps = 6/134 (4%)
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-N 132
F LS E ++ AG G G+ DG A+FD P+ D KG ++VAD NH +R I+
Sbjct: 352 FRLSPEGELEVWAGSGKAGHRDGAADQAQFDSPQGLLWDPKGGLWVADSGNHCLRHISRQ 411
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL--KP 190
VTT A G+ G DG A F F L +L ++D N+ IR+I K
Sbjct: 412 RQVTTFA--GTCVAGYRDGERDEAQFREPFGLALG-LDGSLYVADRANRRIRRITPTGKV 468
Query: 191 EDCSKSSQSGSALG 204
+ + Q GSA G
Sbjct: 469 TTAAGTGQPGSADG 482
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 62/121 (51%), Gaps = 12/121 (9%)
Query: 76 LSEESVVKRLAGDGV--------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
+++ ++RL+ DG GY DG ARF A D G +++AD+ NH +
Sbjct: 505 IADRHRLRRLSADGQLTTLSRAEAGYRDGPLAEARFQTLAGLAFDSAGILWLADRDNHRL 564
Query: 128 RKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
R++ G V+T+A G + G DGPA A F +L +P ++++ D G +R++
Sbjct: 565 RRLQPNGQVSTLA--GQDEPGWQDGPASVARFEQPGDLLVLPD-GSVVVVDAGLPGLRRL 621
Query: 187 N 187
+
Sbjct: 622 S 622
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 7/107 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V AG G G +DG A+ +P + AVD +GN+++AD+ H +R+++ G +TT++
Sbjct: 468 VTTAAGTGQPGSADGPADQAQLLQPTALAVDREGNLWIADR--HRLRRLSADGQLTTLS- 524
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ G DGP A F L F L ++D N +R++
Sbjct: 525 --RAEAGYRDGPLAEARFQTLAGLAFD-SAGILWLADRDNHRLRRLQ 568
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 2/79 (2%)
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
AVD +G Y+AD + H I +++ G + GS K G DG A A F + L + P
Sbjct: 333 IAVDPEGFFYLADPAQHRIFRLSPEGELEVW-AGSGKAGHRDGAADQAQFDSPQGLLWDP 391
Query: 169 HICALLISDHGNQLIRQIN 187
L ++D GN +R I+
Sbjct: 392 K-GGLWVADSGNHCLRHIS 409
>gi|380693545|ref|ZP_09858404.1| hypothetical protein BfaeM_06134 [Bacteroides faecis MAJ27]
Length = 454
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 57/98 (58%), Gaps = 1/98 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY +G + R +KP+ A D GN++V +++NH+IRKIT G T+ G ++ G DG
Sbjct: 358 GYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKITPAGSATLYAGRPEQSGFGDG 417
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ A F+ +T P ++ ++D N +IR++ ++
Sbjct: 418 LPEEAKFNQPECVTVYPD-NSIYVADRDNHVIRRVTVE 454
>gi|302768699|ref|XP_002967769.1| hypothetical protein SELMODRAFT_409048 [Selaginella moellendorffii]
gi|300164507|gb|EFJ31116.1| hypothetical protein SELMODRAFT_409048 [Selaginella moellendorffii]
Length = 199
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 69/140 (49%), Gaps = 13/140 (9%)
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
VTTIAGG S+K G ADGP A FS++F L +LLI+D GN+LIR+I + +D
Sbjct: 68 VTTIAGGSSRKPGFADGPGDTARFSSEFSLAC--SCGSLLIADRGNRLIREIQI--DDPK 123
Query: 195 KSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEALPNQSG 254
S SA+ W L+ V+ L+ + + +II LD + + L N
Sbjct: 124 SRDSSDSAVSGPQKWALIPVV--LLGICLEMPLGAFIIWKI--LDQRCRQAGDQLGNTER 179
Query: 255 -----ETSTDLLLRHQKRNC 269
DL +R QKR C
Sbjct: 180 LLAGVPEDDDLQVRQQKRTC 199
>gi|427400231|ref|ZP_18891469.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
gi|425720971|gb|EKU83886.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
Length = 367
Score = 66.6 bits (161), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 44/77 (57%), Gaps = 1/77 (1%)
Query: 85 LAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
LAGD ++G DG +ARF P+ A+D N+YVAD N VIRKIT GV T G +
Sbjct: 55 LAGDPTIEGTMDGAGTAARFKWPQGIAIDADDNLYVADTGNFVIRKITPAGVVTTVAGAA 114
Query: 144 KKEGRADGPAQNASFSN 160
G DG A NA F N
Sbjct: 115 GTSGFVDGDAGNARFGN 131
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 44/78 (56%), Gaps = 7/78 (8%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV +AG G G+ DG+ G+ARF P + AV+ +G IYVAD N IR IT+ G
Sbjct: 101 ITPAGVVTTVAGAAGTSGFVDGDAGNARFGNPVAVAVNRRGTIYVAD--NLRIRSITSAG 158
Query: 135 ----VTTIAGGGSKKEGR 148
TI G + +GR
Sbjct: 159 RVSTTATIPIGNNVSDGR 176
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 35/54 (64%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
+G ++G +ARF++ + VD +GN+Y AD N+++R+IT GV T G ++
Sbjct: 273 RGSNNGVGTAARFEQVVALTVDPRGNVYAADAINNLVRRITPEGVVTTVAGSTR 326
>gi|298386893|ref|ZP_06996448.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
gi|298260567|gb|EFI03436.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
Length = 454
Score = 66.6 bits (161), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 57/98 (58%), Gaps = 1/98 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY +G + R +KP+ A D GN++V +++NH+IRKIT G T+ G ++ G DG
Sbjct: 358 GYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKITPAGSATLYAGRPEQSGFGDG 417
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ A F+ +T P ++ ++D N +IR++ ++
Sbjct: 418 LPEEAKFNQPECVTVYPD-NSIYVADRDNHVIRRVTVE 454
>gi|124006033|ref|ZP_01690870.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123988440|gb|EAY28086.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length = 789
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 94/184 (51%), Gaps = 22/184 (11%)
Query: 12 ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFY 70
ITLL ++ SL +SG ++ Y + TV+ + N V +D +++ V D+ R+ Y
Sbjct: 9 ITLLCIV-SLEASG---QNQYKINTVLSNYD---NIWGVAVD--AQNNVYVTDAERNRIY 59
Query: 71 TLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
L +++ AG G +GYS DG P +A D P VD GN+Y AD SN V+R
Sbjct: 60 RLDAITGVRTII---AGTGTEGYSGDGGPATAAMLDYPTGITVDAAGNVYFADGSNDVVR 116
Query: 129 KI--TNLGVTTIAGGGSKKEGRADGPAQNAS--FSNDFELTFVPHICALLISDHGNQLIR 184
KI T ++TIAG G++ G A A F +D L +I I DH N IR
Sbjct: 117 KIDATTHVISTIAGNGNRGFAGDGGQATAAQLHFPSDVALDTAGNI---YIVDHRNDRIR 173
Query: 185 QINL 188
++++
Sbjct: 174 KVDI 177
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 53/106 (50%), Gaps = 7/106 (6%)
Query: 85 LAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAG 140
AGDG +DG P A PK A+D GN+Y+AD + IRK+ T + +TTIAG
Sbjct: 225 FAGDG-NALNDGGPAHLASLRNPKGLAIDAAGNVYIADVLDDRIRKVDARTGI-ITTIAG 282
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G+ G A +A + F + V + + +D N +R++
Sbjct: 283 TGAGGYSGDGGVATSARINYPFRVA-VDALGNVYFTDWDNDALRKL 327
>gi|428175994|gb|EKX44881.1| hypothetical protein GUITHDRAFT_60229, partial [Guillardia theta
CCMP2712]
Length = 181
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG--VTTIAG- 140
LAG GY DG+ RF+ P A G+ + VAD +NH IR+I + VTTIAG
Sbjct: 1 LAGPSSAGYQDGQAKIVRFNSPLDIAFSKDGSWLAVADTNNHAIRRIDAMTGVVTTIAGC 60
Query: 141 -GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
S G DGPA ASF++ + P + ++D N +IRQI+L
Sbjct: 61 PRSSSCLGSQDGPASAASFNSPTSIALDPQDKFIFVADTSNNMIRQIDL 109
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 62/116 (53%), Gaps = 9/116 (7%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGR 148
G DG +A F+ P S A+D + I+VAD SN++IR+I T + VTTIA G+ + G
Sbjct: 68 GSQDGPASAASFNSPTSIALDPQDKFIFVADTSNNMIRQIDLTAMTVTTIA--GATRSGY 125
Query: 149 ADGPAQNASFSNDFELTFVPHICA----LLISDHGNQLIRQINLKPEDCSKSSQSG 200
ADG + F N +T P ++++D N IR+I+L S + SG
Sbjct: 126 ADGLLAASLFYNPTGITAHPDFTNENKLVMVADTQNHAIRRIDLFQGRVSTVAGSG 181
>gi|403382146|ref|ZP_10924203.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
JC66]
Length = 531
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 81/191 (42%), Gaps = 25/191 (13%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFP 75
L S L EDG V + DG + E++ S +D G +L V D A +
Sbjct: 98 LAGSSFLQEDGQVVDALGDG-KGELSSFSEPAGLALDHNG--NLFVADKGNHAVRKVD-- 152
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
E V AG GV G+ DG + F P+ V G +YVAD NHVIRKI G
Sbjct: 153 --AEGNVTTYAGQGVLGHKDGTAEESLFYAPEDVVVASDGTVYVADTLNHVIRKIDPEGK 210
Query: 135 -----------VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
+ G DGP Q A F+ L + H+ L ISD GN++I
Sbjct: 211 VSTLNALPQRYIEVFPGEAVLAGDYKDGPLQEAKFNEPTGLA-IDHLGNLYISDTGNRVI 269
Query: 184 RQINLKPEDCS 194
R ++L + S
Sbjct: 270 RYMDLANDRVS 280
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 62/125 (49%), Gaps = 7/125 (5%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
S V+ +AG G G DG+ A F P A G +YV+D +H++R++ + GV+ +A
Sbjct: 40 SSVQTIAGTGSFGARDGDKAEASFRHPSGLAAAPDGTLYVSDTKSHLLRRLDHSGVSLLA 99
Query: 140 GG------GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
G G + DG + +SFS L + H L ++D GN +R+++ +
Sbjct: 100 GSSFLQEDGQVVDALGDGKGELSSFSEPAGLA-LDHNGNLFVADKGNHAVRKVDAEGNVT 158
Query: 194 SKSSQ 198
+ + Q
Sbjct: 159 TYAGQ 163
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
Query: 92 GYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
G+SDG A F P+ A+ +G + +AD NH IR + V T+AGG + G+ D
Sbjct: 302 GFSDGHATEEALFMAPRGIAITEEGGLVIADSLNHAIRYLFEGRVITLAGGHEAEHGQQD 361
Query: 151 G 151
G
Sbjct: 362 G 362
>gi|340788705|ref|YP_004754170.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
gi|340553972|gb|AEK63347.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
Length = 394
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 1/86 (1%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
ARF++P AVD GN+Y+AD NH IR IT+ GV T G + + G+ DG A +A F+
Sbjct: 108 ARFNQPGGIAVDAGGNLYLADTQNHTIRMITSAGVVTTIAGSAGQAGQNDGTAGDARFNQ 167
Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
+ + L ++D GN +R+I
Sbjct: 168 PWGVAR-DGAGNLYVTDTGNATVRKI 192
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 52/93 (55%), Gaps = 6/93 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
+ ++ V DS T+S VV LAG G G +DG ARF++P+ A D
Sbjct: 297 AGNIRVSDSGNQLIRTVSL----TGVVSTLAGAAGTAGSTDGSGNKARFNQPEGIAADAA 352
Query: 115 GNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKE 146
NIYVAD SN++IRK+T + V+T+ GGG+
Sbjct: 353 NNIYVADTSNNLIRKVTPDAQVSTLFGGGNSNS 385
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV +AG G G +DG G ARF++P A D GN+YV D N +RKIT G
Sbjct: 137 ITSAGVVTTIAGSAGQAGQNDGTAGDARFNQPWGVARDGAGNLYVTDTGNATVRKITAAG 196
Query: 135 VTT 137
V T
Sbjct: 197 VVT 199
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 43/86 (50%), Gaps = 1/86 (1%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
ARF++P A+D GNI V+D N +IR ++ GV + G + G DG A F N
Sbjct: 284 ARFNQPYGIALDSAGNIRVSDSGNQLIRTVSLTGVVSTLAGAAGTAGSTDGSGNKARF-N 342
Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
E + ++D N LIR++
Sbjct: 343 QPEGIAADAANNIYVADTSNNLIRKV 368
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 29/43 (67%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
G SDG + RF +P + AVD GN++VAD +N+ IRKIT G
Sbjct: 44 GSSDGSGSNVRFSQPGAIAVDAGGNLFVADTANNTIRKITAAG 86
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 12/99 (12%)
Query: 101 ARFDKPKSFAVDMKGN-----------IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
A+F+ P+ +D GN IY+AD +N+ IRK+ G + G + G A
Sbjct: 218 AQFNLPRGITLDNAGNSTGTAVNLAVNIYIADSNNNTIRKLDQNGNVSTLAGTAGSSGSA 277
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
DG Q A F+ + + + +SD GNQLIR ++L
Sbjct: 278 DGSGQRARFNQPYGIAL-DSAGNIRVSDSGNQLIRTVSL 315
>gi|344924046|ref|ZP_08777507.1| NHL repeat containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 368
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 69/141 (48%), Gaps = 10/141 (7%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
DRP S + +DSS + + F ++ V LAG GY +G G+A+F
Sbjct: 162 FDRPYS---VAVDSSGTVYVADFFNSRIRRITSAGVTSTLAGSSTGGYLEGTGGAAQFGT 218
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P AVD G +YV D +RKIT+ GVT++ GS G A+G +A FS+ + +
Sbjct: 219 PIDVAVDSSGTVYVTDTYTQRVRKITSGGVTSLL-AGSNTIGYAEGTGASARFSSPYGIA 277
Query: 166 FVPHICALLISDHGNQLIRQI 186
V ++D N IR+I
Sbjct: 278 -VDSSGTAYVADSDNHRIRKI 297
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 60/112 (53%), Gaps = 5/112 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ V LAG QGY++G ARFD+P S AVD G +YVAD N IR+IT+ GV
Sbjct: 136 ITSAGVTSLLAGS-AQGYAEGTGSGARFDRPYSVAVDSSGTVYVADFFNSRIRRITSAGV 194
Query: 136 T-TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T T+A GS G +G A F ++ V + ++D Q +R+I
Sbjct: 195 TSTLA--GSSTGGYLEGTGGAAQFGTPIDVA-VDSSGTVYVTDTYTQRVRKI 243
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 3/106 (2%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
+V LAG GY++G SARF+ P AV G IYVAD +NH IR I++ G T++
Sbjct: 34 IVSTLAGS-TGGYAEGTGASARFNYPYGIAVHSSGTIYVADSANHRIRSISSAGTTSVF- 91
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
GS G +G +A F+N + + V L +S++ N IR+I
Sbjct: 92 AGSGTAGTTEGTGASAQFNNPYGVA-VDSSGTLYVSEYTNHRIRKI 136
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 5/84 (5%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
+ +DSS + + T ++ ++ V LAG GY++G SARF P AVD
Sbjct: 222 VAVDSSGTVYVTDTYTQRVRKITSGGVTSLLAGSNTIGYAEGTGASARFSSPYGIAVDSS 281
Query: 115 GNIYVADKSNHVIRKITNLGVTTI 138
G YVAD NH IRKIT+ G T++
Sbjct: 282 GTAYVADSDNHRIRKITSGGTTSL 305
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 3/87 (3%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
SA+F+ P AVD G +YV++ +NH IRKIT+ GVT++ G +G A+G A F
Sbjct: 106 SAQFNNPYGVAVDSSGTLYVSEYTNHRIRKITSAGVTSLLAG--SAQGYAEGTGSGARFD 163
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
+ + V + ++D N IR+I
Sbjct: 164 RPYSVA-VDSSGTVYVADFFNSRIRRI 189
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+F+ P VD G++Y+AD SNH+IRKI L
Sbjct: 323 QFNYPAGIDVDSSGSLYIADSSNHLIRKIDAL 354
>gi|298387997|ref|ZP_06997545.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
gi|298259263|gb|EFI02139.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
Length = 439
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 73/134 (54%), Gaps = 14/134 (10%)
Query: 65 SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
S+SA Y +P + + + + G G G+ DG+ A+F++P+ A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIEEYIPGAGA-GFRDGDVQEAQFNEPRGIAIDKEGNL 369
Query: 118 YVADKSNHVIRKI-TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
Y+AD +NH IRK+ T L VTTIAG G +G DG A F + + ++ L I
Sbjct: 370 YIADVNNHRIRKVDTKLNIVTTIAGSG---KGYKDGDPLEAQFDQPWGV-YLDKNEFLYI 425
Query: 176 SDHGNQLIRQINLK 189
+D N IR++ ++
Sbjct: 426 ADQNNHCIRKLAIE 439
>gi|405345633|ref|ZP_11022426.1| hypothetical protein A176_0011 [Chondromyces apiculatus DSM 436]
gi|397093682|gb|EJJ24379.1| hypothetical protein A176_0011 [Myxococcus sp. (contaminant ex DSM
436)]
Length = 870
Score = 65.9 bits (159), Expect = 2e-08, Method: Composition-based stats.
Identities = 52/148 (35%), Positives = 79/148 (53%), Gaps = 9/148 (6%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFD 104
+P +V+ P + +L V DS L E V+ AG G V G++DG P ARF
Sbjct: 611 SPTAVVAGP-TGELYVADSGNHVIRRLDRG-EEGWQVRTWAGQGFVAGFADGGPARARFS 668
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADG-PAQNASFSND 161
+P + AVD GN+YVAD+ NH IR + V T+AG G+ G AD ++ASF+
Sbjct: 669 RPMALAVDAAGNVYVADQDNHRIRMVRAGTREVVTLAGTGTL--GTADAVRGRDASFAAP 726
Query: 162 FELTFVPHICALLISDHGNQLIRQINLK 189
L + + L + D +Q +R+++L+
Sbjct: 727 SALA-LGGVGTLYVLDTVSQRLRRVSLQ 753
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 7/121 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSK 144
G G+ DG +A F P AV G + VAD N+ IR I G V+TIAG G
Sbjct: 538 GASGWQDGPADTAMFQSPTGLAVTHAGEVVVADTRNNRIRLIQQEGAGRTVSTIAGSG-- 595
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
+ G DG A + + P L ++D GN +IR+++ E + +G
Sbjct: 596 ELGHRDGAGSQALLRSPTAVVAGP-TGELYVADSGNHVIRRLDRGEEGWQVRTWAGQGFV 654
Query: 205 A 205
A
Sbjct: 655 A 655
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 44/155 (28%), Positives = 65/155 (41%), Gaps = 6/155 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI----TNLGVTT 137
V +AG G G+ DG A P + G +YVAD NHVIR++ V T
Sbjct: 588 VSTIAGSGELGHRDGAGSQALLRSPTAVVAGPTGELYVADSGNHVIRRLDRGEEGWQVRT 647
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
AG G G ADG A FS L V + ++D N IR + + +
Sbjct: 648 WAGQGFVA-GFADGGPARARFSRPMALA-VDAAGNVYVADQDNHRIRMVRAGTREVVTLA 705
Query: 198 QSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
+G+ A +V + + +L +G V Y++
Sbjct: 706 GTGTLGTADAVRGRDASFAAPSALALGGVGTLYVL 740
Score = 40.8 bits (94), Expect = 0.67, Method: Composition-based stats.
Identities = 44/139 (31%), Positives = 58/139 (41%), Gaps = 10/139 (7%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAV 111
G L VLD+ +S L V LAG G G+ DG ARF +
Sbjct: 733 GVGTLYVLDTVSQRLRRVS--LQGSRAVVTLAGTGAGTPFGFQDGPGSDARFRAQLGMVM 790
Query: 112 DMKGNIYVADKSNHVIRKIT---NLGVTTI-AGGGSKKEGRADGPAQNASFSNDFELTFV 167
+G + +AD +N +RKI N T + GS + G A G A A S L F
Sbjct: 791 GPQGELLLADTANLRLRKIIPGENAAATRVFTFAGSGRVGTALGRADAADLSAPVGLAFD 850
Query: 168 PHICALLISDHGNQLIRQI 186
L +SD NQ+IR +
Sbjct: 851 AG-GLLYVSDAFNQVIRVV 868
>gi|380692433|ref|ZP_09857292.1| NHL repeat-containing protein [Bacteroides faecis MAJ27]
Length = 439
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 73/134 (54%), Gaps = 14/134 (10%)
Query: 65 SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
S+SA Y +P + + + + G G G+ DG+ A+F++P+ A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIEEYIPGAGA-GFRDGDVQEAQFNEPRGIAIDKEGNL 369
Query: 118 YVADKSNHVIRKI-TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
Y+AD +NH IRK+ T L VTTIAG G +G DG A F + + ++ L I
Sbjct: 370 YIADVNNHRIRKVDTKLNIVTTIAGSG---KGYKDGDPLEAQFDQPWGV-YLDKNEFLYI 425
Query: 176 SDHGNQLIRQINLK 189
+D N IR++ ++
Sbjct: 426 ADQNNHCIRKLAIE 439
>gi|115376708|ref|ZP_01463935.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
gi|310822897|ref|YP_003955255.1| hypothetical protein STAUR_5665 [Stigmatella aurantiaca DW4/3-1]
gi|115366257|gb|EAU65265.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
gi|309395969|gb|ADO73428.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
Length = 856
Score = 65.9 bits (159), Expect = 2e-08, Method: Composition-based stats.
Identities = 35/86 (40%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
+ P A D GNIYVAD NH IR+I G T+ G + G+ DG A+ A F+
Sbjct: 616 LNGPMGIAADASGNIYVADSDNHCIRRIAPDGTATVFAGAVMEPGQGDGTAKAARFNQPA 675
Query: 163 ELTFVPHICALLISDHGNQLIRQINL 188
L F P LL++D GN +IR+++L
Sbjct: 676 GLAFGPG-GELLVADLGNGVIRRVDL 700
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 40/112 (35%), Positives = 55/112 (49%), Gaps = 10/112 (8%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI------TNLG 134
VV +AG GY+DG P S++F AV G++ VAD N+ IR+I
Sbjct: 747 VVSIIAGSS-PGYADGTPTSSQFLPYLGLAVLKDGSLAVADPGNYRIRRILFSAQGKATE 805
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT+AG G + G DG Q+A L V L ++D GN L+R +
Sbjct: 806 VTTLAGSG--RFGSRDGEGQSADLVLPAGLA-VGEDGTLYVADAGNALLRAV 854
Score = 38.1 bits (87), Expect = 4.3, Method: Composition-based stats.
Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 10/64 (15%)
Query: 77 SEESVVKRLAGDGVQ----------GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
S+ ++R+A DG G DG +ARF++P A G + VAD N V
Sbjct: 635 SDNHCIRRIAPDGTATVFAGAVMEPGQGDGTAKAARFNQPAGLAFGPGGELLVADLGNGV 694
Query: 127 IRKI 130
IR++
Sbjct: 695 IRRV 698
>gi|223939666|ref|ZP_03631540.1| NHL repeat containing protein [bacterium Ellin514]
gi|223891717|gb|EEF58204.1| NHL repeat containing protein [bacterium Ellin514]
Length = 452
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 65/133 (48%), Gaps = 6/133 (4%)
Query: 75 PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
P VV +AG G+ G+ DG A F P A+D GN+YVAD N+ IRK+
Sbjct: 203 PDGTNWVVTTIAGAAGIHGFKDGTNNDALFYSPIGLALDSAGNMYVADTGNNAIRKLTLE 262
Query: 131 -TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
TN VTTIAG +++ G DG A F+ E V + ++D N IR++ +
Sbjct: 263 GTNWVVTTIAGSTNQQNGSLDGTNNQALFTWP-ESPAVDSAGNVYVADSYNYTIRKVTVV 321
Query: 190 PEDCSKSSQSGSA 202
D S+ G A
Sbjct: 322 GTDYVVSTVGGRA 334
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 61/122 (50%), Gaps = 11/122 (9%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---- 130
+ ++ VV +AG + G DG ARF P AVD GN++VA+ + IRK+
Sbjct: 88 IGKDWVVTTIAGTNQAYGTKDGTNAEARFTGPTGLAVDASGNVFVAEGYANTIRKLSPIG 147
Query: 131 TNLGVTTIAG-GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
TN VTTIAG GS G ADG +A F + L ++D G IRQ L+
Sbjct: 148 TNWIVTTIAGLAGS--SGSADGTNSDARFYLPYGNMACDTNGNLFVTD-GYDTIRQ--LR 202
Query: 190 PE 191
P+
Sbjct: 203 PD 204
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 39/69 (56%), Gaps = 5/69 (7%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSKKEGRAD 150
DG +A F P+S A D GN+YVAD + VIRK+ +G VTTIA G ++ G D
Sbjct: 50 DGTNTAAHFYSPQSVACDASGNVYVADYNGRVIRKVEVIGKDWVVTTIA-GTNQAYGTKD 108
Query: 151 GPAQNASFS 159
G A F+
Sbjct: 109 GTNAEARFT 117
>gi|153868931|ref|ZP_01998653.1| receptor protein kinase [Beggiatoa sp. PS]
gi|152074497|gb|EDN71345.1| receptor protein kinase [Beggiatoa sp. PS]
Length = 3115
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 4/112 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ +AG+G +GYS DG+P +A + + P VD GN+Y+ADK+NH IRK+ G+ T
Sbjct: 2335 TITTVAGNGNKGYSGDGDPATAAQINTPTGLEVDSTGNLYIADKNNHRIRKVDTEGIITT 2394
Query: 139 AGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G K DG A A S ++ + L I+D GN IR+I K
Sbjct: 2395 FTGTGKPGTATDGIIASVAQISQPTDVA-LDQYGNLYIADKGNDTIRKIGEK 2445
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 59/116 (50%), Gaps = 4/116 (3%)
Query: 77 SEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
S S++ +AG+G GY +G AR P AVD + N+Y+AD NH IRK+ G
Sbjct: 2275 SANSIITTVAGNGRSGYQGDNGPATGARLSNPTGLAVDSQNNLYIADTDNHRIRKVDLTG 2334
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+TT+AG G+K PA A + L V L I+D N IR+++ +
Sbjct: 2335 TITTVAGNGNKGYSGDGDPATAAQINTPTGLE-VDSTGNLYIADKNNHRIRKVDTE 2389
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 57/105 (54%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG G +G + DG A+ P+ A+D +GN+Y+AD NH IRK+ + G +TT+AG
Sbjct: 2163 VAGTGAKGSTGDDGPAIEAKLKNPQGTAIDHEGNLYIADTLNHRIRKVDSNGIITTVAGI 2222
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +G A A N + F + L I+D GN IR++
Sbjct: 2223 GKAGNTGDNGLATAAKLRNPTAIVF-DNNGHLYIADSGNHRIRKV 2266
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 63/132 (47%), Gaps = 19/132 (14%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
SD+++ + SA +TL EE+ G A + PKS + D GN
Sbjct: 2098 SDIVIQPNDNSAIWTLV----EEAAAIEQTG-------------ANLNSPKSMSFDASGN 2140
Query: 117 IYVADKSNH-VIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
Y+AD NH ++++ T +T +AG G+K DGPA A N + T + H L I
Sbjct: 2141 AYIADSLNHRILKRDTQGNLTVVAGTGAKGSTGDDGPAIEAKLKNP-QGTAIDHEGNLYI 2199
Query: 176 SDHGNQLIRQIN 187
+D N IR+++
Sbjct: 2200 ADTLNHRIRKVD 2211
>gi|427400920|ref|ZP_18892158.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
gi|425720099|gb|EKU83025.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
Length = 717
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 74/151 (49%), Gaps = 17/151 (11%)
Query: 49 SVIDRPGSSDLI-----VLDSSRSAFYTLSFP-------LSEESVVKRLAGDGVQGYSDG 96
VID PGS+ V +R A Y + + V LAG G +G++DG
Sbjct: 63 GVIDGPGSASRFSDPYGVAIGARGAVYVADGGEANRIRLIQPDGAVSTLAG-GKEGFADG 121
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQN 155
+A F P + A+D +GN+YVAD NH IRK+ G VTT+AG GS G DG +
Sbjct: 122 IGAAAAFHTPSALALDHEGNLYVADTGNHAIRKVAPDGTVTTVAGSGSP--GYLDGIGRA 179
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F+ + V + ++D N IR+I
Sbjct: 180 AQFNGPVGIA-VDDAGIVYVADTYNDRIRRI 209
Score = 60.1 bits (144), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 67/137 (48%), Gaps = 10/137 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+L V D+ A ++ + V +AG G GY DG +A+F+ P AVD G
Sbjct: 139 EGNLYVADTGNHAIRKVA----PDGTVTTVAGSGSPGYLDGIGRAAQFNGPVGIAVDDAG 194
Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
+YVAD N IR+I G VTT+AG G K G DG +A F L L
Sbjct: 195 IVYVADTYNDRIRRIAPDGMVTTLAGNG--KPGLLDGALLDAGFDTPSALA-AGRDGTLY 251
Query: 175 ISDHGNQLIRQINLKPE 191
++D GN +R+I KP+
Sbjct: 252 VADTGNHAVRRI--KPD 266
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 58/108 (53%), Gaps = 8/108 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSN-HVIRKITNLG-VTTI 138
V LAGDG G DG PGSA RF P A+ +G +YVAD + IR I G V+T+
Sbjct: 53 VSLLAGDGRDGVIDG-PGSASRFSDPYGVAIGARGAVYVADGGEANRIRLIQPDGAVSTL 111
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AGG KEG ADG A+F L H L ++D GN IR++
Sbjct: 112 AGG---KEGFADGIGAAAAFHTPSALAL-DHEGNLYVADTGNHAIRKV 155
>gi|320105416|ref|YP_004181006.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319923937|gb|ADV81012.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 822
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 60/111 (54%), Gaps = 5/111 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ +AG GV G+S G+ G+A P AVD GN+YVAD NH IRKI+ +TT+
Sbjct: 149 IATIAGTGVAGFS-GDGGAAVLATLSHPTGLAVDTGGNLYVADTDNHRIRKISGTTITTV 207
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
AG G + GPA A++ + + V L I+D NQ IR ++ +
Sbjct: 208 AGSGEQGFAGDGGPA-TAAWLDSPDGVAVDATGNLYIADTHNQRIRVVSAE 257
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 59/110 (53%), Gaps = 4/110 (3%)
Query: 80 SVVKRLAGDGVQGYS-DGEPGSARF-DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
+ + +AG G QG++ DG P +A + D P AVD GN+Y+AD N IR ++ G ++
Sbjct: 202 TTITTVAGSGEQGFAGDGGPATAAWLDSPDGVAVDATGNLYIADTHNQRIRVVSAEGTIS 261
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TIAG GS+ G A AS + L+ V + + +D N IR I
Sbjct: 262 TIAGNGSRAYAGDGGSAVAASLARPRGLS-VDALGNIYFADSDNNRIRLI 310
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 57/114 (50%), Gaps = 3/114 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S ++ +AG G QG++ G SA D P AVD GN+Y++D N IR+++
Sbjct: 88 VSAAGIITTVAGTGEQGFAGDGGSATSAWLDSPVGVAVDRVGNLYISDSHNQRIRRVSGG 147
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ TIAG G G A A+ S+ L V L ++D N IR+I+
Sbjct: 148 TIATIAGTGVAGFSGDGGAAVLATLSHPTGLA-VDTGGNLYVADTDNHRIRKIS 200
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 49/94 (52%)
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
S+ E +A P A D GN+Y AD +NHV+RK++ G+ T G ++ DG +
Sbjct: 52 SNVEATTAPLASPAGIAYDASGNMYFADLNNHVVRKVSAAGIITTVAGTGEQGFAGDGGS 111
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+++ + V + L ISD NQ IR+++
Sbjct: 112 ATSAWLDSPVGVAVDRVGNLYISDSHNQRIRRVS 145
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 64/118 (54%), Gaps = 6/118 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S E + +AG+G + Y+ G+ GSA +P+ +VD GNIY AD N+ IR I
Sbjct: 254 VSAEGTISTIAGNGSRAYA-GDGGSAVAASLARPRGLSVDALGNIYFADSDNNRIRLIAT 312
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G +TT+AG GS+ +G G A +A+ T V + +SD N +IR++ L
Sbjct: 313 TGIITTVAGNGSQGDGGDGGSALDATLDTP-RATAVRALGIFDLSDTHNDIIREVGLN 369
>gi|383124185|ref|ZP_09944852.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
gi|251839311|gb|EES67395.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
Length = 454
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 57/98 (58%), Gaps = 1/98 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY +G + R +KP+ A D GN++V +++NH+IRKIT G ++ G ++ G DG
Sbjct: 358 GYINGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKITPAGSASLYAGQPEQSGFGDG 417
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ A F+ +T P ++ ++D N +IR++ ++
Sbjct: 418 LPEEAKFNQPECVTVYPD-NSVYVADRDNHVIRRVTVE 454
>gi|302848538|ref|XP_002955801.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
nagariensis]
gi|300258994|gb|EFJ43226.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
nagariensis]
Length = 1149
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 65/126 (51%), Gaps = 12/126 (9%)
Query: 73 SFPLSEESVVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
S +S++ ++R+A DG G++DG GSARF + AVD GNI+VAD S
Sbjct: 241 SLVVSDQHCLRRVASDGFVTTIAGGTTPGHADGPSGSARFYNLRGVAVDGDGNIFVADSS 300
Query: 124 NHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
NH +R++T V+T+ G G DG +A F N L+++D GN
Sbjct: 301 NHCVRQLTAADSIVSTLV-GSPGNAGFRDGTGTDARFRNPCGAAVNLQDGTLVVADAGNN 359
Query: 182 LIRQIN 187
+R+++
Sbjct: 360 RLRKVD 365
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 63/113 (55%), Gaps = 9/113 (7%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLG- 134
S E+VV LAG G G+ DG A+F P A+D + G++ V+D+ H +R++ + G
Sbjct: 203 SGEAVV--LAGGGGYGHKDGPGRKAKFSCPMYLALDHRDGSLVVSDQ--HCLRRVASDGF 258
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
VTTIAGG + G ADGP+ +A F N V + ++D N +RQ+
Sbjct: 259 VTTIAGGTTP--GHADGPSGSARFYN-LRGVAVDGDGNIFVADSSNHCVRQLT 308
Score = 44.3 bits (103), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 6/92 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
G ++ V DSS L+ + +S+V L G G G+ DG ARF P AV++
Sbjct: 290 GDGNIFVADSSNHCVRQLT---AADSIVSTLVGSPGNAGFRDGTGTDARFRNPCGAAVNL 346
Query: 114 K-GNIYVADKSNHVIRKI-TNLGVTTIAGGGS 143
+ G + VAD N+ +RK+ + TTIAG G+
Sbjct: 347 QDGTLVVADAGNNRLRKVDRDRLTTTIAGNGT 378
>gi|223934991|ref|ZP_03626910.1| NHL repeat containing protein [bacterium Ellin514]
gi|223896444|gb|EEF62886.1| NHL repeat containing protein [bacterium Ellin514]
Length = 1064
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 56/108 (51%), Gaps = 13/108 (12%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
LAG +G ++G S +F P++ A+D+ N++VAD NHVIRKI+ G+ T G
Sbjct: 52 LAGYAGRGSANGISSSVQFKAPQAIAIDISNNVFVADTENHVIRKISCTGIITTLAGSLG 111
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
G DG NA F F P A +L++D GN +R+I
Sbjct: 112 THGSRDGSGTNALF-------FRPAGIAVDASGNVLVADTGNNTVRKI 152
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 59/115 (51%), Gaps = 7/115 (6%)
Query: 75 PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
P +V +AG G G DG SARF +P+S +VD G +YVAD NH IRKI
Sbjct: 593 PSGTNWIVNTVAGLAGFWGSIDGTNTSARFFQPRSLSVDASGALYVADSGNHAIRKITPS 652
Query: 131 -TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
TN VTT+A G + G DG NA FS+ ++ + ++D N IR
Sbjct: 653 GTNWVVTTVA-GLAGAAGSVDGTGINAEFSHPAGISLT-SAGIVYVADSDNNTIR 705
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 63/132 (47%), Gaps = 6/132 (4%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
S +L V+D+ ++ VV AG G G ++G +A F P+ +D+
Sbjct: 246 SGNLYVVDTGNGTIR----KITSSGVVTTFAGSAGNYGATNGIGANALFYAPQGITIDLF 301
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
G +YVAD NH IRKIT+ G T G + G AD +ASF N +T L
Sbjct: 302 GCVYVADTGNHTIRKITSDGTVTTLAGLAGNYGSADSVNSSASFWNPQGIT-SDATGNLY 360
Query: 175 ISDHGNQLIRQI 186
I+D GN IR I
Sbjct: 361 IADTGNNTIRTI 372
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 52/106 (49%), Gaps = 2/106 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
V +AG GV G D +A F P+ VD GN+YV D N IRKIT+ GV T
Sbjct: 213 VNTMAGSAGVYGNLDNSGANALFSGPQGLTVDSSGNLYVVDTGNGTIRKITSSGVVTTFA 272
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + G +G NA F +T C + ++D GN IR+I
Sbjct: 273 GSAGNYGATNGIGANALFYAPQGITIDLFGC-VYVADTGNHTIRKI 317
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 78/159 (49%), Gaps = 11/159 (6%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDM 113
G+ ++ V D+ F L ++ V LAG+ G SDG +ARF P AVD
Sbjct: 519 GTGNVFVADT----FNNLIRKITPGGAVTTLAGNFENFGSSDGTNSNARFYWPSGVAVDN 574
Query: 114 KGNIYVADKSNHVIRKI----TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
GN++VAD NH IR++ TN V T+A G + G DG +A F L+ V
Sbjct: 575 AGNVFVADYMNHTIRELIPSGTNWIVNTVA-GLAGFWGSIDGTNTSARFFQPRSLS-VDA 632
Query: 170 ICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV 208
AL ++D GN IR+I + ++ +G A A SV
Sbjct: 633 SGALYVADSGNHAIRKITPSGTNWVVTTVAGLAGAAGSV 671
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 54/112 (48%), Gaps = 2/112 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S ++ LAG G G DG +A F +P AVD GN+ VAD N+ +RKIT G
Sbjct: 97 ISCTGIITTLAGSLGTHGSRDGSGTNALFFRPAGIAVDASGNVLVADTGNNTVRKITATG 156
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T G + G D NA F + + + + ++D GN IR+I
Sbjct: 157 DVTTFAGSAGNYGSTDNLGTNALFYRPTGIA-IDNFNNIFVADTGNNTIRKI 207
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 2/112 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ + AG GV G ++G A+F P+ AVD GN++VAD N++IRKIT G
Sbjct: 481 ITPDGAATTFAGSAGVSGTANGTNTDAQFYAPQGLAVDGTGNVFVADTFNNLIRKITPGG 540
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T G + G +DG NA F + V + + ++D+ N IR++
Sbjct: 541 AVTTLAGNFENFGSSDGTNSNARFYWPSGVA-VDNAGNVFVADYMNHTIREL 591
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 56/118 (47%), Gaps = 14/118 (11%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-------ARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+S +V LAG S G PGS A F P+ VD GNIYVAD NH+IR
Sbjct: 426 ISPSGLVCTLAG------SIGHPGSVNNIGTNALFSGPQGITVDGVGNIYVADTLNHIIR 479
Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+IT G T G + G A+G +A F L V + ++D N LIR+I
Sbjct: 480 RITPDGAATTFAGSAGVSGTANGTNTDAQFYAPQGLA-VDGTGNVFVADTFNNLIRKI 536
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 73/157 (46%), Gaps = 9/157 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP I + +L + D+ + T++ V AG G +DG ARF
Sbjct: 347 NPQG-ITSDATGNLYIADTGNNTIRTIT----PGGSVTTFAGLPSIGSADGLSSDARFRF 401
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P++ AVD N+YVAD +N IRKI+ G+ G G + NA FS +T
Sbjct: 402 PQAVAVDAATNVYVADTANQTIRKISPSGLVCTLAGSIGHPGSVNNIGTNALFSGPQGIT 461
Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
V + + ++D N +IR+I D + ++ +GSA
Sbjct: 462 -VDGVGNIYVADTLNHIIRRIT---PDGAATTFAGSA 494
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 5/113 (4%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ + V LAG G G +D SA F P+ D GN+Y+AD N+ IR IT G
Sbjct: 317 ITSDGTVTTLAGLAGNYGSADSVNSSASFWNPQGITSDATGNLYIADTGNNTIRTITPGG 376
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT A G G ADG + +A F + V + ++D NQ IR+I
Sbjct: 377 SVTTFA--GLPSIGSADGLSSDARFRFP-QAVAVDAATNVYVADTANQTIRKI 426
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 49/106 (46%), Gaps = 2/106 (1%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
V AG G G +D +A F +P A+D NI+VAD N+ IRKIT G
Sbjct: 158 VTTFAGSAGNYGSTDNLGTNALFYRPTGIAIDNFNNIFVADTGNNTIRKITPSGNVNTMA 217
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + G D NA FS LT V L + D GN IR+I
Sbjct: 218 GSAGVYGNLDNSGANALFSGPQGLT-VDSSGNLYVVDTGNGTIRKI 262
>gi|114568006|ref|YP_755160.1| leucine-rich repeat-containing protein [Syntrophomonas wolfei
subsp. wolfei str. Goettingen]
gi|114338941|gb|ABI69789.1| Leucine-rich repeat (LRR) protein-like protein [Syntrophomonas
wolfei subsp. wolfei str. Goettingen]
Length = 1351
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 62/110 (56%), Gaps = 4/110 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ +AG+G GYS DG P SA+ P A D GN+Y+ DK N+ IRK+ N G ++T
Sbjct: 506 IISTVAGNGTAGYSGDGGPATSAQLKDPVGVACDKNGNLYIVDKDNNRIRKVDNTGKIST 565
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AG G+ PA +A + +TF + + I+D N+ IR+++
Sbjct: 566 VAGNGTGGYAGDGSPATSAQIWVPYGVTF-DNNGNMYIADMNNKRIRKVD 614
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG+G GY+ DG P SA+ P D GN+Y+AD +N IRK+ G +TT+
Sbjct: 563 ISTVAGNGTGGYAGDGSPATSAQIWVPYGVTFDNNGNMYIADMNNKRIRKVDPSGIITTV 622
Query: 139 AGGGSKKEGRADGPAQNASFSN 160
AG GS K GPA+ A N
Sbjct: 623 AGNGSWKYSGDGGPAEAAGLCN 644
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 58/109 (53%), Gaps = 7/109 (6%)
Query: 85 LAGDGVQGYS--DGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+AG QG++ +G+ G SAR + P A D GN+Y+AD+ NH IRK+ G ++T+
Sbjct: 451 VAGIVTQGWATYEGDGGLATSARLNYPFGVACDGSGNLYIADRGNHRIRKVDTSGIISTV 510
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
AG G+ GPA +A + + + L I D N IR+++
Sbjct: 511 AGNGTAGYSGDGGPATSAQLKDPVGVACDKN-GNLYIVDKDNNRIRKVD 558
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 3/81 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ +AG+G GYS DG P SA+ + P+ D N+Y+ D N IRK+ G ++T
Sbjct: 335 IISTVAGNGTAGYSGDGGPATSAQLNTPRGVVCDGADNLYIVDGDNQRIRKVDTSGKIST 394
Query: 138 IAGGGSKKEGRADGPAQNASF 158
+AG G+ GPA A
Sbjct: 395 VAGNGTHWFEGDGGPATAAGL 415
Score = 40.4 bits (93), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 11/114 (9%)
Query: 82 VKRLAGDGVQGYS-DGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG+G + DG P +A +D P A D GN+Y+A+ +++ IRK+ + G ++
Sbjct: 392 ISTVAGNGTHWFEGDGGPATAAGLYD-PIGVACDSSGNLYIAESNSNCIRKVDSSGKISV 450
Query: 138 IAG----GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AG G + EG G A +A + F + L I+D GN IR+++
Sbjct: 451 VAGIVTQGWATYEGDG-GLATSARLNYPFGVA-CDGSGNLYIADRGNHRIRKVD 502
>gi|182416423|ref|YP_001821489.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
gi|177843637|gb|ACB77889.1| NHL repeat containing protein [Opitutus terrae PB90-1]
Length = 1026
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 63/112 (56%), Gaps = 1/112 (0%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+S + V LAG G G +DG+ SARF AVD GN+YVAD NH IRKIT GV
Sbjct: 187 VSPDGNVTILAGAGKAGSADGDSASARFSGSDGLAVDGTGNVYVADLFNHTIRKITPDGV 246
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
T G +++ G ADG A F EL+ + +L++D GN IR+++
Sbjct: 247 VTTLAGVARESGFADGAGAAARFYYPRELS-IDAYGNILVADEGNCAIRKVS 297
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 68/137 (49%), Gaps = 12/137 (8%)
Query: 58 DLIVLDSSRSAFYTLSF------PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFA 110
D + +D+ R Y + F ++ E VV LAG G G G G ARFD P + A
Sbjct: 109 DGLAIDA-RGNLYAVDFTDHTVRKITPEGVVTTLAGSAGDHGTQVGHGGEARFDSPMAVA 167
Query: 111 VDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
VD N+YV + IRK++ G VT +AG G K G ADG + +A FS L V
Sbjct: 168 VDRWDNLYVGQMGDGAIRKVSPDGNVTILAGAG--KAGSADGDSASARFSGSDGLA-VDG 224
Query: 170 ICALLISDHGNQLIRQI 186
+ ++D N IR+I
Sbjct: 225 TGNVYVADLFNHTIRKI 241
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 61/119 (51%), Gaps = 10/119 (8%)
Query: 78 EESVVKRLAG---DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
E V R AG D G +DG SARF P+S AVD G +YVAD ++ VIRKIT G
Sbjct: 22 ESYVFTRYAGNPNDTPYGSTDGFGDSARFAAPRSIAVDASGTLYVADAASSVIRKITAEG 81
Query: 135 VTTIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQINLKPE 191
+ T G + + G ADG A F + + ++ A+ +DH +R+I PE
Sbjct: 82 MVTTFVGTAGQRGSADGIGAAARFQGIDGLAIDARGNLYAVDFTDH---TVRKIT--PE 135
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 57/115 (49%), Gaps = 10/115 (8%)
Query: 77 SEESVVKRLAGDGV------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
S S V+++ DGV Q Y DGEP R P A G++Y+AD N IR+I
Sbjct: 563 SWSSAVRKITPDGVVTTVVRQPY-DGEP--ERLYLPFGIAAGHDGSLYIADTGNSTIRQI 619
Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G GGG ++EG+ DG A F N + + V L ++D GN L+R+
Sbjct: 620 RPDGSMVTIGGGMRQEGKQDGRGGEARFLNPYGVA-VDAAGHLYVADSGNNLVRK 673
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 55/117 (47%), Gaps = 12/117 (10%)
Query: 81 VVKRLAGDGVQGYSDGEPG-----------SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
V+ ++A DG + G PG ARF+ P AV G IYVAD NH IR+
Sbjct: 456 VIAKIAPDGAVTFFAGSPGLSGSTDGNARTEARFNGPTGIAVGPSGTIYVADFDNHTIRQ 515
Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
I+ G+ + G + + G ADG A F +T V + ++D + +R+I
Sbjct: 516 ISPAGMVSTLAGAAGQPGTADGTGSAARFYAPAAVT-VDRAGMIYVADSWSSAVRKI 571
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 66/135 (48%), Gaps = 8/135 (5%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDM 113
I +D +R+ F S+ ++ VV +AG GV G ++G +ARF P + A+D
Sbjct: 384 IAIDQNRNVFVADSYNNVIRKITPGGVVTTVAGLGGVFGSAEGSGAAARFGVPAAVAIDA 443
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQN-ASFSNDFELTFVPHICA 172
N+YVA++ HVI KI G T G G DG A+ A F+ + P
Sbjct: 444 AANLYVANRQTHVIAKIAPDGAVTFFAGSPGLSGSTDGNARTEARFNGPTGIAVGPS-GT 502
Query: 173 LLISDHGNQLIRQIN 187
+ ++D N IRQI+
Sbjct: 503 IYVADFDNHTIRQIS 517
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 56/121 (46%), Gaps = 5/121 (4%)
Query: 38 IDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE 97
+D + + + R G D+ V DS S ++ + + R G GY++G
Sbjct: 318 VDAARFSLPRGVAVSRTG--DIYVADSGNSTVRRIAVGGAVTTFAGRPGG---PGYANGS 372
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNAS 157
+A+F P A+D N++VAD N+VIRKIT GV T G G A+G A
Sbjct: 373 SETAQFYFPTGIAIDQNRNVFVADSYNNVIRKITPGGVVTTVAGLGGVFGSAEGSGAAAR 432
Query: 158 F 158
F
Sbjct: 433 F 433
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 67/142 (47%), Gaps = 7/142 (4%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
P S+ +D G+ L V D++ S ++ +E V + G +G +DG +ARF
Sbjct: 53 PRSIAVDASGT--LYVADAASSVIRKIT---AEGMVTTFVGTAGQRGSADGIGAAARFQG 107
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
A+D +GN+Y D ++H +RKIT GV T G + G G A F + +
Sbjct: 108 IDGLAIDARGNLYAVDFTDHTVRKITPEGVVTTLAGSAGDHGTQVGHGGEARFDSPMAVA 167
Query: 166 FVPHICALLISDHGNQLIRQIN 187
V L + G+ IR+++
Sbjct: 168 -VDRWDNLYVGQMGDGAIRKVS 188
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 53/118 (44%), Gaps = 21/118 (17%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S +V LAG Q G +DG +ARF P + VD G IYVAD + +RKIT G
Sbjct: 516 ISPAGMVSTLAGAAGQPGTADGTGSAARFYAPAAVTVDRAGMIYVADSWSSAVRKITPDG 575
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP------HICALLISDHGNQLIRQI 186
V T + + E ++P H +L I+D GN IRQI
Sbjct: 576 VVTT--------------VVRQPYDGEPERLYLPFGIAAGHDGSLYIADTGNSTIRQI 619
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 56/118 (47%), Gaps = 15/118 (12%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S VV +AG G+ G DG +ARF P+ AV G+IYVAD N +R+I G
Sbjct: 296 VSPSGVVSTVAGKTGLSGSDDGVD-AARFSLPRGVAVSRTGDIYVADSGNSTVRRIAVGG 354
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQI 186
T G G A+G ++ A F + P A+ ++D N +IR+I
Sbjct: 355 AVTTFAGRPGGPGYANGSSETAQF-------YFPTGIAIDQNRNVFVADSYNNVIRKI 405
Score = 45.1 bits (105), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 2/85 (2%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
RF P+ ++D GNI VAD+ N IRK++ GV + G + G DG A FS
Sbjct: 268 RFYYPRELSIDAYGNILVADEGNCAIRKVSPSGVVSTVAGKTGLSGSDDG-VDAARFSLP 326
Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
+ V + ++D GN +R+I
Sbjct: 327 RGVA-VSRTGDIYVADSGNSTVRRI 350
>gi|158319004|ref|YP_001511512.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114409|gb|ABW16606.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 892
Score = 64.7 bits (156), Expect = 4e-08, Method: Composition-based stats.
Identities = 52/146 (35%), Positives = 77/146 (52%), Gaps = 11/146 (7%)
Query: 45 INPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSA 101
++P+SV ID G+ + L S R T + +S+ LAG G GYS +G +A
Sbjct: 576 LSPYSVAIDPQGTLFITSLSSDRIQKVTRTGEVSD------LAGTGADGYSGDNGPATAA 629
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN 160
+ + P S D GNIY+ D N+ IRKIT G +TTIAG G+ GPA A ++
Sbjct: 630 KLNGPGSAVPDKNGNIYIPDAQNYRIRKITPDGIITTIAGTGTAGFSGDGGPATAAQINS 689
Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
++ P ++ I+D+ N IR+I
Sbjct: 690 AEKVAIGPD-GSIYIADYDNHRIRKI 714
Score = 60.8 bits (146), Expect = 6e-07, Method: Composition-based stats.
Identities = 40/120 (33%), Positives = 65/120 (54%), Gaps = 15/120 (12%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + ++ +AG G+QGYS DG P +A + D P + G +Y+A+ ++ I+KIT
Sbjct: 714 ITPDGIINTIAGTGLQGYSGDGGPATAAKLDGPNDVELGDDGTLYIANLGSNTIQKITKD 773
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQIN 187
G VTT+AG G K GPA A S VP + + I+D+GN +R+++
Sbjct: 774 GIVTTVAGNGQKGFSGDGGPATAAQLS-------VPSVSLGNGGEIYIADYGNNRVRKVD 826
Score = 54.3 bits (129), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/94 (38%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + ++ +AG G G+S DG P +A + + + A+ G+IY+AD NH IRKIT
Sbjct: 658 ITPDGIITTIAGTGTAGFSGDGGPATAAQINSAEKVAIGPDGSIYIADYDNHRIRKITPD 717
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFS--NDFEL 164
G + TIAG G + GPA A ND EL
Sbjct: 718 GIINTIAGTGLQGYSGDGGPATAAKLDGPNDVEL 751
Score = 48.5 bits (114), Expect = 0.003, Method: Composition-based stats.
Identities = 40/113 (35%), Positives = 63/113 (55%), Gaps = 3/113 (2%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NL 133
++++ +V +AG+G +G+S DG P +A S ++ G IY+AD N+ +RK+ N
Sbjct: 770 ITKDGIVTTVAGNGQKGFSGDGGPATAAQLSVPSVSLGNGGEIYIADYGNNRVRKVDPNG 829
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+TTIAG G++ G G A A F N+ AL I+D GN +R+I
Sbjct: 830 TITTIAGTGAEGSGGDGGQATAAQF-NEPSSVAEDADGALYIADSGNNRLRRI 881
>gi|108761192|ref|YP_633033.1| lipoprotein [Myxococcus xanthus DK 1622]
gi|108465072|gb|ABF90257.1| putative lipoprotein [Myxococcus xanthus DK 1622]
Length = 868
Score = 64.7 bits (156), Expect = 4e-08, Method: Composition-based stats.
Identities = 35/89 (39%), Positives = 50/89 (56%), Gaps = 2/89 (2%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
+ P A D GN+YVAD ++VIR+I G + GG+ G DGPA+ A+F+
Sbjct: 629 LNGPMGIAADAAGNVYVADTDHYVIRRIDPEGKVEVFAGGTP--GLMDGPAKQAAFNQPT 686
Query: 163 ELTFVPHICALLISDHGNQLIRQINLKPE 191
L P ALL++D N +IR+I+L E
Sbjct: 687 GLAVTPDGTALLVADMNNGVIRRIDLVAE 715
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 49/166 (29%), Positives = 76/166 (45%), Gaps = 17/166 (10%)
Query: 27 LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
L+ +G+ VTT + G L P V + L V++S S + + + +A
Sbjct: 712 LVAEGHPVTT-LQGDWL-YRPSGVAVSADGNTLFVVESGMSRVVRI-----RDGLTSVVA 764
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI------TNLGVTTIAG 140
G G+ DG P S++F AV G++ V+D N+ +R++ VTT+AG
Sbjct: 765 GT-TPGFRDGAPESSQFLPYLGIAVLKDGSLAVSDPGNYRVRRVVLNADGNARKVTTLAG 823
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + G +DGP A LT P L ++D GN L+R I
Sbjct: 824 SG--RYGHSDGPGDAADLVLPAGLTVGPD-GRLYVADAGNSLVRAI 866
Score = 40.0 bits (92), Expect = 1.1, Method: Composition-based stats.
Identities = 21/50 (42%), Positives = 26/50 (52%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG G G+SDG +A P V G +YVAD N ++R IT
Sbjct: 818 VTTLAGSGRYGHSDGPGDAADLVLPAGLTVGPDGRLYVADAGNSLVRAIT 867
>gi|290995422|ref|XP_002680294.1| predicted protein [Naegleria gruberi]
gi|284093914|gb|EFC47550.1| predicted protein [Naegleria gruberi]
Length = 1170
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 85/168 (50%), Gaps = 30/168 (17%)
Query: 48 HSVIDRPGSSDLIVLDS-----SRSAFYT-LSFPLSEESV---------VKRLAGD---- 88
+S++DR + +I LD+ + ++ YT +S SE S +K+L
Sbjct: 281 NSLVDRVAGTGIIELDTDGKLGNLTSLYTPISVTQSESSSDIYIGTTIGIKKLTDSNKLI 340
Query: 89 ----GVQGYS--DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAG 140
G Q YS DG P +A R P+ A+ G ++++DK +H IRKI + GV +T+AG
Sbjct: 341 SSFAGSQAYSSGDGYPATAARLSSPEGIAISTSGEVFISDKGSHTIRKIDSKGVISTVAG 400
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
GS G DGPA A + L F P+ LLI++ N IR+++
Sbjct: 401 TGS--AGYVDGPALKAKLNGPGFLAFTPN-GDLLITESSNNRIRKLSF 445
>gi|312200909|ref|YP_004020970.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311232245|gb|ADP85100.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 847
Score = 64.7 bits (156), Expect = 4e-08, Method: Composition-based stats.
Identities = 44/114 (38%), Positives = 64/114 (56%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S++ V +AG+G G+S DG P SA + P + VD GNIYV D +N+ IRKIT
Sbjct: 554 ISKDGTVTPIAGNGTAGFSGDGGPATSAELNGPGTAVVDKNGNIYVPDTANNRIRKITPD 613
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +TT+ G G+ GPA A N E V +L ++D+ N+ IR++
Sbjct: 614 GKITTVVGNGTAGFSGDGGPATQAEI-NSVEGIAVGPDGSLYLADYSNERIRKV 666
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 31/88 (35%), Positives = 50/88 (56%), Gaps = 3/88 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + + G+G G+S DG P + A + + AV G++Y+AD SN IRK+T
Sbjct: 610 ITPDGKITTVVGNGTAGFSGDGGPATQAEINSVEGIAVGPDGSLYLADYSNERIRKVTPD 669
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSN 160
G ++TIAG G+K PA +A S+
Sbjct: 670 GIISTIAGTGTKGYTSTPTPALSAQISD 697
Score = 45.1 bits (105), Expect = 0.034, Method: Composition-based stats.
Identities = 33/115 (28%), Positives = 61/115 (53%), Gaps = 5/115 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ ++ ++ AG+G G + G+ +A P A+ G +Y+++ + +RK+T
Sbjct: 722 IGKDGILSPFAGNGKTGRTGDGGDAANATLSIP-DLALGPDGTVYISNYGSDTVRKVTPD 780
Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GV TTIAG G++ GPA A + + V A+ I+D+GN+ IR+++
Sbjct: 781 GVITTIAGTGAEGNTGDGGPATAAQLKSPSSVV-VDASGAVYIADNGNKEIRRVD 834
Score = 40.4 bits (93), Expect = 0.75, Method: Composition-based stats.
Identities = 21/65 (32%), Positives = 36/65 (55%), Gaps = 2/65 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + V+ +AG G +G + DG P +A + P S VD G +Y+AD N IR++
Sbjct: 777 VTPDGVITTIAGTGAEGNTGDGGPATAAQLKSPSSVVVDASGAVYIADNGNKEIRRVDPN 836
Query: 134 GVTTI 138
G ++
Sbjct: 837 GTISL 841
Score = 38.9 bits (89), Expect = 2.2, Method: Composition-based stats.
Identities = 31/114 (27%), Positives = 56/114 (49%), Gaps = 5/114 (4%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + ++ +AG G +GY S P SA+ P S + G IY+ + + ++KI
Sbjct: 666 VTPDGIISTIAGTGTKGYTSTPTPALSAQISDPNSVVIADDGTIYIGNLGSDSVQKIGKD 725
Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G+ + AG G G A NA+ S +L P + IS++G+ +R++
Sbjct: 726 GILSPFAGNGKTGRTGDGGDAANATLSIP-DLALGPD-GTVYISNYGSDTVRKV 777
>gi|124008176|ref|ZP_01692873.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
gi|123986275|gb|EAY26097.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
Length = 2548
Score = 64.7 bits (156), Expect = 4e-08, Method: Composition-based stats.
Identities = 46/119 (38%), Positives = 59/119 (49%), Gaps = 20/119 (16%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI- 138
+ V +AGDG G + G SARF +P AVD GN+YVADK+NH I+K+ +L T+
Sbjct: 715 ATVTTVAGDGTSGDAIGASTSARFREPSDVAVDFSGNLYVADKNNHKIKKV-DLNTNTVS 773
Query: 139 -----AGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
A G G DG A A F F P A L ++D N LIRQ+
Sbjct: 774 ILSGPAVGTVFPAGATDGTASIARF-------FFPTSIALDRSGNLFVADRHNNLIRQV 825
Score = 64.3 bits (155), Expect = 6e-08, Method: Composition-based stats.
Identities = 45/109 (41%), Positives = 57/109 (52%), Gaps = 17/109 (15%)
Query: 86 AGDGVQG---YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
AGD Q Y DG SARF+ P VDM G++YVAD N VIRKI+ VTTIAG
Sbjct: 836 AGDISQTNALYVDGAAASARFNHPTGITVDMVGDVYVADTRNQVIRKISEGQVTTIAGIA 895
Query: 143 SKKEGRADGPAQNASFS------NDFELTFVPHICALLISDHGNQLIRQ 185
+ + G +G AQ A F+ D E + + D NQL+R+
Sbjct: 896 NDR-GLTNGTAQAAKFNYPGSVYADLEQN-------IYVGDKVNQLVRR 936
Score = 60.1 bits (144), Expect = 1e-06, Method: Composition-based stats.
Identities = 50/136 (36%), Positives = 72/136 (52%), Gaps = 15/136 (11%)
Query: 79 ESVVKRLAGD-GVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---- 132
+S V +AG G G + G+ ARF+ P A+D G +YVADK NHVIR I N
Sbjct: 655 DSTVSIVAGSVGASGLVNDPVGTNARFNHPSGLAIDDAGILYVADKDNHVIRAIANPDGA 714
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQINLKP 190
VTT+AG G+ G A G + +A F +D + F + L ++D N I++++L
Sbjct: 715 ATVTTVAGDGT--SGDAIGASTSARFREPSDVAVDFSGN---LYVADKNNHKIKKVDLNT 769
Query: 191 EDCSKSSQSGSALGAV 206
+ S SG A+G V
Sbjct: 770 N--TVSILSGPAVGTV 783
Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats.
Identities = 38/99 (38%), Positives = 60/99 (60%), Gaps = 4/99 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT-TIAGGGSKKEG-R 148
G +DG ARF P S A+D GN++VAD+ N++IR++ T+ G T T AG S+
Sbjct: 787 GATDGTASIARFFFPTSIALDRSGNLFVADRHNNLIRQVATSSGATSTYAGDISQTNALY 846
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
DG A +A F++ +T V + + ++D NQ+IR+I+
Sbjct: 847 VDGAAASARFNHPTGIT-VDMVGDVYVADTRNQVIRKIS 884
>gi|223937279|ref|ZP_03629185.1| NHL repeat containing protein [bacterium Ellin514]
gi|223894064|gb|EEF60519.1| NHL repeat containing protein [bacterium Ellin514]
Length = 429
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 58/112 (51%), Gaps = 2/112 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV LAG G DG +A F +P+S AVD GN+YVAD NH IRKIT G
Sbjct: 133 ITPAGVVSTLAGSAGNINSFDGTGINANFYQPRSLAVDNGGNVYVADTWNHTIRKITPAG 192
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ + G + G ADG A F+ + V + L ++D N IR+I
Sbjct: 193 LVSTLAGLAGNPGSADGTNSKARFNRPSGIA-VDNATNLFVTDFHNHTIRKI 243
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 58/118 (49%), Gaps = 14/118 (11%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ +V LAG G G +DG ARF++P AVD N++V D NH IRKIT G
Sbjct: 188 ITPAGLVSTLAGLAGNPGSADGTNSKARFNRPSGIAVDNATNLFVTDFHNHTIRKITPGG 247
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH-ICA-----LLISDHGNQLIRQI 186
T G G ADG +N+ F P I A L ++D GNQ IR+I
Sbjct: 248 TVTTIAGLPGVWGNADG-------TNNVARFFQPQGIVADNAGNLFVADSGNQTIRKI 298
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 52/116 (44%), Gaps = 13/116 (11%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
++ + AG G DG SARF P S A D GNIYVAD N IRKIT G
Sbjct: 25 AQSLTINTFAGHDAPGSKDGLSSSARFRHPNSIAADSAGNIYVADTENSTIRKITPNGSV 84
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
+ G + G ADG NA F + P A + ++D N IR+I
Sbjct: 85 STFAGFAGTFGSADGVGTNALF-------YAPQGIAVDSAGFIYVADTANATIRKI 133
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 60/126 (47%), Gaps = 19/126 (15%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---- 130
++ V +AG GV G +DG ARF +P+ D GN++VAD N IRKI
Sbjct: 243 ITPGGTVTTIAGLPGVWGNADGTNNVARFFQPQGIVADNAGNLFVADSGNQTIRKISPSG 302
Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIR 184
TN V+T+AG G A NA+ +N+ + P A + ++D GN IR
Sbjct: 303 TNWIVSTVAG--------LSGIAGNANGTNNTARFYFPADVAQDIAGYIYVADLGNNAIR 354
Query: 185 QINLKP 190
+ P
Sbjct: 355 TERIVP 360
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 79/194 (40%), Gaps = 32/194 (16%)
Query: 15 LSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD------------LIVL 62
LS V GLL V L IN + D PGS D I
Sbjct: 5 LSFKNLFVGGGLLAS-----MAVAHAQSLTINTFAGHDAPGSKDGLSSSARFRHPNSIAA 59
Query: 63 DSSRSAFYTLSFPLSEESVVKRLAGDG----------VQGYSDGEPGSARFDKPKSFAVD 112
DS+ + + + E S ++++ +G G +DG +A F P+ AVD
Sbjct: 60 DSAGNIYVADT----ENSTIRKITPNGSVSTFAGFAGTFGSADGVGTNALFYAPQGIAVD 115
Query: 113 MKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
G IYVAD +N IRKIT GV + G + DG NA+F L V +
Sbjct: 116 SAGFIYVADTANATIRKITPAGVVSTLAGSAGNINSFDGTGINANFYQPRSLA-VDNGGN 174
Query: 173 LLISDHGNQLIRQI 186
+ ++D N IR+I
Sbjct: 175 VYVADTWNHTIRKI 188
>gi|196230675|ref|ZP_03129536.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
gi|196225016|gb|EDY19525.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
Length = 1120
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 74/156 (47%), Gaps = 20/156 (12%)
Query: 34 VTTVIDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV- 90
VT + G Q +N ++ P S ++++DS PL V+ LAG +
Sbjct: 631 VTMAMSGDQFRCVMNAGALTSAP--SSVLIVDS----------PL----VISTLAGQVLT 674
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
QG +DG +A+F P AVD GNIYVAD +N IRKIT G T G G +
Sbjct: 675 QGTADGTGTAAQFGYPSGVAVDSSGNIYVADFNNDTIRKITPGGTVTTPYGQPGVIGATN 734
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G NA+F N + + ++D GN IR+I
Sbjct: 735 GTGTNATF-NTPNGVAIDSANNIYVADSGNSAIRKI 769
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 51/87 (58%), Gaps = 7/87 (8%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
++++ V DS SA ++ VV LAG G+ G +DG A F+ P+ AVD
Sbjct: 753 ANNIYVADSGNSAIRKIT----PGRVVSTLAGQPGLSGSADGTT-KALFNNPQGVAVDSA 807
Query: 115 GNIYVADKSNHVIRKITNLG-VTTIAG 140
GN+YVAD +N IRKIT G V+T+AG
Sbjct: 808 GNVYVADTTNETIRKITPAGVVSTLAG 834
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 47/95 (49%), Gaps = 10/95 (10%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF---- 158
F+ P S AVD GN+YVAD N V+RKIT GV T G + GR DG A F
Sbjct: 851 FNGPSSVAVDSAGNVYVADLYNFVVRKITPGGVVTTPYGQAGMPGRLDGIGTAALFNAPI 910
Query: 159 ------SNDFELTFVPHICALLISDHGNQLIRQIN 187
+N+ +T L + GN L+R++N
Sbjct: 911 GVAVDANNNLYITDSQIPPDLTSTSTGNNLVRRVN 945
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 2/98 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
GV G ++G +A F+ P A+D NIYVAD N IRKIT V + G G
Sbjct: 728 GVIGATNGTGTNATFNTPNGVAIDSANNIYVADSGNSAIRKITPGRVVSTLAGQPGLSGS 787
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
ADG + + N+ + V + ++D N+ IR+I
Sbjct: 788 ADGTTK--ALFNNPQGVAVDSAGNVYVADTTNETIRKI 823
>gi|302768705|ref|XP_002967772.1| hypothetical protein SELMODRAFT_409051 [Selaginella moellendorffii]
gi|302826179|ref|XP_002994615.1| hypothetical protein SELMODRAFT_432523 [Selaginella moellendorffii]
gi|300137312|gb|EFJ04320.1| hypothetical protein SELMODRAFT_432523 [Selaginella moellendorffii]
gi|300164510|gb|EFJ31119.1| hypothetical protein SELMODRAFT_409051 [Selaginella moellendorffii]
Length = 172
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 68/140 (48%), Gaps = 13/140 (9%)
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
VTTIAGG S+K G ADGP A FS++F L +LLI+D GN+LI +I + +D
Sbjct: 41 VTTIAGGSSRKPGFADGPGDTARFSSEFSLAC--SFGSLLIADRGNRLIHEIQI--DDPK 96
Query: 195 KSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEALPNQSG 254
S SA+ W L+ V+ L+ + + +II LD + + L N
Sbjct: 97 SRDSSDSAVSGPQKWALIPVV--LLGICLEMPLGAFIIWKI--LDQRCRQAGDQLGNTER 152
Query: 255 -----ETSTDLLLRHQKRNC 269
DL +R QKR C
Sbjct: 153 LLAGVPEDDDLQVRQQKRTC 172
>gi|380692993|ref|ZP_09857852.1| cell surface protein [Bacteroides faecis MAJ27]
Length = 457
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 61/114 (53%), Gaps = 2/114 (1%)
Query: 77 SEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+ E + AGD G G++DG+ A+F+ P+ +D + N+Y+AD NH IRK+T GV
Sbjct: 345 TREGNFRIYAGDKGRAGHADGQETEAQFNSPRQLVLDEEENLYIADSGNHCIRKVTPEGV 404
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G G DG + A F+ + L + + I D N+ +R+++++
Sbjct: 405 VSTVIGNPNSSGYKDGTPEIALFTEPWGLA-IDSEGTIYIGDKDNRCVRKLSIE 457
>gi|153868997|ref|ZP_01998705.1| protein kinase-like protein [Beggiatoa sp. PS]
gi|152074439|gb|EDN71293.1| protein kinase-like protein [Beggiatoa sp. PS]
Length = 1432
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 4/120 (3%)
Query: 74 FPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
F L E + AG G +G+ +G A P+ A+D +GN+Y+AD +NH IRKI
Sbjct: 436 FKLDPEGNLTVFAGTGTKGFEGDNGPAIEANLSNPEGLAIDAQGNLYIADTNNHRIRKID 495
Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
+ G +TT+ G G + A A + F H I+D GN IR+IN +P
Sbjct: 496 SDGIITTVVGTGEAGYAGDNEFAIAAQLKKPTAIVF-DHNGHFYIADSGNNSIRKINYQP 554
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 66/135 (48%), Gaps = 9/135 (6%)
Query: 61 VLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDM 113
+ DS ++ +++ PL+ S++ +AGDG GYS +G AR P S VD
Sbjct: 539 IADSGNNSIRKINYQPGTSPLNANSLITTIAGDGRSGYSGDNGPAIQARLGNPSSLVVDN 598
Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
+ N+Y+AD NH IRKI G +TT AG G K A A + L V
Sbjct: 599 ENNLYIADTDNHRIRKIDIRGNITTFAGSGYKGYSGDGNLAITARLNMPTGLA-VDGTGN 657
Query: 173 LLISDHGNQLIRQIN 187
+ I+D N IR+I+
Sbjct: 658 IFIADQNNHRIRKID 672
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 62/128 (48%), Gaps = 9/128 (7%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
+ AG G +GYS DG +AR + P AVD GNI++AD++NH IRKI G+
Sbjct: 621 ITTFAGSGYKGYSGDGNLAITARLNMPTGLAVDGTGNIFIADQNNHRIRKIDGEGIIRTF 680
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED------- 192
G + DG + + N + L +++ GN IR+I K +D
Sbjct: 681 TGTGVRGTATDGILASVAEINQPTDIALDQYGNLYLAEKGNHFIRKIGEKDQDGDEGAPH 740
Query: 193 CSKSSQSG 200
CS +++ G
Sbjct: 741 CSTNNEMG 748
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 2/89 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+A PK D GN+Y+AD NH I K+ G +T AG G+K +GPA A+
Sbjct: 408 TAELKVPKDLIFDATGNLYIADILNHRIFKLDPEGNLTVFAGTGTKGFEGDNGPAIEANL 467
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
SN L + L I+D N IR+I+
Sbjct: 468 SNPEGLA-IDAQGNLYIADTNNHRIRKID 495
>gi|344924044|ref|ZP_08777505.1| NHL repeat containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 383
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 56/106 (52%), Gaps = 7/106 (6%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
LAG G GY +G +A+F P S AVD G +YV D +N+ IRKIT+ G T++ G
Sbjct: 201 LAGSGATGYVEGTGSAAQFSTPTSVAVDSSGTVYVIDANNYRIRKITSGGTTSLFAG--S 258
Query: 145 KEGRADGPAQNASFSNDFEL----TFVPHICALLISDHGNQLIRQI 186
+G A+G A F N F L V + + ++D N IR I
Sbjct: 259 TQGYAEGTGSAARF-NFFNLIPSGITVDNAGTVYVADTFNYRIRTI 303
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 3/97 (3%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
+GY +G +ARF+ P + +D G +YVAD N+ IRKIT+ G T++ GS G A+
Sbjct: 45 RGYQEGTGSAARFENPYAITIDSSGTLYVADNGNYRIRKITSSGTTSLL-AGSGTTGYAE 103
Query: 151 GPAQNASFSNDFELTFVPHICA-LLISDHGNQLIRQI 186
G +A F N F+ + +SD N +R+I
Sbjct: 104 GTGASAQF-NTFQWGIAADNSGNVYVSDTTNNRVRKI 139
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 10/110 (9%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP+++ ID G+ L V D+ Y + ++ LAG G GY++G SA+F+
Sbjct: 59 NPYAITIDSSGT--LYVADNGN---YRIR-KITSSGTTSLLAGSGTTGYAEGTGASAQFN 112
Query: 105 KPK-SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK--KEGRADG 151
+ A D GN+YV+D +N+ +RKIT+ G T++ G + +EG G
Sbjct: 113 TFQWGIAADNSGNVYVSDTTNNRVRKITSGGTTSLLAGSTSGYQEGTGAG 162
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 48/102 (47%), Gaps = 3/102 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
LAG GY +G AR P+ AV+ G +YVA ++ IR IT+ G T++ GS
Sbjct: 148 LAGS-TSGYQEGTGAGARLSSPRGLAVNSAGTVYVATATSERIRAITSGGTTSLL-AGSG 205
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +G A FS + V + + D N IR+I
Sbjct: 206 ATGYVEGTGSAAQFSTPTSVA-VDSSGTVYVIDANNYRIRKI 246
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 5/56 (8%)
Query: 90 VQGYSDGEPGSARFD----KPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAG 140
QGY++G +ARF+ P VD G +YVAD N+ IR IT GVT T+AG
Sbjct: 259 TQGYAEGTGSAARFNFFNLIPSGITVDNAGTVYVADTFNYRIRTITPGGVTSTLAG 314
>gi|182414669|ref|YP_001819735.1| SMP-30/gluconolaconase/LRE domain-containing protein [Opitutus
terrae PB90-1]
gi|177841883|gb|ACB76135.1| SMP-30/Gluconolaconase/LRE domain protein [Opitutus terrae PB90-1]
Length = 1292
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 52/106 (49%), Gaps = 4/106 (3%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
DG ARF+ PK AVD G +YVAD SNH+IRK+T GV T G G +DG
Sbjct: 209 DGAGSDARFNGPKGIAVDANGTVYVADTSNHIIRKVTPSGVVTTLAGSPGISGNSDGAGD 268
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
A F+ ++ V + + D L R+I PE S SG
Sbjct: 269 AARFNAPTDIA-VDDAGTIYVVDQSGSL-RKIT--PEGVVTSLASG 310
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 55/111 (49%), Gaps = 2/111 (1%)
Query: 76 LSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S E VV +AG G DG +A F + K AVD GN++VA+ + HVIR+IT G
Sbjct: 558 ISPEGVVTTVAGTANATGSIDGVGAAATFGEIKGLAVDRAGNVFVAENTTHVIRRITPDG 617
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
GG + G ADG A F+ + L L I D GN IR+
Sbjct: 618 TVVTIGGLAGSIGTADGVGSAARFNEPWGLAL-DRFGHLYIVDSGNNTIRK 667
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 41/78 (52%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G+ G DGE +ARF+ P AVD GN+YVAD NH IRKI G T G S EG
Sbjct: 464 GLDGSLDGEKTAARFNAPHGLAVDQHGNLYVADTFNHSIRKIDAAGQVTTPYGVSGVEGT 523
Query: 149 ADGPAQNASFSNDFELTF 166
DG A F + L F
Sbjct: 524 VDGIGNAARFGSPTALAF 541
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 47/101 (46%), Gaps = 13/101 (12%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY DGE ARF +P A D GN+YV D ++IRKIT + G G DG
Sbjct: 43 GYVDGEGTQARFHEPAGLACDADGNLYVVDPGTNLIRKITPAASVSTFAGTPTGWGLVDG 102
Query: 152 PAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
PA +A F +P A + I+D GN IR I
Sbjct: 103 PAASARFG-------LPQGVAVGADGTVYIADTGNAAIRII 136
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 46/95 (48%), Gaps = 3/95 (3%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G DG SARF P+ AV G +Y+AD N IR IT G I GG + G DG
Sbjct: 98 GLVDGPAASARFGLPQGVAVGADGTVYIADTGNAAIRIITPDGSVRILAGG--RSGSQDG 155
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A+F N E V + ++D GN +R+I
Sbjct: 156 YGTGATF-NLPEAVAVNAAGVVYVADSGNNTVRRI 189
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 3/98 (3%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
GV+G DG +ARF P + A D G+++VAD H +R+I+ GV T G + G
Sbjct: 519 GVEGTVDGIGNAARFGSPTALAFDRDGSLFVAD--GHRVRRISPEGVVTTVAGTANATGS 576
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
DG A+F + + V + ++++ +IR+I
Sbjct: 577 IDGVGAAATF-GEIKGLAVDRAGNVFVAENTTHVIRRI 613
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ VV LAG G+ G SDG +ARF+ P AVD G IYV D+S +RKIT G
Sbjct: 244 VTPSGVVTTLAGSPGISGNSDGAGDAARFNAPTDIAVDDAGTIYVVDQSGS-LRKITPEG 302
Query: 135 -VTTIAGGGSKKEGRA 149
VT++A G S G A
Sbjct: 303 VVTSLASGFSYPRGVA 318
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 2/106 (1%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
DG+ +A F P AVD GN++VAD IRKI G + G +G DG
Sbjct: 416 DGQGSNAGFGNPTGIAVDAAGNLFVADF-KATIRKIDATGYVSTVAGAHGLDGSLDGEKT 474
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
A F+ L H L ++D N IR+I+ + + SG
Sbjct: 475 AARFNAPHGLAVDQH-GNLYVADTFNHSIRKIDAAGQVTTPYGVSG 519
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 15/91 (16%)
Query: 81 VVKRLAGDGVQ----------GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V++R+ DG G +DG +ARF++P A+D G++Y+ D N+ IRK
Sbjct: 609 VIRRITPDGTVVTIGGLAGSIGTADGVGSAARFNEPWGLALDRFGHLYIVDSGNNTIRKG 668
Query: 131 TNLGVTTIAGGGSKKE--GRADGPAQNASFS 159
+G T A K+ +A G + ASF+
Sbjct: 669 VIVGATAPA---ITKDPVAKAAGAGRQASFT 696
Score = 40.4 bits (93), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 48/96 (50%), Gaps = 4/96 (4%)
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKK-EGRADG 151
+DG +A+F P+ V G ++VAD + +VIRK+T L VTT AG + DG
Sbjct: 358 ADGVGLNAQFTSPRGLTVAADGTVFVADSNAYVIRKVTPGLLVTTWAGSLVRPIYQTVDG 417
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
NA F N + V L ++D IR+I+
Sbjct: 418 QGSNAGFGNPTGIA-VDAAGNLFVADF-KATIRKID 451
>gi|365864816|ref|ZP_09404493.1| NHL repeat-containing protein [Streptomyces sp. W007]
gi|364005745|gb|EHM26808.1| NHL repeat-containing protein [Streptomyces sp. W007]
Length = 292
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 72/136 (52%), Gaps = 6/136 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + +AG G GYS G SAR + P VD N+Y++D +NH +RK+T
Sbjct: 20 VTADGTISTVAGTGTAGYSGDSGAATSARLNLPLGVVVDSADNLYISDYNNHRVRKVTPD 79
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
G +TTIAG GS G GP+ A SN F L V + L I+DH N IR+I E
Sbjct: 80 GEITTIAGTGSAGFGGDGGPSPVAQLSNPFGLA-VDCVDTLYIADHLNNRIRKI--ASER 136
Query: 193 CSKSSQSGSALGAVSV 208
+ QSG+ + +V
Sbjct: 137 MAGLPQSGTVVSWANV 152
>gi|405375498|ref|ZP_11029528.1| putative hemolysin [Chondromyces apiculatus DSM 436]
gi|397086225|gb|EJJ17355.1| putative hemolysin [Myxococcus sp. (contaminant ex DSM 436)]
Length = 855
Score = 63.2 bits (152), Expect = 1e-07, Method: Composition-based stats.
Identities = 42/113 (37%), Positives = 60/113 (53%), Gaps = 9/113 (7%)
Query: 80 SVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ VKR+A G V + G G P A D GN+YV+D ++VIR+I G +
Sbjct: 598 NAVKRVAAGGEVTTLASGLNG------PMGIATDAAGNVYVSDTDHYVIRRIDPEGKVEL 651
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
G + G DGPA+ A+F+ LT P ALL++D N +IR+I+L E
Sbjct: 652 FAGSTP--GLQDGPAKQAAFNQPAGLTVTPDGSALLVADLNNGVIRRIDLVAE 702
Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats.
Identities = 52/166 (31%), Positives = 77/166 (46%), Gaps = 17/166 (10%)
Query: 27 LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
L+ +G+ VTT + G L P V L V++S S + + V +A
Sbjct: 699 LVAEGHPVTT-LQGDWL-YRPSGVTVSEDGGTLYVVESGMSRVVRI-----RDGVTSVVA 751
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG------VTTIAG 140
G G+ DG P S++F AV G++ V+D N+ +R+I G VTT+AG
Sbjct: 752 GT-TPGFRDGAPESSQFLPYLGIAVLKDGSLAVSDPGNYRVRRIHLDGNGQARKVTTLAG 810
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G+ G ADGP ++A L P L ++D GN L+R I
Sbjct: 811 SGTY--GHADGPGEDAQLVLPAGLVVGPD-GRLYVADSGNALVRTI 853
Score = 39.7 bits (91), Expect = 1.4, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 26/50 (52%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V LAG G G++DG A+ P V G +YVAD N ++R IT
Sbjct: 805 VTTLAGSGTYGHADGPGEDAQLVLPAGLVVGPDGRLYVADSGNALVRTIT 854
>gi|291223251|ref|XP_002731624.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 434
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 83/180 (46%), Gaps = 17/180 (9%)
Query: 33 TVTTVIDGHQLEINPHSVIDRPG-------SSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
T+ + G+Q + ++ P + L V DS+ S ++ E V
Sbjct: 107 TLAGSLQGYQDGVGEKAMFHHPAGMSMHRKTRTLFVADSANSRIRAINVVTGE---VTTF 163
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVD--MKGNIYVADKSNHVIRKIT--NLGVTTIAGG 141
AG G + DG A F P++ AVD K +VAD NH IR+++ + VTTIAGG
Sbjct: 164 AGSGKEELKDGLKTIASFFNPQAVAVDHVYKDRFFVADTDNHAIREVSLPDGEVTTIAGG 223
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
++G DG A+F + +T P L I+DH N IR I ++ + + + SG
Sbjct: 224 ---EKGFKDGKGTGATFYHPAGVTIDPIRNILFIADHYNHAIRMIGVESKIVTTLAGSGK 280
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 5/112 (4%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG-V 135
E +V LAG G G+ +G A F+ P+ A D + + YV + N+ +R + + G V
Sbjct: 268 ESKIVTTLAGSGKPGFVNGMGNQAMFNYPEGMAYDTENKVLYVVEFDNNCVRIVDDEGEV 327
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ GG +EG++DG + A F + LTF + I+D N IR I+
Sbjct: 328 RSFVGG---REGKSDGLGEEAKFFHPTGLTFDEKEKIIYITDQYNHQIRGIS 376
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 52/127 (40%), Gaps = 17/127 (13%)
Query: 60 IVLDSSRSAFYTLSFP------LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
+ D+ Y + F + +E V+ G G +G SDG A+F P D
Sbjct: 299 MAYDTENKVLYVVEFDNNCVRIVDDEGEVRSFVG-GREGKSDGLGEEAKFFHPTGLTFDE 357
Query: 114 KGNI-YVADKSNHVIRKITNLGVTTIAGGG--------SKKEGRADGPAQNASFSNDFEL 164
K I Y+ D+ NH IR I+ +G T G S KEG + +F F +
Sbjct: 358 KEKIIYITDQYNHQIRGISGIGAKTSPPGKITIEKIMTSVKEGYTGTAMMSLTFILFFAM 417
Query: 165 TFVPHIC 171
FV IC
Sbjct: 418 VFVC-IC 423
>gi|298386946|ref|ZP_06996500.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
gi|298260096|gb|EFI02966.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
Length = 457
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 2/114 (1%)
Query: 77 SEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+ E + AGD G G++DG+ A+F P+ +D + N+Y+AD NH IRK+T GV
Sbjct: 345 TREGNFRIYAGDKGRAGHADGQETEAQFSSPRQLVLDEEENLYIADSGNHCIRKVTPEGV 404
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G G DG + A F+ + L + + I D N+ +R+++++
Sbjct: 405 VSTVIGNPNTSGYKDGTPEIALFTEPWGLA-IDSEGTIYIGDKDNRCVRKLSIE 457
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ E VV + G+ GY DG P A F +P A+D +G IY+ DK N +RK++
Sbjct: 399 VTPEGVVSTVIGNPNTSGYKDGTPEIALFTEPWGLAIDSEGTIYIGDKDNRCVRKLS 455
>gi|320106842|ref|YP_004182432.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319925363|gb|ADV82438.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 382
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 63/116 (54%), Gaps = 5/116 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLG 134
L+ + V LAG+G GY+DG+ SARF+KP + G Y+AD N+ IR++ + G
Sbjct: 261 LNGANQVTTLAGNGTAGYADGQGNSARFNKPTQVVYNSGDGAYYIADTFNNCIRRMDSAG 320
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
VTT AG G + G DG + +A F + L ++D GN IR++++
Sbjct: 321 NVTTYAGIGG-QSGLVDGASTSAKFDKPTGIAIANGY--LYVADSGNNAIRRVDMN 373
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 63/165 (38%), Gaps = 44/165 (26%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-----A 110
+ +L + D+ + + L SV LAG G GY+DG P SA+F+ P A
Sbjct: 54 NGNLYIADTGNNVIKMFNPSLGTLSV---LAGSGTAGYNDGSPASAQFNTPTGLSAVYTA 110
Query: 111 VDMKGN------------IYVADKSNHVIRKIT-----------------NLGVTTIAGG 141
D++ I + D N+ IR++ N V T+ GG
Sbjct: 111 QDIQSGPGFPPTRLRLIEIRINDTGNNAIRRMCIQAPGDQVTCQSVNGLPNGQVVTMGGG 170
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ADG + F++ LLI D GN IR +
Sbjct: 171 ---SLGDADGNGSASRFNSPTGFA----TDGLLIGDTGNNSIRTM 208
>gi|302821389|ref|XP_002992357.1| hypothetical protein SELMODRAFT_430570 [Selaginella moellendorffii]
gi|300139773|gb|EFJ06507.1| hypothetical protein SELMODRAFT_430570 [Selaginella moellendorffii]
Length = 220
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 56/99 (56%), Gaps = 8/99 (8%)
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC 193
VTTI GG S+K G ADGP A FS+ E + +LLI+D GN+LIR+I + P+ C
Sbjct: 23 VTTIVGGSSRKPGFADGPGDTARFSS--ESSLACSCGSLLIADRGNRLIREIQIDDPKSC 80
Query: 194 SKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
S SA+ W L+ V+ L+ + +G P+II
Sbjct: 81 DSSD---SAVSGSQKWALIPVV--LLGVCLGMPLGPFII 114
>gi|297568314|ref|YP_003689658.1| hypothetical protein [Desulfurivibrio alkaliphilus AHT2]
gi|296924229|gb|ADH85039.1| YD repeat protein [Desulfurivibrio alkaliphilus AHT2]
Length = 2439
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 3/103 (2%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
+ ++ +AG G +G+S DG P AR P + A+ G+IY+AD NH IR++ G +
Sbjct: 1244 DGIITTIAGTGYRGFSGDGGPADEARLFGPIAVAIGPDGSIYIADTYNHRIRRVGTDGII 1303
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDH 178
TT+AG GS GPA ASFS +TF P + ++H
Sbjct: 1304 TTVAGTGSLGYSGDGGPATEASFSTPSGITFGPDGSLYIAANH 1346
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 60/114 (52%), Gaps = 9/114 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
+++ +AG G G+S G +AR +P ++D G IY+AD NH IR + G +TT A
Sbjct: 1068 IIETIAGTGSWGHSGGLAVNARLYRPSCVSIDPDGTIYIADTGNHRIRSVGTDGIITTHA 1127
Query: 140 GGGSKKEGRA-------DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G + G DGPA NA ++ ++ P ++ I+D N IR++
Sbjct: 1128 GSGFRAGGLGDSGYSGDDGPAVNARLNSPTDIALGPD-GSIYIADSINHRIRRV 1180
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
AR D+P + A+ G+IY+AD NH IR++ G +TTIAG G + GPA A
Sbjct: 1212 ARLDRPHAVAIGPDGSIYIADTYNHRIRRVGTDGIITTIAGTGYRGFSGDGGPADEARLF 1271
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
+ P ++ I+D N IR++
Sbjct: 1272 GPIAVAIGPD-GSIYIADTYNHRIRRV 1297
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 84 RLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
R G G GYS DG +AR + P A+ G+IY+AD NH IR++ G +TT+AG
Sbjct: 1132 RAGGLGDSGYSGDDGPAVNARLNSPTDIALGPDGSIYIADSINHRIRRVGTDGIITTVAG 1191
Query: 141 GG 142
G
Sbjct: 1192 TG 1193
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV- 135
+ ++ +AG G GYS DG P + A F P G++Y+A +NH IR++ G+
Sbjct: 1300 DGIITTVAGTGSLGYSGDGGPATEASFSTPSGITFGPDGSLYIA--ANHRIRRVGTDGII 1357
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TTIAG G + GPA A ++ P ++ ++D N IR I
Sbjct: 1358 TTIAGTGYRGFSGDGGPADEARLG-PRGVSLGPD-GSIYVADSNNHRIRHI 1406
>gi|291452573|ref|ZP_06591963.1| NHL repeat containing protein [Streptomyces albus J1074]
gi|291355522|gb|EFE82424.1| NHL repeat containing protein [Streptomyces albus J1074]
Length = 613
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 65/116 (56%), Gaps = 7/116 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G+G +G +DG P ARF +P+ A+ +G
Sbjct: 185 SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLALTPEG 242
Query: 116 N-IYVADKSNHVIRKITNLG--VTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
+ + VAD NH +R++ VTT AG G + + GPA+ S+ +++ +
Sbjct: 243 DAVIVADTVNHALRRVELASGVVTTPAGTGRQWWQGSPTSGPAREVDLSSPWDVAW 298
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 47/103 (45%), Gaps = 2/103 (1%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
AG +G DG A F +P A D ++VAD +R + G V T+ G G
Sbjct: 327 AGTTNEGLVDGPGAEAWFAQPSGLAADGDARLWVADSETSALRWVDPEGTVHTVVGTGLF 386
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G DG A A F + +T +P ++ +SD N +R+ +
Sbjct: 387 DFGHRDGEAAQALFQHPLAVTVLPD-GSVAVSDTYNHALRRYD 428
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 7/105 (6%)
Query: 87 GDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-GGS 143
GDG Y EP G+ RF P + GN V+D + H + ++ G + + GG
Sbjct: 160 GDGP--YVAPEPVAGALRF--PGRAVLLPSGNFLVSDTTRHQLVELAEDGESVVRRIGGE 215
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ G ADG A FS L P A++++D N +R++ L
Sbjct: 216 GERGLADGGPAEARFSEPQGLALTPEGDAVIVADTVNHALRRVEL 260
>gi|260821256|ref|XP_002605949.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
gi|229291286|gb|EEN61959.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
Length = 446
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 59/100 (59%), Gaps = 4/100 (4%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRA 149
+ DG+ ARF P+ +D K N ++VAD NHVIR+++ + V T+AG K++G
Sbjct: 177 WKDGKGLEARFYHPQGLYLDAKQNRMFVADTDNHVIREMSMPDAVVKTVAGT-PKEKGLV 235
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+G + A F + ++ + P+ L +SDH N IR INL+
Sbjct: 236 NGQGRAAKFYHPTQMAYDPYTDILYVSDHFNHAIRTINLQ 275
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 69/149 (46%), Gaps = 16/149 (10%)
Query: 42 QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPG 99
Q+ +P++ I L V D A T++ E + LAG G G+ DG+
Sbjct: 249 QMAYDPYTDI-------LYVSDHFNHAIRTINLQGYEVDI---LAGSPKGEPGFKDGKGT 298
Query: 100 SARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
+AR + P+ A D + YV + N+ IR +T G V T+AGG + G DG A
Sbjct: 299 AARLNYPEGIAFDTTHRVLYVVEFGNNCIRMVTPAGIVKTLAGG--PEPGYKDGSGSAAR 356
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
F + LT P + ++D N L+R +
Sbjct: 357 FFHPTGLTLDPTNKVIYVTDQYNHLVRSV 385
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 40/82 (48%), Gaps = 7/82 (8%)
Query: 60 IVLDSSRSAFYTLSFP------LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
I D++ Y + F ++ +VK LAG GY DG +ARF P +D
Sbjct: 308 IAFDTTHRVLYVVEFGNNCIRMVTPAGIVKTLAGGPEPGYKDGSGSAARFFHPTGLTLDP 367
Query: 114 KGN-IYVADKSNHVIRKITNLG 134
IYV D+ NH++R +T +G
Sbjct: 368 TNKVIYVTDQYNHLVRSVTAVG 389
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 13/102 (12%)
Query: 101 ARFDKPKSFAVDMKGN-IYVAD-------KSNHVIRKIT--NLGVTTIAGGGSKKEGRAD 150
AR + P D N IYVAD +N IR+I+ + +TT+AG +G D
Sbjct: 64 ARLNYPWGLVFDKSDNSIYVADCGCPNSPHTNERIRRISLSDSWITTLAG---SSQGNQD 120
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
G ++A F + + L + D N +IRQ++ + D
Sbjct: 121 GLGKDAHFHHTAGMVVDEDNAMLYVCDSSNNVIRQVSTRTGD 162
>gi|383454552|ref|YP_005368541.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
gi|380728713|gb|AFE04715.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
Length = 861
Score = 62.4 bits (150), Expect = 2e-07, Method: Composition-based stats.
Identities = 40/110 (36%), Positives = 61/110 (55%), Gaps = 9/110 (8%)
Query: 80 SVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ VKR++ DG ++ G G P A D GN+YV+D N VIR+I+ G +
Sbjct: 605 NTVKRVSPDGKIRTVLTGLYG------PMGIAADALGNVYVSDTENAVIRRISPEGKAEV 658
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G + G DGPA +A+F+ L+F P ALL++D N +IR+I++
Sbjct: 659 FAGTT--WGYQDGPALSAAFNQPAGLSFTPDGTALLVADLNNSVIRRIDM 706
>gi|374311754|ref|YP_005058184.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
gi|358753764|gb|AEU37154.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
Length = 1456
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 76/152 (50%), Gaps = 10/152 (6%)
Query: 40 GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GE 97
G + EI ++DR G D+ ++D + ++ + ++ +AG+G +G+S G
Sbjct: 82 GAKFEIPFGILVDRAG--DIFIVDPGNNNVRKIA---ASTGIITTVAGNGTRGFSGDGGA 136
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQN 155
SA ++P++ A+D GN+Y+ D N +RK+ T +TT+AG GS GPA N
Sbjct: 137 ATSAELNQPQAVALDAAGNMYITDTINSRVRKVEATTGIITTVAGNGSVTSSGDGGPATN 196
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A + + ++D G +R+I+
Sbjct: 197 AGLVQPAGIAL-DSANNIFVTDSGEPGVREIS 227
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 61/111 (54%), Gaps = 5/111 (4%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
++ +AG+G GY+ DG P SA +P+ A+D G++Y AD +HV+R+I +T
Sbjct: 503 IITTIAGNGTSGYTGDGGPATSAGMYEPQGVALDNAGDVYFADFYHHVVRRIDAASGIIT 562
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
T+AG G++ G A +A N E + L I+D GN +R+++
Sbjct: 563 TVAGNGTEGFSGDGGSATSAEL-NYPEGISLDSAANLYIADSGNYRVRKVD 612
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 54/98 (55%), Gaps = 6/98 (6%)
Query: 77 SEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---T 131
+ ++ +AG+G +G+S G SA + P+ ++D N+Y+AD N+ +RK+ T
Sbjct: 556 AASGIITTVAGNGTEGFSGDGGSATSAELNYPEGISLDSAANLYIADSGNYRVRKVDAAT 615
Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
+ +TT+AG G+K + G A NA F ++ F H
Sbjct: 616 GI-ITTVAGDGAKTDSGDGGLAVNAGFRAISDVKFDSH 652
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 81/154 (52%), Gaps = 17/154 (11%)
Query: 41 HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEP 98
HQL S I GS ++ ++D+ R A + L+ SVV AG+G +G+S G
Sbjct: 249 HQL-----SGIAIDGSGNVFIVDTDRYAVFKLTTGTGTVSVV---AGNGTRGFSGDGGAA 300
Query: 99 GSARFDKPKSFAVDMKGNIYVAD---KSNHVIRKITNLG--VTTIAGGGSKKEGRADGPA 153
SA P++ A+D GN+Y++D +SN+ IRK+T ++T+AG G+ E DG A
Sbjct: 301 TSAELAYPQAVALDTAGNLYISDGDYRSNN-IRKVTAATGIISTVAGDGT-AEFSGDGGA 358
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ N+ + L I+D N ++R+++
Sbjct: 359 AVDARLNEPGGVALDTAGNLYIADVNNSVVREVS 392
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 37/56 (66%), Gaps = 4/56 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ +AGDG +S G+ G+A R ++P A+D GN+Y+AD +N V+R+++ L
Sbjct: 340 IISTVAGDGTAEFS-GDGGAAVDARLNEPGGVALDTAGNLYIADVNNSVVREVSPL 394
>gi|116619658|ref|YP_821814.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116222820|gb|ABJ81529.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 585
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 68/137 (49%), Gaps = 8/137 (5%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG--SARFDKPKSFAV 111
P + +L + DS+ +S S + +AG+G G++ + SA + P A+
Sbjct: 60 PKTGNLYIADSANHRIRMIS-----GSTISTVAGNGTAGFAGDKAAATSANLNTPSGVAL 114
Query: 112 DMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG-RADGPAQNASFSNDFELTFVPHI 170
D GN Y+AD N VIRK+T +TT+AG ++ G + DG N + N+ V
Sbjct: 115 DSSGNFYIADSLNSVIRKVTGGTITTVAGDYTQFPGDQGDGGQANVAVLNNPTSVMVDPA 174
Query: 171 CALLISDHGNQLIRQIN 187
I+D GN IR+++
Sbjct: 175 GNYYIADSGNNRIRKVD 191
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 45/136 (33%), Positives = 68/136 (50%), Gaps = 12/136 (8%)
Query: 59 LIVLDSS--RSAFYTLSFPLSEESVVKRL---AGDGVQGYS-DGEPGS-ARFDKPKSFAV 111
L + D+S R A Y P + +V L AG+ G++ DG + ++ +KP AV
Sbjct: 222 LYIADTSNNRIAKYA---PYTANNVAADLTNFAGNLTAGFAGDGNTATLSQLNKPVGIAV 278
Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
D GN+Y+AD +N IRK+ G +TTIAG G GPA +A S + +
Sbjct: 279 DSAGNVYIADSNNGRIRKVGTDGIITTIAGKGGSGYSGDGGPATSAVLSFPRGIAVAAN- 337
Query: 171 CALLISDHGNQLIRQI 186
+ I+D N +IR +
Sbjct: 338 GTVYIADTNNHVIRAL 353
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 55/138 (39%), Gaps = 27/138 (19%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRL--------AGDGVQGYSD----GEPGSARFDKPK 107
+ LDSS + FY S SV++++ AGD Q D G+ A + P
Sbjct: 112 VALDSSGN-FYIAD---SLNSVIRKVTGGTITTVAGDYTQFPGDQGDGGQANVAVLNNPT 167
Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
S VD GN Y+AD N+ IRK+ G G GR P A F N
Sbjct: 168 SVMVDPAGNYYIADSGNNRIRKVDTTGTINAYLGTLATGGRLRNPYALALFGN------- 220
Query: 168 PHICALLISDHGNQLIRQ 185
L I+D N I +
Sbjct: 221 ----VLYIADTSNNRIAK 234
>gi|421740691|ref|ZP_16178929.1| NHL repeat protein [Streptomyces sp. SM8]
gi|406690906|gb|EKC94689.1| NHL repeat protein [Streptomyces sp. SM8]
Length = 444
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 65/116 (56%), Gaps = 7/116 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G+G +G +DG P ARF +P+ A+ +G
Sbjct: 16 SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLALTPEG 73
Query: 116 N-IYVADKSNHVIRKITNLG--VTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
+ + VAD NH +R++ VTT AG G + + GPA+ S+ +++ +
Sbjct: 74 DAVIVADTVNHALRRVELASGVVTTPAGTGRQWWQGSPTSGPAREVDLSSPWDVAW 129
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 2/103 (1%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
AG +G DG A F +P A D ++VAD +R + G V T G G
Sbjct: 158 AGTTNEGLVDGPGAEAWFAQPSGLAADGDARLWVADSETSALRWVDPEGTVHTAVGTGLF 217
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G DG A A F + +T +P ++ +SD N +R+ +
Sbjct: 218 DFGHRDGEAAQALFQHPLAVTVLPD-GSVAVSDTYNHALRRYD 259
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 3/91 (3%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-GGSKKEGRADGPAQNAS 157
G+ RF P + GN V+D + H + ++ G + + GG + G ADG A
Sbjct: 3 GALRF--PGRAVLLPSGNFLVSDTTRHQLVELAEDGESVVRRIGGEGERGLADGGPAEAR 60
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINL 188
FS L P A++++D N +R++ L
Sbjct: 61 FSEPQGLALTPEGDAVIVADTVNHALRRVEL 91
>gi|291612976|ref|YP_003523133.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
gi|291583088|gb|ADE10746.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
Length = 385
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGS 143
LA GV G SDG +ARF++P D N+YVAD +NH IRKI G T G +
Sbjct: 285 LASSGV-GSSDGIGANARFNQPMGITTDGS-NLYVADTNNHTIRKIVIATGNVTTPAGAA 342
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G ADG A F++ F + ++ AL ++D+ N IR++ L
Sbjct: 343 LVPGTADGAGPTARFNHPFGIIYIN--GALYVADYTNGSIRKVQL 385
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 51/112 (45%), Gaps = 7/112 (6%)
Query: 81 VVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
+V AG G +DG +ARF P A D N+YV D + IRKI VTT
Sbjct: 43 LVSTFAGTSGSHAITDGIGATARFQIPAYIASDGT-NLYVTDMYANNIRKIVIATGQVTT 101
Query: 138 IAGGGSKKEGRAD---GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG + G D GP +A F+N + + L ++D N IRQI
Sbjct: 102 FAGSITGASGVTDAPSGPGTSALFNNPYGIVIDATNTKLYVADGYNHTIRQI 153
>gi|325186506|emb|CCA21046.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1881
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 63/124 (50%), Gaps = 11/124 (8%)
Query: 71 TLSFPLSE-----ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
T SFP S + V+ AG G +G++DGE ++ FD P A+D G YV D NH
Sbjct: 214 TWSFPKSNSKTKPQGAVETFAG-GSRGHNDGEAVASLFDNPNDLAIDSTGVTYVVDTGNH 272
Query: 126 VIRKITNLG-VTTIAGGGSKKEGRADGPAQNA--SFSNDFELTFVPHICALLISDHGNQL 182
IR+I VTT A G++ G DGP A +F + + + + ++D GN
Sbjct: 273 CIRRIDQRSRVTTFA--GNRTRGFRDGPLDIAQYNFPSGIAVVNEQNRVTVYVADTGNHR 330
Query: 183 IRQI 186
IR+I
Sbjct: 331 IRRI 334
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 58/117 (49%), Gaps = 6/117 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN---IYVADKSNHVIRKITN 132
+ + S V AG+ +G+ DG A+++ P AV + N +YVAD NH IR+I +
Sbjct: 277 IDQRSRVTTFAGNRTRGFRDGPLDIAQYNFPSGIAVVNEQNRVTVYVADTGNHRIRRIRD 336
Query: 133 LGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
V IAG G +DG A + F L + L+++D GN LIR I+
Sbjct: 337 GQVACIAGRCDSIPHPGFSDGNASESRFDTPLGLA-IDVDGNLIVADSGNNLIRLID 392
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 34/50 (68%), Gaps = 1/50 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAG 140
G+SDG +RFD P A+D+ GN+ VAD N++IR I +G+T T+AG
Sbjct: 353 GFSDGNASESRFDTPLGLAIDVDGNLIVADSGNNLIRLIDAVGLTRTLAG 402
>gi|383636240|ref|ZP_09950646.1| redoxin domain-containing protein [Streptomyces chartreusis NRRL
12338]
Length = 605
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 74/139 (53%), Gaps = 14/139 (10%)
Query: 39 DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
DG + P + R PG S +V D++R L+ ESVV+R+ G G +
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGTFLVSDTTRHQLVELAE--DGESVVRRI-GSGTR 221
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEG 147
G++DG G+A F++P+ A+ +G++ VAD NH +R+ +T VTT+AG G + +
Sbjct: 222 GFADGPTGTAAFNEPQGLALLDEGSVVVADTVNHALRRLDLTTGEVTTLAGTGRQWWQGS 281
Query: 148 RADGPAQNASFSNDFELTF 166
GPA+ S+ +++
Sbjct: 282 PTSGPAREVDLSSPWDVAL 300
>gi|222054715|ref|YP_002537077.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
gi|221564004|gb|ACM19976.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
Length = 2393
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 68/121 (56%), Gaps = 7/121 (5%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRK 129
S L+E+ ++ +AG+GV G++ G+ G A + P+ AVD+ GNIY+AD +N+ IR+
Sbjct: 1554 SITLAEQGIINTVAGNGVAGFT-GDGGIAVGSSLNYPEDAAVDLDGNIYIADSANNRIRR 1612
Query: 130 I--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ + + TIAG G + GPA A + F + + ISD+ N +R+I+
Sbjct: 1613 VDKASGSIATIAGNGLAEFSGDGGPAYMAGLNRPFAVALDTS-GNIYISDNLNYRVRKID 1671
Query: 188 L 188
L
Sbjct: 1672 L 1672
>gi|375146530|ref|YP_005008971.1| NHL repeat containing protein [Niastella koreensis GR20-10]
gi|361060576|gb|AEV99567.1| NHL repeat containing protein [Niastella koreensis GR20-10]
Length = 434
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 66/139 (47%), Gaps = 15/139 (10%)
Query: 53 RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK---SF 109
+ G+ ++V + T F VV LAG G G++DG+ +A F+
Sbjct: 94 KTGTGPVVVTIGKNTGASTTDFKYLFSYVVTTLAGSGNAGFADGKGANASFNFAGVRCQL 153
Query: 110 AVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
+VD GN+YV D N IRKI G VTTIAG G + G +GPA A F+N
Sbjct: 154 SVDNIGNVYVPDGGNQRIRKIAPDGTVTTIAGTG--ENGYNEGPASGAKFNNP------- 204
Query: 169 HICALLISDHGNQLIRQIN 187
CA + +GN + + N
Sbjct: 205 --CATAMDANGNMYVAERN 221
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 32/50 (64%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V +AG G GY++G S RF+ P AVD GN+Y+ D +N+ +RKIT
Sbjct: 383 VTTIAGAGDSGYAEGTGSSVRFNGPTGIAVDKNGNMYILDMANNRVRKIT 432
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 36/56 (64%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + V +AG G GY++G A+F+ P + A+D GN+YVA+++ IRKIT
Sbjct: 174 IAPDGTVTTIAGTGENGYNEGPASGAKFNNPCATAMDANGNMYVAERNGRRIRKIT 229
Score = 40.4 bits (93), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 9/104 (8%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVA----DKSNHVIRKITN--LGVTTIAGGGSKK 145
G+ DG ARF+ P A+D K N+YV ++++ +R IT VTTIAG G
Sbjct: 334 GWVDGIGTVARFNNPWGIAMDNKSNLYVTGLGEGRNSNCVRMITPDVWNVTTIAGAG--D 391
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G A+G + F+ + V + I D N +R+I ++
Sbjct: 392 SGYAEGTGSSVRFNGPTGIA-VDKNGNMYILDMANNRVRKITVE 434
>gi|393778566|ref|ZP_10366837.1| Ig family protein [Ralstonia sp. PBA]
gi|392714468|gb|EIZ02071.1| Ig family protein [Ralstonia sp. PBA]
Length = 342
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 57/111 (51%), Gaps = 1/111 (0%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ +V AG G G +DG +A F+ P A+D GN+YVAD N+++R+IT GV
Sbjct: 227 ITPGGLVTTFAGSGTTGSADGPGNTASFNSPGGVALDGDGNVYVADTGNNLLRRITRAGV 286
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ G + G +G A F + + V + ++D N LIR++
Sbjct: 287 VSTLAGQAGVTGAQNGIGSAARFKQPYGVV-VDADGTVYVADTFNNLIRKV 336
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 79/163 (48%), Gaps = 24/163 (14%)
Query: 47 PHSVI-DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFD 104
P+ +I D+ G +L V D ++ P E + +AG G+ G DG +A FD
Sbjct: 148 PYGIIFDKNG--NLFVSDVQTGILRKIT-PAGETT---NIAGGGISSGPVDGNGAAAYFD 201
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
A+D GNIY + S + IRKIT G VTT AG G+ G ADGP ASF++
Sbjct: 202 GLGGIALDAAGNIYATELSGNRIRKITPGGLVTTFAGSGTT--GSADGPGNTASFNS--- 256
Query: 164 LTFVPHICAL------LISDHGNQLIRQINLKPEDCSKSSQSG 200
P AL ++D GN L+R+I + + Q+G
Sbjct: 257 ----PGGVALDGDGNVYVADTGNNLLRRITRAGVVSTLAGQAG 295
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 38/82 (46%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
+ ++ V AG G G + G +A F P D GN++V+D ++RKIT G T
Sbjct: 119 AAQAEVSLFAGSGAAGSATGIGSAASFSLPYGIIFDKNGNLFVSDVQTGILRKITPAGET 178
Query: 137 TIAGGGSKKEGRADGPAQNASF 158
T GG G DG A F
Sbjct: 179 TNIAGGGISSGPVDGNGAAAYF 200
>gi|29347802|ref|NP_811305.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|383120108|ref|ZP_09940841.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
gi|29339703|gb|AAO77499.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251838367|gb|EES66454.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
Length = 440
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 72/134 (53%), Gaps = 13/134 (9%)
Query: 65 SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
S+SA Y +P + + + + G G G+ DG+ A+F +P+ A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIDEYIPGAGA-GFRDGDVQEAQFKEPRGIAIDKEGNL 369
Query: 118 YVADKSNHVIRKI-TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
Y+AD N+ IRK+ T L VTTIAG G+ G DG A F+ + + ++ L I
Sbjct: 370 YIADVGNNRIRKVDTKLNVVTTIAGSGA--AGYKDGDPLEAQFNQPWGV-YLDKNEFLYI 426
Query: 176 SDHGNQLIRQINLK 189
+D N IR++ ++
Sbjct: 427 ADQNNHCIRKLAIE 440
>gi|302821969|ref|XP_002992645.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
gi|300139609|gb|EFJ06347.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
Length = 79
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 43/65 (66%), Gaps = 1/65 (1%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEG 147
G GY DG +RF++P+S A+ G ++VAD +N IRKI+ G VTTIAGG S++ G
Sbjct: 15 GESGYVDGPAAKSRFNRPQSLAICDNGAVFVADTTNLAIRKISKDGEVTTIAGGSSRRPG 74
Query: 148 RADGP 152
AD P
Sbjct: 75 IADSP 79
>gi|336180276|ref|YP_004585651.1| NHL repeat containing protein [Frankia symbiont of Datisca
glomerata]
gi|334861256|gb|AEH11730.1| NHL repeat containing protein [Frankia symbiont of Datisca
glomerata]
Length = 398
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 51/88 (57%), Gaps = 2/88 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
A + P + AVD GN+Y++D NH IRKI LG +TTIAG G+ GPA A +
Sbjct: 133 AELNSPATTAVDSDGNVYISDSHNHRIRKIDPLGIITTIAGTGTAGFSGDGGPATAAQLN 192
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
+ L V ++ ISD+ NQ IR+I+
Sbjct: 193 EPYGLA-VATDDSIYISDYENQRIRKID 219
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 59/110 (53%), Gaps = 4/110 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ +AG G G+S DG P +A + ++P AV +IY++D N IRKI LG +TT
Sbjct: 167 IITTIAGTGTAGFSGDGGPATAAQLNEPYGLAVATDDSIYISDYENQRIRKIDPLGIITT 226
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
IAG G+ GPA A N L V L IS+ GN IR+++
Sbjct: 227 IAGTGTAGFSGDGGPALQAQIKNPNNLA-VAADGTLYISELGNARIRKVS 275
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 15/114 (13%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TT 137
++ +AG G G+S DG P A+ P + AV G +Y+++ N IRK++ GV TT
Sbjct: 223 IITTIAGTGTAGFSGDGGPALQAQIKNPNNLAVAADGTLYISELGNARIRKVSPNGVITT 282
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
+AG G+ G GPA A VP + L I+D+ N+ IR++
Sbjct: 283 VAGNGTSGYGGDGGPATAAQLR-------VPSVALSPDGVLYIADYSNERIRRV 329
>gi|375148602|ref|YP_005011043.1| NHL repeat containing protein [Niastella koreensis GR20-10]
gi|361062648|gb|AEW01640.1| NHL repeat containing protein [Niastella koreensis GR20-10]
Length = 716
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 62/116 (53%), Gaps = 5/116 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ ++ AG+G +G+S G+ A+ +P + AVD GN+Y++D SN VIRK+
Sbjct: 132 VTTSGIMSIFAGNGTEGFSGDGGQAAQAKLYRPTAVAVDKDGNLYISDASNKVIRKVNKQ 191
Query: 134 GVTTIAGGGSKKEGRA--DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GV + G + G A GPA A + + V + + I+D N ++R+IN
Sbjct: 192 GVISTIAGVPGRAGYAGDGGPATKALLTQPAGIA-VDYSGNIYIADPSNSVVRKIN 246
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 63/119 (52%), Gaps = 5/119 (4%)
Query: 76 LSEESVVKRLAG-DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++++ V+ +AG G GY+ DG P + A +P AVD GNIY+AD SN V+RKI
Sbjct: 188 VNKQGVISTIAGVPGRAGYAGDGGPATKALLTQPAGIAVDYSGNIYIADPSNSVVRKINP 247
Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFS-NDFELTFVPHICALLISDHGNQLIRQINLK 189
GV TT AG G+ GPA A F + V + SD+ N IR+I+ K
Sbjct: 248 AGVITTFAGNGTAGYSGDGGPAIKAQFQMGSPQGLAVDPAGNVYASDYQNHAIRKISSK 306
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 67/136 (49%), Gaps = 10/136 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFD--KPKSFAV 111
S ++ + D S S ++ V+ AG+G GYS DG P A+F P+ AV
Sbjct: 229 SGNIYIADPSNSVVRKIN----PAGVITTFAGNGTAGYSGDGGPAIKAQFQMGSPQGLAV 284
Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
D GN+Y +D NH IRKI++ G +TTIAG G+ GPA A + +
Sbjct: 285 DPAGNVYASDYQNHAIRKISSKGIITTIAGTGAPDYAGDGGPAILAKIWYPIGIA-TDNA 343
Query: 171 CALLISDHGNQLIRQI 186
+ I+D N IR+I
Sbjct: 344 GNVFITDSYNNTIREI 359
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 57/101 (56%), Gaps = 4/101 (3%)
Query: 89 GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKK 145
G GY G +ARF++P++ A D N+Y+AD NHVIRK+ + G ++T+AG G+
Sbjct: 33 GSPGYGGDAGPANNARFNQPRAVATDNDNNLYIADMRNHVIRKVNSNGIISTVAGNGTAG 92
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
GPA A + +T + + + I+D + +I+++
Sbjct: 93 TAGDGGPATAAQLAQPTGMT-IDNDGNIYIADFNSSVIKKV 132
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 48/89 (53%), Gaps = 2/89 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI-AGGGSKKEGRADGPAQNASF 158
+A+ +P +D GNIY+AD ++ VI+K+T G+ +I AG G++ G A A
Sbjct: 102 AAQLAQPTGMTIDNDGNIYIADFNSSVIKKVTTSGIMSIFAGNGTEGFSGDGGQAAQAKL 161
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
+ V L ISD N++IR++N
Sbjct: 162 YRPTAVA-VDKDGNLYISDASNKVIRKVN 189
>gi|383784367|ref|YP_005468936.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
gi|383083279|dbj|BAM06806.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
Length = 382
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 67/152 (44%), Gaps = 6/152 (3%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + P L V DS + + V LAG +DG A F +
Sbjct: 125 NPEGLAISPDGKTLYVADSRNNRIRKIDL---ATKTVSTLAGHAFASGNDGIGDHAGFAQ 181
Query: 106 PKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
P+ A+ G +Y+ D N++IRK +T L VTT+AG G+ G ADG A+F
Sbjct: 182 PRGLAISPDGKTLYIGDSGNNMIRKMDLTTLQVTTLAGQGALVPGHADGVGTQATFLEPR 241
Query: 163 ELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
L P L I+D N LIR++ L S
Sbjct: 242 GLAISPDGQILYIADTRNHLIRKLVLATNSVS 273
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTTIAGGGSKKE 146
+G +DG SARF+ P+ G+IYVAD N +IRKIT V TIAG
Sbjct: 53 RGDNDGNGASARFEFPQGIVAAPDGSIYVADTGNDMIRKITMSSGTGTVETIAGVNHHAR 112
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
R DG A F+N L P L ++D N IR+I+L + S
Sbjct: 113 FR-DGAGAAARFNNPEGLAISPDGKTLYVADSRNNRIRKIDLATKTVS 159
>gi|111219919|ref|YP_710713.1| serine/threonine protein kinase [Frankia alni ACN14a]
gi|111147451|emb|CAJ59101.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
Length = 779
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 63/128 (49%), Gaps = 15/128 (11%)
Query: 42 QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS 100
+L+ +V DR S DL + DS + + +V +AG G +G+S DG P +
Sbjct: 622 RLQATTAAVADR--SGDLYIADSEANRIRRID----SRGIVTTVAGTGAKGFSGDGGPAT 675
Query: 101 -ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG-------GSKKEGRADGP 152
A + P AVD GN+Y+AD +NH IR++T G+ T G GS K +GP
Sbjct: 676 RAMLNNPTGVAVDRAGNLYIADSANHRIRRVTPDGIITAFAGSGAAFVWGSPKSDETNGP 735
Query: 153 AQNASFSN 160
A + N
Sbjct: 736 ATSVVLWN 743
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 64/132 (48%), Gaps = 8/132 (6%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+++KR G DG P SA R + D G++Y+AD + IR+I + G VTT
Sbjct: 599 TLIKRGTGLSPSPPGDGGPASAGRLQATTAAVADRSGDLYIADSEANRIRRIDSRGIVTT 658
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
+AG G+K GPA A +N + V L I+D N IR++ D ++
Sbjct: 659 VAGTGAKGFSGDGGPATRAMLNNPTGVA-VDRAGNLYIADSANHRIRRVT---PDGIITA 714
Query: 198 QSGSALGAVSVW 209
+GS GA VW
Sbjct: 715 FAGS--GAAFVW 724
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
V +AG G G S DG P A F + S AVD GNIY++D+SN V R T+ VTTI
Sbjct: 486 VSTVAGSGETGTSGDGGPALQAAFQRLGSVAVDAAGNIYISDESNRVRRVDTHGTVTTI 544
Score = 37.0 bits (84), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 2/90 (2%)
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQ 154
G G A ++ K VD GN+Y+AD+++ IR+I G V+T+AG G GPA
Sbjct: 446 GPDGVAAINEFKDLTVDAAGNVYLADQADPRIRRIDRRGYVSTVAGSGETGTSGDGGPAL 505
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIR 184
A+F V + ISD N++ R
Sbjct: 506 QAAFQR-LGSVAVDAAGNIYISDESNRVRR 534
>gi|290995104|ref|XP_002680171.1| predicted protein [Naegleria gruberi]
gi|284093791|gb|EFC47427.1| predicted protein [Naegleria gruberi]
Length = 928
Score = 61.6 bits (148), Expect = 4e-07, Method: Composition-based stats.
Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG+G G+ +G +A+ P S V +Y+AD SNH IRKI G + TIAG
Sbjct: 46 IAGNGTAGFRGDNGPATNAQLYNPYSVFVSSNNEVYIADFSNHRIRKILENGKIVTIAGN 105
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G+ +GPA NA +N + + FV + I D+ N IR+I
Sbjct: 106 GTGGFSGDNGPATNAQLNNPYSV-FVSSNNEVYIVDYNNHRIRKI 149
Score = 60.5 bits (145), Expect = 7e-07, Method: Composition-based stats.
Identities = 39/112 (34%), Positives = 59/112 (52%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+ + P S V +Y+ D +NH IRKI N
Sbjct: 95 ENGKIVTIAGNGTGGFSGDNGPATNAQLNNPYSVFVSSNNEVYIVDYNNHRIRKILKNGN 154
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +GPA NA +N + FV + I+D+ N IR+I
Sbjct: 155 IVTIAGNGTGGFSGDNGPATNAQLNNPMGV-FVSSNNEVYIADYYNHRIRKI 205
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 40/137 (29%), Positives = 60/137 (43%), Gaps = 19/137 (13%)
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
F L L++ + +AG+G G+ +G +A+ P S V +Y+ D +NH
Sbjct: 340 FIDLKRSLTKNGNIVTIAGNGTAGFRGDNGPATNAQLYNPYSVFVSSNNEVYIVDYNNHR 399
Query: 127 IRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL----------------TFVPH 169
IRKI N + TIAG G+ +GPA NA +N + FV
Sbjct: 400 IRKILKNGNIVTIAGNGTGGFSGDNGPATNAQLNNPMVVIHLGPATNAQLNNPYSVFVSS 459
Query: 170 ICALLISDHGNQLIRQI 186
+ I D+ N IR+I
Sbjct: 460 NNEVYIVDYNNHRIRKI 476
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 2/92 (2%)
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQ 154
G +A+ + P S V +Y+ D +NH IRKI G + TIAG G+ +GPA
Sbjct: 442 GPATNAQLNNPYSVFVSSNNEVYIVDYNNHRIRKILENGNIVTIAGNGTGGFSGDNGPAT 501
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
NA +N + FV + I+D+ N IR+I
Sbjct: 502 NAQLNNPMGV-FVSSNNEVYIADYYNHRIRKI 532
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 51/101 (50%), Gaps = 17/101 (16%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG+G G+S +G +A+ + P V +Y+AD NH IRKI G + TIAG
Sbjct: 158 IAGNGTGGFSGDNGPATNAQLNNPMGVFVSSNNEVYIADYYNHRIRKILENGNIVTIAGN 217
Query: 142 GSKKEGRADGPAQNASFSND--FELTFVPHICALLISDHGN 180
G+ A FS D F++ PHI L++ +GN
Sbjct: 218 GT------------AGFSGDSPFDIRTYPHIGNKLLTGNGN 246
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 32/88 (36%), Positives = 44/88 (50%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
SA+ + P V +Y+AD N IRKI N + TIAG G+ +GPA NA
Sbjct: 7 SAQLNSPFGVFVSSNNEVYIADYGNQRIRKILKNGNIVTIAGNGTAGFRGDNGPATNAQL 66
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
N + + FV + I+D N IR+I
Sbjct: 67 YNPYSV-FVSSNNEVYIADFSNHRIRKI 93
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 17/98 (17%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
E + +AG+G G+S +G +A+ + P V +Y+AD NH IRKI G
Sbjct: 478 ENGNIVTIAGNGTGGFSGDNGPATNAQLNNPMGVFVSSNNEVYIADYYNHRIRKILENGN 537
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
+ TIAG G+ A FS D F++ PHI
Sbjct: 538 IVTIAGNGT------------AGFSGDSPFDIRTYPHI 563
>gi|373955343|ref|ZP_09615303.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
gi|373891943|gb|EHQ27840.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
Length = 929
Score = 61.2 bits (147), Expect = 4e-07, Method: Composition-based stats.
Identities = 39/87 (44%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
F++P VD+ GN+YVAD + + I+KIT G VT +AG GS G +G ASF
Sbjct: 199 FNQPNGVTVDLNGNVYVADAATNSIKKITPTGSVTVVAGNGSM--GSNNGIGSAASFYYP 256
Query: 162 FELTFVPHICALLISDHGNQLIRQINL 188
F +T V L +SD+GN LIR+I+L
Sbjct: 257 FSVT-VDGAGNLYVSDNGNNLIRKIDL 282
Score = 53.9 bits (128), Expect = 7e-05, Method: Composition-based stats.
Identities = 29/73 (39%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V LAG G+ ++DG +A F P +D GN+Y+AD N+ +RK+T LG VTT+AG
Sbjct: 286 VTTLAGSGMAAFADGTGTAASFYGPCGGTLDAMGNLYIADGVNNRVRKVTPLGVVTTVAG 345
Query: 141 GGSKKEGRADGPA 153
G++ +G +
Sbjct: 346 NGTRATINGNGTS 358
Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats.
Identities = 35/78 (44%), Positives = 45/78 (57%), Gaps = 3/78 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V +AG+G G ++G +A F P S VD GN+YV+D N++IRKI G VTT+AG
Sbjct: 232 VTVVAGNGSMGSNNGIGSAASFYYPFSVTVDGAGNLYVSDNGNNLIRKIDLAGAVTTLAG 291
Query: 141 GGSKKEGRADGPAQNASF 158
G ADG ASF
Sbjct: 292 SG--MAAFADGTGTAASF 307
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 37/108 (34%), Positives = 55/108 (50%), Gaps = 6/108 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
VV +AG+G + +G SA + P +D+ G +YVA+ + IRK+ G TI
Sbjct: 339 VVTTVAGNGTRATINGNGTSASLNTPTGATIDIAGIVYVAELDGNCIRKVDPSGNVTIL- 397
Query: 141 GGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQI 186
GS G A+G ASF ND + I ++D+GN +IR+I
Sbjct: 398 AGSNVAGSANGIGTAASFRRPNDVQADQSGFI---YVTDYGNNVIRKI 442
Score = 41.6 bits (96), Expect = 0.39, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 28/53 (52%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
V LAG V G ++G +A F +P D G IYV D N+VIRKI G
Sbjct: 394 VTILAGSNVAGSANGIGTAASFRRPNDVQADQSGFIYVTDYGNNVIRKILTTG 446
>gi|114566902|ref|YP_754056.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114337837|gb|ABI68685.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 1030
Score = 60.8 bits (146), Expect = 5e-07, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 62/116 (53%), Gaps = 10/116 (8%)
Query: 81 VVKRLAGDGV------QGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ +AG+GV GYS G SA+ + P A D GN+Y+AD +NH IRK+
Sbjct: 594 IISTVAGNGVLTGSYKSGYSGDGGSATSAQLNYPYGVAFDASGNMYIADSNNHCIRKVDT 653
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
LG ++T AG G+ GPA +A +N L+F + + I+D N IR ++
Sbjct: 654 LGIISTAAGNGTYGYSGDGGPATSAQLNNPNGLSF-DNRGNMYIADTYNHRIRMVD 708
Score = 55.8 bits (133), Expect = 2e-05, Method: Composition-based stats.
Identities = 39/121 (32%), Positives = 63/121 (52%), Gaps = 5/121 (4%)
Query: 71 TLSFPLSEES-VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
T L++ + ++ +AG+G GYS G SA + P A D GN+Y+AD SN +
Sbjct: 471 TFRLTLTQNTGIISTVAGNGTAGYSGDGGLAASALLNYPHGLAFDGNGNLYIADASNRRV 530
Query: 128 RKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
RKI + G +TT+AG G+ G A A + + + F + + I+D N IR++
Sbjct: 531 RKIDSAGIITTVAGNGTSGYSGDGGSAIAAKITCPYGVAFDSN-GNMYIADIFNHRIRKV 589
Query: 187 N 187
+
Sbjct: 590 D 590
Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats.
Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 12/118 (10%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TT 137
++ AG+G GYS DG P SA+ + P + D +GN+Y+AD NH IR + GV +T
Sbjct: 656 IISTAAGNGTYGYSGDGGPATSAQLNNPNGLSFDNRGNMYIADTYNHRIRMVDPNGVIST 715
Query: 138 IAGGGSKKEGRAD--------GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AG G+ + + G A +A +N +TF + I+D N IR+++
Sbjct: 716 VAGNGNSGDRYGNDGGYSGDGGLATSAQLNNPNGITF-DSSGNMYIADSNNNCIRKVD 772
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 38/130 (29%), Positives = 63/130 (48%), Gaps = 12/130 (9%)
Query: 81 VVKRLAGDGVQGYSDGEPG----------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V+ +AG+G G G G SA+ + P D GN+Y+AD +N+ IRK+
Sbjct: 712 VISTVAGNGNSGDRYGNDGGYSGDGGLATSAQLNNPNGITFDSSGNMYIADSNNNCIRKV 771
Query: 131 TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G ++T AG G+ GPA +A N + + + L I+D+ + IR++ L
Sbjct: 772 DHSGMISTFAGNGTSGHFGDGGPATSAQLRNPVGVA-LDNSGNLFIADYFDHSIRKVVLA 830
Query: 190 PEDCSKSSQS 199
+ S S +
Sbjct: 831 AQQNSTISPT 840
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 44/143 (30%), Positives = 71/143 (49%), Gaps = 16/143 (11%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAV 111
G+ +L + D+S + ++ +AG+G GYS G+ GSA + P A
Sbjct: 516 GNGNLYIADASNRRVRKID----SAGIITTVAGNGTSGYS-GDGGSAIAAKITCPYGVAF 570
Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGG----GSKKEGRA--DGPAQNASFSNDFEL 164
D GN+Y+AD NH IRK+ G ++T+AG GS K G + G A +A + + +
Sbjct: 571 DSNGNMYIADIFNHRIRKVDPAGIISTVAGNGVLTGSYKSGYSGDGGSATSAQLNYPYGV 630
Query: 165 TFVPHICALLISDHGNQLIRQIN 187
F + I+D N IR+++
Sbjct: 631 AFDAS-GNMYIADSNNHCIRKVD 652
>gi|428180908|gb|EKX49774.1| hypothetical protein GUITHDRAFT_67798, partial [Guillardia theta
CCMP2712]
Length = 247
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 60/106 (56%), Gaps = 4/106 (3%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
+AG G +G DG S R + P+ D G++Y+AD SNH IR++ ++ + T+AG G
Sbjct: 74 VAGCGRRGCQDGSHESGRLNGPRGLTFDPHGDLYIADSSNHRIRRLRSQDMFLETVAGSG 133
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ E R DG NASFS + L + L++S+ + IR+++
Sbjct: 134 -EGESR-DGTLLNASFSFPYGLAWDALTNTLIVSEQESHKIRRVDF 177
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/60 (46%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V +AG G G+ DGE A F++P A D +GNI+VAD N IR IT G V T+ G
Sbjct: 182 VTTIAGTGECGHRDGEASQATFNEPCFIACDQQGNIFVADSRNDSIRCITRHGFVHTLVG 241
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 65/147 (44%), Gaps = 17/147 (11%)
Query: 43 LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSAR 102
L +PH DL + DSS L S++ ++ +AG G DG +A
Sbjct: 98 LTFDPHG--------DLYIADSSNHRIRRLR---SQDMFLETVAGSGEGESRDGTLLNAS 146
Query: 103 FDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFS 159
F P A D + + V+++ +H IR++ VTTIAG G + G DG A A+F
Sbjct: 147 FSFPYGLAWDALTNTLIVSEQESHKIRRVDFAANSVTTIAGTG--ECGHRDGEASQATF- 203
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
N+ + ++D N IR I
Sbjct: 204 NEPCFIACDQQGNIFVADSRNDSIRCI 230
>gi|336178839|ref|YP_004584214.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
gi|334859819|gb|AEH10293.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
Length = 776
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 62/115 (53%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ V+ +AG G GYS DG P +A + +P A D GN+Y+AD +N+ IR++++
Sbjct: 603 IGTNDVITTVAGSGTSGYSGDGGPATAAQLARPGGLAADTAGNLYIADNANNRIRRVSSD 662
Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GV T+AG G+ GPA A +N + V + I+D N +R+++
Sbjct: 663 GVIITVAGSGTSGYSGDGGPATAAQLANPGSVA-VTDDGRVYIADTDNNRVRRVD 716
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 53/99 (53%), Gaps = 4/99 (4%)
Query: 92 GYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGR 148
GYS DG P +A + KP S VD G +Y+AD NH IR+I TN +TT+AG G+
Sbjct: 563 GYSGDGGPATAAQLAKPTSVLVDADGTLYIADTGNHRIRRIGTNDVITTVAGSGTSGYSG 622
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GPA A + L L I+D+ N IR+++
Sbjct: 623 DGGPATAAQLARPGGLA-ADTAGNLYIADNANNRIRRVS 660
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 56/114 (49%), Gaps = 3/114 (2%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S + V+ +AG G GYS DG P +A + P S AV G +Y+AD N+ +R++
Sbjct: 659 VSSDGVIITVAGSGTSGYSGDGGPATAAQLANPGSVAVTDDGRVYIADTDNNRVRRVDAD 718
Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
GV TT+AG GPA A L L I+D+ N IR++
Sbjct: 719 GVITTVAGSDEAGYSGDGGPATAARLCEPNGLGLDTTERLLYITDNCNDRIRRV 772
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 54/106 (50%), Gaps = 4/106 (3%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG GV G+S +G +A KP VD G +Y D N+ IR+I G +TT+AG
Sbjct: 444 VAGTGVAGFSGDNGPATAADLSKPDDPLVDNTGAVYFTDTGNNRIRRIGADGIITTVAGT 503
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+ +GPA A F+ ++ L I+D N IR+IN
Sbjct: 504 GTYGFSGDNGPAAQAHFATPTDVAR-DRAGNLYIADTDNNRIRRIN 548
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + ++ +AG G G+S +G A F P A D GN+Y+AD N+ IR+I +
Sbjct: 491 IGADGIITTVAGTGTYGFSGDNGPAAQAHFATPTDVARDRAGNLYIADTDNNRIRRINVV 550
Query: 134 GVTTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G T G D GPA A + + V L I+D GN IR+I
Sbjct: 551 GTVTTVAGTGTPGYSGDGGPATAAQLAKPTSV-LVDADGTLYIADTGNHRIRRI 603
>gi|423215200|ref|ZP_17201728.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692463|gb|EIY85701.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
CL03T12C04]
Length = 451
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 57/98 (58%), Gaps = 1/98 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY +G + R ++P+ A D GN++V +KS H+IRKIT GV ++ G + G DG
Sbjct: 355 GYVNGSGATVRLNQPRQPAFDEDGNMFVPEKSAHIIRKITPAGVASLYAGIPGQSGFGDG 414
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ A F++ +T P ++ ++D N +IR++ ++
Sbjct: 415 LPELAKFNSPECVTVYPD-NSVYVADRENHVIRRVTVE 451
Score = 44.3 bits (103), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 28/43 (65%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
G G+ DG P A+F+ P+ V ++YVAD+ NHVIR++T
Sbjct: 407 GQSGFGDGLPELAKFNSPECVTVYPDNSVYVADRENHVIRRVT 449
>gi|86610239|ref|YP_479001.1| NHL repeat-containing protein [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86558781|gb|ABD03738.1| NHL repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 649
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 3/78 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V AG + GY DG A+F +P A+ G++YVAD++N IR+IT G VTT+AG
Sbjct: 426 VSTFAGTCIAGYRDGGLDRAQFREPSGLALGSDGSLYVADRANRRIRRITPAGEVTTVAG 485
Query: 141 GGSKKEGRADGPAQNASF 158
G + G DGPA+ A
Sbjct: 486 TG--QPGSVDGPAEQAQL 501
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 51/97 (52%), Gaps = 4/97 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
GY DG ARF A D GN++VAD+ NH IR++ G V+T+A G + G D
Sbjct: 541 GYRDGPLAEARFQTLGGLAFDAAGNLWVADRDNHRIRRLQPGGQVSTLA--GQNQPGWQD 598
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GP A F L +P ++++ D G +R+I+
Sbjct: 599 GPTSVARFDQPAGLAVLPD-GSVVVVDAGLPGLRRIS 634
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 61/134 (45%), Gaps = 6/134 (4%)
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-N 132
F LS E ++ AG G G DG A+FD P+ D KG ++VAD NH +R I
Sbjct: 364 FRLSPEGELEVWAGTGQAGRRDGAANQAQFDSPQGLLWDPKGGLWVADSGNHCLRYINLQ 423
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE- 191
V+T A G+ G DG A F L +L ++D N+ IR+I E
Sbjct: 424 RQVSTFA--GTCIAGYRDGGLDRAQFREPSGLALG-SDGSLYVADRANRRIRRITPAGEV 480
Query: 192 -DCSKSSQSGSALG 204
+ + Q GS G
Sbjct: 481 TTVAGTGQPGSVDG 494
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 55/107 (51%), Gaps = 7/107 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V +AG G G DG A+ +P + AVD +GN+++AD H +R+++ G VTT++G
Sbjct: 480 VTTVAGTGQPGSVDGPAEQAQLLQPTALAVDKEGNLWIAD--GHRLRRLSAGGRVTTLSG 537
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ G DGP A F L F L ++D N IR++
Sbjct: 538 ---AEPGYRDGPLAEARFQTLGGLAFD-AAGNLWVADRDNHRIRRLQ 580
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 2/81 (2%)
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
AVD +G Y+AD++ H I +++ G + G+ + GR DG A A F + L + P
Sbjct: 345 IAVDPEGAFYLADRAQHRIFRLSPEGELEVW-AGTGQAGRRDGAANQAQFDSPQGLLWDP 403
Query: 169 HICALLISDHGNQLIRQINLK 189
L ++D GN +R INL+
Sbjct: 404 K-GGLWVADSGNHCLRYINLQ 423
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIA 139
V LAG G+ DG ARFD+P AV G++ V D +R+I+ + G T
Sbjct: 585 VSTLAGQNQPGWQDGPTSVARFDQPAGLAVLPDGSVVVVDAGLPGLRRISRVASGPVTPT 644
Query: 140 GGGS 143
GGS
Sbjct: 645 PGGS 648
>gi|359151030|ref|ZP_09183771.1| Redoxin domain-containing protein [Streptomyces sp. S4]
Length = 568
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 64/116 (55%), Gaps = 7/116 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G+G +G +DG P ARF +P+ + +G
Sbjct: 140 SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLVLTPEG 197
Query: 116 N-IYVADKSNHVIRKITNLG--VTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
+ + VAD NH +R++ VTT AG G + + GPA+ S+ +++ +
Sbjct: 198 DAVIVADTVNHALRRVELASGVVTTPAGTGRQWWQGSPTSGPAREVDLSSPWDVAW 253
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 2/103 (1%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
AG +G DG A F +P A D ++VAD +R + G V T G G
Sbjct: 282 AGTTNEGLVDGPGAEAWFAQPSGLAADGDARLWVADSETSALRWVDPEGTVHTAVGTGLF 341
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G DG A A F + +T +P ++ +SD N +R+ +
Sbjct: 342 DFGHRDGEAAQALFQHPLAVTVLPD-GSVAVSDTYNHALRRYD 383
Score = 40.4 bits (93), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 7/105 (6%)
Query: 87 GDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-GGS 143
GDG Y EP G+ RF P + GN V+D + H + ++ G + + GG
Sbjct: 115 GDGP--YVAPEPVAGALRF--PGRAVLLPSGNFLVSDTTRHQLVELAEDGESVVRRIGGE 170
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ G ADG A FS L P A++++D N +R++ L
Sbjct: 171 GERGLADGGPAEARFSEPQGLVLTPEGDAVIVADTVNHALRRVEL 215
>gi|288921012|ref|ZP_06415304.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288347587|gb|EFC81872.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 882
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 64/117 (54%), Gaps = 8/117 (6%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + +AG GV GYS DG P +A + + AV G+IY+AD NH IRKI+
Sbjct: 648 IAADGTISTIAGTGVAGYSGDGGPATAAQLKSAEKLAVAPDGSIYIADYENHRIRKISTD 707
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQIN 187
G +TTIAG G GPA A+ + ND E V L I++ G+ I++IN
Sbjct: 708 GIITTIAGTGVDGYSGEGGPATAATLNGPNDVE---VASDGTLYIANLGSDTIQKIN 761
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 48/89 (53%), Gaps = 2/89 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+A+ D P S D KGNIY+ D N+ IRKI G ++TIAG G GPA A
Sbjct: 618 AAKLDGPGSAVTDGKGNIYIPDARNYRIRKIAADGTISTIAGTGVAGYSGDGGPATAAQL 677
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
+ +L P ++ I+D+ N IR+I+
Sbjct: 678 KSAEKLAVAPD-GSIYIADYENHRIRKIS 705
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 79/160 (49%), Gaps = 13/160 (8%)
Query: 36 TVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF------PLSEESVVKRLAGDG 89
T +DG+ E P + G +D+ V +S Y + ++ + + +AG G
Sbjct: 716 TGVDGYSGEGGPATAATLNGPNDVEV--ASDGTLYIANLGSDTIQKINTDGTIHTVAGTG 773
Query: 90 VQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKE 146
QGYS DG P + A+ P S ++ G +Y+AD N+ IRK+ N +TTIAG GS+
Sbjct: 774 EQGYSGDGGPATKAQLSIP-SVSLGRNGTLYIADYGNNRIRKVDANGTITTIAGTGSEGS 832
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
GPA A F+ D AL I+D GN IR+I
Sbjct: 833 SGDGGPATEAKFT-DPSSVVEDSTGALYIADSGNNRIRRI 871
>gi|357008076|ref|ZP_09073075.1| copper amine oxidase domain-containing protein [Paenibacillus elgii
B69]
Length = 533
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 53/94 (56%), Gaps = 4/94 (4%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPA 153
DG+ ++ F++P+ A D GN+YVAD NH IRKI G V+T+AG G GR DG
Sbjct: 119 DGKSDASLFNEPQGLAADANGNVYVADSGNHAIRKIDTAGQVSTVAGNGLL--GRKDGEG 176
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
++A F ++ V L ++D N IR I+
Sbjct: 177 KDALFYRPTDVA-VAADGTLYVADSLNHAIRSIS 209
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 95/212 (44%), Gaps = 35/212 (16%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG---------VQG-YSDGEPG 99
V+D G +LIV DS + ++ V LAG G VQG ++DG
Sbjct: 255 VLDAKG--NLIVSDSGNQRIRYIDL---QQGKVTTLAGGGQAATNKELHVQGGFADGSAS 309
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
ARF P A+ +G + +AD NH IR + + V+TIAG + G ADG +A+
Sbjct: 310 DARFSFPMGLALTEEGGLVIADSQNHAIRYLLDGQVSTIAGAADRITGNADGIEGSAALH 369
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLK--PEDCSKSSQSGSALGAVSVWVLVSVLSC 217
++ + ++L +D N +R+++L P+ K +V V+V
Sbjct: 370 RPMDVAVLAD-GSILAADTYNNKLREVSLYRLPDGLPKDG---------TVKVVVDRQQ- 418
Query: 218 LVSLVIGFVARPYIIRHTGRLDPPLQHDMEAL 249
I F A+P I+ GR P++ EAL
Sbjct: 419 -----IAFEAQPEIV--NGRTMVPIRAITEAL 443
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 13/133 (9%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG------- 134
V +AG+G+ G DGE A F +P AV G +YVAD NH IR I+ G
Sbjct: 160 VSTVAGNGLLGRKDGEGKDALFYRPTDVAVAADGTLYVADSLNHAIRSISPSGEVKTLNA 219
Query: 135 -----VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
V G S ADG ++A F+ L L++SD GNQ IR I+L+
Sbjct: 220 LSPRVVELFPGQVSPAGDFADGDLKSAKFNEPTALVLDAK-GNLIVSDSGNQRIRYIDLQ 278
Query: 190 PEDCSKSSQSGSA 202
+ + G A
Sbjct: 279 QGKVTTLAGGGQA 291
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 11/120 (9%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
PL+E + LAG+G G +DG SA F P V G + V+D N +IRK++
Sbjct: 40 PLTEMAT---LAGNGGLGSADGAGVSASFRMPGGLTVLKDGTVLVSDSRNQLIRKLSQGT 96
Query: 135 VTTIAGGGSKKEGRA-------DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
V+T AG K++ + DG + +AS N+ + + ++D GN IR+I+
Sbjct: 97 VSTFAGAAYKQDSKGFPVGALLDGKS-DASLFNEPQGLAADANGNVYVADSGNHAIRKID 155
>gi|29349488|ref|NP_812991.1| hypothetical protein BT_4080 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298384984|ref|ZP_06994543.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383120427|ref|ZP_09941155.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
gi|29341397|gb|AAO79185.1| putative cell surface protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251840523|gb|EES68605.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
gi|298262128|gb|EFI04993.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 456
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 52/100 (52%), Gaps = 2/100 (2%)
Query: 88 DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKE 146
+G GY DG G A F+KP VD +GNI + D NH IRKIT + G + G +
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKITMSTGYVSTLAGKPQNS 414
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +G A++A F + + + + I D N+ IR++
Sbjct: 415 GYVNGSAEDAQFKKPLGIC-IDNDDVMYIGDSENRAIRRL 453
>gi|261404592|ref|YP_003240833.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
Y412MC10]
gi|261281055|gb|ACX63026.1| copper amine oxidase domain protein [Paenibacillus sp. Y412MC10]
Length = 1280
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 57/104 (54%), Gaps = 4/104 (3%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
A ++P AVD KG +Y+AD NH IRKI G VTTIAG G + G DG AQ+A F
Sbjct: 138 AFLNRPSGIAVDGKGQVYIADSGNHAIRKIDQAGRVTTIAGNG--RIGLKDGKAQDALFH 195
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
++ V L ++D N +IR+I+ E + S S A+
Sbjct: 196 EPQDVA-VTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAV 238
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 47/93 (50%), Gaps = 3/93 (3%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY DG +ARF+ P+ A ++ + +AD NH +R++ + V+T+AGG G ADG
Sbjct: 318 GYLDGAADAARFNSPRGLAWSVEDGLLIADSHNHAVRQLKDERVSTMAGG---TRGYADG 374
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
F+ ++ L I+D N +R
Sbjct: 375 IESEVRFNAPADIAVASDSGELFIADQRNGAVR 407
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 69/155 (44%), Gaps = 15/155 (9%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
+N S I G + + DS A + + V +AG+G G DG+ A F
Sbjct: 140 LNRPSGIAVDGKGQVYIADSGNHAIRKID----QAGRVTTIAGNGRIGLKDGKAQDALFH 195
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK--EGRAD-----GPAQNAS 157
+P+ AV G +YVAD NHVIR+I+ G T G S + + RA G +N S
Sbjct: 196 EPQDVAVTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAVQVRAGVIALAGDYKNGS 255
Query: 158 FS----NDFELTFVPHICALLISDHGNQLIRQINL 188
+ N+ + L +SD GN IR I+
Sbjct: 256 LAEAQFNEPAGLALDGKGNLYVSDSGNHAIRYIDF 290
>gi|118161380|gb|ABK64062.1| conserved hypothetical protein [Janthinobacterium lividum]
Length = 779
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 12/120 (10%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV-------I 127
++ + V LAG G SDG+ SARF AVD G IYVAD +++ I
Sbjct: 269 ITPDGTVTTLAGISRTYGTSDGDAKSARFGGISGIAVDTAGAIYVADAGSNINGIVGSRI 328
Query: 128 RKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
RK+T G V+T+AGG G ADG A+FS+ LT + L ++D GN IR+I
Sbjct: 329 RKLTPAGIVSTLAGG--SAYGSADGAGAVATFSSLNGLT-IDKQGNLFVADEGNHTIRKI 385
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 51/104 (49%), Gaps = 13/104 (12%)
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
+DG ARF ++ A D GN+Y+ D NH +RKIT GV T G + G ADG
Sbjct: 634 ADGSLAQARFGYLRAIAFDASGNLYLTDSLNHNVRKITPAGVVTTLAGTTGVAGDADGRG 693
Query: 154 QNASFSNDFELTFVPHICAL------LISDHGNQLIRQINLKPE 191
ASF+ PH AL ++D N L+R+I+ E
Sbjct: 694 SAASFNG-------PHGIALDKAGNVYVADTENNLVRRISTSGE 730
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 58/118 (49%), Gaps = 8/118 (6%)
Query: 76 LSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV----IR 128
++ VV LAG Q +DG +ARF+ P AVD GN+YV+D + + +R
Sbjct: 385 ITPAGVVSTLAGTASQLRPAVADGIGAAARFNLPYGLAVDGAGNVYVSDSNPGLQLNGVR 444
Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
KIT G T G + G DG A A F+ + L ++D GNQLIR++
Sbjct: 445 KITPAGEVTTITGSNAGIGVTDGLASEARFAGPQAIALHSD-GTLFVADTGNQLIRRV 501
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 49/91 (53%), Gaps = 2/91 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
L+ +V LAG G +DG A F +D +GN++VAD+ NH IRKIT G
Sbjct: 331 LTPAGIVSTLAGGSAYGSADGAGAVATFSSLNGLTIDKQGNLFVADEGNHTIRKITPAGV 390
Query: 135 VTTIAGGGSK-KEGRADGPAQNASFSNDFEL 164
V+T+AG S+ + ADG A F+ + L
Sbjct: 391 VSTLAGTASQLRPAVADGIGAAARFNLPYGL 421
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 71/155 (45%), Gaps = 12/155 (7%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+S ++ LAG G +DG+ +A FD + VD G++YV ++ IRKIT G
Sbjct: 217 ISPAGLLTTLAGSSSCGAADGKGTAASFDSMLAITVDASGSVYVGGGAS--IRKITPDGT 274
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL-------IRQINL 188
T G S+ G +DG A++A F V A+ ++D G+ + IR+ L
Sbjct: 275 VTTLAGISRTYGTSDGDAKSARFGG-ISGIAVDTAGAIYVADAGSNINGIVGSRIRK--L 331
Query: 189 KPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVI 223
P + GSA G+ V+ S L L I
Sbjct: 332 TPAGIVSTLAGGSAYGSADGAGAVATFSSLNGLTI 366
Score = 43.9 bits (102), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAG 140
G +DG ARF P++ A+ G ++VAD N +IR+++ +GVT T+AG
Sbjct: 463 GVTDGLASEARFAGPQAIALHSDGTLFVADTGNQLIRRVSAVGVTSTLAG 512
Score = 41.2 bits (95), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 8/117 (6%)
Query: 76 LSEESVVKRLAGDGVQGY----SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S V LAG+ G DG A F +P V G Y+AD + IR++
Sbjct: 501 VSAVGVTSTLAGNPGHGSFLSSGDGTGAQATFAQPAGIVVGSDGVAYIADAFRNTIRRVG 560
Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
N G VTT+AG + DG A FS+ + + L ++D+ Q +R+++
Sbjct: 561 NDGVVTTLAGSYENRSQPTDGQGAKAGFSSTNGIA-IDGNATLYVADY--QAVRKVD 614
>gi|290974673|ref|XP_002670069.1| predicted protein [Naegleria gruberi]
gi|284083624|gb|EFC37325.1| predicted protein [Naegleria gruberi]
Length = 821
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 77/152 (50%), Gaps = 21/152 (13%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEE-----------SVVKRLAGDGVQGYSDGEPG 99
++ PG L+ + + Y + FPLS + + AG+G G + GE G
Sbjct: 101 VENPG---LLAFSPNETTLY-VCFPLSNKVQRMDFLPNGTEIFSDFAGNGNSG-NTGEGG 155
Query: 100 ---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQN 155
+A+ ++P AVD G++Y++D N +IRK++ + G+ T G S+ G A+N
Sbjct: 156 LAINAQLNQPSGVAVDSNGDVYISDSGNGLIRKVSISSGIITKFAGTSQGFAGDGGLAKN 215
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A SN L F P+ L I+D N ++R+IN
Sbjct: 216 AKLSNPRGLNFGPN-GDLYIADSDNNVVRRIN 246
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 68/146 (46%), Gaps = 8/146 (5%)
Query: 44 EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSA 101
++N S + + D+ + DS +S ++ + AG QG++ G +A
Sbjct: 161 QLNQPSGVAVDSNGDVYISDSGNGLIRKVSI---SSGIITKFAGTS-QGFAGDGGLAKNA 216
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN 160
+ P+ G++Y+AD N+V+R+I + G +TTIAG + G A+ A N
Sbjct: 217 KLSNPRGLNFGPNGDLYIADSDNNVVRRINSSGIITTIAGDTTSGYSGDGGDAKLAKMKN 276
Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
+ V + ISD N +IR++
Sbjct: 277 PINV-IVSKTNEIFISDADNNVIRKV 301
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 9/98 (9%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + P + DL + DS + ++ ++ +AGD GYS G+ G A+ K
Sbjct: 220 NPRGLNFGP-NGDLYIADSDNNVVRRIN----SSGIITTIAGDTTSGYS-GDGGDAKLAK 273
Query: 106 ---PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
P + V I+++D N+VIRK++N ++TIAG
Sbjct: 274 MKNPINVIVSKTNEIFISDADNNVIRKVSNGNISTIAG 311
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 3/91 (3%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG-GSKKEGRADGPAQNASF 158
A+ + P S V I +AD SN+VIRK+ G + TIAG GS+ +G A ++
Sbjct: 496 AKLNAPNSVKVSSSDEIVIADTSNNVIRKVFKNGTIITIAGTIGSQGYSGDNGLAISSVL 555
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLK 189
S+ ++ P+ + I+D GN +IR+I+ K
Sbjct: 556 SSPSDVAIAPN-GEVFIADWGNHVIRKIDTK 585
Score = 40.4 bits (93), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 47/94 (50%), Gaps = 8/94 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYS--DGEPGSARF 103
P+SV + SSD IV+ + + F + + +AG G QGYS +G S+
Sbjct: 501 PNSV--KVSSSDEIVIADTSNNVIRKVF---KNGTIITIAGTIGSQGYSGDNGLAISSVL 555
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
P A+ G +++AD NHVIRKI G+ T
Sbjct: 556 SSPSDVAIAPNGEVFIADWGNHVIRKIDTKGIIT 589
>gi|29349195|ref|NP_812698.1| hypothetical protein BT_3787 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341103|gb|AAO78892.1| conserved hypothetical protein with NHL repeat [Bacteroides
thetaiotaomicron VPI-5482]
Length = 452
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 52/98 (53%), Gaps = 1/98 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY +G +ARFD P+ A D GN++V + H IRKIT G ++ G + G DG
Sbjct: 356 GYVNGSGVTARFDNPRQPAFDQDGNMFVPEYGRHTIRKITPTGEVSLYAGLPGQAGFTDG 415
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ A F N E V +L ++D N LIR++ ++
Sbjct: 416 LPEKARF-NKPECVTVYLDNSLYVADRDNHLIRRVTVE 452
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
P E S+ L G G++DG P ARF+KP+ V + ++YVAD+ NH+IR++T
Sbjct: 396 PTGEVSLYAGLPGQA--GFTDGLPEKARFNKPECVTVYLDNSLYVADRDNHLIRRVT 450
>gi|325104254|ref|YP_004273908.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
gi|324973102|gb|ADY52086.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
Length = 439
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 84/171 (49%), Gaps = 24/171 (14%)
Query: 29 EDGYTVTTV--------IDGHQLEI---NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPL 76
E Y V+T+ IDG+ L+ NP + +D G D+I+ D + + +
Sbjct: 117 ESNYIVSTIAGSTTFGFIDGNGLDARFRNPDGILVDMNG--DIIITDRTNHSIR----KM 170
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
+ VV LAG GV GY++G+PG +F+ P +D GNI V +K IRKI G V
Sbjct: 171 TTAGVVSTLAGTGVSGYANGKPG--QFNTPWQSTMDAAGNIIVIEKDGGRIRKIAPDGAV 228
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ IAG GS G DG A F++ + V + ++D N IR+I
Sbjct: 229 SLIAGTGS--LGFTDGNVSVARFNHALD-GVVDSEGNIFVADRNNYRIRKI 276
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 53/103 (51%), Gaps = 7/103 (6%)
Query: 42 QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
Q S +D G+ +I D R ++ + V +AG G G++DG A
Sbjct: 194 QFNTPWQSTMDAAGNIIVIEKDGGRIR------KIAPDGAVSLIAGTGSLGFTDGNVSVA 247
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
RF+ VD +GNI+VAD++N+ IRKIT G V+T G G+
Sbjct: 248 RFNHALDGVVDSEGNIFVADRNNYRIRKITPAGMVSTFMGDGT 290
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 33/52 (63%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ V +AG+G G DG +A F++P +D KG+IYVAD NH IRKI
Sbjct: 385 KATVSTIAGNGTAGRGDGLGHAATFNQPYDVVMDAKGDIYVADNVNHSIRKI 436
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 62/125 (49%), Gaps = 15/125 (12%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-- 130
+++ V+ + G +GY DG PG + + D +GNI +AD SN+ IRKI
Sbjct: 319 ITKSGVMTTIVGS-TKGYDDGTPGNPLTVKLGDIFGLNFDNEGNILLADASNNRIRKITP 377
Query: 131 ------TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
T V+TIAG G+ GR DG A+F+ +++ + ++D+ N IR
Sbjct: 378 GVGNDWTKATVSTIAGNGT--AGRGDGLGHAATFNQPYDVVMDAK-GDIYVADNVNHSIR 434
Query: 185 QINLK 189
+I K
Sbjct: 435 KIIYK 439
>gi|301064161|ref|ZP_07204608.1| RHS repeat-associated core domain protein [delta proteobacterium
NaphS2]
gi|300441781|gb|EFK06099.1| RHS repeat-associated core domain protein [delta proteobacterium
NaphS2]
Length = 2050
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 58/110 (52%), Gaps = 3/110 (2%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
++ +AG G G+S DG P A P A+D GN+Y+ D N+ +RK+ TN +TT
Sbjct: 804 IITTVAGSGDYGFSGDGGPAIEASLSFPMGIAIDSAGNLYILDSDNNRVRKVDTNGIITT 863
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AG G+ GPA AS S+ + L ISD GN IR+++
Sbjct: 864 VAGNGNWSYNGDGGPAVEASLSSAASGIAIDSAGNLYISDTGNYCIRKVD 913
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 56/110 (50%), Gaps = 4/110 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
++ +AG+GV G+S DG P A A+D GN+Y+ D SNH +RK+ TN +TT
Sbjct: 917 IITTVAGNGVAGFSGDGGPAVEASLGWAMGIAIDSAGNLYILDGSNHRVRKVDTNGIITT 976
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AG GPA AS + + L ISD N IR+++
Sbjct: 977 VAGSDDYGFSGDGGPAIEASLGYAVGIA-IDSAENLYISDSSNHCIRRVD 1025
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 64/136 (47%), Gaps = 9/136 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG--SARFDKPKSFAVD 112
+ +L +LDS + + ++ +AG+G Y+ DG P ++ A+D
Sbjct: 839 AGNLYILDSDNNRVRKVD----TNGIITTVAGNGNWSYNGDGGPAVEASLSSAASGIAID 894
Query: 113 MKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
GN+Y++D N+ IRK+ TN +TT+AG G GPA AS + +
Sbjct: 895 SAGNLYISDTGNYCIRKVDTNGIITTVAGNGVAGFSGDGGPAVEASLGWAMGIA-IDSAG 953
Query: 172 ALLISDHGNQLIRQIN 187
L I D N +R+++
Sbjct: 954 NLYILDGSNHRVRKVD 969
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 4/110 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
V+ +AG+G Y+ DG P A A D GN+++AD N+ IRK+ TN +TT
Sbjct: 748 VLTTVAGNGNWSYNGDGGPAIEASLRSSYGIATDSAGNLHIADWGNNRIRKVDTNGIITT 807
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AG G GPA AS S + + L I D N +R+++
Sbjct: 808 VAGSGDYGFSGDGGPAIEASLSFPMGIA-IDSAGNLYILDSDNNRVRKVD 856
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 57/126 (45%), Gaps = 7/126 (5%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ +AG G+S DG P A A+D N+Y++D SNH IR++ G + T
Sbjct: 973 IITTVAGSDDYGFSGDGGPAIEASLGYAVGIAIDSAENLYISDSSNHCIRRVDTGGIIAT 1032
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK---PEDCS 194
+AG G PA+ + + V L ISD N+ I ++ + P S
Sbjct: 1033 VAGNGIYGSIEDGVPAKKSPLGYPVGVA-VDTAGNLYISDEANERILKVGVPHNLPIPSS 1091
Query: 195 KSSQSG 200
S ++G
Sbjct: 1092 ASMETG 1097
>gi|380693812|ref|ZP_09858671.1| hypothetical protein BfaeM_07498 [Bacteroides faecis MAJ27]
Length = 456
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 52/100 (52%), Gaps = 2/100 (2%)
Query: 88 DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKE 146
+G GY DG G A F+KP VD +GNI + D NH IRKIT + G + G +
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKITMSTGYVSTLAGKPQNS 414
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +G A++A F + + + + I D N+ IR++
Sbjct: 415 GYVNGSAEDAQFKKPLGIC-IDNDDVMYIGDSENRAIRRL 453
>gi|290975761|ref|XP_002670610.1| predicted protein [Naegleria gruberi]
gi|284084171|gb|EFC37866.1| predicted protein [Naegleria gruberi]
Length = 308
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 59/112 (52%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+F+ P S V K +Y+ D SNH IRKI N
Sbjct: 39 ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHRIRKILENGN 98
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +GPA NA N + FV + I+D N IR+I
Sbjct: 99 IITIAGNGTVGFSGDNGPATNAQLYNPSSV-FVSSNNEVYIADFCNHRIRKI 149
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 58/112 (51%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+F+ P S V K +Y+ D SNH IRKI N
Sbjct: 151 ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHRIRKILENGN 210
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +GPA NA N + FV + +D N IR+I
Sbjct: 211 IITIAGNGTVGFSGDNGPATNAQLYNPSSV-FVSSNNEVYFTDQHNNRIRKI 261
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+ P S V +Y+AD NH IRKI N
Sbjct: 95 ENGNIITIAGNGTVGFSGDNGPATNAQLYNPSSVFVSSNNEVYIADFCNHRIRKILENGN 154
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +GPA NA F+ + FV + I+D+ N IR+I
Sbjct: 155 IVTIAGNGNYGFSGDNGPATNAQFNYPCSV-FVSSKNEVYITDYSNHRIRKI 205
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
A+ P S V +Y+AD NH IRKI N + TIAG G+ +GPA NA F+
Sbjct: 8 AQLFNPSSVFVSSNNEVYIADFCNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQFN 67
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
+ FV + I+D+ N IR+I
Sbjct: 68 YPCSV-FVSSKNEVYITDYSNHRIRKI 93
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 50/103 (48%), Gaps = 4/103 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+ P S V +Y D+ N+ IRKI N
Sbjct: 207 ENGNIITIAGNGTVGFSGDNGPATNAQLYNPSSVFVSSNNEVYFTDQHNNRIRKILENGN 266
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
+ TIAG G+ +GPA NA N + FV + I+D
Sbjct: 267 IITIAGNGNYGFSGDNGPATNAQLYNPNSV-FVSSNNEVYITD 308
>gi|351729452|ref|ZP_08947143.1| NHL repeat-containing protein [Acidovorax radicis N35]
Length = 659
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 42/69 (60%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
V ++DG+ +ARF +PK+ AVD GN+YVAD NH++R+I+ G T G G
Sbjct: 44 VSNHADGDGPAARFYQPKAVAVDTAGNMYVADTENHLVRRISPTGAVTTLAGKPGVCGNQ 103
Query: 150 DGPAQNASF 158
DG + A+F
Sbjct: 104 DGTSDTATF 112
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 53/111 (47%), Gaps = 9/111 (8%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH-------VIRKITNL 133
V LAG G G++DG+ +A+F + A D GN+YV D H IRKIT
Sbjct: 482 VTLLAGSLGTVGHADGQGAAAQFSALGNMATDAAGNVYVVDGLFHEVNKIGPTIRKITPS 541
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
G V+T+AG + G ADG A F+ D + AL + GN I
Sbjct: 542 GMVSTLAGNPAAAPGYADGTGAAAVFTVDLGFRYPQQTAALAVDAQGNVYI 592
Score = 43.9 bits (102), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 46/106 (43%), Gaps = 10/106 (9%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG--GSKKEGRADGPAQNASFSNDFE 163
P A D G +Y AD ++HV+R+ G T+ G G +G DG A A F
Sbjct: 179 PVHLATDSTGALYAADPNDHVVRRYATDGQATVVSGTVGPNNQGTTDGTASQAKFGAFKA 238
Query: 164 LTFVPHICALLISDHGNQLIRQIN--------LKPEDCSKSSQSGS 201
+ F ++ + G IR+I ++ + C++S GS
Sbjct: 239 MAFDAADRLFVLDNTGLPAIRRIGSDGSATTLVRAQGCTESRAPGS 284
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 11/67 (16%)
Query: 76 LSEESVVKRLAGD--GVQGYSDGEPGSARF---------DKPKSFAVDMKGNIYVADKSN 124
++ +V LAG+ GY+DG +A F + + AVD +GN+Y+ D ++
Sbjct: 538 ITPSGMVSTLAGNPAAAPGYADGTGAAAVFTVDLGFRYPQQTAALAVDAQGNVYITDGAH 597
Query: 125 HVIRKIT 131
HV+R+IT
Sbjct: 598 HVVRRIT 604
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 30/61 (49%), Gaps = 2/61 (3%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
P ++V LA G DG+ GSARFD P + A G +YV D N +R + G
Sbjct: 321 PQGTSTLVAGLAS--APGADDGQGGSARFDAPGALAWSPSGTLYVRDDGNRSLRTVMPDG 378
Query: 135 V 135
V
Sbjct: 379 V 379
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 9/68 (13%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--------KSNHV 126
+S V LAG GV G DG +A F +P S AVD G +YVA+ + +
Sbjct: 84 ISPTGAVTTLAGKPGVCGNQDGTSDTATFCRPSSIAVDKAGTVYVAETRPSSTTLSAANP 143
Query: 127 IRKITNLG 134
IRKI+ G
Sbjct: 144 IRKISTTG 151
>gi|222056335|ref|YP_002538697.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
gi|221565624|gb|ACM21596.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
Length = 372
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 43/118 (36%), Positives = 59/118 (50%), Gaps = 10/118 (8%)
Query: 76 LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITN 132
L+ V LAG Q G +DG +ARF+ P D N+YVAD N++IRK IT
Sbjct: 44 LTLTGTVSVLAGQAPQMGTADGTGSAARFNAPSGITTDGT-NLYVADTGNNLIRKVVITT 102
Query: 133 LGVTTIAG----GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VTT+AG G ++ G DG A F+ F +T L ++D N IR++
Sbjct: 103 GAVTTLAGTVGTGTAQTSGSTDGTGSAAKFNAPFAIT--TDGTNLYVADTNNNTIRKV 158
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 67/152 (44%), Gaps = 10/152 (6%)
Query: 40 GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV---QGYSDG 96
G N S I G+ +L V D+ + + + + G G G +DG
Sbjct: 67 GSAARFNAPSGITTDGT-NLYVADTGNNLIRKVVITTGAVTTLAGTVGTGTAQTSGSTDG 125
Query: 97 EPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQ 154
+A+F+ P FA+ G N+YVAD +N+ IRK+ G T G G ADG
Sbjct: 126 TGSAAKFNAP--FAITTDGTNLYVADTNNNTIRKVVIATGTVTTLAGSVGIPGSADGIGP 183
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
F++ +T L +SD GN+ IR++
Sbjct: 184 AGLFNSPGGIT--TDGTNLYVSDTGNRTIRKV 213
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 71/164 (43%), Gaps = 14/164 (8%)
Query: 33 TVTTV-----IDGHQLEINPHSVIDRPG--SSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
TVTT+ I G I P + + PG ++D L S + T+ + V L
Sbjct: 164 TVTTLAGSVGIPGSADGIGPAGLFNSPGGITTDGTNLYVSDTGNRTIRKVVIATGAVTTL 223
Query: 86 AGD-GVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKIT-NLGVTTIAGGG 142
AG G G +DG SA F F + G +++VAD N IRKI G+ T G
Sbjct: 224 AGSAGTPGSTDGVGPSALFGTV--FGITTDGTSLFVADTDNSTIRKIVIATGMVTTLAGS 281
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ G ADG A F+ F +T L ++D IR++
Sbjct: 282 AGVSGIADGTGSTAKFNAPFGIT--TDGTNLYVTDSRQGSIRKV 323
>gi|168701257|ref|ZP_02733534.1| NHL repeat containing protein [Gemmata obscuriglobus UQM 2246]
Length = 358
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 45/111 (40%), Positives = 58/111 (52%), Gaps = 7/111 (6%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTT 137
++R AG G +GY+ DG A FD PK +D GN++V D N VIRKI VTT
Sbjct: 248 IERFAGTGKKGYTGDGTKALDATFDGPKEIDIDKDGNVFVVDTENEVIRKIDAKSGVVTT 307
Query: 138 IAGGG-SKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG G +K G D GPA A+ + P AL I D + IR++
Sbjct: 308 IAGKGRTKTPGLGDNGPATGATLGRPHGVAVGPD-GALYIGDTNSHRIRKV 357
Score = 40.8 bits (94), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 58/113 (51%), Gaps = 5/113 (4%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG-VT 136
V+ G G +G++ DG P + A D+P A D GN+Y +D NH++RK+ G +T
Sbjct: 18 VITSPVGSGKKGFAGDGGPAAKAELDQPFDVAFDKAGNLYFSDTFNHLVRKVDAKTGTIT 77
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
T+AG G K G G A AS + + + L I D N +R+++ K
Sbjct: 78 TVAGNGRKGFGGDGGKATEASLNEPYGIELDAD-GNLYIVDRLNFCVRKVDGK 129
>gi|196229731|ref|ZP_03128595.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
gi|196226057|gb|EDY20563.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
Length = 360
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 64/132 (48%), Gaps = 7/132 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
++ + + AG G +GYS DG P A F+ P D GN Y+AD +NH IR++
Sbjct: 74 VTPDGKIHTAAGIGQKGYSGDGGPALEATFNLPHEIRFDRAGNYYIADMANHAIRRVDAK 133
Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
T + +TT AG G GPA A + F P +L I D GN IR++++
Sbjct: 134 TKI-ITTFAGTGKPGYSGDGGPAAQAQLKQPHSIQFGPD-GSLYICDVGNNCIRKVDMTT 191
Query: 191 EDCSKSSQSGSA 202
S + +G A
Sbjct: 192 GTISTFAGTGKA 203
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 64/143 (44%), Gaps = 10/143 (6%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARF 103
PH + DR G + + D + A + ++ ++ AG G GYS DG P + A+
Sbjct: 106 PHEIRFDRAG--NYYIADMANHAIRRVD---AKTKIITTFAGTGKPGYSGDGGPAAQAQL 160
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
+P S G++Y+ D N+ IRK+ T ++T AG G + P +
Sbjct: 161 KQPHSIQFGPDGSLYICDVGNNCIRKVDMTTGTISTFAGTGKAGDTPDGSPIEGTPLKGP 220
Query: 162 FELTFVPHICALLISDHGNQLIR 184
+ F L++ GNQ+++
Sbjct: 221 RSMDFDKEGNLWLVTREGNQVLK 243
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 9/92 (9%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADK 122
+R L F ++ ++ AG G +G++ +G P A PK AVD +GN+++AD
Sbjct: 235 TREGNQVLKFD-AKTGIISIAAGTGKKGFTGNGGPALEATLSGPKGIAVDAQGNVWLADT 293
Query: 123 SNHVIRKIT--NLGVTTIAGGGSKKEGRADGP 152
+H IR+I + +AG G R DGP
Sbjct: 294 ESHSIREINAKTGAIELVAGDGQ----RGDGP 321
>gi|111219920|ref|YP_710714.1| serine/threonine protein kinase [Frankia alni ACN14a]
gi|111147452|emb|CAJ59102.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
Length = 725
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 49/86 (56%), Gaps = 2/86 (2%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
+KP + AVD GN+Y+ DKSN +R++ G VTT+AG G + GPA A ++
Sbjct: 395 LNKPDAMAVDTAGNLYIVDKSNQRVRRVDRDGVVTTVAGNGIRGFTGDGGPAIRAELADP 454
Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
+ V + ISD GNQ +R++N
Sbjct: 455 AGIA-VDAAGDIYISDQGNQRVRRVN 479
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 61/126 (48%), Gaps = 15/126 (11%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + VV +AG+G++G++ DG P A P AVD G+IY++D+ N +R++
Sbjct: 422 VDRDGVVTTVAGNGIRGFTGDGGPAIRAELADPAGIAVDAAGDIYISDQGNQRVRRVNPA 481
Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELT-----------FVPHICALLISDHGNQ 181
GV TT AG G +GP + FS D L +V + I D N
Sbjct: 482 GVITTFAGTGVFGFSGENGP-KIGGFSGDGVLARQAMLDEPSALWVDRAGNVYICDGSND 540
Query: 182 LIRQIN 187
IR+++
Sbjct: 541 RIRKVD 546
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+AR P+S AVD+ G +YV D+ N+ +R+I G +TT AG G+ GPA A+
Sbjct: 571 AARLQWPESLAVDVAGTMYVTDQGNNRVRRIDTHGIITTAAGTGTMGFSGDGGPATRAAI 630
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
V + ++D IR+I+
Sbjct: 631 HTVGAGVTVDDAGNIYLADPQVNRIRRID 659
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 58/127 (45%), Gaps = 17/127 (13%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG--------------SARFDKPKSFAVDMKGNIYVAD 121
++ V+ AG GV G+S GE G A D+P + VD GN+Y+ D
Sbjct: 478 VNPAGVITTFAGTGVFGFS-GENGPKIGGFSGDGVLARQAMLDEPSALWVDRAGNVYICD 536
Query: 122 KSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
SN IRK+ GV +T+ G G G GPA A L V + ++D GN
Sbjct: 537 GSNDRIRKVDTAGVISTVVGSGGDGFGGDGGPATAARLQWPESLA-VDVAGTMYVTDQGN 595
Query: 181 QLIRQIN 187
+R+I+
Sbjct: 596 NRVRRID 602
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 53/119 (44%), Gaps = 6/119 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ ++ AG G G+S DG P + A VD GNIY+AD + IR+I
Sbjct: 601 IDTHGIITTAAGTGTMGFSGDGGPATRAAIHTVGAGVTVDDAGNIYLADPQVNRIRRIDT 660
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
G +TTIAG G A +A+ S L F + LL+S+ I I L P
Sbjct: 661 HGIITTIAGTGVAGNAGNGQRALDAAISYPSSLLF--YRGYLLVSEVNGSRILAIALGP 717
>gi|153867703|ref|ZP_01997987.1| NHL repeat protein [Beggiatoa sp. SS]
gi|152144978|gb|EDN72014.1| NHL repeat protein [Beggiatoa sp. SS]
Length = 263
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 64/112 (57%), Gaps = 4/112 (3%)
Query: 86 AGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
AGDG G++ DG P +A + + P +D GN+Y+AD NH IRKI + G ++T+AG G
Sbjct: 11 AGDGNLGFAGDGGPATAAQLNSPLGMGIDSAGNLYIADTENHRIRKIDSSGNISTVAGDG 70
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
++ DG A ++ N+ + L I+D GN ++R++++ + S
Sbjct: 71 NRGY-SGDGAAAVSAKLNNPMWVSLDSAGNLYIADTGNNVVRKLDIASGNIS 121
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 55/110 (50%), Gaps = 8/110 (7%)
Query: 82 VKRLAGD---GVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
+ +AGD GV G DG P +A + P +D GN+Y+AD SNH IRK+ G +T
Sbjct: 120 ISTVAGDSSVGVAG--DGGPATAAQLSYPTGIDIDTAGNLYIADSSNHRIRKVDTTGNIT 177
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TIAG G+ DG A+ N V + I+D N IR++
Sbjct: 178 TIAGDGTPGF-AGDGKIATAAQLNAPTQVMVDSTGQVYIADTSNHRIRKV 226
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 3/61 (4%)
Query: 82 VKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
+ +AGDG G++ DG+ +A+ + P VD G +Y+AD SNH IRK+ T+ +TT+
Sbjct: 176 ITTIAGDGTPGFAGDGKIATAAQLNAPTQVMVDSTGQVYIADTSNHRIRKVSTDNIITTM 235
Query: 139 A 139
A
Sbjct: 236 A 236
>gi|290995070|ref|XP_002680154.1| predicted protein [Naegleria gruberi]
gi|284093774|gb|EFC47410.1| predicted protein [Naegleria gruberi]
Length = 636
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 61/114 (53%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ ++ +AG G GYS DG SA+ + P A+ G IY+AD +N+ IRKIT
Sbjct: 401 ITTSGIISTIAGTGTSGYSGDGSSATSAQLNSPYGIAISSSGEIYIADTNNNRIRKITTS 460
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++TIAG G++ A +A N + + + + ++D+ N IR+I
Sbjct: 461 GIISTIAGTGTQGYSGDGSSATSAQLYNPYGVA-ISSRGEIYVADYNNNRIRKI 513
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ ++ +AG G QGYS DG SA+ P A+ +G IYVAD +N+ IRKIT
Sbjct: 457 ITTSGIISTIAGTGTQGYSGDGSSATSAQLYNPYGVAISSRGEIYVADYNNNRIRKITTS 516
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++TIAG G+ A +A N + + + + I+D N IR++
Sbjct: 517 GIISTIAGTGTSGYSGDGSSAISAQLYNPYGVA-ISSSDEIYITDTNNNRIRKL 569
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 16/144 (11%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ ++ +AG+G GYS DG SA+ + P A+ IY+AD N+ IRKIT
Sbjct: 239 ITTSGIISTIAGNGTAGYSGDGSSATSAQLNSPSGIAISSSDEIYIADMFNNRIRKITTS 298
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
G ++TIAG G+ G + + S + +L F P+ A + I+D N IR+I
Sbjct: 299 GIISTIAGTGTS------GYSGDGSSATSIQLYF-PYGVAVSLSDEIYIADMFNNRIRKI 351
Query: 187 NLKPEDCSKSSQSGSALGAVSVWV 210
+ + G L A + ++
Sbjct: 352 TTSGIISTIAGGIGDGLSATTAYI 375
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 12/117 (10%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ ++ +AG+G GYS SA+ P A+ Y+AD +NH IRKIT G
Sbjct: 129 ITTSGIISTIAGNGTAGYSGDVATSAKLYYPSGIAISSSDETYIADTNNHRIRKITTSGI 188
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
++TIAG G+ G + + S + +L + + + I D N IR+I
Sbjct: 189 ISTIAGNGTA------GYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKI 239
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ ++ +AG+G GYS DG SA+ P A+ IY+ D+SN+ IRKIT
Sbjct: 183 ITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKITTS 242
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++TIAG G+ DG + ++ N + + I+D N IR+I
Sbjct: 243 GIISTIAGNGTAGY-SGDGSSATSAQLNSPSGIAISSSDEIYIADMFNNRIRKI 295
Score = 50.1 bits (118), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 24/123 (19%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ ++ +AG+G GYS DG SA+ P A+ IY+ D+SN+ IRKIT
Sbjct: 73 ITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKITTS 132
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELT---FVPHICAL------LISDHGNQLI 183
G ++TIAG G+ A +S D + + P A+ I+D N I
Sbjct: 133 GIISTIAGNGT------------AGYSGDVATSAKLYYPSGIAISSSDETYIADTNNHRI 180
Query: 184 RQI 186
R+I
Sbjct: 181 RKI 183
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 6/112 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ ++ +AG G DG + + +F G IY+AD +NH IRKIT G
Sbjct: 351 ITTSGIISTIAG----GIGDGLSATTAYINAITFEFSSSGEIYIADTNNHRIRKITTSGI 406
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++TIAG G+ A +A ++ + + + + I+D N IR+I
Sbjct: 407 ISTIAGTGTSGYSGDGSSATSAQLNSPYGIA-ISSSGEIYIADTNNNRIRKI 457
Score = 40.0 bits (92), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 12/93 (12%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SA+ P A+ Y+AD +NH IRKIT G ++TIAG G+ G + + S
Sbjct: 43 SAKLYYPSGIAISSSDETYIADTNNHRIRKITTSGIISTIAGNGTA------GYSGDGSS 96
Query: 159 SNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
+ +L + + + I D N IR+I
Sbjct: 97 AKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKI 129
>gi|116624994|ref|YP_827150.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116228156|gb|ABJ86865.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 912
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 62/108 (57%), Gaps = 5/108 (4%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ LAG GV GYS DG P SA+ + P++ AVD GN+YVAD +N+ +RKI G +TT+
Sbjct: 511 ISTLAGTGVSGYSGDGGPATSAQLNGPQAVAVDGSGNVYVADTANNRVRKIGPTGLITTV 570
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G GPA +A N L + + I+D G+ +R++
Sbjct: 571 AGNGIGGFSGDGGPATSAQVGNPNGLAL-DSVGNVFITD-GSARVRKL 616
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 50/105 (47%), Gaps = 3/105 (2%)
Query: 87 GDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
GDG QG++ G P P S VD GN+Y AD N+ IRK++ V T AG G
Sbjct: 294 GDGTQGFAGDGGAPNKVEMSLPTSVQVDSSGNLYFADSLNNRIRKLSGGNVNTYAGNGIV 353
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G A NA + + V L +SD N L+R+++ K
Sbjct: 354 ARSGDGGAATNAQLNTPLGVA-VDAAGNLYVSDTLNNLVRRVDTK 397
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 59/109 (54%), Gaps = 4/109 (3%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
V+ +AG G G++ G SA+ + P S A+D GN+Y+A+ SN+ IRK+ TN ++T
Sbjct: 454 VMSTVAGSGTSGFAGDGGAATSAQLNAPFSVALDAAGNLYIAEFSNNRIRKVATNGNIST 513
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+AG G GPA +A N + V + ++D N +R+I
Sbjct: 514 LAGTGVSGYSGDGGPATSAQL-NGPQAVAVDGSGNVYVADTANNRVRKI 561
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
+ + P+ AVD GN+Y+AD NH +RK++ ++T+AG G+ G A +A +
Sbjct: 422 QLNNPQGLAVDSAGNLYIADTQNHRVRKVSGGVMSTVAGSGTSGFAGDGGAATSAQLNAP 481
Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
F + L I++ N IR++
Sbjct: 482 FSVAL-DAAGNLYIAEFSNNRIRKV 505
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 67/133 (50%), Gaps = 10/133 (7%)
Query: 76 LSEESVVKRLAGDG---VQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + ++ AG+G V G+ G A P + AVD N+Y+A +++ +R++
Sbjct: 113 VNPQGIISTFAGNGNVSVPGFWGDSGAATDASIHLPVAIAVDSSNNVYIAASADNTVRRV 172
Query: 131 TNLGVTTI-AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
T G+ I AG G K G A A + ++TF P LLI+D GN +IRQ+
Sbjct: 173 TTDGIINIFAGAGYKGYYGDAGAAGLAGLTGPQDITFGPK-GVLLIADTGNAVIRQVG-- 229
Query: 190 PEDCSKSSQSGSA 202
D S+ SG+A
Sbjct: 230 -TDGVISTVSGNA 241
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 5/117 (4%)
Query: 32 YTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS--FPLSEESVVKRLAGDG 89
YT++TV G ++ G I L F + + F + + +AG+
Sbjct: 11 YTISTVAGGAPPPTPVAALSTSIGQPRKIALSGGNMYFSSGNSVFKIDGSGTLTLVAGNS 70
Query: 90 VQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
G+S DG P +A+ + P+ A+D GN+Y+AD N+ +RK+ G ++T AG G+
Sbjct: 71 RAGFSGDGGPAVNAQLNSPQGVALDSAGNLYIADSQNNRVRKVNPQGIISTFAGNGN 127
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTI 138
V AG+G+ S G +A+ + P AVD GN+YV+D N+++R++ GV TT
Sbjct: 344 VNTYAGNGIVARSGDGGAATNAQLNTPLGVAVDAAGNLYVSDTLNNLVRRVDTKGVITTF 403
Query: 139 AG 140
AG
Sbjct: 404 AG 405
Score = 37.0 bits (84), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 44/95 (46%), Gaps = 1/95 (1%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
SA+ P A+D GN+++ D S V + + ++TIAGGG++ G A A +
Sbjct: 587 SAQVGNPNGLALDSVGNVFITDGSARVRKLFISGIISTIAGGGNRGYSGDGGNAFAAQLN 646
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
L + AL ++D N +R + + S
Sbjct: 647 GPSGLA-INSTGALFVADALNNAVRMLQISASGIS 680
>gi|290972086|ref|XP_002668792.1| predicted protein [Naegleria gruberi]
gi|284082314|gb|EFC36048.1| predicted protein [Naegleria gruberi]
Length = 679
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 57/112 (50%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+F+ P S V K +Y+ D SNH IRKI N
Sbjct: 39 ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHSIRKILENGN 98
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ GPA NA N + FV + +D N IR+I
Sbjct: 99 IITIAGNGTVGFSGDSGPATNAQLYNPSSV-FVSSKNEVYFTDQHNNRIRKI 149
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 57/112 (50%), Gaps = 5/112 (4%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+ P S V +Y+ D SNH IRKI N
Sbjct: 151 ENGNIITIAGNGTYGFSGDNGPATNAQLYNPYSVFVSSNNEVYITDYSNHRIRKILENGN 210
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +GPA NA N FV + + ISD NQ IR+I
Sbjct: 211 IVTIAGNGNYGFSGDNGPATNAQL-NRPNSVFVSN-NEVYISDQSNQRIRKI 260
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 57/112 (50%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S G +A+ P S V K +Y D+ N+ IRKI N
Sbjct: 95 ENGNIITIAGNGTVGFSGDSGPATNAQLYNPSSVFVSSKNEVYFTDQHNNRIRKILENGN 154
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +GPA NA N + + FV + I+D+ N IR+I
Sbjct: 155 IITIAGNGTYGFSGDNGPATNAQLYNPYSV-FVSSNNEVYITDYSNHRIRKI 205
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 60/112 (53%), Gaps = 6/112 (5%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+ ++P S V +Y++D+SN IRKI N
Sbjct: 207 ENGNIVTIAGNGNYGFSGDNGPATNAQLNRPNSVFVS-NNEVYISDQSNQRIRKILENGN 265
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +GPA NA N FV + + ISD NQ IR+I
Sbjct: 266 IITIAGNGNYGFSGDNGPATNAQL-NRPNSVFVSN-NEVYISDQSNQRIRKI 315
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
A+ P S V +Y+AD NH IRKI N + TIAG G+ +GPA NA F+
Sbjct: 8 AQLFNPSSVFVSSNNEVYIADFCNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQFN 67
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
+ FV + I+D+ N IR+I
Sbjct: 68 YPCSV-FVSSKNEVYITDYSNHSIRKI 93
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 18/104 (17%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+ ++P S V +Y++D+SN IRKI N
Sbjct: 262 ENGNIITIAGNGNYGFSGDNGPATNAQLNRPNSVFVS-NNEVYISDQSNQRIRKILENGN 320
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHICALLIS 176
+ TIAG G+ A FS D F + PHI L++
Sbjct: 321 IVTIAGNGT------------AGFSGDSPFNIIMYPHIGNKLLT 352
>gi|290982061|ref|XP_002673749.1| predicted protein [Naegleria gruberi]
gi|284087335|gb|EFC41005.1| predicted protein [Naegleria gruberi]
Length = 185
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 62/126 (49%), Gaps = 6/126 (4%)
Query: 74 FPLSEESVVKRLAGDGV-QGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIR--- 128
F + V + +AG+G+ Q DG G+ A + P+ A+DM GN+Y+A++ H IR
Sbjct: 16 FTSHAQYVTQTVAGNGISQSLGDGFLGTQASLNSPRDVALDMYGNMYIAEEKAHKIRFIN 75
Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
K T + ++T+ G G G A NA F L L ISD GN +IR IN+
Sbjct: 76 KTTGI-ISTLVGNGIAGSVGDGGLAINAQLHLPFSLAIDSTGQNLFISDSGNHVIRHINM 134
Query: 189 KPEDCS 194
S
Sbjct: 135 TTRIIS 140
>gi|116620673|ref|YP_822829.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223835|gb|ABJ82544.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 912
Score = 59.7 bits (143), Expect = 1e-06, Method: Composition-based stats.
Identities = 49/188 (26%), Positives = 87/188 (46%), Gaps = 10/188 (5%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL---SFPLS 77
L+ G + Y ++T+ G + G + +D++ + +++ F +
Sbjct: 10 LLFCGSVWAQQYVISTIAGGAAPPTPVAAAKASVGDPTRVAVDAAGNVYFSSLHSIFKVD 69
Query: 78 EESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+ R AG+G G S G+ SA+ P AVD GNI+VAD+ +V+R+I G
Sbjct: 70 SSGTMTRFAGNGRPGNSGDGGQASSAQLLFPMGLAVDAAGNIFVADRDANVVRRIATTGI 129
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
+ T+AG G+ GPA A + F + V + I+D N ++R++ D +
Sbjct: 130 IQTVAGNGTPGYQGDGGPATTAQLNAPFAVA-VDGQGNVFIADTNNVVVRRVT---PDGT 185
Query: 195 KSSQSGSA 202
S+ +GS
Sbjct: 186 ISTYAGSG 193
Score = 58.9 bits (141), Expect = 2e-06, Method: Composition-based stats.
Identities = 37/115 (32%), Positives = 65/115 (56%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + V+ +AG G G+S G +A+ ++P+ AVD +GN+YVAD N+ IRKI L
Sbjct: 524 VTTDGVIHTIAGFGTPGFSGDGGAATAAKLNRPRGVAVDAQGNVYVADTGNNRIRKIDPL 583
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G ++T+AG GS + DG A + D V + +++ G+ +R+++
Sbjct: 584 GNISTVAGDGSTEFIPGDGIATQQGLT-DPRGVAVDRAGNIYVAETGHNRVRRVS 637
Score = 58.2 bits (139), Expect = 4e-06, Method: Composition-based stats.
Identities = 39/114 (34%), Positives = 66/114 (57%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ +++ +AG+G GY DG P + A+ + P + AVD +GN+++AD +N V+R++T
Sbjct: 124 IATTGIIQTVAGNGTPGYQGDGGPATTAQLNAPFAVAVDGQGNVFIADTNNVVVRRVTPD 183
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T AG G++ G A+NA F E V L I+D N IR++
Sbjct: 184 GTISTYAGSGARGFAGDGGAARNAWFDGP-EGVAVDANGVLYIADTFNGRIRRV 236
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 39/116 (33%), Positives = 62/116 (53%), Gaps = 7/116 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ + + AG G +G++ G+ G+AR FD P+ AVD G +Y+AD N IR++
Sbjct: 180 VTPDGTISTYAGSGARGFA-GDGGAARNAWFDGPEGVAVDANGVLYIADTFNGRIRRVAA 238
Query: 133 LG-VTTIAGGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +TT AG GS G +GP +A+ S ++ V I+D GN +R +
Sbjct: 239 DGTITTAAGVGSTGVFGGDNGPPASAALSLPTDVA-VDRSGNPYIADFGNSRVRMV 293
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 4/105 (3%)
Query: 86 AGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAGGG 142
AG+G Y DG P + A ++P+ A D GN Y+AD ++V+RK+T GV TIAG G
Sbjct: 478 AGNGNASYFGDGLPATQASVNQPEGVAGDAAGNTYIADTFDNVVRKVTTDGVIHTIAGFG 537
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ DG A A+ N V + ++D GN IR+I+
Sbjct: 538 TPGFS-GDGGAATAAKLNRPRGVAVDAQGNVYVADTGNNRIRKID 581
Score = 47.0 bits (110), Expect = 0.007, Method: Composition-based stats.
Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 2/61 (3%)
Query: 85 LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
LAG+G+ + DG + A+FD P A+D GN+ + D N +RKI+ +TTIAG G
Sbjct: 366 LAGNGLASFGGDGAAATTAQFDTPTGVAIDAGGNVLIVDSQNQRLRKISRGVITTIAGTG 425
Query: 143 S 143
+
Sbjct: 426 T 426
Score = 46.6 bits (109), Expect = 0.011, Method: Composition-based stats.
Identities = 28/73 (38%), Positives = 34/73 (46%), Gaps = 9/73 (12%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-------- 140
GV G +G P SA P AVD GN Y+AD N +R + N +TT+AG
Sbjct: 252 GVFGGDNGPPASAALSLPTDVAVDRSGNPYIADFGNSRVRMVANGVITTVAGRNNGAPIV 311
Query: 141 -GGSKKEGRADGP 152
G R DGP
Sbjct: 312 EGEEAVNARLDGP 324
Score = 43.5 bits (101), Expect = 0.10, Method: Composition-based stats.
Identities = 39/129 (30%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 82 VKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AGDG + DG P+ AVD GNIYVA+ ++ +R+++ G +TTI
Sbjct: 586 ISTVAGDGSTEFIPGDGIATQQGLTDPRGVAVDRAGNIYVAETGHNRVRRVSTGGTITTI 645
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
AG G G A + + + V + ++D GN IR L P S + Q
Sbjct: 646 AGNGQCCYTGDGGLGTAAQLNQPWGIA-VDSAGNIYVADSGNNAIRL--LAP--VSANIQ 700
Query: 199 SGSALGAVS 207
G+ + A S
Sbjct: 701 VGAVVNAAS 709
Score = 41.6 bits (96), Expect = 0.38, Method: Composition-based stats.
Identities = 43/147 (29%), Positives = 63/147 (42%), Gaps = 18/147 (12%)
Query: 17 LLFSLVSSGL--LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF 74
LL +L +GL DG TT Q + ID G +++++DS +S
Sbjct: 362 LLTTLAGNGLASFGGDGAAATTA----QFDTPTGVAIDAGG--NVLIVDSQNQRLRKIS- 414
Query: 75 PLSEESVVKRLAGDGVQGYSDGE---PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V+ +AG G G+ +GE P +A+ + P+ A D GN +VAD N +R+
Sbjct: 415 ----RGVITTIAGTGTAGF-NGEVVLPATAQLNTPRGVAADAYGNYFVADTGNRRVREGQ 469
Query: 132 NLGVTTIAGGGSKKEGRADG-PAQNAS 157
G G DG PA AS
Sbjct: 470 PGGNFFTKAGNGNASYFGDGLPATQAS 496
Score = 38.9 bits (89), Expect = 2.2, Method: Composition-based stats.
Identities = 26/79 (32%), Positives = 38/79 (48%), Gaps = 2/79 (2%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S + +AG+G Y+ DG G+A + ++P AVD GNIYVAD N+ IR + +
Sbjct: 636 VSTGGTITTIAGNGQCCYTGDGGLGTAAQLNQPWGIAVDSAGNIYVADSGNNAIRLLAPV 695
Query: 134 GVTTIAGGGSKKEGRADGP 152
G GP
Sbjct: 696 SANIQVGAVVNAASNLPGP 714
>gi|294817139|ref|ZP_06775781.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
gi|294321954|gb|EFG04089.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
Length = 510
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 64/114 (56%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + + G GV G+S DG P +A R ++P AVD G +YV + +N+ IRK+
Sbjct: 126 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKVAAD 185
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+AG G+ DGPA +A + + + V A+ I+D N IR+I
Sbjct: 186 GTISTVAGTGATGNRGDDGPAASAQLNRPYGVA-VDGAGAVYIADTDNHRIRKI 238
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 62/108 (57%), Gaps = 4/108 (3%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG GV G++ +G +A+ ++P AVD G +Y +D +NH +RKIT G V+T+
Sbjct: 20 ISTVAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKITTDGKVSTV 79
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG GS +GPA +A + E+ V A+ ++D N +R+I
Sbjct: 80 AGIGSAGYRGDNGPAVSAQLNCPREVA-VDGAGAVYVTDAENHRVRKI 126
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + +AG G G DG SA+ ++P AVD G +Y+AD NH IRKIT
Sbjct: 182 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKITAD 241
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+AG G+ GPA A + + + L ++D+GN IR+I
Sbjct: 242 GTISTVAGTGTGGFDGDGGPATAARLNRPMGV-VIDSTGTLYVADYGNHRIRKI 294
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + V +AG G GY +G SA+ + P+ AVD G +YV D NH +RKIT
Sbjct: 70 ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKITAD 129
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+ G G GPA A + F + V L ++++ N IR++
Sbjct: 130 GTISTVVGTGVAGFSGDGGPATAARLNRPFGVA-VDSTGVLYVTEYNNNRIRKV 182
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+AR ++P +D G +YVAD NH IRKIT G + T+AG G+ G GP +A
Sbjct: 264 AARLNRPMGVVIDSTGTLYVADYGNHRIRKITTDGKIGTVAGTGTGGFGGDGGPPASAQV 323
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+N + V + L I+D+GN +R++
Sbjct: 324 NNSLGVA-VDCVDTLYIADYGNHRVRKV 350
>gi|326445969|ref|ZP_08220703.1| hypothetical protein SclaA2_33112 [Streptomyces clavuligerus ATCC
27064]
Length = 508
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 64/114 (56%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + + G GV G+S DG P +A R ++P AVD G +YV + +N+ IRK+
Sbjct: 124 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKVAAD 183
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+AG G+ DGPA +A + + + V A+ I+D N IR+I
Sbjct: 184 GTISTVAGTGATGNRGDDGPAASAQLNRPYGVA-VDGAGAVYIADTDNHRIRKI 236
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 62/108 (57%), Gaps = 4/108 (3%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG GV G++ +G +A+ ++P AVD G +Y +D +NH +RKIT G V+T+
Sbjct: 18 ISTVAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKITTDGKVSTV 77
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG GS +GPA +A + E+ V A+ ++D N +R+I
Sbjct: 78 AGIGSAGYRGDNGPAVSAQLNCPREVA-VDGAGAVYVTDAENHRVRKI 124
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + +AG G G DG SA+ ++P AVD G +Y+AD NH IRKIT
Sbjct: 180 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKITAD 239
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+AG G+ GPA A + + + L ++D+GN IR+I
Sbjct: 240 GTISTVAGTGTGGFDGDGGPATAARLNRPMGV-VIDSTGTLYVADYGNHRIRKI 292
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + V +AG G GY +G SA+ + P+ AVD G +YV D NH +RKIT
Sbjct: 68 ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKITAD 127
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+ G G GPA A + F + V L ++++ N IR++
Sbjct: 128 GTISTVVGTGVAGFSGDGGPATAARLNRPFGVA-VDSTGVLYVTEYNNNRIRKV 180
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+AR ++P +D G +YVAD NH IRKIT G + T+AG G+ G GP +A
Sbjct: 262 AARLNRPMGVVIDSTGTLYVADYGNHRIRKITTDGKIGTVAGTGTGGFGGDGGPPASAQV 321
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+N + V + L I+D+GN +R++
Sbjct: 322 NNSLGVA-VDCVDTLYIADYGNHRVRKV 348
>gi|411003633|ref|ZP_11379962.1| hypothetical protein SgloC_12561 [Streptomyces globisporus C-1027]
Length = 605
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 59/112 (52%), Gaps = 7/112 (6%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V D++R L ++ V+R G G +G SDG PG ARF +P+ AV G +
Sbjct: 188 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLSDGGPGEARFSEPQGLAVLPDGRVA 244
Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
VAD NH IR + GVT T+AG G + + GPA+ S+ +++ +
Sbjct: 245 VADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPARGVDLSSPWDVAW 296
>gi|254390149|ref|ZP_05005369.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
27064]
gi|197703856|gb|EDY49668.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
27064]
Length = 527
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 64/114 (56%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + + G GV G+S DG P +A R ++P AVD G +YV + +N+ IRK+
Sbjct: 143 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKVAAD 202
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+AG G+ DGPA +A + + + V A+ I+D N IR+I
Sbjct: 203 GTISTVAGTGATGNRGDDGPAASAQLNRPYGVA-VDGAGAVYIADTDNHRIRKI 255
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 62/108 (57%), Gaps = 4/108 (3%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG GV G++ +G +A+ ++P AVD G +Y +D +NH +RKIT G V+T+
Sbjct: 37 ISTVAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKITTDGKVSTV 96
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG GS +GPA +A + E+ V A+ ++D N +R+I
Sbjct: 97 AGIGSAGYRGDNGPAVSAQLNCPREVA-VDGAGAVYVTDAENHRVRKI 143
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + +AG G G DG SA+ ++P AVD G +Y+AD NH IRKIT
Sbjct: 199 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKITAD 258
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+AG G+ GPA A + + + L ++D+GN IR+I
Sbjct: 259 GTISTVAGTGTGGFDGDGGPATAARLNRPMGV-VIDSTGTLYVADYGNHRIRKI 311
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + V +AG G GY +G SA+ + P+ AVD G +YV D NH +RKIT
Sbjct: 87 ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKITAD 146
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+ G G GPA A + F + V L ++++ N IR++
Sbjct: 147 GTISTVVGTGVAGFSGDGGPATAARLNRPFGVA-VDSTGVLYVTEYNNNRIRKV 199
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+AR ++P +D G +YVAD NH IRKIT G + T+AG G+ G GP +A
Sbjct: 281 AARLNRPMGVVIDSTGTLYVADYGNHRIRKITTDGKIGTVAGTGTGGFGGDGGPPASAQV 340
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+N + V + L I+D+GN +R++
Sbjct: 341 NNSLGVA-VDCVDTLYIADYGNHRVRKV 367
>gi|116619867|ref|YP_822023.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223029|gb|ABJ81738.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 592
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 54/112 (48%), Gaps = 2/112 (1%)
Query: 77 SEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+ +++ AG+G QG+S G+ A F+ P + AVD G +YV D+SN IR+I G
Sbjct: 184 AASNIISTFAGNGSQGFSGDGGQAAQASFNNPTALAVDAAGTVYVTDQSNQRIRRIDTGG 243
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V T G DG + A+ N + L I D NQ +R++
Sbjct: 244 VITTVAGNGNAAFSGDGGSATAASLNYPGGIVLDSSGTLYIVDSVNQRVRKV 295
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 63/125 (50%), Gaps = 4/125 (3%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADK 122
SR L+ ++ + +AG+G G++ G+ A+ ++ D GN+Y+A++
Sbjct: 3 SRFVLPLLAAAIASGQTITTVAGNGTAGFAGDTGQATQAQINRAVGLVTDANGNLYLAEE 62
Query: 123 SNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
N+ +RK+ GV TT+AG G+ GPA A + L + + +SD GN+
Sbjct: 63 LNNRVRKVDTGGVITTLAGIGTAGFAGDGGPAAQAQLNGPLGLC-IDGSGNIYVSDQGNK 121
Query: 182 LIRQI 186
+R+I
Sbjct: 122 RVRKI 126
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 5/88 (5%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
F P AVD GN+ + D+ H IRK+ +N+ ++T AG GS+ G A ASF+
Sbjct: 155 FAIPIRCAVDKSGNLLIVDQGAHKIRKVEAASNI-ISTFAGNGSQGFSGDGGQAAQASFN 213
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
N L V + ++D NQ IR+I+
Sbjct: 214 NPTALA-VDAAGTVYVTDQSNQRIRRID 240
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 3/108 (2%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
V+ +AG+G +S G +A + P +D G +Y+ D N +RK++ ++T+
Sbjct: 244 VITTVAGNGNAAFSGDGGSATAASLNYPGGIVLDSSGTLYIVDSVNQRVRKVSGTTISTV 303
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G+ G A A +N F +T L + D N +R+I
Sbjct: 304 AGTGTAGFSGDGGAALQAQLNNPFPIT-ADGSGNLYVGDVSNNRVRKI 350
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 65/130 (50%), Gaps = 16/130 (12%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
V+ LAG G G++ DG P + A+ + P +D GNIYV+D+ N +RKI G +TT
Sbjct: 75 VITTLAGIGTAGFAGDGGPAAQAQLNGPLGLCIDGSGNIYVSDQGNKRVRKIAPSGTITT 134
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPE 191
+AG GS G G A A F+ +P CA LLI D G IR++
Sbjct: 135 VAGSGSAASGGDGGAATAAGFA-------IPIRCAVDKSGNLLIVDQGAHKIRKVEAASN 187
Query: 192 DCSKSSQSGS 201
S + +GS
Sbjct: 188 IISTFAGNGS 197
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 11/88 (12%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESV----VKRLAGDGVQGYSDGEPGSA---RF 103
++ PG IVLDSS + + S V + +AG G G+S G+ G+A +
Sbjct: 268 LNYPGG---IVLDSSGTLYIVDSVNQRVRKVSGTTISTVAGTGTAGFS-GDGGAALQAQL 323
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ P D GN+YV D SN+ +RKIT
Sbjct: 324 NNPFPITADGSGNLYVGDVSNNRVRKIT 351
>gi|423214677|ref|ZP_17201205.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692583|gb|EIY85820.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
CL03T12C04]
Length = 419
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 73/152 (48%), Gaps = 20/152 (13%)
Query: 56 SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
S + IV D +++ S +S + K +AGDGV+G Y DGEPG SA+
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT--------NLGVTTIAGGGSKKEGRADGPAQNAS 157
A G +Y++D + + IRK+T + TIAG G K G +DG A+
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETIAGSG--KAGFSDGKGLKAT 387
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLK 189
F+ +E+ + ++ N LIR+I +K
Sbjct: 388 FNQPYEIIITEDCKTMYVAGAVNYLIRRITVK 419
>gi|262405589|ref|ZP_06082139.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
gi|294644531|ref|ZP_06722285.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|294805793|ref|ZP_06764670.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
gi|345510784|ref|ZP_08790345.1| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
gi|262356464|gb|EEZ05554.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
gi|292640084|gb|EFF58348.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|294447014|gb|EFG15604.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
gi|345454355|gb|EEO48593.2| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
Length = 419
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 73/152 (48%), Gaps = 20/152 (13%)
Query: 56 SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
S + IV D +++ S +S + K +AGDGV+G Y DGEPG SA+
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT--------NLGVTTIAGGGSKKEGRADGPAQNAS 157
A G +Y++D + + IRK+T + TIAG G K G +DG A+
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETIAGSG--KAGFSDGKGLKAT 387
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLK 189
F+ +E+ + ++ N LIR+I +K
Sbjct: 388 FNQPYEIIITEDCKTMYVAGAVNYLIRRITVK 419
>gi|452992643|emb|CCQ95895.1| hypothetical protein CULT_320031 [Clostridium ultunense Esp]
Length = 329
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/138 (33%), Positives = 68/138 (49%), Gaps = 21/138 (15%)
Query: 53 RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSAR---------- 102
R ++ +LDS A ++ ++ V+ +AG G GY+ G+ G A+
Sbjct: 182 RGPEGNIYILDSKNYAVRKINI---DKETVELIAGTGKPGYT-GDGGDAKDATFGGNKES 237
Query: 103 -FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN 160
FD P S ++D GNIYV D NHV+R ITN G V TIAG + +GR +N +
Sbjct: 238 YFDGPWSLSLDEVGNIYVGDTQNHVVRMITNKGKVYTIAGNINVIKGR-----KNRTNEK 292
Query: 161 DFELTFVPHICALLISDH 178
D +P IC++ D
Sbjct: 293 DLFKINLPLICSMDYYDR 310
>gi|336403610|ref|ZP_08584322.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
gi|335945536|gb|EGN07347.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
Length = 419
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 73/152 (48%), Gaps = 20/152 (13%)
Query: 56 SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
S + IV D +++ S +S + K +AGDGV+G Y DGEPG SA+
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT--------NLGVTTIAGGGSKKEGRADGPAQNAS 157
A G +Y++D + + IRK+T + TIAG G K G +DG A+
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETIAGSG--KAGFSDGKGLKAT 387
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLK 189
F+ +E+ + ++ N LIR+I +K
Sbjct: 388 FNQPYEIIITEDCKTMYVAGAVNYLIRRITVK 419
>gi|428185691|gb|EKX54543.1| hypothetical protein GUITHDRAFT_63359, partial [Guillardia theta
CCMP2712]
Length = 305
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 70/143 (48%), Gaps = 9/143 (6%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
++P + P S L+V DS ++S+ E VV AG G +G+ DG A+FD
Sbjct: 57 LHPKGICPLP--SGLLVCDSGHHRIRSVSY--DGERVVA-FAGSGKRGHRDGPVQVAQFD 111
Query: 105 KPKSFAV-DMKGNIYVADKSNHVIRKITNLGVTTIAGGGS--KKEGRADGPAQNASFSND 161
P S V +I VAD N+ +R+I N VTT+AGG + G DG ++ A F
Sbjct: 112 TPCSICVCPSDKSIIVADSGNNAVRRIANGMVTTLAGGSGPDRAGGFVDGESEGAKFRRP 171
Query: 162 FELTFVPHICALLISDHGNQLIR 184
+ F LL+ D GN +R
Sbjct: 172 TFVMFDKEE-TLLVIDSGNHCLR 193
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 58/136 (42%), Gaps = 18/136 (13%)
Query: 56 SSDLIVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
+S ++V D+ +A ++ PL E K A SA + PK
Sbjct: 16 TSVVVVADTGNNALRLVALGRQGGPLVEGICTKSFAA------------SASWLHPKGIC 63
Query: 111 VDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
+ + V D +H IR ++ G +A GS K G DGP Q A F + P
Sbjct: 64 -PLPSGLLVCDSGHHRIRSVSYDGERVVAFAGSGKRGHRDGPVQVAQFDTPCSICVCPSD 122
Query: 171 CALLISDHGNQLIRQI 186
+++++D GN +R+I
Sbjct: 123 KSIIVADSGNNAVRRI 138
Score = 44.3 bits (103), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 61/128 (47%), Gaps = 4/128 (3%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
L+V+DS +S E V+ LAG G +DG + + P++ + G+I
Sbjct: 182 LLVIDSGNHCLRVMSPDWKE---VRTLAGGPKAGGTDGAVDTCELNHPEASCLLEDGSIL 238
Query: 119 VADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDH 178
+AD+ N+ IR++ + + G+ G DG + ++F+ + + ++ISD
Sbjct: 239 IADRENNKIRRLDGDLRSLSSWAGNGCWGATDGLIEESTFNKPCGVCCLEDG-TIVISDS 297
Query: 179 GNQLIRQI 186
GN IR +
Sbjct: 298 GNNCIRLV 305
>gi|329935738|ref|ZP_08285543.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
gi|329304829|gb|EGG48702.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
Length = 617
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 64/113 (56%), Gaps = 7/113 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +L+V D++R L+ ES V+R+ G G +G++DG +A F +P+ A+ G
Sbjct: 198 SGNLLVSDTTRHQLVELA--ADGESEVRRI-GSGARGFADGTAEAAAFSEPQGLALLEDG 254
Query: 116 NIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFEL 164
++ VAD NH +R+ + GVTT+AG G + + GPA+ S+ +++
Sbjct: 255 SVVVADTVNHALRRLDVATGGVTTLAGTGRQWWQGSPTSGPAREVDLSSPWDV 307
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 52/102 (50%), Gaps = 3/102 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GDG + EP + RF P + GN+ V+D + H + ++ G + + GS
Sbjct: 173 GDGPYVAPEPEPTTLRF--PGKALLLPSGNLLVSDTTRHQLVELAADGESEVRRIGSGAR 230
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G ADG A+ A+FS L + +++++D N +R++++
Sbjct: 231 GFADGTAEAAAFSEPQGLALL-EDGSVVVADTVNHALRRLDV 271
>gi|399034109|ref|ZP_10732477.1| gluconolactonase [Flavobacterium sp. CF136]
gi|398067543|gb|EJL59038.1| gluconolactonase [Flavobacterium sp. CF136]
Length = 418
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 76/168 (45%), Gaps = 19/168 (11%)
Query: 38 IDGHQLEINPHSVIDRP----------GSSDLIVLDSSRSAFYTLS-----FPLSEESVV 82
++G+ + P V+ RP G++ + LDS + + + L+ ++
Sbjct: 212 VNGYIYKFTPSGVLTRPRNNEGALIRVGAASGMCLDSKNNLYISEDATSKIMKLTPTGIL 271
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
AG G+ +G A+F KP D N+YV D+ N+ IRKI G V+T+AG
Sbjct: 272 TIFAGANGTGFQNGNIKEAKFSKPSGIVFDKMDNLYVVDRYNNRIRKIAVDGTVSTVAGS 331
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G + G DG A+F+ E+T + L I + N IR I K
Sbjct: 332 GIR--GNKDGITSEATFNFPQEIT-IDKYNNLYIVEPDNNRIRMITSK 376
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 33/59 (55%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ + V +AG G++G DG A F+ P+ +D N+Y+ + N+ IR IT+ G
Sbjct: 319 IAVDGTVSTVAGSGIRGNKDGITSEATFNFPQEITIDKYNNLYIVEPDNNRIRMITSKG 377
>gi|290986743|ref|XP_002676083.1| predicted protein [Naegleria gruberi]
gi|284089683|gb|EFC43339.1| predicted protein [Naegleria gruberi]
Length = 733
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 56/108 (51%), Gaps = 4/108 (3%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
+ +AG G G+S +G +A+ P S V +Y+AD+SNH IRKI N + TI
Sbjct: 43 IATIAGKGTCGFSGDNGPATNAQIYYPSSVFVSSNNEVYIADQSNHRIRKILENGNIVTI 102
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G +GPA NA + + FV + I D+GN +R+I
Sbjct: 103 AGNGIGGFSGDNGPATNAQIYYPYSV-FVSSNNVVYIVDYGNNRVRKI 149
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 57/112 (50%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
E + +AG+G+ G+S +G +A+ P S V +Y+ D N+ +RKI N
Sbjct: 95 ENGNIVTIAGNGIGGFSGDNGPATNAQIYYPYSVFVSSNNVVYIVDYGNNRVRKILGNGN 154
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +GPA NA +N + FV + I+D N IR+I
Sbjct: 155 IVTIAGNGTSGFSGDNGPATNAQLNNPVGV-FVSSNNEVYIADQSNHRIRKI 205
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 59/105 (56%), Gaps = 5/105 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
+AG+G G+S +G +A+ + P V +Y+AD+SNH IRKI N + TIAG
Sbjct: 158 IAGNGTSGFSGDNGPATNAQLNNPVGVFVSSNNEVYIADQSNHRIRKILENGNIVTIAGN 217
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G+ G +GPA NA + + FV + + I D+GN IR+I
Sbjct: 218 GTGGFGGDNGPATNAQLYIPYSV-FVSN-NEVYIVDYGNNRIRKI 260
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 18/91 (19%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
+AG+G G+S +G +A+ ++P S V +Y+AD +NH IRKI N + TIAG
Sbjct: 269 IAGNGTSGFSGDNGPATNAQLNRPSSVFVS-NNEVYIADLNNHKIRKILENGNIITIAGN 327
Query: 142 GSKKEGRADGPAQNASFSND--FELTFVPHI 170
G+K FS D F++ PHI
Sbjct: 328 GTK------------GFSGDSPFDIRMYPHI 346
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 5/108 (4%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
A+ P V +Y+AD SNH IRKI N + TIAG G+ +GPA NA
Sbjct: 8 AQLYNPVGIFVSSNNEVYIADYSNHRIRKILKNGNIATIAGKGTCGFSGDNGPATNAQIY 67
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
+ FV + I+D N IR+I E+ + + +G+ +G S
Sbjct: 68 YPSSV-FVSSNNEVYIADQSNHRIRKI---LENGNIVTIAGNGIGGFS 111
>gi|320103301|ref|YP_004178892.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
gi|319750583|gb|ADV62343.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
Length = 676
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 4/111 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI--TNLGVTTI 138
+K + G+G G DG A F++P+ +D ++ +YVAD NH IR I T VTT+
Sbjct: 236 LKAVVGNGKIGMRDGAYERASFNRPQGIRLDALRNRLYVADTENHAIRAIDLTTRSVTTV 295
Query: 139 AGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
AG G G GPA+ ++ ++L +P L++ G I +I+
Sbjct: 296 AGTGEMVYPGLPGGPARRFGLNSPWDLVQIPETNQFLVAMAGTHQIYKIDF 346
Score = 40.4 bits (93), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 7/82 (8%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
L V D+ + + F + ++ L GD +G +D P RFD+P +V + G +Y
Sbjct: 441 LYVADTYNNKIKVIDFS-ARTKTIRTLVGDRRRGDTDDPP---RFDQPGGLSV-LGGQLY 495
Query: 119 VADKSNHVIR--KITNLGVTTI 138
VAD +NH IR + L V T+
Sbjct: 496 VADTNNHKIRVVDLATLAVKTL 517
>gi|441163236|ref|ZP_20968263.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
gi|440616375|gb|ELQ79517.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
Length = 238
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 64/117 (54%), Gaps = 4/117 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + ++ +AG+G+ GY SDG P +A + P A D GN+Y+AD N+ +RK+
Sbjct: 71 VTPQGIITTVAGNGIAGYVSDGGPATATQLSSPCGVAADGAGNLYIADLGNNRVRKVDTK 130
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G +TT+AG G+ GPA A+ N V + I+D+ N +R+++ K
Sbjct: 131 GIITTVAGNGTGGYVSDGGPAT-ATQLNGPHSVAVDRDGNVYIADYHNHRVRKVDSK 186
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 40/62 (64%), Gaps = 2/62 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + ++ +AG+G GY SDG P +A + + P S AVD GN+Y+AD NH +RK+ +
Sbjct: 127 VDTKGIITTVAGNGTGGYVSDGGPATATQLNGPHSVAVDRDGNVYIADYHNHRVRKVDSK 186
Query: 134 GV 135
G+
Sbjct: 187 GL 188
>gi|357485931|ref|XP_003613253.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
gi|355514588|gb|AES96211.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
Length = 154
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 63/124 (50%), Gaps = 7/124 (5%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
+ +G G+ D + AR + PK VD +GNIYVAD N IRKI+ LG +
Sbjct: 11 SAEGYSGHVDEKLREARMNHPKGITVDDRGNIYVADIINMAIRKIS-LGNNMTYLSFLYE 69
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
E + + F++ +V +LL+ D G Q IR+I L+ +DC+ +S LG
Sbjct: 70 E------SLILFYLLLFDVIYVGSSYSLLVIDRGKQAIREIQLRFDDCAYQYESRFPLGK 123
Query: 206 VSVW 209
++ +
Sbjct: 124 LNKF 127
>gi|424869138|ref|ZP_18292858.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
gi|124514239|gb|EAY55753.1| conserved protein of unknown function [Leptospirillum rubarum]
gi|387220840|gb|EIJ75456.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
Length = 389
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 64/153 (41%), Gaps = 8/153 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + P L V DS + + V +AG DG A F+
Sbjct: 132 NPEGMAISPDGKTLYVADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 188
Query: 106 PKSFAVDMKGN-IYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
P+ A+ G +YVAD N+ IRKI TN VTT+AG G G ADG A+F
Sbjct: 189 PRGLAISPDGKTLYVADSGNNAIRKIDLATNT-VTTLAGAGKLMSGSADGVGVQATFHEP 247
Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
L L I+D N LIR++ L S
Sbjct: 248 RSLAISGDGQVLYIADTRNNLIRKMVLATNSVS 280
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
Y DG +ARF+ P+ A+ G +YVAD N++IRKI V+TIAG G
Sbjct: 120 YRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTVSTIAGHSFPSSG-- 177
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
DG + A F L P L ++D GN IR+I+L + + +G
Sbjct: 178 DGVGKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDLATNTVTTLAGAG 228
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 18/121 (14%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---------------- 134
+G DG ARF+ P+ G IY+AD N +IRKI++ G
Sbjct: 47 RGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKISSNGLSGATPTPAELHHLSV 106
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
V IAG + R DG NA F+N + P L ++D N +IR+I+L +
Sbjct: 107 DVEDIAGSDHRARYR-DGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTV 165
Query: 194 S 194
S
Sbjct: 166 S 166
>gi|410478931|ref|YP_006766568.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
gi|406774183|gb|AFS53608.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
Length = 427
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 64/153 (41%), Gaps = 8/153 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + P L V DS + + V +AG DG A F+
Sbjct: 170 NPEGMAISPDGKTLYVADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 226
Query: 106 PKSFAVDMKGN-IYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
P+ A+ G +YVAD N+ IRKI TN VTT+AG G G ADG A+F
Sbjct: 227 PRGLAISPDGKTLYVADSGNNAIRKIDLATNT-VTTLAGAGKLMSGSADGVGVQATFHEP 285
Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
L L I+D N LIR++ L S
Sbjct: 286 RSLAISGDGQVLYIADTRNNLIRKMVLATNSVS 318
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
Y DG +ARF+ P+ A+ G +YVAD N++IRKI V+TIAG G
Sbjct: 158 YRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTVSTIAGHSFPSSG-- 215
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
DG + A F L P L ++D GN IR+I+L + + +G
Sbjct: 216 DGVGKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDLATNTVTTLAGAG 266
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 18/121 (14%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---------------- 134
+G DG ARF+ P+ G IY+AD N +IRKI++ G
Sbjct: 85 RGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKISSNGLSGATPTPAELHHLSV 144
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
V IAG + R DG NA F+N + P L ++D N +IR+I+L +
Sbjct: 145 DVEDIAGSDHRARYR-DGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTV 203
Query: 194 S 194
S
Sbjct: 204 S 204
>gi|318059218|ref|ZP_07977941.1| hypothetical protein SSA3_14816 [Streptomyces sp. SA3_actG]
gi|318079124|ref|ZP_07986456.1| hypothetical protein SSA3_21010 [Streptomyces sp. SA3_actF]
Length = 610
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 67/124 (54%), Gaps = 8/124 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R A L ++ V+R G G +G DG+ ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLRDGDAAGARFNEP 238
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGR-ADGPAQNASFSNDF 162
+ A+ G + VAD NH +R++ + V+T+AG G + +G GPA+ S S+ +
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRVDLASGEVSTLAGTGRQWWQGEPVAGPAREVSLSSPW 298
Query: 163 ELTF 166
++ +
Sbjct: 299 DVAW 302
>gi|197118445|ref|YP_002138872.1| lipoprotein [Geobacter bemidjiensis Bem]
gi|197087805|gb|ACH39076.1| repeat-containing lipoprotein, putative [Geobacter bemidjiensis
Bem]
Length = 652
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 59/108 (54%), Gaps = 8/108 (7%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
V LAG GV G ++G G+ARF P A D N+YVAD NH +RKI + VTT+
Sbjct: 162 VTTLAGTAGVVGSTNGTGGAARFSSPSGVATDGA-NLYVADSLNHRVRKIVLASAAVTTL 220
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G + ADG ASF++ + +L ++D GN +R+I
Sbjct: 221 AGSGLQD--FADGTGIAASFNSPRGI--ATDGVSLYLADQGNSAVRRI 264
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 56/124 (45%), Gaps = 8/124 (6%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEG 147
G G +DG +A F P D + N+YVAD +NH +R+I+ G T G + G
Sbjct: 375 GTVGSADGVGVAAAFTSPYDLTTDGR-NVYVADTNNHTVRQISIATGAVTTLAGTPDRAG 433
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS----KSSQSGSAL 203
DG ASF F L +SD GN +R+I + S + +GSA
Sbjct: 434 SGDGTGAAASFR--FPSGLTTDGTNLFVSDTGNNTVRKIVIATGAVSTLAGTAGATGSAD 491
Query: 204 GAVS 207
GA S
Sbjct: 492 GAGS 495
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 45/102 (44%), Gaps = 2/102 (1%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEG 147
G G +DG ARF P A D ++YV D N VIRK+ G T+ G + G
Sbjct: 58 GSAGSTDGAASQARFSIPGDVASDGT-SLYVVDTGNSVIRKVVLATGATSTVAGLAGITG 116
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
DG A F+ + L +SD GN IR IN+
Sbjct: 117 SGDGTGAVARFNYPSGIALSSDNSTLYVSDTGNNTIRSINVA 158
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 11/112 (9%)
Query: 82 VKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTT 137
V LAGDG G +D G+ARF P ++ G +YVAD N ++RK + +T
Sbjct: 314 VSTLAGDGGGSPGSTDAVGGAARFSAPAGLVLN-AGVLYVADTGNDLLRKVAVATGASST 372
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+AG G ADG A+F++ ++LT + ++D N +RQI++
Sbjct: 373 LAG----TVGSADGVGVAAAFTSPYDLTTDGR--NVYVADTNNHTVRQISIA 418
Score = 45.1 bits (105), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 53/123 (43%), Gaps = 15/123 (12%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
V LAG G G +DG +ARF P D N+YVAD N+ IRKI IA
Sbjct: 477 VSTLAGTAGATGSADGAGSAARFHSPNGLTTDGT-NLYVADSGNNAIRKI------VIAT 529
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
G A +A S+ + +T L I+D GN I + L S + +G
Sbjct: 530 GAVSTA-----VASSAGLSSPYGVT--TDGTTLFITDSGNHRICKFTLAGSAFSSLTVTG 582
Query: 201 SAL 203
+A
Sbjct: 583 AAF 585
>gi|444306537|ref|ZP_21142300.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
gi|443481173|gb|ELT44105.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
Length = 673
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 78/166 (46%), Gaps = 10/166 (6%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
ED T + GH LE++ S D S L V+ + SF P+S E V +A
Sbjct: 324 EDAAGFTGRLSGHPLEVSLSSPWDLVWSRKLNAVVIAMAGTHQIFSFDPVSGE--VDIIA 381
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGG 141
G+G++G DG A F +P A D GNI+VAD +RK+ ++ V + G
Sbjct: 382 GNGLEGLLDGPAHEAWFAQPSGLAEDGDGNIWVADSETSALRKLVIDDAGSITVESALGK 441
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G DGPA A + +T +P ++ I+D N +R+ +
Sbjct: 442 GLFDFGFRDGPAVEARLQHPLGVTVLPD-GSVAIADTYNGAVRRYD 486
>gi|294630411|ref|ZP_06708971.1| NHL repeat protein [Streptomyces sp. e14]
gi|292833744|gb|EFF92093.1| NHL repeat protein [Streptomyces sp. e14]
Length = 609
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 66/115 (57%), Gaps = 7/115 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G G +G++DG A F++P+ A+ G
Sbjct: 193 SGNFLVSDTTRHQLVELA--ADGESVVRRI-GSGARGFADGAAEDAAFNEPQGLALLDDG 249
Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDFELTF 166
+ VAD NH +R+ +TT+AG G + +G A +GPA+ + S+ +++ +
Sbjct: 250 AVVVADTVNHALRRFDPATGRITTLAGTGRQWMQGEATEGPAREVNLSSPWDVAW 304
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 50/101 (49%), Gaps = 3/101 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GDG + EP + RF P + GN V+D + H + ++ G + + GS
Sbjct: 168 GDGPYVAPEPEPTTLRF--PGKALLLPSGNFLVSDTTRHQLVELAADGESVVRRIGSGAR 225
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G ADG A++A+F+ L + A++++D N +R+ +
Sbjct: 226 GFADGAAEDAAFNEPQGLALLDD-GAVVVADTVNHALRRFD 265
>gi|290990564|ref|XP_002677906.1| NHL domain-containing protein kinase [Naegleria gruberi]
gi|284091516|gb|EFC45162.1| NHL domain-containing protein kinase [Naegleria gruberi]
Length = 2733
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 58/109 (53%), Gaps = 6/109 (5%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TT 137
+ +AG G GYS G+ G SA+F +P+ AV G IY+AD NHV+RKI+ V +T
Sbjct: 548 IATVAGSGASGYS-GDGGLLTSAKFQQPQGVAVSSNGEIYIADTENHVVRKISTSNVLST 606
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG GS G A A + + + + I+D+ N IR+I
Sbjct: 607 IAGTGSYGYNGDGGLAITAKLFSPIGVA-ISSSGEVFIADNNNHRIRKI 654
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 57/113 (50%), Gaps = 6/113 (5%)
Query: 81 VVKRLAGDGVQ----GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
V++R+ + G DG P + + + +SF + G IY+AD NH IRK++ LG +
Sbjct: 762 VIRRVISGNISTIAGGVGDGGPATGGYIQAQSFDISSTGEIYIADTENHRIRKVSTLGKI 821
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+TIAG G+ G A A S+ + V + I+D N IR++ L
Sbjct: 822 STIAGTGAMGYSGDGGLAITAKLSSPVGVA-VSSTGEVFIADRDNHRIRKVTL 873
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 61/116 (52%), Gaps = 8/116 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S +V+ +AG G GY +G+ G +A+ P A+ G +++AD +NH IRKI
Sbjct: 598 ISTSNVLSTIAGTGSYGY-NGDGGLAITAKLFSPIGVAISSSGEVFIADNNNHRIRKIAK 656
Query: 133 LG-VTTIAGGG-SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++TI G G + +G D A NA N +T P L I+D N IR++
Sbjct: 657 DGYISTIVGTGLNGYDGDGD-LATNAKLGNPQGVTLSP-TGELFIADQNNHRIRKV 710
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 7/109 (6%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG G GYS G+ G +A+ P AV G +++AD+ NH IRK+T G +TT
Sbjct: 821 ISTIAGTGAMGYS-GDGGLAITAKLSSPVGVAVSSTGEVFIADRDNHRIRKVTLSGIITT 879
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG G+ A+ A ++ +T + I+D N IRQI
Sbjct: 880 IAGNGTSGFNSDGIEAKFAQLNSPSSVTISGG--EIYIADTNNHRIRQI 926
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 65/132 (49%), Gaps = 7/132 (5%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKG 115
D+ + D+ + +SF ++ +AG G+ G + DG SA+ + P AV G
Sbjct: 471 DVYIADTYNNRIRKVSF---STGIISTVAGTGIAGQNGDGNLATSAQLNFPSGVAVSSNG 527
Query: 116 NIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
+IY+AD NH IRKI TN + T+AG G+ G +A F + V +
Sbjct: 528 DIYIADTENHRIRKIETNGYIATVAGSGASGYSGDGGLLTSAKFQQP-QGVAVSSNGEIY 586
Query: 175 ISDHGNQLIRQI 186
I+D N ++R+I
Sbjct: 587 IADTENHVVRKI 598
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 63/114 (55%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++++ + + G G+ GY DG+ + A+ P+ + G +++AD++NH IRK+ +
Sbjct: 654 IAKDGYISTIVGTGLNGYDGDGDLATNAKLGNPQGVTLSPTGELFIADQNNHRIRKVASN 713
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+AG G+ A NA ++ + F I + I+D N++IR++
Sbjct: 714 GYISTVAGNGNFGFSGDGDLATNAELNSPSGIAF-SSIGTMYIADRLNRVIRRV 766
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 41/65 (63%), Gaps = 4/65 (6%)
Query: 81 VVKRLAGDGVQGY-SDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ +AG+G G+ SDG E A+ + P S + G IY+AD +NH IR+I+N G + T
Sbjct: 876 IITTIAGNGTSGFNSDGIEAKFAQLNSPSSVTIS-GGEIYIADTNNHRIRQISNTGIIKT 934
Query: 138 IAGGG 142
IAG G
Sbjct: 935 IAGNG 939
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 17/120 (14%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ ++ +AG G GY DG+ +A+ P V+ G++Y+AD N+ IRKI+
Sbjct: 230 IATNGIITTIAGTGTHGY-DGDGALAINAQLYSPTGVVVNSNGDVYIADSGNNRIRKISK 288
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
++T+AG G G + +A + P A ++I+D N IR+I
Sbjct: 289 GYISTVAGNGIAGYSGDGGLSTSAQLA-------TPQSVAINLNGEIIIADSNNNRIRKI 341
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 56/122 (45%), Gaps = 8/122 (6%)
Query: 72 LSFP----LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
L+FP ++ + +AG G Y + A+ P A+D G Y++D NH
Sbjct: 166 LNFPSSIAITTNGNISTIAGTGTGSYGGDNAIATGAQLYAPACIAIDSNGEFYISDSMNH 225
Query: 126 VIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
IRKI TN +TTIAG G+ A NA + + V + I+D GN IR
Sbjct: 226 RIRKIATNGIITTIAGTGTHGYDGDGALAINAQLYSPTGVV-VNSNGDVYIADSGNNRIR 284
Query: 185 QI 186
+I
Sbjct: 285 KI 286
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 64/143 (44%), Gaps = 14/143 (9%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARF 103
NP V P + +L + D + ++ + +AG+G G+S DG+ + A
Sbjct: 685 NPQGVTLSP-TGELFIADQNNHRIR----KVASNGYISTVAGNGNFGFSGDGDLATNAEL 739
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
+ P A G +Y+AD+ N VIR++ + ++TIAGG G Q SF
Sbjct: 740 NSPSGIAFSSIGTMYIADRLNRVIRRVISGNISTIAGGVGDGGPATGGYIQAQSFD---- 795
Query: 164 LTFVPHICALLISDHGNQLIRQI 186
+ + I+D N IR++
Sbjct: 796 ---ISSTGEIYIADTENHRIRKV 815
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 31/46 (67%), Gaps = 1/46 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSK 144
+A+ P A+ + G IY+AD +N+ IRKI TN +TTIAG G++
Sbjct: 107 AAQLFYPYGVALSLGGEIYIADSNNNRIRKIATNGIITTIAGTGTR 152
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 59/116 (50%), Gaps = 17/116 (14%)
Query: 80 SVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-V 135
++V R+ G +GY+ D SA + +S AV+ G++Y+AD N+ IRK++ + G +
Sbjct: 435 NIVSRVVGS--EGYNGDDIAATSALLNYARSIAVNSNGDVYIADTYNNRIRKVSFSTGII 492
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
+T+AG G G + + + +L F + + I+D N IR+I
Sbjct: 493 STVAGTG------IAGQNGDGNLATSAQLNFPSGVAVSSNGDIYIADTENHRIRKI 542
>gi|302520584|ref|ZP_07272926.1| NHL repeat containing protein [Streptomyces sp. SPB78]
gi|302429479|gb|EFL01295.1| NHL repeat containing protein [Streptomyces sp. SPB78]
Length = 634
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 66/124 (53%), Gaps = 8/124 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R A L ++ V+R G G +G DG+ ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLRDGDAAGARFNEP 238
Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSK-KEGR-ADGPAQNASFSNDF 162
+ A+ G + VAD NH +R++ V+T+AG G + +G GPA+ S S+ +
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRVDPASGEVSTLAGTGRQWWQGEPVAGPAREVSLSSPW 298
Query: 163 ELTF 166
++ +
Sbjct: 299 DVAW 302
>gi|428183007|gb|EKX51866.1| hypothetical protein GUITHDRAFT_84912 [Guillardia theta CCMP2712]
Length = 351
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 1/105 (0%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGS 143
LAG G G++DG+ S+ F++P + G +YVAD N+ +RK+ T+ G
Sbjct: 64 LAGTGTPGFADGDWNSSSFNQPCGVSTSPDGQWLYVADSGNNRVRKVNTTSGATLTLVGK 123
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G ADG A S S + P L I+D +Q IRQ+ L
Sbjct: 124 GDLGSADGEAATVSLSTPLSVAASPDGRYLYIADSLSQRIRQVRL 168
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 102 RFDKPKSFAVDMKG-NIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADGPAQNASF 158
RF+ P A+ +YV D+ NH IRK++ + TT+AG G+ G ADG ++SF
Sbjct: 25 RFNCPYGLALSPDDLWLYVGDRDNHRIRKVSMRDGSTTTLAGTGTP--GFADGDWNSSSF 82
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
+ ++ P L ++D GN +R++N
Sbjct: 83 NQPCGVSTSPDGQWLYVADSGNNRVRKVN 111
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 1/106 (0%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGS 143
L G G G +DGE + P S A G +Y+AD + IR++ T GS
Sbjct: 120 LVGKGDLGSADGEAATVSLSTPLSVAASPDGRYLYIADSLSQRIRQVRLADGWTRTLAGS 179
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ +DGP +SF+ + P + ++D N+ +R+I ++
Sbjct: 180 GNQSFSDGPPSESSFNTPAAIAVSPDGLTVYVADLNNERVRKIAVR 225
>gi|329927682|ref|ZP_08281814.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
HGF5]
gi|328938318|gb|EGG34710.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
HGF5]
Length = 1251
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 56/104 (53%), Gaps = 4/104 (3%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
A ++P AVD KG +Y+AD NH IRKI G V TIAG G + G DG AQ+A F
Sbjct: 109 AFLNRPSGIAVDGKGQVYIADSGNHAIRKIDQAGRVNTIAGNG--RIGLKDGKAQDALFH 166
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
++ V L ++D N +IR+I+ E + S S A+
Sbjct: 167 EPQDVA-VTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAV 209
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 60/142 (42%), Gaps = 17/142 (11%)
Query: 55 GSSDLIVLDSSRSAFYTLSF------------PLSEESVVKRLAGDGVQGYSDGEPGSAR 102
G +L V DS A + F P S+ L D GY DG +AR
Sbjct: 242 GKGNLYVSDSGNQAIRYIDFGKGTVSTAAGSVPSSDGYAKDALYAD--PGYLDGAADAAR 299
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
F+ P+ A ++ + +AD NH +R++ + VTT+ GG G ADG F+
Sbjct: 300 FNSPRGLAWSVEDGLLIADSHNHAVRQLKDERVTTMVGG---TRGYADGIESEVRFNAPA 356
Query: 163 ELTFVPHICALLISDHGNQLIR 184
++ L I+D N +R
Sbjct: 357 DIGVASDSGELFIADQRNGAVR 378
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 70/155 (45%), Gaps = 15/155 (9%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
+N S I G + + DS A + + V +AG+G G DG+ A F
Sbjct: 111 LNRPSGIAVDGKGQVYIADSGNHAIRKID----QAGRVNTIAGNGRIGLKDGKAQDALFH 166
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK--EGRAD-----GPAQNAS 157
+P+ AV G +YVAD NHVIR+I+ G T G S + + RA G +N S
Sbjct: 167 EPQDVAVTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAVQVRAGVIALAGDYKNGS 226
Query: 158 FS----NDFELTFVPHICALLISDHGNQLIRQINL 188
+ N+ + L +SD GNQ IR I+
Sbjct: 227 LAEAQFNEPAGLALDGKGNLYVSDSGNQAIRYIDF 261
>gi|410098158|ref|ZP_11293137.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222431|gb|EKN15373.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
CL02T12C30]
Length = 446
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 4/104 (3%)
Query: 89 GVQ---GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
GVQ G+ DGE A F++PK D G +YVAD NHVIRKIT GV + G +
Sbjct: 344 GVQNETGWQDGEREEALFNEPKQICFDQDGVMYVADAGNHVIRKITRDGVVSTVIGIAGV 403
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+G DG ++A F + + + + D N +R++ ++
Sbjct: 404 KGYVDGSPEDALFQYPTGVA-IDKEGTIYVGDARNNCVRKLAIE 446
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ + VV + G GV+GY DG P A F P A+D +G IYV D N+ +RK+
Sbjct: 388 ITRDGVVSTVIGIAGVKGYVDGSPEDALFQYPTGVAIDKEGTIYVGDARNNCVRKL 443
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 44/93 (47%), Gaps = 12/93 (12%)
Query: 73 SFPLSEESVVKRLAGDGVQGYS------DGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
+F + ++ V +AG V G DG A FD+P VD + NI+V+++ H
Sbjct: 127 TFSYTTQTTVSTIAGKPVGGDDGHGEMIDGTLAEASFDQPWFLCVDAEKNIFVSERLGHA 186
Query: 127 IRKITNLG--VTTIAGGGSKKEGRADGPAQNAS 157
+R+I V+T+ G EG P +A+
Sbjct: 187 VRQINEEKNVVSTLVKG----EGNCQNPNASAT 215
>gi|428186137|gb|EKX54988.1| hypothetical protein GUITHDRAFT_59145, partial [Guillardia theta
CCMP2712]
Length = 231
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 64/142 (45%), Gaps = 19/142 (13%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
+ V AG G+QG+ DG G ARF +P + + +YVAD +NH IR I + V
Sbjct: 17 NNFVSYFAGSGIQGFRDGAAGDARFSEPLGI-IHFRNKLYVADSNNHAIRSIDPISGEVA 75
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKP 190
T+A G+++ G DG A + L P L ++D GN +I+ ++
Sbjct: 76 TVA--GNQQRGNLDGAASQSQLDTPTSLLNRPEGVVFDSHGHLYVADTGNHVIKMLDFSS 133
Query: 191 --------EDCSKSSQSGSALG 204
+ S +S GS G
Sbjct: 134 GKVLRTIGTEVSSTSNRGSRFG 155
>gi|302543897|ref|ZP_07296239.1| putative NHL repeat protein [Streptomyces hygroscopicus ATCC 53653]
gi|302461515|gb|EFL24608.1| putative NHL repeat protein [Streptomyces himastatinicus ATCC
53653]
Length = 608
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 64/117 (54%), Gaps = 8/117 (6%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
PG + +V D++R L+ ESV++R+ G G +G +DG ARF +P+ A
Sbjct: 184 PGGT-FLVSDTTRHQLVELA--ADGESVLRRI-GTGERGLADGRADRARFSEPQGLAALP 239
Query: 114 KGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
G + VAD NH +R+ VTT+AG G + + A+GPA++ S+ ++L +
Sbjct: 240 DGTVAVADTVNHALRRFDPATGEVTTLAGTGRQWWQGSPAEGPARDIDLSSPWDLAW 296
>gi|332027687|gb|EGI67755.1| NHL repeat-containing protein 2 [Acromyrmex echinatior]
Length = 686
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 89/197 (45%), Gaps = 20/197 (10%)
Query: 5 FVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
+Y+ +AI L + + GL L+ + + L +V+ + L++ DS
Sbjct: 180 LLYTKIAIAYFKSLKQMSNHGLPLKPAHHLLPSSKDGLLFPGKLTVLQLEQETKLVISDS 239
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
+ + E V+ + G QG+ DG+ +ARF+ P+ V + IYVAD +N
Sbjct: 240 GNNRIVITN----EHGRVEHVIGGCNQGFKDGDFKNARFNSPQGVCV-LNNIIYVADNNN 294
Query: 125 HVIRKI--TNLGVTTIAGGGSKKEGRADG----------PAQNASFSNDFELTFVPHICA 172
H IRKI V+TIAG GS+ R G P A + ++++ T VP
Sbjct: 295 HAIRKIDLAKRIVSTIAGTGSQGCDRKGGKHGTDQALSSPWDVAIYHHEYKGTMVP---V 351
Query: 173 LLISDHGNQLIRQINLK 189
LLI+ G I + L+
Sbjct: 352 LLIAIAGTHQIWALFLE 368
>gi|206602867|gb|EDZ39348.1| Conserved protein of unknown function [Leptospirillum sp. Group II
'5-way CG']
Length = 374
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 64/153 (41%), Gaps = 8/153 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + P L + DS + + V +AG DG A F+
Sbjct: 117 NPEGMAISPDGKTLYIADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 173
Query: 106 PKSFAVDMKGN-IYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
P+ A+ G +YVAD N+ IRKI TN VTT+AG G G ADG A+F
Sbjct: 174 PRGLAISPDGKTLYVADSGNNAIRKIDLATNT-VTTLAGAGKLMSGSADGVGVQATFHEP 232
Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
L L I+D N LIR++ L S
Sbjct: 233 RSLAISGDGQVLYIADTRNNLIRKMVLATNSVS 265
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
Y DG +ARF+ P+ A+ G +Y+AD N++IRKI V+TIAG G
Sbjct: 105 YRDGVGANARFNNPEGMAISPDGKTLYIADSRNNMIRKIDLATKTVSTIAGHSFPSSG-- 162
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
DG + A F L P L ++D GN IR+I+L + + +G
Sbjct: 163 DGVGKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDLATNTVTTLAGAG 213
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 3/106 (2%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGR 148
+G DG +ARF+ P+ G IY+AD N +IRK+ + V +AG + R
Sbjct: 47 RGAIDGAGTNARFEFPQGILAAPDGTIYIADTGNDMIRKMNPSTKSVENVAGSDHRARYR 106
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
DG NA F+N + P L I+D N +IR+I+L + S
Sbjct: 107 -DGVGANARFNNPEGMAISPDGKTLYIADSRNNMIRKIDLATKTVS 151
>gi|224536973|ref|ZP_03677512.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521412|gb|EEF90517.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
DSM 14838]
Length = 446
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
+GY+DG A+F++P+ D N YV D++N++IRK+ G + G ++ G AD
Sbjct: 347 KGYTDGPGSKAQFNEPQQGCFDNDDNFYVCDQNNNLIRKVEPSGQVSTFAGRREEWGWAD 406
Query: 151 GP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + A F F + + + I+D N+ IR I
Sbjct: 407 GDLRKEARFDRPFGIAYNRNTSEFYIADKNNKRIRII 443
>gi|410096564|ref|ZP_11291551.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
CL02T12C30]
gi|409226528|gb|EKN19437.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
CL02T12C30]
Length = 457
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 59/117 (50%), Gaps = 1/117 (0%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S P ++ ++ + G G +G +A+ + + A D +GN+Y+AD +NHVI K+
Sbjct: 342 STPWITQNEIEHIVGTGKGAAKEGNGKAAQLGEIEGMAADQEGNVYLADYTNHVIWKVDE 401
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
TI G + G DG Q A F+ +++ P L ++D N LIR I ++
Sbjct: 402 EFNATIFAGVPGESGYKDGKPQEALFNKPYDVAATPD-GILYVADTYNYLIRCIAIQ 457
>gi|196228192|ref|ZP_03127059.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
gi|196227595|gb|EDY22098.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
Length = 357
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 76/141 (53%), Gaps = 8/141 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
+ ++ ++ +AG G +G + DG P +A F+ +F V G++ +AD N+++R+I
Sbjct: 71 VGKDGILHVIAGTGAKGATGDGGPATAATFNGIHNFVVLRNGDLLLADSFNNLLRRIDAK 130
Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
GV TT+AGG K GPA++A FS E+ P L +D GN+ +R I+L
Sbjct: 131 TGVITTVAGGVKKGFAGDGGPAKDALFSTLIEIALNPAGTKLYCADIGNRRVRCIDLATN 190
Query: 192 DCSKSSQSGSAL----GAVSV 208
+ + +G A GAV+V
Sbjct: 191 SITTVAGNGKAGVPADGAVAV 211
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 53/110 (48%), Gaps = 7/110 (6%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVT 136
+K + G G +G S D PG A + PK A+D G++ +AD NHVIR+ T L +T
Sbjct: 248 IKTVVGTGAKGLSGDAGPGLQATMNGPKYIALDRDGSVLIADAENHVIRRYSPKTGL-IT 306
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+AG G G DG +T P AL I+D N I +I
Sbjct: 307 RVAGTGKPGNGGLDGDPLQCQLHRPHGVTVGPD-GALYITDSYNDRILKI 355
>gi|153808835|ref|ZP_01961503.1| hypothetical protein BACCAC_03135 [Bacteroides caccae ATCC 43185]
gi|149128661|gb|EDM19879.1| NHL repeat protein [Bacteroides caccae ATCC 43185]
Length = 426
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 1/98 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY DG+ +ARF+ P D+ G +Y+AD SNH IR I G + G + G DG
Sbjct: 330 GYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSIDREGAVSTVIGVPGRAGYVDG 389
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+A F + + V + I+D N+ IR++ ++
Sbjct: 390 TPDDALFDEPWGVA-VDEEGTIYIADTKNKCIRKLAIQ 426
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 26/42 (61%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G GY DG P A FD+P AVD +G IY+AD N IRK+
Sbjct: 382 GRAGYVDGTPDDALFDEPWGVAVDEEGTIYIADTKNKCIRKL 423
>gi|440789641|gb|ELR10946.1| NHL repeat containing 2 isoform 3, putative [Acanthamoeba
castellanii str. Neff]
Length = 934
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 74/161 (45%), Gaps = 9/161 (5%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
PH V +DR G + V DS LS + + + + D G DG+ +ARF
Sbjct: 298 PHGVAVDREGDR-MFVADSGHHRILVLSLDGAFRTAIG--SDDATTGLVDGDYATARFHS 354
Query: 106 PKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKK-EGRADGPAQNASFSNDF 162
P + + +YVAD NH IR ++ V T+AG G + E + G A S SN +
Sbjct: 355 PLGLSYAGEDKLYVADSENHCIRCVRLAEERVVTVAGTGQRGYERKGGGRALEWSLSNPW 414
Query: 163 ELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
++ H L I+ G I + N + E+ S S SGS L
Sbjct: 415 DV--ASHGHDLYIAMAGTHQIWKYNEQSEEISLVSGSGSEL 453
>gi|386841352|ref|YP_006246410.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374101653|gb|AEY90537.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451794647|gb|AGF64696.1| hypothetical protein SHJGH_5033 [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 611
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 64/113 (56%), Gaps = 7/113 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +V D++R L+ E+VV+R+ G G +G++DG P A F +P+ A+ G
Sbjct: 189 SGTFLVSDTTRHQLVELAE--DAETVVRRI-GSGTRGFADGGPREASFSEPQGLALLDDG 245
Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDFEL 164
++ VAD NH +R++ VTT+AG G + +G A GPA+ S+ +++
Sbjct: 246 SVVVADTVNHALRRLDPATGEVTTLAGTGKQWWQGSATSGPAREVDLSSPWDV 298
>gi|393789630|ref|ZP_10377750.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
CL02T12C05]
gi|392650346|gb|EIY44015.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
CL02T12C05]
Length = 435
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 1/98 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY+DG A FD P+ + + ++++AD +NHVIRKIT GV + G + G DG
Sbjct: 339 GYADGPCEYALFDTPRQIILTAENDLFLADSNNHVIRKITQDGVVSTVIGQAGMTGNQDG 398
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ A F+ F + V + I D NQ IR++ ++
Sbjct: 399 TPEVALFNEPFGVA-VDTDGTIYIGDSKNQSIRRLAIE 435
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 51/122 (41%), Gaps = 22/122 (18%)
Query: 31 GYTVTTVIDGHQLEINPHS----VIDRPGSSD----LIVLDSSRSAFYTLSFPL----SE 78
YT I + L+ H +++ PG +D + D+ R T L S
Sbjct: 311 AYTNAHCIYTYNLKTGAHKLFAGMVNTPGYADGPCEYALFDTPRQIILTAENDLFLADSN 370
Query: 79 ESVVKRLAGDGV----------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
V++++ DGV G DG P A F++P AVD G IY+ D N IR
Sbjct: 371 NHVIRKITQDGVVSTVIGQAGMTGNQDGTPEVALFNEPFGVAVDTDGTIYIGDSKNQSIR 430
Query: 129 KI 130
++
Sbjct: 431 RL 432
>gi|423220897|ref|ZP_17207391.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
CL03T12C61]
gi|392622375|gb|EIY16503.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
CL03T12C61]
Length = 426
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 1/98 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY DG+ +ARF+ P D+ G +Y+AD SNH IR I G + G + G DG
Sbjct: 330 GYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSIDREGAVSTVIGVPGRAGYVDG 389
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+A F + + V + I+D N+ IR++ ++
Sbjct: 390 TPDDALFDEPWGVA-VDEEGTIYIADTKNKCIRKLAIQ 426
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 26/42 (61%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G GY DG P A FD+P AVD +G IY+AD N IRK+
Sbjct: 382 GRAGYVDGTPDDALFDEPWGVAVDEEGTIYIADTKNKCIRKL 423
>gi|288923319|ref|ZP_06417453.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288345335|gb|EFC79730.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 752
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 52/103 (50%), Gaps = 3/103 (2%)
Query: 101 ARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
A FD + +D +G IYVAD N+ IR+I G +TT+AG G+ GPA AS
Sbjct: 415 AEFDSARDLTIDTTRGVIYVADTENNRIRRIDRTGRITTVAGNGTAGFDGDGGPATAASL 474
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
+ P L I+D+GN IR+I + + QSGS
Sbjct: 475 QEPSGVVVAPD-GTLFIADNGNHRIRRIGTNGVITTIAGQSGS 516
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 73/153 (47%), Gaps = 28/153 (18%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRL---------AGDGVQGY-SDGEPGSA-RFDKPKS 108
+ +D++R Y +E + ++R+ AG+G G+ DG P +A +P
Sbjct: 423 LTIDTTRGVIYVAD---TENNRIRRIDRTGRITTVAGNGTAGFDGDGGPATAASLQEPSG 479
Query: 109 FAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG----GGSKKEGRADG--------PAQN 155
V G +++AD NH IR+I TN +TTIAG GG E DG PA N
Sbjct: 480 VVVAPDGTLFIADNGNHRIRRIGTNGVITTIAGQSGSGGFAGEVSEDGLIFAGDGLPAVN 539
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
A +N + +LL++D NQ IR+I L
Sbjct: 540 AKLNNP-NSLLLDDDGSLLVADGTNQRIRRIGL 571
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 44/81 (54%), Gaps = 4/81 (4%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKS-FAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG G G+S DG P + A+ + AVD G IY++D +N IR+IT G +TT
Sbjct: 642 ISTVAGTGDAGFSGDGGPATRAKISTVGADVAVDSTGTIYLSDPANSRIRRITPDGIITT 701
Query: 138 IAGGGSKKEGRADGPAQNASF 158
IAG G + GPA A
Sbjct: 702 IAGTGVAQYSGNGGPAVEAEL 722
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 1/89 (1%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+A P + A G+IY+ D+ N +R+I G ++T+AG G GPA A
Sbjct: 606 AATLSYPCALARGPDGSIYLVDQDNFRLRRIDPAGRISTVAGTGDAGFSGDGGPATRAKI 665
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
S V + +SD N IR+I
Sbjct: 666 STVGADVAVDSTGTIYLSDPANSRIRRIT 694
>gi|383111216|ref|ZP_09932033.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
gi|313694781|gb|EFS31616.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
Length = 419
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 20/152 (13%)
Query: 56 SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
S + IV D +++ S +S + K +AGDGV+G Y DG+PG SA+
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGDPGNPLSAKVGA 329
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT--------NLGVTTIAGGGSKKEGRADGPAQNAS 157
A G +Y++D + + IRK+T + TIAG G K G +DG A+
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDASGDYSKGTLETIAGSG--KSGFSDGKGLKAT 387
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLK 189
F+ +E+ + ++ N LIR+I +K
Sbjct: 388 FNQPYEIIITKDCKTMYVAGAVNYLIRRITVK 419
>gi|442323754|ref|YP_007363775.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
gi|441491396|gb|AGC48091.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
Length = 906
Score = 57.8 bits (138), Expect = 5e-06, Method: Composition-based stats.
Identities = 41/109 (37%), Positives = 54/109 (49%), Gaps = 7/109 (6%)
Query: 82 VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTT 137
V LAG Q G+ DG AR +P AVD GNI+VAD NH +R+I VTT
Sbjct: 479 VSTLAGVAFQDGFVDGPVARARLRRPVGVAVDGLGNIFVADTGNHAVRRIAPDAARTVTT 538
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG G+ G +GP + + + P L ++D GN I +I
Sbjct: 539 IAGLGTP--GVGEGPGATTALRSPQSIAVAPD-GTLYVADTGNHRIVRI 584
Score = 55.1 bits (131), Expect = 3e-05, Method: Composition-based stats.
Identities = 43/131 (32%), Positives = 58/131 (44%), Gaps = 7/131 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTI 138
V +AG G G +G + P+S AV G +YVAD NH I +I G V+T
Sbjct: 536 VTTIAGLGTPGVGEGPGATTALRSPQSIAVAPDGTLYVADTGNHRIVRIARDGRWTVSTF 595
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS--KS 196
AG ++GRADG A F L F L ++D N + +I + + S
Sbjct: 596 AGSREGRQGRADGTGPAARFQTPTSLVFAG--TDLYVTDTFNHRLARITPQARVSTVIGS 653
Query: 197 SQSGSALGAVS 207
SGS G S
Sbjct: 654 RGSGSTNGPAS 664
Score = 50.1 bits (118), Expect = 9e-04, Method: Composition-based stats.
Identities = 39/110 (35%), Positives = 56/110 (50%), Gaps = 8/110 (7%)
Query: 82 VKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTI 138
V AG +G QG +DG +ARF P S V ++YV D NH + +IT V+T+
Sbjct: 592 VSTFAGSREGRQGRADGTGPAARFQTPTSL-VFAGTDLYVTDTFNHRLARITPQARVSTV 650
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G S+ G +GPA A + F AL + D GN+LIR++ +
Sbjct: 651 IG--SRGSGSTNGPASQARLHRPTAVAFGDG--ALWVVDTGNRLIRRVAM 696
Score = 40.4 bits (93), Expect = 0.76, Method: Composition-based stats.
Identities = 27/87 (31%), Positives = 43/87 (49%), Gaps = 3/87 (3%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P S AV + G + + D N IR + V T AG G+ G DG A+ A+FS +
Sbjct: 725 PMSGAVHVDGRLLLTDTGNERIRMLVGGRVRTYAGSGA--HGARDGTAEQATFSLPTGIA 782
Query: 166 FVPHICALLISDHGNQLIRQINLKPED 192
+P+ +L+ D G +R++ + D
Sbjct: 783 ALPN-GNMLVVDQGASTLRELRMPAAD 808
>gi|443625938|ref|ZP_21110373.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
gi|443340614|gb|ELS54821.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
Length = 632
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 73/139 (52%), Gaps = 14/139 (10%)
Query: 39 DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
DG + P + R PG S D +V D++R L+ E+VV+R+ G G +
Sbjct: 169 DGPYVAPEPEPTVLRFPGKVLLLPSGDFLVSDTTRHQLVELAG--DGETVVRRI-GQGSR 225
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGR 148
G +DG ARF +P+ A+ G + VAD NH++R+ TT+AG G + +G+
Sbjct: 226 GSTDGYADRARFQEPQGLALLPDGTVAVADTVNHLVRRFDPETGRTTTLAGTGIQWMQGQ 285
Query: 149 A-DGPAQNASFSNDFELTF 166
A GP ++ S S+ +++ +
Sbjct: 286 ATSGPGRDVSLSSPWDVAW 304
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 51/117 (43%), Gaps = 5/117 (4%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GDG + EP RF P + G+ V+D + H + ++ G T + G
Sbjct: 168 GDGPYVAPEPEPTVLRF--PGKVLLLPSGDFLVSDTTRHQLVELAGDGETVVRRIGQGSR 225
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
G DG A A F L +P + ++D N L+R+ + PE ++ +G+ +
Sbjct: 226 GSTDGYADRARFQEPQGLALLPD-GTVAVADTVNHLVRRFD--PETGRTTTLAGTGI 279
>gi|365121900|ref|ZP_09338810.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643613|gb|EHL82920.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
6_1_58FAA_CT1]
Length = 443
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 13/104 (12%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY DG A F++P+ +D N+Y+AD NHVIRKIT G + G + G DG
Sbjct: 347 GYLDGIGSYAMFNQPRQLILDEDDNLYIADTENHVIRKITPQGQVSTVIGQAGVAGYQDG 406
Query: 152 PAQNASFSNDFELTFVPH-IC-----ALLISDHGNQLIRQINLK 189
+ A F PH +C + I D+ NQ IR++ ++
Sbjct: 407 DPEVALFDR-------PHGVCINKEGIIYIGDYENQCIRRLAIE 443
>gi|290989086|ref|XP_002677176.1| predicted protein [Naegleria gruberi]
gi|284090782|gb|EFC44432.1| predicted protein [Naegleria gruberi]
Length = 2103
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 62/115 (53%), Gaps = 6/115 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S ++ +AG+G G+ G+ G A ++P A+ G +Y+AD NH IRK++N
Sbjct: 525 VSNNGIITTIAGNGTAGFC-GDGGLAVNTCLNRPNGIAISSSGELYIADYGNHRIRKVSN 583
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +TTIAG G+ A NAS + +++ + + I+D GN IR+I
Sbjct: 584 NGIITTIAGNGNTIYNGDGIDAANASLYSPVDVSIGAN-NEIYIADAGNYRIRKI 637
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 72/153 (47%), Gaps = 9/153 (5%)
Query: 38 IDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
ID + +N PH V P S ++ D + + + +AG+G Y+ D
Sbjct: 105 IDATKATVNSPHGVAFHPTSGEVYFADYANHRIR----KIYSNGTITTIAGNGEPNYNGD 160
Query: 96 GEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPA 153
P SA+ P AV G +++AD N+ IRKI+N G +TTIAG G+ GPA
Sbjct: 161 NIPATSAQLGYPIGIAVSSGGEVFIADTLNNRIRKISNNGMITTIAGNGTGGYSGDGGPA 220
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
NA + F + + ++ +D N IR++
Sbjct: 221 VNAMLNTPFGVA-IDSDGNIVFADLVNNRIRKV 252
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 65/132 (49%), Gaps = 8/132 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +LIV +++ S + + L+ +V +AG G DG S F F + G
Sbjct: 790 SENLIVCETTSS--WKIRKILATTGMVSTIAG----GIGDGLNASNAFLVSTLFDISNSG 843
Query: 116 NIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
IY+AD NH IRKI TN + TIAG G G A +A +N ++ V L+
Sbjct: 844 EIYIADTGNHRIRKIFTNGTIITIAGNGIGGYAGDGGQATSAQLNNPKDVA-VSSNGELI 902
Query: 175 ISDHGNQLIRQI 186
I+D+ N IR++
Sbjct: 903 IADYSNNRIRKV 914
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 62/125 (49%), Gaps = 35/125 (28%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG-VT 136
++ +AGDG GY+ DG + A + P A+D N Y+AD +N+ IRK+++ G +T
Sbjct: 313 IISSIAGDGQSGYNYDGIDATIATLNNPVGIAIDSSNNEFYIADSNNNRIRKVSSSGKIT 372
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL-----ISDHGNQL--------I 183
TIAGG S SF N+ VP I A + +S + NQL
Sbjct: 373 TIAGGTS-------------SFGNN-----VPAIPAFISPIGSMSLNNNQLSFADSSGQF 414
Query: 184 RQINL 188
RQINL
Sbjct: 415 RQINL 419
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 56/115 (48%), Gaps = 5/115 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI-T 131
+S ++ +AG+G GY+ DG + A + P A G +Y AD +NH IRKI +
Sbjct: 83 ISNNGIISTIAGNGQAGYNGDGIDATKATVNSPHGVAFHPTSGEVYFADYANHRIRKIYS 142
Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
N +TTIAG G + PA +A + V + I+D N IR+I
Sbjct: 143 NGTITTIAGNGEPNYNGDNIPATSAQLGYPIGIA-VSSGGEVFIADTLNNRIRKI 196
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 61/140 (43%), Gaps = 10/140 (7%)
Query: 76 LSEESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
+S ++ +AG+G Y+ DG + +A P ++ IY+AD N+ IRKI TN
Sbjct: 581 VSNNGIITTIAGNGNTIYNGDGIDAANASLYSPVDVSIGANNEIYIADAGNYRIRKIFTN 640
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFS-------NDFELTFVPHICALLISDHGNQLIRQ 185
+ TIAG G+ +G NA+ N + F + ++ + N I
Sbjct: 641 GTIVTIAGTGTNGFSGDNGLGSNATIGYPSSVLFNSGNVYFTDIVYCVIRKIYSNGTITT 700
Query: 186 INLKPEDCSKSSQSGSALGA 205
I+ K C+ G A A
Sbjct: 701 ISGKAGTCTYGGDGGKASNA 720
Score = 41.6 bits (96), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 74/158 (46%), Gaps = 10/158 (6%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
GYT T+I + NP+S+ + ++D+ V+ +S S Y +LAG
Sbjct: 432 GYT--TIIQTYS---NPNSI--KFFNNDMYVMYTSNSVSYIDRIVNLNYGPAVKLAGAIF 484
Query: 91 QGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGR 148
+ SA + P ++ G + ++D +NH IRK++N G +TTIAG G+
Sbjct: 485 YNGDNIPATSAIMNYPSHCVINSTNGELIISDMNNHRIRKVSNNGIITTIAGNGTAGFCG 544
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G A N + + + L I+D+GN IR++
Sbjct: 545 DGGLAVNTCLNRPNGIA-ISSSGELYIADYGNHRIRKV 581
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 4/84 (4%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGS 143
AG G G+ +A+ P A+ G+IY++D NH IR I+++ ++ IAG G
Sbjct: 705 AGTCTYGGDGGKASNAQLSYPAGIAISSTGDIYISDNYNHRIRVISSVTGIISNIAGTG- 763
Query: 144 KKEGRADGPAQN-ASFSNDFELTF 166
+ E DG ++ +F+ LTF
Sbjct: 764 RSEYNGDGLHESITNFAYPVGLTF 787
Score = 37.0 bits (84), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 82 VKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
+ +AG G DG P +A + + A G I +AD N+ IRKI+N G ++TIA
Sbjct: 36 ITTVAGGG-NSVGDGLPAVNALLNSVRDVAY-YSGVIVIADTLNNRIRKISNNGIISTIA 93
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G A A+ ++ + F P + +D+ N IR+I
Sbjct: 94 GNGQAGYNGDGIDATKATVNSPHGVAFHPTSGEVYFADYANHRIRKI 140
>gi|37522270|ref|NP_925647.1| hypothetical protein gll2701 [Gloeobacter violaceus PCC 7421]
gi|35213270|dbj|BAC90642.1| gll2701 [Gloeobacter violaceus PCC 7421]
Length = 351
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 55/114 (48%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + ++ +AG G Y DG P ARF+ P AVD GN+ VAD NH IR I
Sbjct: 124 VGTDGRIETIAGTGEADYGGDGGPAKQARFNDPAGVAVDALGNVLVADTYNHRIRTIGPD 183
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + T+AG G GPA A + + P +LI+D GN IR I
Sbjct: 184 GTIRTVAGTGQAGYSGDGGPATAARLDFPWGVAVAPD-GRILIADTGNNRIRSI 236
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 88/177 (49%), Gaps = 16/177 (9%)
Query: 17 LLFSLVSSGLLLEDGYTVTTVIDGHQLEIN-----PHSVIDRPGSSDLIVLDSSRSAFYT 71
+ F+L+ + L + + +V+ G+ +++ P V+ P + +L V++ SR+
Sbjct: 13 VWFALLGTAL---GSFALRSVVAGNHIKVAEEPLYPVFVLPAP-AGELFVVEHSRNQILK 68
Query: 72 LSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
L P +V+ AG+G YS DG P + A D GN+Y+AD ++H +R++
Sbjct: 69 L-VPGRLPTVI---AGNGTSDYSGDGGPATRAGLFMMGIARDRAGNLYIADHNHHRVRRV 124
Query: 131 TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + TIAG G G GPA+ A F ND V + +L++D N IR I
Sbjct: 125 GTDGRIETIAGTGEADYGGDGGPAKQARF-NDPAGVAVDALGNVLVADTYNHRIRTI 180
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 55/114 (48%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + ++ +AG G GYS DG P +A R D P AV G I +AD N+ IR I
Sbjct: 180 IGPDGTIRTVAGTGQAGYSGDGGPATAARLDFPWGVAVAPDGRILIADTGNNRIRSIGPD 239
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + T+AG G G GPA A +L H L ++D N +R+I
Sbjct: 240 GTIRTVAGTGQAGFGGDGGPAVKARLERP-QLAVADHRGNLFVADTNNNRVRRI 292
Score = 40.0 bits (92), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 48/88 (54%), Gaps = 11/88 (12%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
AR ++P+ D +GN++VAD +N+ +R+I G ++T+AGG + PA A+
Sbjct: 263 ARLERPQLAVADHRGNLFVADTNNNRVRRIAPDGTISTVAGG--------EPPA--AARL 312
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
ND V L I+D GN + +I+
Sbjct: 313 NDPFAVGVDERGHLYIADTGNFRVLKID 340
>gi|406836139|ref|ZP_11095733.1| NHL repeat containing protein [Schlesneria paludicola DSM 18645]
Length = 365
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 13/135 (9%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKG 115
D+I++ +A Y + +E V+ LAG G +GY+ DG G A+ + PK A+D +G
Sbjct: 236 DVILVLREGNAVYRID---RKEKSVRHLAGTGKKGYAGDGGDGKLAQVNGPKGIAIDHQG 292
Query: 116 NIYVADKSNHVIRKITNL--GVTTIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHIC 171
NI + D NHVIR I L + T+ G G+ DGP N N FV
Sbjct: 293 NILLCDTENHVIRIIERLTGKIDTLVGDGTI----GDGPDGNPRHCRLNRPHGVFVALDG 348
Query: 172 ALLISDHGNQLIRQI 186
+ I D GN IR++
Sbjct: 349 TVYIGDSGNHKIRKL 363
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 50/100 (50%), Gaps = 6/100 (6%)
Query: 94 SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRA 149
DG P A+F +P D GN Y++D + VIR+I T + +TT+AG
Sbjct: 99 GDGGPALRAKFHEPHEIRFDRNGNYYISDMKSDVIRRIDAKTQI-ITTVAGTAKPGFTGD 157
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
GPA A F+N ++ LLI D N +RQ++L+
Sbjct: 158 GGPATKAEFNNPIAVSLDGD-ARLLICDIKNHRVRQVDLE 196
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 82 VKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ L GDG G DG P R ++P V + G +Y+ D NH IRK+T
Sbjct: 314 IDTLVGDGTIGDGPDGNPRHCRLNRPHGVFVALDGTVYIGDSGNHKIRKLT 364
Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 53/115 (46%), Gaps = 6/115 (5%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTI 138
+ +AG GV +S DG P A P + G +Y D +NHVIR++ G T
Sbjct: 28 IATIAGTGVDEHSGDGGPALKAGLSNPFGLEIAPDGMLYFCDFTNHVIRRMDLKTGFLTT 87
Query: 139 AGGGSKKEGRA--DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
G + G A GPA A F E+ F + ISD + +IR+I+ K +
Sbjct: 88 VAGTPRNPGFAGDGGPALRAKFHEPHEIRFDRN-GNYYISDMKSDVIRRIDAKTQ 141
>gi|326432343|gb|EGD77913.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
Length = 1505
Score = 57.8 bits (138), Expect = 5e-06, Method: Composition-based stats.
Identities = 39/114 (34%), Positives = 57/114 (50%), Gaps = 9/114 (7%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV--- 135
V+ +AG GV G+S G P AR D P+ AV G++ +AD NH +R+ N+GV
Sbjct: 1391 VITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVREF-NIGVGAA 1449
Query: 136 ---TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TT AG G + A + + + +TF P +L D N+ +RQI
Sbjct: 1450 GIITTTAGNGQRGYNGDGMVATDTALNFPTGITFSPLTDNVLFVDRRNRRVRQI 1503
Score = 37.7 bits (86), Expect = 5.1, Method: Composition-based stats.
Identities = 36/117 (30%), Positives = 51/117 (43%), Gaps = 27/117 (23%)
Query: 84 RLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK-------ITNLGV 135
R+AGD DG+ AR + P AV +Y+AD NH +R IT +
Sbjct: 1343 RVAGD----EGDGQLAIGARLNMPAGIAV-YDHMLYIADSGNHRVRAVDLYTQVITTVAG 1397
Query: 136 TTIAG----GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
T +AG GG + R D P A S+ +L I+D GN +R+ N+
Sbjct: 1398 TGVAGFSGDGGLPTDARLDTPRGVAVHSSG----------SLAIADSGNHRVREFNI 1444
>gi|116622401|ref|YP_824557.1| Ig domain-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116225563|gb|ABJ84272.1| Ig domain protein, group 1 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 2770
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 63/111 (56%), Gaps = 5/111 (4%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV-T 136
++ +AG G +GYS DG+P + A+ + P A+D GN+++AD NH +R++ N GV T
Sbjct: 141 IITTVAGTGERGYSGDGQPATLAKINSPYHIALDGHGNLFIADDGNHRVRRVDGNSGVIT 200
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
T+AG G+ D A +A N + + L I+D+GN +R ++
Sbjct: 201 TVAGTGNAGYNGDDQQATHADLQNPRGV-LIDASGNLYIADYGNHRVRVVD 250
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 10/114 (8%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTI 138
+ +AG G+QG+S G +AR P A+ G++Y+AD NH IRK+ GV TT+
Sbjct: 25 ISTVAGTGIQGFSGDSGPATAARLFDPTDVAIHPNGDLYIADTYNHRIRKVDKNGVITTV 84
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQIN 187
AG G G A + + EL I L I+D G+ IR+++
Sbjct: 85 AGTGQATN--VGGDANDNILAVSAELNHPSGIAFDTAGNLYIADTGHDRIRRVD 136
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 4/112 (3%)
Query: 81 VVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-T 137
V+ +AG G GY+ D + A P+ +D GN+Y+AD NH +R + GV T
Sbjct: 198 VITTVAGTGNAGYNGDDQQATHADLQNPRGVLIDASGNLYIADYGNHRVRVVDATGVIHT 257
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
AG G G A A+ L + ++D +Q +RQ+N++
Sbjct: 258 FAGTGVYGFSGDGGAAMAANLKGPIGLG-TDAAGNIYVADGQDQRVRQVNIQ 308
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 46/90 (51%), Gaps = 3/90 (3%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNAS 157
SA + P A D GN+Y+AD + IR++ + +TT+AG G + PA A
Sbjct: 105 SAELNHPSGIAFDTAGNLYIADTGHDRIRRVDGVTGIITTVAGTGERGYSGDGQPATLAK 164
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
++ + + H L I+D GN +R+++
Sbjct: 165 INSPYHIALDGH-GNLFIADDGNHRVRRVD 193
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 12/113 (10%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVT 136
V+ AG GV G+S G +A P D GNIYVAD + +R+ I ++
Sbjct: 254 VIHTFAGTGVYGFSGDGGAAMAANLKGPIGLGTDAAGNIYVADGQDQRVRQVNIQTRVIS 313
Query: 137 TIAGGGSKKEGRADGPAQNASFSN---DFELTFVPHICALLISDHGNQLIRQI 186
T+ G G GPA +A+ S+ F+ T + I+ +G+ +IR +
Sbjct: 314 TVVGNGQSGYSGDGGPATDATISSASATFDAT-----GRIYIAQYGSNVIRSV 361
>gi|290992224|ref|XP_002678734.1| predicted protein [Naegleria gruberi]
gi|284092348|gb|EFC45990.1| predicted protein [Naegleria gruberi]
Length = 1366
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 80/165 (48%), Gaps = 18/165 (10%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
L Y +TTV G + +P I + + DLIV D+ + +S+ VV R+AG
Sbjct: 399 LPSSYYITTVARGLK---DPQK-ISKLSNGDLIVSDTGDHSIKKISY---STGVVTRIAG 451
Query: 88 DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG---G 141
GV G+S DG + A+F+KP AV + IY+AD NH IR I N ++T+ G G
Sbjct: 452 TGVAGFSGDGGLATLAQFNKPYGIAVTINDEIYIADSLNHRIRFIDVNGNISTVVGTSIG 511
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
S G A NA D L+ L I+D N IR++
Sbjct: 512 FSGDGGLATAAKLNAPM--DVSLSASGD---LYIADRDNYRIRKV 551
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 50/188 (26%), Positives = 85/188 (45%), Gaps = 8/188 (4%)
Query: 2 ASHFVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIV 61
+++ V SLAI L+ + ++ G L + G E + R + D++V
Sbjct: 9 SANIVGFSLAIVLIMMAMVVMVKGTSLNPYSVKISTFAGGNGEFKDPQKLARLSNGDILV 68
Query: 62 LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYV 119
D+ A ++ V+ +AG GV G++ +G +A+ +KP AV IY
Sbjct: 69 TDALGHAIK----KINGTGVITTIAGTGVAGFAGDNGPAINAQVNKPYGIAVSSNDEIYF 124
Query: 120 ADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDH 178
AD NH IRKI G +TT+ G G+ G A + + + +++ L I+D
Sbjct: 125 ADSMNHRIRKIDITGNITTVVGTGANTFSGDGGLATDCTMNTPMDVSLSAS-GELYIADM 183
Query: 179 GNQLIRQI 186
N IR++
Sbjct: 184 YNYRIRKV 191
>gi|299144811|ref|ZP_07037879.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
gi|298515302|gb|EFI39183.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
Length = 459
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Query: 85 LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNIY---VADKSNHVIRKITN 132
AG+ G GY D SARFD P + + + + +IY + D +NH IRKIT
Sbjct: 336 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKITP 395
Query: 133 LG-VTTIAGGGSKK-----EGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G VTT AG GS G DG P + A F+ L + + D+GN +R
Sbjct: 396 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 455
Query: 186 INLK 189
I L+
Sbjct: 456 IALQ 459
>gi|326435583|gb|EGD81153.1| hypothetical protein PTSG_13151 [Salpingoeca sp. ATCC 50818]
Length = 1551
Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats.
Identities = 39/114 (34%), Positives = 57/114 (50%), Gaps = 9/114 (7%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV--- 135
V+ +AG GV G+S G P AR D P+ AV G++ +AD NH +R+ N+GV
Sbjct: 1437 VITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVREF-NIGVGAA 1495
Query: 136 ---TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TT AG G + A + + + +TF P +L D N+ +RQI
Sbjct: 1496 GIITTTAGNGQRGYNGDGMVATDTALNFPTGITFSPLTDNVLFVDRRNRRVRQI 1549
Score = 38.9 bits (89), Expect = 2.1, Method: Composition-based stats.
Identities = 43/146 (29%), Positives = 63/146 (43%), Gaps = 30/146 (20%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVK---RLAGDGVQGYSDGEPG-SARFDKPKSFAVDM 113
+L+ D A Y ++ + SVV R+AGD DG+ AR + P AV
Sbjct: 1360 NLLFSDEENHAVYMVNPHTTIISVVAGTPRVAGD----EGDGQLAIGARLNMPAGIAV-Y 1414
Query: 114 KGNIYVADKSNHVIRK-------ITNLGVTTIAG----GGSKKEGRADGPAQNASFSNDF 162
+Y+AD NH +R IT + T +AG GG + R D P A S+
Sbjct: 1415 DHMLYIADSGNHRVRAVDLYTQVITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSG- 1473
Query: 163 ELTFVPHICALLISDHGNQLIRQINL 188
+L I+D GN +R+ N+
Sbjct: 1474 ---------SLAIADSGNHRVREFNI 1490
>gi|293372451|ref|ZP_06618835.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
gi|292632634|gb|EFF51228.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
Length = 446
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Query: 85 LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNIY---VADKSNHVIRKITN 132
AG+ G GY D SARFD P + + + + +IY + D +NH IRKIT
Sbjct: 323 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKITP 382
Query: 133 LG-VTTIAGGGSKK-----EGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G VTT AG GS G DG P + A F+ L + + D+GN +R
Sbjct: 383 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 442
Query: 186 INLK 189
I L+
Sbjct: 443 IALQ 446
>gi|325965074|ref|YP_004242980.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
phenanthrenivorans Sphe3]
gi|323471161|gb|ADX74846.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
phenanthrenivorans Sphe3]
Length = 674
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 76/166 (45%), Gaps = 10/166 (6%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
ED + + GH L++ S D S L V+ + SF PL+ V +A
Sbjct: 325 EDAAGFSGQLVGHPLDVALSSPWDVVWSRKLNAVVIAMAGTHQIFSFDPLT--GSVAIVA 382
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGG 141
G+G++G DG P A F +P A D GNI+VAD +RK+ + V + G
Sbjct: 383 GNGLEGLLDGAPHEAWFAQPSGLAEDADGNIWVADSETSALRKLVIDDDGGVAVESAIGK 442
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G DGPA A + +T +P ++ I+D N +R+ +
Sbjct: 443 GLFDFGFRDGPAAEARLQHPLGVTVLPD-GSVAIADTYNGAVRRYD 487
>gi|290975813|ref|XP_002670636.1| predicted protein [Naegleria gruberi]
gi|284084197|gb|EFC37892.1| predicted protein [Naegleria gruberi]
Length = 2442
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 66/140 (47%), Gaps = 11/140 (7%)
Query: 82 VKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
+ +AG G+ G Y DG + A+ P V G+IY+AD NH IRK+ +N +TTI
Sbjct: 919 ITTVAGSGLAGFYGDGGLATNAKLSYPMGIYVSKNGDIYIADYGNHRIRKVFSNGTITTI 978
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
AG GS A + ++ + P+ + I+D GN LIR+I L DC +
Sbjct: 979 AGTGSINYNGDIQEATLTTLNSPRGVYMSPNEDEIYIADSGNNLIRKIKL---DCPSDYE 1035
Query: 199 SGSALGAVSVWVLVSVLSCL 218
LG V ++ C
Sbjct: 1036 LDKRLGEC-----VQIVKCF 1050
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 60/114 (52%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
++ + V+ +AG G+ +G +A + P+S V+ KG IYV+D NH +RKI N
Sbjct: 857 ITSDGVISLVAGSYYGGFGGDNGPAANAVLNSPESLFVNSKGEIYVSDTVNHRVRKIDLN 916
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+TT+AG G G A NA S + +V + I+D+GN IR++
Sbjct: 917 GTITTVAGSGLAGFYGDGGLATNAKLSYPMGI-YVSKNGDIYIADYGNHRIRKV 969
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 64/148 (43%), Gaps = 22/148 (14%)
Query: 48 HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-----VQGYSDGEPGSAR 102
+ VI P +S++ +L S R ++++ +G V G DG
Sbjct: 785 NQVIIHPQTSEMFILSSDR---------------IRKVNTNGTISTVVGGIGDGAQAIYA 829
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT-IAGGGSKKEGRADGPAQNASFSND 161
P V G IY+ D NH IR IT+ GV + +AG G +GPA NA N
Sbjct: 830 LINPTDIHVTNNGTIYLTDSLNHCIRMITSDGVISLVAGSYYGGFGGDNGPAANAVL-NS 888
Query: 162 FELTFVPHICALLISDHGNQLIRQINLK 189
E FV + +SD N +R+I+L
Sbjct: 889 PESLFVNSKGEIYVSDTVNHRVRKIDLN 916
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 9/111 (8%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKS-FAVDMKG-NIYVADKSNHVIRKITNLGVTTI 138
+V +AG GV GYS G+ G A + + + M G ++Y+AD +N+VIRK++N VTT+
Sbjct: 703 IVSTIAGTGVLGYS-GDGGLATNANFRWIYGLWMVGSDLYIADSTNNVIRKLSNNIVTTV 761
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI--SDHGNQLIRQIN 187
AG G+ PA +A+ + ++ P + I SD IR++N
Sbjct: 762 AGTGNANFNGDGIPALSANINTANQVIIHPQTSEMFILSSDR----IRKVN 808
Score = 43.5 bits (101), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 57/114 (50%), Gaps = 5/114 (4%)
Query: 76 LSEESVVKRLAG-DGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
L + +V+R+AG G+ G+S G +A +V G+IYVA+ N IRKI+N
Sbjct: 529 LGTDGIVRRIAGITGICGFSGDGGLATNALMGVSWGLSVSPSGDIYVAESINSRIRKISN 588
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+TT+AG D A +AS N F A ++D GN+ +R++
Sbjct: 589 GIITTVAGTSVAGYNGDDIAATSASL-NGPPAVFATS-TAYYLADAGNRRVRKV 640
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 52/110 (47%), Gaps = 6/110 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSAR----FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
++ +AG+G G S+ + P++ V ++Y AD SN R +TN V+
Sbjct: 646 IITTIAGNGASGSSEENINATSAIIGTSSPRALWVSNDDSVYYADVSNGKTRVVTNGIVS 705
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TIAG G G A NA+F + L V L I+D N +IR++
Sbjct: 706 TIAGTGVLGYSGDGGLATNANFRWIYGLWMVGS--DLYIADSTNNVIRKL 753
>gi|291000698|ref|XP_002682916.1| dual specificity protein phosphatase [Naegleria gruberi]
gi|284096544|gb|EFC50172.1| dual specificity protein phosphatase [Naegleria gruberi]
Length = 1502
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 4/105 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
+ AG+G +G S G FD P V + N++V+D NH + +I N G T A G
Sbjct: 421 ITNFAGNGNRGCSTGRVTDVMFDTPTGICV-IGDNVFVSDTGNHRVVRINN-GFVTNAAG 478
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ G DG QN+ ++ +L H L I+D GN IRQ+
Sbjct: 479 NPRTAGFMDGKVQNSILNSPTDL--FSHENTLYINDFGNSRIRQV 521
>gi|260904571|ref|ZP_05912893.1| NHL repeat-containing protein [Brevibacterium linens BL2]
Length = 647
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 89/179 (49%), Gaps = 31/179 (17%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V + P S +L+V DS + + + S +++++R+ G G +G +DG+ SA F +P
Sbjct: 186 PGKVTELP-SGNLLVADSGHHSL--VEYDASGQNIIRRI-GTGERGANDGDFTSASFSEP 241
Query: 107 KSFAV---DMKG----NIYVADKSNHVIRKITNLG---VTTIAGGGSK------------ 144
V D+ ++ VAD NH +R I NL VTT+AG GS+
Sbjct: 242 GGITVLPDDVAAKAGYHLVVADTVNHTLRGI-NLDTETVTTVAGTGSQHMVGAIDNVVGT 300
Query: 145 --KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
+ GR DGPA + S+ +++ ++P ++++ GN I + PE S SG+
Sbjct: 301 HGELGRYDGPALDVKLSSPWDVLYIPATAEVVVAMAGNHTIWSFD--PETGSIRLLSGT 357
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 76/161 (47%), Gaps = 9/161 (5%)
Query: 39 DGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
DG L++ +P V+ P +++++V + ++ E ++ L+G +G +D
Sbjct: 308 DGPALDVKLSSPWDVLYIPATAEVVVAMAGNHTIWSFD---PETGSIRLLSGTMNEGLAD 364
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPA 153
GE +A F + + G++++AD IR++ + V+T+ G G G DGPA
Sbjct: 365 GEAEAAWFAQTSGLDLSSDGDVFIADSETSAIRRLDPSTGAVSTLIGEGLFDFGFRDGPA 424
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
A + + +P ++ I+D N IR+ + + S
Sbjct: 425 AEARLQHPLGVRSLPD-GSIAIADTYNGAIRRYDFTTNEVS 464
>gi|223993873|ref|XP_002286620.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977935|gb|EED96261.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1937
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 75/150 (50%), Gaps = 24/150 (16%)
Query: 73 SFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ L+ + V +AG G +G+ DG ARF P+ AVD G +YVAD NH IR I
Sbjct: 218 NYTLNPQGFVFTMAGSTSGEEGFVDGVESDARFRHPEGVAVDHDGYVYVADTGNHAIRMI 277
Query: 131 TNLG-VTTIAGGG--SKKEGRADGPAQNASFSN---DFELTFVPH--------------- 169
+ G V+T+AG G K+G A+ AQ +S ++ + + P+
Sbjct: 278 SPSGRVSTLAGNGVPGYKDGLANSGAQLSSPTDIAVWRDWAWWPYENPVDPDSFIYKNGN 337
Query: 170 -ICALLISDHGNQLIRQINLKPEDCSKSSQ 198
ALL++D N +R+I E+ S++ +
Sbjct: 338 GTLALLVADTDNHRVRKITGDIEEDSETGE 367
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 35/50 (70%), Gaps = 1/50 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
GY+DGE ARFD P+ V G ++VAD +NH++R+I+ LG V T+AG
Sbjct: 392 GYADGEKDEARFDSPQGITVASDGRVFVADTNNHLVREISRLGTVVTVAG 441
Score = 38.1 bits (87), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 59/155 (38%), Gaps = 45/155 (29%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAV-------------DMKGNIY--- 118
+S V LAG+GV GY DG S A+ P AV D IY
Sbjct: 277 ISPSGRVSTLAGNGVPGYKDGLANSGAQLSSPTDIAVWRDWAWWPYENPVDPDSFIYKNG 336
Query: 119 -------VADKSNHVIRKIT---------------NLGVTTIAG-----GGSKKEGRADG 151
VAD NH +RKIT N+ V AG + + G ADG
Sbjct: 337 NGTLALLVADTDNHRVRKITGDIEEDSETGEKMWNNVRVECFAGRCGSVSSNPEPGYADG 396
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F + +T V + ++D N L+R+I
Sbjct: 397 EKDEARFDSPQGIT-VASDGRVFVADTNNHLVREI 430
>gi|333025645|ref|ZP_08453709.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
gi|332745497|gb|EGJ75938.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
Length = 610
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 67/124 (54%), Gaps = 8/124 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R A L ++ V+R G G +G DG+ ARF++P
Sbjct: 183 PAKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLLDGDATGARFNEP 238
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGR-ADGPAQNASFSNDF 162
+ A+ G + VAD NH +R++ + V+T+AG G + +G GPA+ S S+ +
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRVDLASGEVSTLAGTGRQWWQGEPVAGPAREVSLSSPW 298
Query: 163 ELTF 166
++ +
Sbjct: 299 DVAW 302
>gi|421097867|ref|ZP_15558546.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
str. 200901122]
gi|410799150|gb|EKS01231.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
str. 200901122]
Length = 357
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 60/108 (55%), Gaps = 5/108 (4%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
GV G+ +G+ ++ F P +D + N+YV + SNH IRKI N G V+T++GG
Sbjct: 162 GVDGFQNGDRLNSLFKSPFFMDLDRERNLYVGELSNHAIRKINLNSGTVSTLSGG---VL 218
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G DG +A F + +T+ +LL++D N IR+I+LK S
Sbjct: 219 GYLDGDLASAQFKSPLGITYDQKTDSLLVADIQNHNIRKIDLKASTVS 266
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 2/95 (2%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + I LD + ++ V+ AG G++ DG +A F+
Sbjct: 21 NPSIFMGRAWVENTI-LDCILKECHLCKLKVTNNPVISLFAGTGIKESIDGTTQTASFNT 79
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
P +D GNI+V+D++ ++IRKI G VTT++
Sbjct: 80 PFGLELDTFGNIFVSDQTANLIRKIDRFGNVTTLS 114
>gi|290975887|ref|XP_002670673.1| predicted protein [Naegleria gruberi]
gi|284084234|gb|EFC37929.1| predicted protein [Naegleria gruberi]
Length = 1074
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 75/143 (52%), Gaps = 9/143 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFD 104
P+ + S ++I++D+S + +S + +AG G GYS +G+ +A+F+
Sbjct: 152 PYGLALNSSSGEVIIVDTSNNVIR----KVSSIGNITTIAGTGAAGYSGDNGQATNAKFN 207
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
P++ A G ++VAD NH IRKI+N G VTT+AG G+ A+NA +
Sbjct: 208 APRA-AFYSNGELFVADSRNHRIRKISNSGIVTTVAGTGTAGFNGDSILAKNAQLNYPSG 266
Query: 164 LTFVPHICALLISDHGNQLIRQI 186
++ V + ISD N IR+I
Sbjct: 267 IS-VNSNGEIFISDSVNNRIRKI 288
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 62/117 (52%), Gaps = 16/117 (13%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG GV G+S G+ G SA + P AV+ G + ++D +NH IRK++ G +TT
Sbjct: 828 ISTVAGSGVAGFS-GDGGLATSATLNNPSYVAVNSNGELLISDTNNHRIRKVSLNGIITT 886
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTF-----VPHICALLISDHGNQLIRQINLK 189
IAG G+ +G NAS ++L + V L I+D N IR++N+
Sbjct: 887 IAGNGTAGY---NGEGNNASL---YQLNYPSGLVVSSTGDLFIADSYNHRIRKLNVN 937
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 65/134 (48%), Gaps = 12/134 (8%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GN 116
D+ ++DS S LS + +AG G DG + +SFA+ K G
Sbjct: 758 DVFIIDSFNSRIRKLS-----SGKISTVAG----GLGDGSSAVNSYLNSQSFAISSKSGE 808
Query: 117 IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
I++AD +NH IRKI TN ++T+AG G G A +A+ +N V LLI
Sbjct: 809 IFIADSNNHRIRKIATNGDISTVAGSGVAGFSGDGGLATSATLNNP-SYVAVNSNGELLI 867
Query: 176 SDHGNQLIRQINLK 189
SD N IR+++L
Sbjct: 868 SDTNNHRIRKVSLN 881
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 10/120 (8%)
Query: 76 LSEESVVKRLAGDGVQGYS---DGEPG---SARFDKPKSFAVDMKGN-IYVADKSNHVIR 128
+S + +AG+G Y G+ G +A+ P + AV++ N IY+AD NH IR
Sbjct: 598 VSTSGYISTIAGNGTVAYKATFSGDNGLAINAQLFVPFAVAVNLTNNDIYIADSGNHRIR 657
Query: 129 KITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
K+++ +TT+AG G+ +G A NA + F ++ + + + ISD NQ +R++
Sbjct: 658 KVSSSSGIITTVAGTGTSGFSGDNGLATNAKLNFPFSIS-IGNSGEIYISDQYNQRVRKV 716
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 3/120 (2%)
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
Y S+ +S + +AG G G DG + R P S + G+ Y +D SNH
Sbjct: 424 MYLGSYLSKIKSSLTAVAGTGAIGADSGDGLAITERLHNPNSIFISSNGDSYFSDSSNHK 483
Query: 127 IRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IRK++N +TTIAG G+ DG + ++ N+ + V + SD N IR+I
Sbjct: 484 IRKLSNGYITTIAGTGTSGY-SGDGSSATSAKLNNPKGVVVSSSGEIYFSDSENHRIRKI 542
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 49/100 (49%), Gaps = 9/100 (9%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARF 103
NP+S+ + D DSS LS + +AG G GYS DG SA+
Sbjct: 462 NPNSIFISS-NGDSYFSDSSNHKIRKLS-----NGYITTIAGTGTSGYSGDGSSATSAKL 515
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
+ PK V G IY +D NH IRKI+ G ++T+AG G
Sbjct: 516 NNPKGVVVSSSGEIYFSDSENHRIRKISTGGIISTVAGSG 555
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 63/129 (48%), Gaps = 19/129 (14%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT- 131
+S ++ +AG+G GY +GE +A + + P V G++++AD NH IRK+
Sbjct: 878 VSLNGIITTIAGNGTAGY-NGEGNNASLYQLNYPSGLVVSSTGDLFIADSYNHRIRKLNV 936
Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF-----VPHICALLISDHGNQLIRQI 186
N ++T AG G G + + S +L F V + L ISD N IR++
Sbjct: 937 NGTISTSAGNG------IAGLSGDGSLPTSAQLNFPTGLAVSSVGELYISDDSNNRIRKV 990
Query: 187 NLKPEDCSK 195
+L CS+
Sbjct: 991 SLT---CSQ 996
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 55/109 (50%), Gaps = 4/109 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
V AG G +S DG + + + P V+ G ++AD SNHVIRK+ TN ++TI
Sbjct: 69 VSTFAGVGTSSFSGDGALATQSEINFPYGVIVNSLGETFIADTSNHVIRKVSTNGKISTI 128
Query: 139 AGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG S DG A NA ++ + L ++I D N +IR++
Sbjct: 129 AGTASSYGYSGDGGLATNALLNSPYGLALNSSSGEVIIVDTSNNVIRKV 177
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 53/100 (53%), Gaps = 6/100 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFD 104
P +V ++D+ + DS +S S ++ +AG G G+S +G +A+ +
Sbjct: 634 PFAVAVNLTNNDIYIADSGNHRIRKVS---SSSGIITTVAGTGTSGFSGDNGLATNAKLN 690
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
P S ++ G IY++D+ N +RK+ G ++TIAG G+
Sbjct: 691 FPFSISIGNSGEIYISDQYNQRVRKVAANGYISTIAGSGA 730
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 28/113 (24%), Positives = 54/113 (47%), Gaps = 8/113 (7%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + +AG G G++ DG ++ F+ P + + G++++ D N IRK+++
Sbjct: 716 VAANGYISTIAGSGAIGFNGDGLAATSTCFNYPSGVSSNSNGDVFIIDSFNSRIRKLSSG 775
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++T+AG G DG + S+ N + I+D N IR+I
Sbjct: 776 KISTVAG------GLGDGSSAVNSYLNSQSFAISSKSGEIFIADSNNHRIRKI 822
Score = 37.0 bits (84), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 1/92 (1%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
DG G+A P A+ + I +AD N IRK+++ V+T AG G+ DG
Sbjct: 29 DGMTGNALLRYPYGIALGLNSEILIADTFNQRIRKVSSSDVSTFAGVGTSSF-SGDGALA 87
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
S N V + I+D N +IR++
Sbjct: 88 TQSEINFPYGVIVNSLGETFIADTSNHVIRKV 119
>gi|290973961|ref|XP_002669715.1| predicted protein [Naegleria gruberi]
gi|284083266|gb|EFC36971.1| predicted protein [Naegleria gruberi]
Length = 710
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG+G +G+S +G +A+ + P V +Y+AD SNHVIRKI+ G + TIAG
Sbjct: 144 IAGNGTKGFSGDNGPATNAQLNGPAGVFVS-NNEVYIADYSNHVIRKISQNGTIVTIAGN 202
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +G A NA N TFV + ISD N +IR+I
Sbjct: 203 GKPGFSGDNGLATNAQLYNP-SGTFVSSNNEVYISDCFNHVIRKI 246
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 59/112 (52%), Gaps = 5/112 (4%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
+ + +AG+G G+ +G +A+ + P++ V +Y+AD N IRK+ N
Sbjct: 81 QNGTIVTIAGNGKPGFGGDNGLATNAQLNYPRNVYVSSNNEVYIADFCNQRIRKVLQNGN 140
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+K +GPA NA N FV + + I+D+ N +IR+I
Sbjct: 141 IITIAGNGTKGFSGDNGPATNAQL-NGPAGVFVSN-NEVYIADYSNHVIRKI 190
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S+ + +AG+G G+S +G +A+ P V +Y++D NHVIRKI
Sbjct: 190 ISQNGTIVTIAGNGKPGFSGDNGLATNAQLYNPSGTFVSSNNEVYISDCFNHVIRKILQN 249
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + TIAG G +G A NA + + FV + ISD N IR++
Sbjct: 250 GTIVTIAGNGKGGFSGDNGLATNAQLYSPLGV-FVSSNNEVYISDCFNHRIRKV 302
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 46/98 (46%), Gaps = 17/98 (17%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+ + +AG+G G+S +G +A+ P V +Y++D NH IRK+ + G
Sbjct: 248 QNGTIVTIAGNGKGGFSGDNGLATNAQLYSPLGVFVSSNNEVYISDCFNHRIRKVLHNGN 307
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
+ TIAG G+ FS D F+++ PH
Sbjct: 308 IVTIAGNGT------------PGFSGDSPFDISLYPHF 333
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 27/49 (55%), Gaps = 1/49 (2%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSK 144
E SA+ + P S V +++AD SN IRK+ N + TIAG G K
Sbjct: 2 EATSAQLNNPFSVFVSSNNEVFIADYSNQSIRKVLQNGNIVTIAGNGDK 50
>gi|323343289|ref|ZP_08083516.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
33269]
gi|323095108|gb|EFZ37682.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
33269]
Length = 436
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 76/182 (41%), Gaps = 23/182 (12%)
Query: 30 DGYTVTTVIDGHQLEINPHSVIDRP------GSSDLIVLDSSRSAFYTLSFP-------- 75
DGY T G ++INP + P G S + + +SF
Sbjct: 256 DGYIYTRYFYGDIIKINPRTYEVTPVYKTQEGDSYGLTFNPLHPNILYMSFRENAGTMAN 315
Query: 76 -------LSEESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
E+ +RL+ + G + DG+ A+F P D GNIYVAD++NH I
Sbjct: 316 SICSIDVTDPENTYRRLSSSNISGGHRDGKLEKAQFHNPTQIYCDADGNIYVADRNNHCI 375
Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
R+I+ + G + +G DG A F N+ + A+ ++D GN +R++
Sbjct: 376 RRISPDDMVETVLGMPETKGWKDGAKSEALF-NEPTGIGIGKDGAVYVADWGNGRVRKLT 434
Query: 188 LK 189
+
Sbjct: 435 IN 436
>gi|322435234|ref|YP_004217446.1| hypothetical protein AciX9_1613 [Granulicella tundricola MP5ACTX9]
gi|321162961|gb|ADW68666.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
Length = 781
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 7/112 (6%)
Query: 81 VVKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGV 135
++ +AG+G ++ SA P+ +D GN+ +AD +NH IR+I T L +
Sbjct: 225 IITTIAGNGTPAFTSDNIAATSATLYLPRGITLDPSGNLLIADSANHRIRRIDAVTGL-I 283
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
TT+AG G++ PA AS + P L +D NQ IRQI+
Sbjct: 284 TTLAGDGTQTYAGDSTPATTASLDTPRAIALSPATLPTL-TDTANQRIRQID 334
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 58/115 (50%), Gaps = 5/115 (4%)
Query: 77 SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TN 132
+ ++ +AG G G S DG + A+ D P + A+D N+Y+AD NH+IR++ T
Sbjct: 107 AATQIITTVAGTGTPGRSPDGTLATKAQLDTPTAIALDSSQNLYIADTRNHIIRRVDATT 166
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+TT+AG G++ GPA A L L ++D N IR+I+
Sbjct: 167 HLITTLAGTGTQGFSGDAGPALAAQIDTPTGLALDAS-NNLYLADTHNHRIRRID 220
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 46/98 (46%), Gaps = 4/98 (4%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG--PAQNASFSNDFE 163
P D +GN+Y+A+ + H+I +++ G TI G + DG P Q A +
Sbjct: 25 PSGLVYDAQGNLYLAETAAHLILRVSPSGALTIVAGTGTQGYAGDGTLPTQ-ALLDSPTA 83
Query: 164 LTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
L P L ++D N IR+I+ + + + +G+
Sbjct: 84 LAITP-TGDLYLADTHNHAIRRIDAATQIITTVAGTGT 120
>gi|451982091|ref|ZP_21930422.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
gi|451760645|emb|CCQ91702.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
Length = 724
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 63/120 (52%), Gaps = 14/120 (11%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + ++ +AG+G G+S +G A + P +D+KGN+Y+AD++NH +R++
Sbjct: 81 IGKNGIITTVAGNGNAGFSGDEGPALEAALNFPAGLCLDLKGNLYIADRNNHRVRRVDTK 140
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQIN 187
G +TT+AG G G +GPA+ + L F + + ISD N + ++N
Sbjct: 141 GIITTVAGTGEPDFGMEEGPAE------EIPLHFPSDVACDSQGQVYISDRSNNRVLKMN 194
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 6/132 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ + ++ +AG+G +S G+ G A P D +GN+Y+AD++N+ +RK+
Sbjct: 249 VTPDGIITTVAGEGTHFFS-GDFGPATRCSLAYPTDVVTDDRGNLYIADRNNNRVRKVDT 307
Query: 133 LGVTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
G+ T G K E D A S F L F P LLI D + +R ++L
Sbjct: 308 NGIITTVMGTGKNEYNGDNEIASETSLHLPFALAFTPD-QHLLIVDRNHHRVRSMHLVQH 366
Query: 192 DCSKSSQSGSAL 203
+ +G AL
Sbjct: 367 TVQTVAGNGQAL 378
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 3/68 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + ++ +AG GVQG+S DG P A+ P A D +GN+Y D+ N+ +R++
Sbjct: 636 VDKNGIITTVAGTGVQGFSGDGGPAVQAQLKSPYRMAFDSEGNLYFTDRDNNRVRRVDTH 695
Query: 134 G-VTTIAG 140
G +TT+AG
Sbjct: 696 GTITTLAG 703
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 8/113 (7%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+ + ++ LA D DGE P AV+ KG ++V+D + IRKI N G
Sbjct: 532 VDAQGSIQTLAKDAWLAIEDGEV------HPNGLAVNDKGEVFVSDSGSSKIRKIDNEGN 585
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
VTT AG GS ++ GPA A + L F P L IS+ IR+++
Sbjct: 586 VTTYAGDGSFEDKGDGGPALLAGIRSPGGLVFSPS-GELYISEENTHRIRKVD 637
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 12/86 (13%)
Query: 110 AVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
AVD +GNI+++ +S + IRKI G +TT+AG G+ +GPA A+ L F
Sbjct: 61 AVDKEGNIFISHRSKNRIRKIGKNGIITTVAGNGNAGFSGDEGPALEAA------LNFPA 114
Query: 169 HIC-----ALLISDHGNQLIRQINLK 189
+C L I+D N +R+++ K
Sbjct: 115 GLCLDLKGNLYIADRNNHRVRRVDTK 140
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + +AG G+ GY G A P VD GN+Y+AD+ N+ +RK+T
Sbjct: 193 MNPQGQIVTVAGLGMAGYGGDFGPAIDALLKYPFGIHVDESGNLYIADRGNNRVRKVTPD 252
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G +TT+AG G+ GPA S + ++ L I+D N +R+++
Sbjct: 253 GIITTVAGEGTHFFSGDFGPATRCSLAYPTDVV-TDDRGNLYIADRNNNRVRKVD 306
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 52/115 (45%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDG-VQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ E V AGDG + DG P A P G +Y+++++ H IRK+
Sbjct: 580 IDNEGNVTTYAGDGSFEDKGDGGPALLAGIRSPGGLVFSPSGELYISEENTHRIRKVDKN 639
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G +TT+AG G + GPA A + + + F L +D N +R+++
Sbjct: 640 GIITTVAGTGVQGFSGDGGPAVQAQLKSPYRMAFDSE-GNLYFTDRDNNRVRRVD 693
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
P A D +G +Y++D+SN+ + K+ G + T+AG G G GPA +A F +
Sbjct: 169 PSDVACDSQGQVYISDRSNNRVLKMNPQGQIVTVAGLGMAGYGGDFGPAIDALLKYPFGI 228
Query: 165 TFVPHICALLISDHGNQLIRQI 186
V L I+D GN +R++
Sbjct: 229 -HVDESGNLYIADRGNNRVRKV 249
>gi|307202136|gb|EFN81636.1| NHL repeat-containing protein 2 [Harpegnathos saltator]
Length = 688
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 20/143 (13%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
LI+ DS S + + ++ + G QG+ DG+ SA+F+ P+ + + IY
Sbjct: 236 LIISDSGNSRILITN----QHGEIEHIIGGTNQGFEDGDFKSAKFNSPQGVCM-LDDVIY 290
Query: 119 VADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADG----------PAQNASFSNDFELTF 166
VAD +NH IRKI + V+TIAG GS+ R G P A + ++++ T
Sbjct: 291 VADNNNHAIRKINLSEKSVSTIAGTGSQSHDRCGGRSGTNQDLSSPWDVAIYYHEYKNTT 350
Query: 167 VPHICALLISDHGNQLIRQINLK 189
VP LLI+ G I + LK
Sbjct: 351 VP---VLLIAIAGTHQIWALFLK 370
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 7/104 (6%)
Query: 92 GYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
G DG+ SA+ P A + ++ IYVAD NH I+ + +A G +
Sbjct: 459 GDMDGKEYSAKLQHPLGIAWNHLEKQIYVADTYNHKIKSVD------VATGYCRTLFGDR 512
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
P SF+ +L P L ++D N ++ INLK ++ S
Sbjct: 513 KPNSMFSFNEPSDLAVSPDGNILYVADTNNHALKVINLKSKEIS 556
>gi|290982352|ref|XP_002673894.1| predicted protein [Naegleria gruberi]
gi|284087481|gb|EFC41150.1| predicted protein [Naegleria gruberi]
Length = 2807
Score = 57.0 bits (136), Expect = 9e-06, Method: Composition-based stats.
Identities = 40/109 (36%), Positives = 56/109 (51%), Gaps = 4/109 (3%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG G GYS G +A+ P++ AV G IY+AD NH IRKI G ++TI
Sbjct: 572 ISTIAGTGSVGYSGDGGLATNAQLYYPQTVAVSSSGEIYIADAYNHRIRKINTSGYISTI 631
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
AG GS G A +A F + + + + I+D N IR+IN
Sbjct: 632 AGTGSVGYSGDGGLATSAQLYYPFSVA-ISSVGEIYIADTYNHRIRKIN 679
Score = 53.5 bits (127), Expect = 8e-05, Method: Composition-based stats.
Identities = 32/97 (32%), Positives = 51/97 (52%), Gaps = 2/97 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
G+ DG+ + F SF + G I +AD +NH IRKIT LG ++TI+G G+ +
Sbjct: 802 GFGDGDMATTSFISAYSFEFTLNGEIIIADSNNHRIRKITTLGYISTISGTGTAGYNGDE 861
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A N+ +N + + + I+D N IR++N
Sbjct: 862 ILATNSQLNNPNGIALSSN-SEIYIADTNNHRIRKVN 897
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 48/95 (50%), Gaps = 14/95 (14%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SA+ + P+S A+ G IY+AD +NH IRKI G ++TIAG GS G A NA
Sbjct: 536 SAQLNYPRSVAISSSGEIYIADFNNHRIRKINISGYISTIAGTGSVGYSGDGGLATNAQL 595
Query: 159 SNDFELTFVPHICA------LLISDHGNQLIRQIN 187
+ P A + I+D N IR+IN
Sbjct: 596 -------YYPQTVAVSSSGEIYIADAYNHRIRKIN 623
Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats.
Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SA+ + P S AV G IY+ D +N+ IRKI G ++TIAG G+ A +A
Sbjct: 704 SAQLNYPFSVAVSSVGEIYIVDTNNYRIRKINTSGYISTIAGTGTGGYNGDSILATSAQL 763
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
+ + LT + ++++D+ N IR+IN
Sbjct: 764 NYPYGLT-ISSTSEIIVADYYNHRIRKIN 791
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 36/129 (27%), Positives = 63/129 (48%), Gaps = 7/129 (5%)
Query: 82 VKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ ++G G GY+ E +++ + P A+ IY+AD +NH IRK+ G ++TI
Sbjct: 846 ISTISGTGTAGYNGDEILATNSQLNNPNGIALSSNSEIYIADTNNHRIRKVNASGYISTI 905
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
AG G+ DG ++ N + +LI+D+ N IR+I + S+
Sbjct: 906 AGTGTGGYN-GDGVLATSAQLNYPNGIAIQENGEILIADNNNHRIRKIR---TNGYISTI 961
Query: 199 SGSALGAVS 207
+GS +G +
Sbjct: 962 AGSGIGGFT 970
Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats.
Identities = 35/108 (32%), Positives = 50/108 (46%), Gaps = 10/108 (9%)
Query: 82 VKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG G GY+ SA+ + P + I VAD NH IRKI G ++TI
Sbjct: 740 ISTIAGTGTGGYNGDSILATSAQLNYPYGLTISSTSEIIVADYYNHRIRKINTSGYISTI 799
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G DG SF + + F + ++I+D N IR+I
Sbjct: 800 AG------GFGDGDMATTSFISAYSFEFTLN-GEIIIADSNNHRIRKI 840
Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats.
Identities = 41/109 (37%), Positives = 55/109 (50%), Gaps = 4/109 (3%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG G GYS G SA+ P S A+ G IY+AD NH IRKI G ++TI
Sbjct: 628 ISTIAGTGSVGYSGDGGLATSAQLYYPFSVAISSVGEIYIADTYNHRIRKINTSGYISTI 687
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+G GS G A +A + F + V + + I D N IR+IN
Sbjct: 688 SGTGSGGYSGDGGLATSAQLNYPFSVA-VSSVGEIYIVDTNNYRIRKIN 735
Score = 45.4 bits (106), Expect = 0.027, Method: Composition-based stats.
Identities = 38/117 (32%), Positives = 53/117 (45%), Gaps = 21/117 (17%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG G GYS G SA+ P ++ G IY+AD NH IRK+T G ++TI
Sbjct: 251 ISTVAGSGTAGYSGDGGLATSAQLYYPLGVSISSAGEIYIADYYNHRIRKVTTSGYISTI 310
Query: 139 AG--------GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
AG GG + P ++ SN + I+D N IR+IN
Sbjct: 311 AGTTYGFSGDGGLATSAQLYYPNGVSTGSNG----------EIFIADTYNYRIRKIN 357
Score = 43.1 bits (100), Expect = 0.13, Method: Composition-based stats.
Identities = 38/113 (33%), Positives = 60/113 (53%), Gaps = 5/113 (4%)
Query: 80 SVVKRLAGDG-VQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
SV ++AG G + GY DG SAR + P A G++Y+AD +N+ IRK+T G +
Sbjct: 457 SVASKVAGTGTLNGYGGDGNLATSARLNYPYGTAFSSNGDMYIADTNNNRIRKVTTSGYI 516
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+T+AG G+ G A +A + + + + I+D N IR+IN+
Sbjct: 517 STVAGSGTGGYSGDGGLATSAQLNYPRSVA-ISSSGEIYIADFNNHRIRKINI 568
Score = 42.7 bits (99), Expect = 0.15, Method: Composition-based stats.
Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SA+ + P S ++ G IY+AD N IRK+T G ++T+AG G+ G A +A
Sbjct: 215 SAQVNYPFSVSISSTGEIYIADYYNQRIRKVTTYGFISTVAGSGTAGYSGDGGLATSAQL 274
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
++ + + I+D+ N IR++
Sbjct: 275 YYPLGVS-ISSAGEIYIADYYNHRIRKV 301
Score = 40.0 bits (92), Expect = 1.1, Method: Composition-based stats.
Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
SA+ PK A+ IY+AD NH IRK+ TN ++T+AG G+ G G A +A
Sbjct: 159 SAQLYYPKGVAISPSNEIYIADTYNHRIRKVNTNGYISTVAGSGTGGYGGDGGLATSAQV 218
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+ F ++ + + I+D+ NQ IR++
Sbjct: 219 NYPFSVS-ISSTGEIYIADYYNQRIRKV 245
Score = 39.7 bits (91), Expect = 1.2, Method: Composition-based stats.
Identities = 50/198 (25%), Positives = 88/198 (44%), Gaps = 28/198 (14%)
Query: 10 LAITLLSLLFSLVSSGLLLEDG----YTVTTVIDG------------HQLEINPHSVIDR 53
+ + LL++L +V S ++ Y +TT+ G QL I I
Sbjct: 1 MKVVLLAILLCIVMSAKFIDTQSLLLYNITTIAGGGSVVGDGLLATNAQLAIPAGVAISS 60
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFA 110
G ++I+ DS+ + +S + +AG G + G+ G +A+ + P + +
Sbjct: 61 IG--EIIIADSNNNRIRKIS----TNGYISTIAGTGSASFG-GDGGLAINAQLNSPSAVS 113
Query: 111 VDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
+ G++Y++D N IRKI TN ++TIAG G+ G A +A + P
Sbjct: 114 ISSNGDMYISDTGNQRIRKISTNGYISTIAGSGTGGYSGDGGLATSAQLYYPKGVAISPS 173
Query: 170 ICALLISDHGNQLIRQIN 187
+ I+D N IR++N
Sbjct: 174 -NEIYIADTYNHRIRKVN 190
>gi|290978635|ref|XP_002672041.1| predicted protein [Naegleria gruberi]
gi|284085614|gb|EFC39297.1| predicted protein [Naegleria gruberi]
Length = 706
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 61/119 (51%), Gaps = 5/119 (4%)
Query: 75 PLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
P E + +AG+G++G+ +G A+ P AV+ N +Y+AD NH IRK++
Sbjct: 5 PKGMEQTIWTIAGNGIKGFYGDNGLAIEAKLSSPYGVAVNPYSNDVYIADYRNHCIRKVS 64
Query: 132 NLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
L +TTIAG G G A +A + ++ P L I+D+ N IR+I+L
Sbjct: 65 ALDNKITTIAGTGVAGYSGDGGLALHAQLNCPLSVSIHPKNEELYIADYSNHRIRKISL 123
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 70/143 (48%), Gaps = 14/143 (9%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARF 103
P V P + D+ ++D + +S + ++ AG+G G+ GE G +A+
Sbjct: 154 PQCVKIHPITFDIYIIDFVNNKIRRIS---NSSGIISTFAGNGTAGFC-GEGGFATNAQL 209
Query: 104 DKPKSFAVD-MKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSN 160
+ P + G++Y+AD +NH +RK+ +TT+AG G K G +DG +A +
Sbjct: 210 NGPSGLDFNPTTGDVYIADSNNHRVRKVNCKSGIITTLAGTG--KAGYSDGI--DAQLNY 265
Query: 161 DFELTFVPHICALLISDHGNQLI 183
++++F + ++D N I
Sbjct: 266 PYDVSFCTRGQIIYVTDRSNNRI 288
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 68/149 (45%), Gaps = 10/149 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK- 105
P SV P + +L + D S +S S V AG+ G S G+ G A F +
Sbjct: 96 PLSVSIHPKNEELYIADYSNHRIRKISLTYGTISTV---AGNNTGGTS-GDGGLALFAQL 151
Query: 106 --PKSFAVD-MKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSN 160
P+ + + +IY+ D N+ IR+I+N ++T AG G+ G A NA +
Sbjct: 152 YFPQCVKIHPITFDIYIIDFVNNKIRRISNSSGIISTFAGNGTAGFCGEGGFATNAQLNG 211
Query: 161 DFELTFVPHICALLISDHGNQLIRQINLK 189
L F P + I+D N +R++N K
Sbjct: 212 PSGLDFNPTTGDVYIADSNNHRVRKVNCK 240
>gi|239989050|ref|ZP_04709714.1| hypothetical protein SrosN1_17240 [Streptomyces roseosporus NRRL
11379]
gi|291446053|ref|ZP_06585443.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
allergen [Streptomyces roseosporus NRRL 15998]
gi|291349000|gb|EFE75904.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
allergen [Streptomyces roseosporus NRRL 15998]
Length = 605
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 7/112 (6%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V D++R L ++ V+R G G +G +DG P ARF +P+ AV G +
Sbjct: 188 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLNDGGPDEARFSEPQGLAVLPDGRVA 244
Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
VAD NH IR + GVT T+AG G + + GPA+ S+ ++L +
Sbjct: 245 VADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSSPWDLAW 296
>gi|375146761|ref|YP_005009202.1| NHL repeat containing protein [Niastella koreensis GR20-10]
gi|361060807|gb|AEV99798.1| NHL repeat containing protein [Niastella koreensis GR20-10]
Length = 439
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 83/196 (42%), Gaps = 19/196 (9%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEI---NPHSVI----DRPGSSDLIVLDSSRSAFYTLS 73
L S + D +T ++G L++ N ++ + GS +V + SA T
Sbjct: 55 LFSGSNYINDTNQITVTLNGKPLKVVGVNGSQIMAVVPKKAGSGHFVVKIGADSAVSTGI 114
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFD-------KPKSFAVDMKGNIYVADKSNHV 126
F V AG G G+++G A FD + VD N+YVAD NH
Sbjct: 115 FNYVYTRTVSTFAGSGKAGFANGMGTDAMFDFGGQSWYRSMGIIVDDNLNLYVADPGNHC 174
Query: 127 IRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
IRKI +N VTT G S G ADG A FS +++ F + D N IR+
Sbjct: 175 IRKIDSNANVTTFCGSPS-GAGYADGKGTAAKFSLPYDVAFDAQ-GNIWCVDPANWDIRK 232
Query: 186 INLKPEDCSKSSQSGS 201
I P+ + + GS
Sbjct: 233 I--APDGTATTWAWGS 246
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 51/109 (46%), Gaps = 14/109 (12%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA---------DKSNHVIRKI--TNLGVTT 137
GV GY +G A+FD P A+D GN+YVA S+ IR I + V+T
Sbjct: 331 GVPGYVNGAAAVAKFDHPWGLAIDTAGNLYVAGNGTYDGNTGNSDQAIRYIEAASFNVST 390
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG GS G AD + A+FS + V L + D N IR+I
Sbjct: 391 FAGSGS--AGYADAVGEAAAFSAPTGIA-VDKNGTLYVLDKNNNRIRKI 436
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V AG G GY+D +A F P AVD G +YV DK+N+ IRKI
Sbjct: 388 VSTFAGSGSAGYADAVGEAAAFSAPTGIAVDKNGTLYVLDKNNNRIRKI 436
Score = 43.5 bits (101), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 30/53 (56%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
GY+DG+ +A+F P A D +GNI+ D +N IRKI G T GS+
Sbjct: 195 GYADGKGTAAKFSLPYDVAFDAQGNIWCVDPANWDIRKIAPDGTATTWAWGSQ 247
>gi|88601128|gb|ABD46562.1| NHL repeat-containing protein [Vermamoeba vermiformis]
Length = 309
Score = 56.6 bits (135), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
Query: 76 LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ + VV LAG G G DG ARF+ P VD GN+YVAD N+ +RK+T G
Sbjct: 29 ITPDGVVVTLAGHPGHWGSRDGVGSKARFNGPSGLDVDTDGNVYVADYYNNTMRKVTPEG 88
Query: 135 VTTIAGGGSKKEGRADGPAQNASFS 159
+ T G + G DG + A F+
Sbjct: 89 IVTTIAGHVGQWGSTDGSGEKARFN 113
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 64/114 (56%), Gaps = 11/114 (9%)
Query: 82 VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
V +AG +Q G +DG+ +ARF++P +VD GN++V+D NH IRKI+ LG VTTIA
Sbjct: 202 VTTIAGRVMQPGSADGKLTAARFNQPSGISVDKIGNLFVSDYYNHTIRKISPLGEVTTIA 261
Query: 140 GGGSKK---EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
G + EG D N F N + +IC D N ++R++ ++P
Sbjct: 262 GMFGHQGAVEGFGDHIRLNHPFRNTIDSAGNLYIC-----DEYNSIVRKL-IRP 309
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 47/95 (49%), Gaps = 1/95 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G DG ARF P AV G I+V D+ NH IR I+ +G T G + G ADG
Sbjct: 158 GSEDGNGQQARFFGPSGIAVAPDGTIFVCDRYNHTIRSISIVGEVTTIAGRVMQPGSADG 217
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A F+ ++ V I L +SD+ N IR+I
Sbjct: 218 KLTAARFNQPSGIS-VDKIGNLFVSDYYNHTIRKI 251
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 1/81 (1%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P+ A+D NIY+ + +N+ IRKIT GV G G DG A F+ L
Sbjct: 5 PRGIAIDKHDNIYICEWNNNTIRKITPDGVVVTLAGHPGHWGSRDGVGSKARFNGPSGLD 64
Query: 166 FVPHICALLISDHGNQLIRQI 186
V + ++D+ N +R++
Sbjct: 65 -VDTDGNVYVADYYNNTMRKV 84
>gi|393782357|ref|ZP_10370541.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
CL02T12C01]
gi|392673185|gb|EIY66648.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
CL02T12C01]
Length = 446
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 1/98 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY+DG A F++P+ V +Y+AD +NHVIRKI+ G+ + G + + G DG
Sbjct: 350 GYADGPCEYAMFNEPRQMIVSADNELYLADTNNHVIRKISQDGIVSTVIGLAGQSGFMDG 409
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ A F F + + I D NQ +R++ ++
Sbjct: 410 TPEEALFDKPFGVALDTD-GTIYIGDSENQCVRRLAIE 446
>gi|251771416|gb|EES51995.1| conserved protein of unknown function [Leptospirillum
ferrodiazotrophum]
Length = 373
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 58/117 (49%), Gaps = 3/117 (2%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVTT 137
V LAG G +DG G+A F P++ A+ G +Y++D N++IRK + VTT
Sbjct: 148 AVTTLAGRAFPGSNDGTGGAAGFYGPRALALTPDGKTLYISDSGNNMIRKLDVATGTVTT 207
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
+AG G+ G DG ASF + + L ++D N LIR+I L S
Sbjct: 208 VAGKGALAPGMDDGIGGAASFRDPRGIALSSDGSVLYVADTRNNLIRKIVLATNAVS 264
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 56/112 (50%), Gaps = 5/112 (4%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRA 149
+ DG +ARF+ P+ A+ G +Y+AD N+ IRK +++ VTT+AG G
Sbjct: 104 FRDGNGTAARFNNPEGLAISSDGKTLYIADSRNNRIRKMDLSSGAVTTLAG--RAFPGSN 161
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
DG A F L P L ISD GN +IR++++ + + G+
Sbjct: 162 DGTGGAAGFYGPRALALTPDGKTLYISDSGNNMIRKLDVATGTVTTVAGKGA 213
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 48/102 (47%), Gaps = 5/102 (4%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSKKE 146
+G DG ARF+ + G +YVAD N +IRK+T G VTTIAG
Sbjct: 44 RGAVDGTGSQARFEYVQGIVAAPDGTLYVADTGNDLIRKVTVSGGTATVTTIAGVNHHAR 103
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
R DG A F+N L L I+D N IR+++L
Sbjct: 104 FR-DGNGTAARFNNPEGLAISSDGKTLYIADSRNNRIRKMDL 144
>gi|307110397|gb|EFN58633.1| hypothetical protein CHLNCDRAFT_140870 [Chlorella variabilis]
Length = 415
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 90/202 (44%), Gaps = 25/202 (12%)
Query: 54 PGSSDL--IVLDSSRSAFYTLSFPLSEESVVK---------RLAG-DGVQGYSDGEPGSA 101
PGS I +D R Y + S +V + +AG DG GY DG G A
Sbjct: 42 PGSGRFHNIAVDDPRGPHYVYAVDTSRHTVGRIALDTGTWETVAGLDGTPGYRDGPEGPA 101
Query: 102 R----FDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQN 155
F++P + G++ VAD N IR+I V+T+A G + G ADGPA
Sbjct: 102 GQPALFNRPSAICQMPHGHLAVADTGNACIRQIDAATKQVSTLA-GRCGEPGAADGPAAE 160
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC-SKSSQSGSALGAVSV-WVLVS 213
A F + + + C++ + D +R + + +C S+ SG + SV W+L
Sbjct: 161 AQFGSSIKSIACAN-CSVFVGDVSTGRLRLVRVDDAECLGASNPSGRFITRESVKWLLAG 219
Query: 214 VLSCLVSLVIGFVARPYIIRHT 235
V+ L L +G R Y+ RH
Sbjct: 220 VV-MLSILALG--GRRYLQRHA 238
>gi|182437571|ref|YP_001825290.1| hypothetical protein SGR_3778 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178466087|dbj|BAG20607.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
NBRC 13350]
Length = 605
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 7/112 (6%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
L+V D++R L ++ V+R G G +G DG P ARF +P+ AV G +
Sbjct: 188 LLVSDTTRHRLVELD---ADGETVRRHFGTGERGLRDGGPDEARFSEPQGLAVLPDGRVA 244
Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
VAD NH IR + GVT T+AG G + + GPA+ S+ +++ +
Sbjct: 245 VADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSSPWDVAW 296
>gi|456874551|gb|EMF89837.1| hypothetical protein LEP1GSC005_3835 [Leptospira santarosai str.
ST188]
Length = 356
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
GV G+ +G+ ++ F P +D + N+YV + NH IRKI N G V+T++GG
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + +T+ +LL++D + IR+I+LK + + S+ G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
P VD GNI+V+D+ ++IRKI G VTT+
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138
Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
AG S +G +G N+SF F + + L + + GN
Sbjct: 139 DGAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197
Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
IR+INL S S+ SG LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G GY DG+ SA+F P D K + + VAD +H IRKI N V+T
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G + DG NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286
>gi|290977087|ref|XP_002671270.1| predicted protein [Naegleria gruberi]
gi|284084837|gb|EFC38526.1| predicted protein [Naegleria gruberi]
Length = 1095
Score = 56.6 bits (135), Expect = 1e-05, Method: Composition-based stats.
Identities = 53/161 (32%), Positives = 75/161 (46%), Gaps = 13/161 (8%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
G+T T ++ L +P + +SDL + D +S + +AG G
Sbjct: 319 GFTDNTTVNLATLA-SPELMNFAKNTSDLYIADKGNHRIRKIS-----NGFITTIAGQGS 372
Query: 91 QGYSDGEPGSARFD---KPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKK 145
Y GE +R KPK A+D GNIY+AD SNH +RKI+ L +TTIAG GS
Sbjct: 373 PSYC-GENVDSRLSALSKPKGAALDSLGNIYIADTSNHRVRKISYLDGTITTIAGTGSFG 431
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A +A + + F I + I+D GN IR+I
Sbjct: 432 YNGDGILATSAQVNKPTGIAF-DSIGNIYIADSGNNRIRKI 471
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 43/111 (38%), Positives = 57/111 (51%), Gaps = 4/111 (3%)
Query: 79 ESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV 135
+ + +AG G GY+ DG SA+ +KP A D GNIY+AD N+ IRKI TN +
Sbjct: 418 DGTITTIAGTGSFGYNGDGILATSAQVNKPTGIAFDSIGNIYIADSGNNRIRKILTNGTI 477
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TIAG G PA NA ++ +T V + I+D N IR I
Sbjct: 478 LTIAGVGLGGYNGDGIPATNAKLNSPVSVT-VDSNDLVYITDTYNHRIRLI 527
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 48/144 (33%), Positives = 67/144 (46%), Gaps = 9/144 (6%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DG-EPGSARF 103
NP S P S L + DS+ + FP + +AG G GY+ DG E A+
Sbjct: 56 NPASSALGPDGS-LFIADSNNHRIRQV-FP---NGTITTIAGTGFSGYNGDGIEAIRAQL 110
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
P S V+ G + +D N+ IRKI TN + TI G G +G A+NA+ + +
Sbjct: 111 KNPVSVVVNSMGEVLFSDNGNNRIRKILTNGTIITIVGTGVDSFSGDNGLARNAAINYPY 170
Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
LT LL +D N IRQ+
Sbjct: 171 GLTLNSKE-ELLFTDVNNNRIRQV 193
Score = 42.4 bits (98), Expect = 0.20, Method: Composition-based stats.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 4/106 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
+ +AG GV G A P S A+ G++++AD +NH IR++ N +TTIAG
Sbjct: 35 ITTIAGGGV--CDGGLAIKASLANPASSALGPDGSLFIADSNNHRIRQVFPNGTITTIAG 92
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G A A N + V + +L SD+GN IR+I
Sbjct: 93 TGFSGYNGDGIEAIRAQLKNPVSVV-VNSMGEVLFSDNGNNRIRKI 137
Score = 41.2 bits (95), Expect = 0.51, Method: Composition-based stats.
Identities = 31/94 (32%), Positives = 47/94 (50%), Gaps = 11/94 (11%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNA 156
P + + + P+S A D N+Y+AD N+ IRK+ TN + T+AG G+ A NA
Sbjct: 550 PNNTKLNYPQSIAFDSSNNMYIADTYNNRIRKMFTNGTIITVAGTGTLGYNGDGIEATNA 609
Query: 157 SFSNDFELTFVPHIC----ALLISDHGNQLIRQI 186
+L + I LLI+D N IR++
Sbjct: 610 ------QLKYPQGIAIDGDELLIADSYNNRIRRV 637
Score = 38.9 bits (89), Expect = 2.6, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 3/60 (5%)
Query: 85 LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG G+ GY+ DG P + A+ + P S VD +Y+ D NH IR I G ++T+ GG
Sbjct: 480 IAGVGLGGYNGDGIPATNAKLNSPVSVTVDSNDLVYITDTYNHRIRLILPSGNISTVIGG 539
Score = 37.7 bits (86), Expect = 5.8, Method: Composition-based stats.
Identities = 28/88 (31%), Positives = 44/88 (50%), Gaps = 4/88 (4%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQN-ASFS 159
A+ + P AV G +Y +D SN + + TN +TT AG + G D N A+ +
Sbjct: 276 AKINSPYGVAVSSLGEVYFSD-SNRIRKVSTNGTITTFAG--TSLYGFTDNTTVNLATLA 332
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
+ + F + L I+D GN IR+I+
Sbjct: 333 SPELMNFAKNTSDLYIADKGNHRIRKIS 360
>gi|197120448|ref|YP_002132399.1| fibronectin type III domain-containing protein [Anaeromyxobacter
sp. K]
gi|196170297|gb|ACG71270.1| Fibronectin type III domain protein [Anaeromyxobacter sp. K]
Length = 460
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 57/112 (50%), Gaps = 8/112 (7%)
Query: 80 SVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGV 135
+ V LAG Q G +DG SA F P+ A D +YVAD N+ IR + T + V
Sbjct: 244 AAVTTLAGSTTQAGAADGPGASATFRWPRGVATDGT-TLYVADTMNNAIRALDLATGV-V 301
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+T+AG + G ADG A FS + L H LL++D GN IR ++
Sbjct: 302 STLAGDPASYAGYADGRGTAARFSAPYGLAM--HGGELLVADSGNSAIRAVD 351
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 57/112 (50%), Gaps = 8/112 (7%)
Query: 82 VKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
V LAGD G+ DG +A F+ P+ D +YVAD +N+++R+I VTT
Sbjct: 190 VTTLAGDPRTYAGHQDGIGTAASFNSPRGITSDGT-YLYVADTANNLVRRIDLATAAVTT 248
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+AG + + G ADGP +A+F + L ++D N IR ++L
Sbjct: 249 LAGS-TTQAGAADGPGASATFR--WPRGVATDGTTLYVADTMNNAIRALDLA 297
>gi|422003150|ref|ZP_16350382.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
Shermani str. LT 821]
gi|417258114|gb|EKT87507.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
Shermani str. LT 821]
Length = 356
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
GV G+ +G+ ++ F P +D + N+YV + NH IRKI N G V+T++GG
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + +T+ +LL++D + IR+I+LK + + S+ G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
P VD GNI+V+D+ ++IRKI G VTT+
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138
Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
AG S +G +G N+SF F + + L + + GN
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197
Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
IR+INL S S+ SG LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G GY DG+ SA+F P D K + + VAD +H IRKI N V+T
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G + DG NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286
>gi|326778226|ref|ZP_08237491.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces griseus XylebKG-1]
gi|326658559|gb|EGE43405.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces griseus XylebKG-1]
Length = 615
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 7/112 (6%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
L+V D++R L ++ V+R G G +G DG P ARF +P+ AV G +
Sbjct: 198 LLVSDTTRHRLVELD---ADGETVRRHFGTGERGLRDGGPDEARFSEPQGLAVLPDGRVA 254
Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
VAD NH IR + GVT T+AG G + + GPA+ S+ +++ +
Sbjct: 255 VADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSSPWDVAW 306
>gi|29348305|ref|NP_811808.1| hypothetical protein BT_2896 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383121762|ref|ZP_09942466.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
gi|29340208|gb|AAO78002.1| NHL repeat protein [Bacteroides thetaiotaomicron VPI-5482]
gi|251841348|gb|EES69429.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
Length = 452
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 1/103 (0%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
R AG+GV G DG+ S+R D+P AVD +GN+ V +RKIT G + G
Sbjct: 348 RFAGNGVWGIIDGQGVSSRMDQPNQIAVDAEGNLLVTTVYGRTVRKITPEGYVSTYAGIG 407
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ G DG A A F+ + + + + + D N IR I
Sbjct: 408 YQTGYVDGLAAEAKFNKPYGIA-IDAQGNVYVGDCENWRIRVI 449
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 52/106 (49%), Gaps = 13/106 (12%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF-----PLSEESVVKRL 85
G V +IDG + S +D+P + I +D+ + T + ++ E V
Sbjct: 351 GNGVWGIIDGQGVS----SRMDQP---NQIAVDAEGNLLVTTVYGRTVRKITPEGYVSTY 403
Query: 86 AGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AG G Q GY DG A+F+KP A+D +GN+YV D N IR I
Sbjct: 404 AGIGYQTGYVDGLAAEAKFNKPYGIAIDAQGNVYVGDCENWRIRVI 449
>gi|410448598|ref|ZP_11302672.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
LV3954]
gi|410017668|gb|EKO79726.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
LV3954]
Length = 356
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
GV G+ +G+ ++ F P +D + N+YV + NH IRKI N G V+T++GG
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + +T+ +LL++D + IR+I+LK + + S+ G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
P VD GNI+V+D+ ++IRKI G VTT+
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138
Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
AG S +G +G N+SF F + + L + + GN
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197
Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
IR+INL S S+ SG LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 40/81 (49%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G GY DG+ SA+F P D K + + VAD +H IRKI N V+T
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G DG NAS +
Sbjct: 268 VGNGIG--ANIDGNGLNASLN 286
>gi|359685014|ref|ZP_09255015.1| hypothetical protein Lsan2_10264 [Leptospira santarosai str.
2000030832]
Length = 356
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
GV G+ +G+ ++ F P +D + N+YV + NH IRKI N G V+T++GG
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + +T+ +LL++D + IR+I+LK + + S+ G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G+Q DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKVTNNPVVSLFAGTGIQESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
P VD GNI+V+D+ ++IRKI G VTT+
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138
Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
AG S +G +G N+SF F + + L + + GN
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197
Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
IR+INL S S+ SG LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G GY DG+ SA+F P D K + + VAD +H IRKI N V+T
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G + DG NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286
>gi|398331208|ref|ZP_10515913.1| hypothetical protein LalesM3_04029 [Leptospira alexanderi serovar
Manhao 3 str. L 60]
Length = 357
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
++G+ +G+ ++ F P +D + N+YV + NH IRKI N G V+T++GG S G
Sbjct: 163 IEGFQNGDRLNSLFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGTVSTLSGGVS---G 219
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
DG +A F + +T+ +LL++D N IR+I+LK + S+ G+ +GA
Sbjct: 220 YLDGDLASAQFKSPLGITYDQKTDSLLVADLQNHRIRKIDLKTS--TVSTFLGNGIGA 275
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 2/94 (2%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + I LD Y ++ V+ AG G+ DG +A F+
Sbjct: 22 NPSIFMSRAWVENTI-LDCILKECYLCKLKVTNNPVISLFAGTGINVSIDGTTQTASFNT 80
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
P +D GNI+V+D+S ++IRKI G +TT+
Sbjct: 81 PFGLELDTFGNIFVSDQSANLIRKIDRFGNITTL 114
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 57/108 (52%), Gaps = 5/108 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAG 140
V L+G GV GY DG+ SA+F P D K + + VAD NH IRKI +L +T++
Sbjct: 210 VSTLSG-GVSGYLDGDLASAQFKSPLGITYDQKTDSLLVADLQNHRIRKI-DLKTSTVST 267
Query: 141 G-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+ DG NASF+ ++F + + +SD + IR ++
Sbjct: 268 FLGNGIGASIDGKGLNASFNGPAFISF-DNSGYMFVSDTNSNKIRIVD 314
>gi|418754100|ref|ZP_13310334.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
MOR084]
gi|409965522|gb|EKO33385.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
MOR084]
Length = 356
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
GV G+ +G+ ++ F P +D + N+YV + NH IRKI N G V+T++GG
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + +T+ +LL++D + IR+I+LK + + S+ G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 78/203 (38%), Gaps = 48/203 (23%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I+ D Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTII-DCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
P VD GNI+V+D+ ++IRKI G VTT+
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138
Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
AG S +G +G N+SF F + + L + + GN
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197
Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
IR+INL S S+ SG LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G GY DG+ SA+F P D K + + VAD +H IRKI N V+T
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G + DG NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286
>gi|332668794|ref|YP_004451801.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Cellulomonas fimi ATCC 484]
gi|332337831|gb|AEE44414.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Cellulomonas fimi ATCC 484]
Length = 639
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 79/163 (48%), Gaps = 26/163 (15%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + L+V D+ L+ E++V+R+ G G +G+ DG P ARF +P
Sbjct: 181 PAKAVALPGGT-LLVADAGHHGLAELA--ADGETLVRRI-GSGERGFVDGGPDEARFSEP 236
Query: 107 KSFAVDMKG-------NIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADG----- 151
A+ G ++ VAD NH +R ++++ VTT+AG G + G AD
Sbjct: 237 NGLALVPDGLRATLGYDVLVADTVNHALRGVRLSDGSVTTVAGTGEQLMVGAADNVRGGG 296
Query: 152 -------PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
PA++ S+ +++ + P + A +++ GN + +
Sbjct: 297 TGAGYDGPARDVRLSSPWDVAWSPVLGAFVVAMAGNHTLWTFD 339
Score = 38.1 bits (87), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 3/106 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
V+ +AG +G DG A F +P +V G ++VAD +R + + V T
Sbjct: 343 VRHVAGTMNEGLLDGPGEGAWFAQPSGLSVGPDGRVWVADSETSALRWLDPADGTVGTAV 402
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G G G DG A A + + +P ++L++D N +R+
Sbjct: 403 GEGLFDFGHRDGRADEARMQHPLGVAALPD-GSVLVADTYNGAVRR 447
>gi|32472743|ref|NP_865737.1| hypothetical protein RB3815 [Rhodopirellula baltica SH 1]
gi|32443980|emb|CAD73422.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 351
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 54/102 (52%), Gaps = 5/102 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VTTIAGGGSKKE 146
GY DG SARF+K + +D + +Y++D +NH +R++ V T AG G +
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMVDTYAGQGKEGP 165
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ ++A+F +T LLI+D GNQ++R I+L
Sbjct: 166 ATDNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDL 207
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ T + G ++ +D G+ L+VL+ + + + + LAG G +G
Sbjct: 211 LVTTLAGRTSKLKDPRAVDLDGNRRLLVLERNGNRLRRVK----SNGDITTLAGSGKKGK 266
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 48/78 (61%), Gaps = 6/78 (7%)
Query: 110 AVDMKGN--IYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
AVD+ GN + V +++ + +R++ + G +TT+AG G K+G+ADG A+ ASF+ +
Sbjct: 227 AVDLDGNRRLLVLERNGNRLRRVKSNGDITTLAGSG--KKGKADGDAKQASFNGPKHMDV 284
Query: 167 VPHICALLISDHGNQLIR 184
P + I+D N L+R
Sbjct: 285 APD-GRVYIADDVNHLVR 301
>gi|418747486|ref|ZP_13303786.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
CBC379]
gi|410791609|gb|EKR89564.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
CBC379]
Length = 356
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
GV G+ +G+ ++ F P +D + N+YV + NH IRKI N G V+T++GG
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + +T+ +LL++D + IR+I+LK + + S+ G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 78/203 (38%), Gaps = 48/203 (23%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I+ D Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTII-DCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
P VD GNI+V+D+ ++IRKI G VTT+
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTNLLLEDPSGIKFDPITGDKYVSCK 138
Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
AG S +G +G N+SF F + + L + + GN
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197
Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
IR+INL S S+ SG LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G GY DG+ SA+F P D K + + VAD +H IRKI N V+T
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G + DG NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286
>gi|290985345|ref|XP_002675386.1| predicted protein [Naegleria gruberi]
gi|284088982|gb|EFC42642.1| predicted protein [Naegleria gruberi]
Length = 788
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 56/109 (51%), Gaps = 4/109 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ +AG G +G+S DG P + A+ + P V +Y+ D N IRKI G + T
Sbjct: 241 IINTIAGTGEEGFSGDGGPATNAQLNVPCGVFVSATNEVYIVDSGNCRIRKILRNGIINT 300
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG G + GPA NA SN + + FV + I+D N IR+I
Sbjct: 301 IAGTGEEGYSGDGGPAINAQISNVYNI-FVSQNDEVYIADTNNHRIRKI 348
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 6/106 (5%)
Query: 85 LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
+AG G G+S G+ G +A+ P S V +Y+AD N+ +RKI G + TIAG
Sbjct: 189 IAGTGEDGFS-GDGGLAINAKLGYPCSVFVSKDDEVYIADSQNNRVRKILRNGIINTIAG 247
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + GPA NA + + FV + I D GN IR+I
Sbjct: 248 TGEEGFSGDGGPATNAQLNVPCGV-FVSATNEVYIVDSGNCRIRKI 292
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 17/95 (17%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ +AG G +GYS DG P +A+ + V +Y+AD +NH IRKI G + T
Sbjct: 297 IINTIAGTGEEGYSGDGGPAINAQISNVYNIFVSQNDEVYIADTNNHRIRKILKDGTIET 356
Query: 138 IAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
IAG G K FS D F+ + PHI
Sbjct: 357 IAGNGEK------------GFSGDSPFDFSSHPHI 379
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 42/88 (47%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
SA D P S AV +Y+AD + IRKI N + TIAG G + GPA NA
Sbjct: 94 SAMLDFPTSIAVSSDNEVYIADLGIYRIRKILRNGNIVTIAGTGEEGFSGDGGPAINAQI 153
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
S + + + L D+ N IR+I
Sbjct: 154 SAVWNIALSQN-NELYFVDYSNCRIRKI 180
Score = 38.5 bits (88), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 49/105 (46%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG G +G+S DG P +A+ + A+ +Y D SN IRKI G + TIAG
Sbjct: 133 IAGTGEEGFSGDGGPAINAQISAVWNIALSQNNELYFVDYSNCRIRKILRNGTIVTIAGT 192
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G A NA + FV + I+D N +R+I
Sbjct: 193 GEDGFSGDGGLAINAKLGYPCSV-FVSKDDEVYIADSQNNRVRKI 236
>gi|227537797|ref|ZP_03967846.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
gi|227242411|gb|EEI92426.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
Length = 441
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 1/104 (0%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
KR+ G G+ DG A F+ PK D GN+++AD NH IR ++ G+ + G
Sbjct: 336 KRINAPGGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLSADGIVSTVAGQ 395
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
K G DG + F+ + + V + I+D N IR++
Sbjct: 396 PTKAGYKDGGPVESQFNQPWGVA-VNDQGDIYIADWNNARIRKL 438
>gi|290994196|ref|XP_002679718.1| predicted protein [Naegleria gruberi]
gi|284093336|gb|EFC46974.1| predicted protein [Naegleria gruberi]
Length = 762
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 57/105 (54%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG+G G+S DG +A+ + P S V K +++AD+ NH IRKI G + TIAG
Sbjct: 74 IAGNGKGGFSGDDGLATNAQLNCPSSVFVSNKNEVFIADQYNHRIRKILESGRIVTIAGN 133
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + +GPA +A + + FV ++ + D N +R+I
Sbjct: 134 GEEGFSGDNGPATSARLNCPMSI-FVSNMNEVYFVDSNNNRVRKI 177
Score = 45.1 bits (105), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 46/96 (47%), Gaps = 13/96 (13%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E V +AG G QG+S DG P + A+ P V + IY AD +N+ IRK+ N
Sbjct: 283 ENGNVVTVAGTGQQGFSGDGGPATRAQLACPTGLFVTLNNEIYFADSANNRIRKVLENGN 342
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
+ TIAG G K D P F+ + PHI
Sbjct: 343 IVTIAGNG-KHGYSGDAP---------FDFSLHPHI 368
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 41/87 (47%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
A+ P V IY+AD+ NH +RKI G + TIAG G DG A NA
Sbjct: 36 AKLCYPFGVFVSPTNEIYIADQYNHRVRKILESGRIVTIAGNGKGGFSGDDGLATNAQL- 94
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
N FV + + I+D N IR+I
Sbjct: 95 NCPSSVFVSNKNEVFIADQYNHRIRKI 121
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 3/60 (5%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG+G +G+S +G SAR + P S V +Y D +N+ +RKI G + TIAG
Sbjct: 130 IAGNGEEGFSGDNGPATSARLNCPMSIFVSNMNEVYFVDSNNNRVRKILENGIIVTIAGN 189
>gi|290983166|ref|XP_002674300.1| predicted protein [Naegleria gruberi]
gi|284087889|gb|EFC41556.1| predicted protein [Naegleria gruberi]
Length = 1435
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 83/174 (47%), Gaps = 27/174 (15%)
Query: 31 GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL----- 85
G ++T + +G+ L+ + ++RP + SS+ Y F S++++L
Sbjct: 140 GTSITGLYNGNNLK-GTLTALNRPIGLFYTEIFSSQGELY---FADESNSLIRKLVIANG 195
Query: 86 -----AGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-T 137
AG+ V GYS GE SA P S V G IY+AD N VIRK++NLG+ T
Sbjct: 196 TVSNVAGNVVAGYSGDGGEATSASLRYPSSVYVSNSGEIYIADTFNCVIRKVSNLGIIGT 255
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
+AG G + G Q++ + + +L F I L I D GN IR++
Sbjct: 256 VAGVGGQC-----GYNQDSINATESKLNFPTGITVSDSGDLYIVDKGNHRIRKV 304
Score = 37.0 bits (84), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 75/167 (44%), Gaps = 23/167 (13%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARF 103
P+ + +P + DL + D+ + S + R G G+ GY +GE + +
Sbjct: 450 PYEIAIQPSTQDLFISDTHNHRILRVD---STSGRIYRFMGSGIPGY-NGEGSDLLNLQL 505
Query: 104 DKPK--SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
+ P+ SF+ D+ + V+D NH IRK+ N +TI G + G + P+ +D
Sbjct: 506 NNPQGLSFSGDI---LIVSDTFNHRIRKVFNSTSSTILG---TQFGGKEPPS--TLVLSD 557
Query: 162 FELTFVPHIC-----ALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
L F P L ++D N LIR ++ + +SQ+ S +
Sbjct: 558 IMLKF-PRFALLKGTILAVADTKNYLIRDFDISTQSVYGNSQTLSTI 603
Score = 37.0 bits (84), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 3/91 (3%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG-SKKEGRADG-PAQ 154
EP A+ A + K ++ +D NH I KI N + +AG G + + DG A
Sbjct: 384 EPTIAKLKYTYGVATNSKNEVFFSDAYNHRIVKIENGVLKQVAGKGVAGFDNNIDGLLAT 443
Query: 155 NASFSNDFELTFVPHICALLISD-HGNQLIR 184
+ S + +E+ P L ISD H ++++R
Sbjct: 444 DVSLNYPYEIAIQPSTQDLFISDTHNHRILR 474
>gi|290975594|ref|XP_002670527.1| predicted protein [Naegleria gruberi]
gi|284084087|gb|EFC37783.1| predicted protein [Naegleria gruberi]
Length = 443
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G SA+ P V +Y++D N+ IRK+ N
Sbjct: 39 ENGNIVTIAGNGTAGFSGDNGPATSAQLYNPFGIFVSSNNEVYISDSGNNRIRKVLENGN 98
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +GPA +A N F + FV + ISD N +IR+I
Sbjct: 99 IITIAGNGTVGFSGDNGPATSAQLYNPFGI-FVSANNEVYISDSNNNIIRKI 149
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
A+ + P+ V +Y+AD+ N IRK+ G + TIAG G+ +GPA +A
Sbjct: 8 AQLNYPRCVFVSSNNEVYIADEGNQRIRKVVENGNIVTIAGNGTAGFSGDNGPATSAQLY 67
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
N F + FV + ISD GN IR++
Sbjct: 68 NPFGI-FVSSNNEVYISDSGNNRIRKV 93
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 17/98 (17%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G SA+ P V +Y++D +N++IRKI N
Sbjct: 95 ENGNIITIAGNGTVGFSGDNGPATSAQLYNPFGIFVSANNEVYISDSNNNIIRKILENGN 154
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
+ TIAG G++ FS D F++ PH+
Sbjct: 155 IVTIAGNGTR------------GFSGDSPFDIRMYPHV 180
>gi|300772471|ref|ZP_07082341.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300760774|gb|EFK57600.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 441
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 1/104 (0%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
KR+ G G+ DG A F+ PK D GN+++AD NH IR ++ G+ + G
Sbjct: 336 KRINAPGGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLSADGIVSTVAGQ 395
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
K G DG + F+ + + V + I+D N IR++
Sbjct: 396 PTKAGYKDGGPVESQFNQPWGVA-VNDQGDIYIADWNNARIRKL 438
>gi|290971645|ref|XP_002668597.1| predicted protein [Naegleria gruberi]
gi|284082068|gb|EFC35853.1| predicted protein [Naegleria gruberi]
Length = 223
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 64/119 (53%), Gaps = 14/119 (11%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S+ +++ +AG+G +G+S DG A+ + P S V+ G IY+ D +N IRK+
Sbjct: 86 VSKSGIIETIAGNGNEGFSGDDGLATQAQLNCPMSTFVNSNGEIYITDSNNFRIRKVQRN 145
Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
G+ TIAG G+ DG + S + + +L + H+ + ISD GN IR++
Sbjct: 146 GIIKTIAGNGN------DGFEGDGSSATNAQLNYPMHVFVSSNDEMYISDSGNNAIRKV 198
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 7/106 (6%)
Query: 85 LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
+AG+G G S G+ G +A + P + V +Y+ D +NH IRK++ G + TIAG
Sbjct: 40 IAGNGKLGCS-GDGGLAINAELNCPMNVFV-FNEEVYITDSANHRIRKVSKSGIIETIAG 97
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G++ DG A A + TFV + I+D N IR++
Sbjct: 98 NGNEGFSGDDGLATQAQLNCPMS-TFVNSNGEIYITDSNNFRIRKV 142
Score = 37.0 bits (84), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ ++K +AG+G G+ DG + A+ + P V +Y++D N+ IRK++
Sbjct: 142 VQRNGIIKTIAGNGNDGFEGDGSSATNAQLNYPMHVFVSSNDEMYISDSGNNAIRKVSRN 201
Query: 134 G-VTTIAGGGSKKEG 147
G + T+AG G +G
Sbjct: 202 GIIETVAGEGPYSQG 216
>gi|386722458|ref|YP_006188784.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus K02]
gi|384089583|gb|AFH61019.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus K02]
Length = 537
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 72/159 (45%), Gaps = 18/159 (11%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G+S+G AR + P AV +G + +AD NH +R + +TT+AG G +K G DG
Sbjct: 302 GFSNGAALQARLNYPMGIAVTEEGGLLIADSQNHAVRYLFGGQLTTLAGAGEQKTGLLDG 361
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK-PEDCSKSSQSGSALGAVSVWV 210
A + ++ + L+ N+L R + PED +G AL V
Sbjct: 362 MEGKAGLNRPADVAVLGDGSVLVADSFNNRLRRLTGYRLPEDLP----AGDALKVV---- 413
Query: 211 LVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEAL 249
L + V+ F A+P I GRL P++ EAL
Sbjct: 414 -------LDAGVMKFDAQPEITE--GRLMVPVRAVTEAL 443
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 55/189 (29%), Positives = 79/189 (41%), Gaps = 21/189 (11%)
Query: 29 EDGYTVTTVIDGH---QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ GY V ++DG L P + P S L V D+ A + + V +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
AG G G DG+ +A F +P A G +YVAD H IR+I+ G T SK+
Sbjct: 164 AGSGRLGVKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSKR 223
Query: 146 -----EGR-------ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
G+ ADGP A F+ + L +SD GNQ +R I+L
Sbjct: 224 VVEATPGQVAAAGDFADGPLAQAKFNEPTGIALDAK-GNLYVSDSGNQRVRYIDLAKGTV 282
Query: 194 SKSSQSGSA 202
+ + G+A
Sbjct: 283 TTVAGGGTA 291
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 52/111 (46%), Gaps = 8/111 (7%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
AG G G ++G A F P AV G V+D N VIRK+T V +AG +K
Sbjct: 48 AGTGRLGSANGTGLEASFRVPAGLAVLPDGTAAVSDSRNGVIRKLTGGRVDVLAGVFYRK 107
Query: 146 E-------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G DG A + F L+ P +L ++D GN IR+I+ K
Sbjct: 108 DDKGYPVGGLLDGAANASLFQEPLGLSAGPD-GSLYVADAGNHAIRRIDAK 157
>gi|116625892|ref|YP_828048.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116229054|gb|ABJ87763.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 2384
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 65/131 (49%), Gaps = 9/131 (6%)
Query: 78 EESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TN 132
+ ++ +AG GVQG++ G GSA+ P A D GN+Y+AD N IR+I T
Sbjct: 138 QTGMISTVAGTGVQGFAGDGGLAGSAQLSHPAGIAFDSAGNLYIADMGNFRIRRIDGQTG 197
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP-HICALLISDHGNQLIRQINLKPE 191
+ +TTIAG GS A A S + P LL+S+ G IR+++L
Sbjct: 198 V-ITTIAGDGSNTTSPDGVLAAGAGLSQPIWV--APDRSGGLLVSEMGAMRIRRVDLAGG 254
Query: 192 DCSKSSQSGSA 202
+ + +G+A
Sbjct: 255 GLTTVAGNGNA 265
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 71/138 (51%), Gaps = 12/138 (8%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVD 112
S+++ + DS + + +P ++ ++AG+G S G+ GSA ++P A D
Sbjct: 62 SNNVFIADSGNNRVVVVQYP---SGILYQIAGNGTA-TSSGDGGSALQSSVNRPMGLAAD 117
Query: 113 MKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
GN+Y+++ + IR+I T + ++T+AG G + G A +A S+ + F
Sbjct: 118 FNGNLYISEFQGNRIRRIDMQTGM-ISTVAGTGVQGFAGDGGLAGSAQLSHPAGIAF-DS 175
Query: 170 ICALLISDHGNQLIRQIN 187
L I+D GN IR+I+
Sbjct: 176 AGNLYIADMGNFRIRRID 193
>gi|196233467|ref|ZP_03132310.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
gi|196222463|gb|EDY16990.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
Length = 509
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
AG+ +G +DG +A+F P A+D GN+ VAD++N +R+I+ G T G +
Sbjct: 205 FAGE-TEGAADGAGVTAQFKNPSGLAIDEHGNVDVADRTNQTLRRISPSGETETIAGTAG 263
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G ADG + A F + P L I +HGN + + K
Sbjct: 264 EHGTADGAGKAARF-------WYPQ--GLAIDEHGNVYVADTSAK 299
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 53/98 (54%), Gaps = 7/98 (7%)
Query: 46 NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
NP + ID G+ D V D + +S P E + AG+ G +DG +ARF
Sbjct: 224 NPSGLAIDEHGNVD--VADRTNQTLRRIS-PSGETETIAGTAGE--HGTADGAGKAARFW 278
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGG 141
P+ A+D GN+YVAD S +RK+T N V+T+AGG
Sbjct: 279 YPQGLAIDEHGNVYVADTSAKSVRKVTPNGQVSTLAGG 316
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 60/132 (45%), Gaps = 12/132 (9%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGN 116
D+ V DSS ++ + V LAG G G +G ARF P A+ G
Sbjct: 330 DVFVTDSSNHVIQKIA----HDGKVSVLAGKPGESGAKNGHGEEARFHFPNGIAIGADGA 385
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA--LL 174
+YV + S+ IRKI G T G ++EGR DG A F + LT + L
Sbjct: 386 LYVVEHSS--IRKIDGKGNVTTFAGSLEEEGRNDGTGAAARF---WGLTAIAADARGNLY 440
Query: 175 ISDHGNQLIRQI 186
++DH + +R+I
Sbjct: 441 VADHELKNLRRI 452
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 35/68 (51%), Gaps = 10/68 (14%)
Query: 78 EESVVKRLAGDG----------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
E S ++++ G G +G +DG +ARF + A D +GN+YVAD +
Sbjct: 390 EHSSIRKIDGKGNVTTFAGSLEEEGRNDGTGAAARFWGLTAIAADARGNLYVADHELKNL 449
Query: 128 RKITNLGV 135
R+I+ GV
Sbjct: 450 RRISRSGV 457
>gi|421614357|ref|ZP_16055418.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
gi|408494877|gb|EKJ99474.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
Length = 351
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VTTIAGGGSKKE 146
GY DG SARF+K + +D + +Y++D +NH +R++ V T AG G +
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHCVRRLIQTSDGEWMVDTYAGQGEEGP 165
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ ++A+F +T LLI+D GNQ++R I+
Sbjct: 166 ATDNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDF 207
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ T + G + ++ +D G+ L+VL+ + + L + LAG G +G
Sbjct: 211 LVTTLAGRKSKLKDPRAVDLDGNGRLLVLERNGNRLRRLE----SNGDITTLAGSGKKGT 266
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 49/85 (57%), Gaps = 4/85 (4%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
++ P++ +D G + V +++ + +R++ +N +TT+AG G K+G ADG A+ ASF+
Sbjct: 220 SKLKDPRAVDLDGNGRLLVLERNGNRLRRLESNGDITTLAGSG--KKGTADGDAKQASFN 277
Query: 160 NDFELTFVPHICALLISDHGNQLIR 184
+ P + I+D N L+R
Sbjct: 278 GPKHMDVAPD-GRVYIADDVNHLVR 301
>gi|158315874|ref|YP_001508382.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158111279|gb|ABW13476.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 772
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 65/127 (51%), Gaps = 15/127 (11%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + +AG G G+S DG P + A+ D+P S AV G +YVAD NH +R+I
Sbjct: 436 VDRAGTITTVAGTGADGFSGDGGPATEAQLDEPTSVAVAPDGTLYVADTRNHRVRRIGRD 495
Query: 134 G-VTTIAG-------GGSKKEGRA---DG-PAQNASFSNDFELTFVPHICALLISDHGNQ 181
G +TTIAG G ++G A DG PA NA N + +LLI+D N
Sbjct: 496 GIITTIAGQDEFGFAGEVSEDGLAYSGDGLPAVNAKL-NYPNTVLMETDGSLLIADGENN 554
Query: 182 LIRQINL 188
+R+I L
Sbjct: 555 RVRRIGL 561
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 61/125 (48%), Gaps = 5/125 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKS-FAVDMKGNIYVADKSNHVIRKITN 132
++ + + LAG G GYS DG P A+ + + VD GN+Y++D ++ +R+I
Sbjct: 615 IAGDGTISTLAGTGKTGYSGDGGPADQAQINAVGADLVVDAAGNVYLSDPGSNRVRRIAP 674
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
G +TTIAG G K GPA A L + L I+D + +R + L P
Sbjct: 675 DGTITTIAGTGVSKYSGNGGPATAAELVYPGGLAL-DQLGNLYIADGIDSRVRAVRLPPG 733
Query: 192 DCSKS 196
C+ +
Sbjct: 734 SCAAA 738
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 1/88 (1%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SARF P + A G++YVAD+ NH +R+I G ++T+AG G GPA A
Sbjct: 585 SARFSYPSALARGPDGSLYVADQDNHRVRRIAGDGTISTLAGTGKTGYSGDGGPADQAQI 644
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+ V + +SD G+ +R+I
Sbjct: 645 NAVGADLVVDAAGNVYLSDPGSNRVRRI 672
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 56/101 (55%), Gaps = 3/101 (2%)
Query: 88 DGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKK 145
DG+ DG P +A+ + P + ++ G++ +AD N+ +R+I G +TTIAG G++
Sbjct: 516 DGLAYSGDGLPAVNAKLNYPNTVLMETDGSLLIADGENNRVRRIGLDGIITTIAGTGAEG 575
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G GPA +A FS L P +L ++D N +R+I
Sbjct: 576 FGGDGGPATSARFSYPSALARGPD-GSLYVADQDNHRVRRI 615
>gi|302552781|ref|ZP_07305123.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
40736]
gi|302470399|gb|EFL33492.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
40736]
Length = 605
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 71/139 (51%), Gaps = 14/139 (10%)
Query: 39 DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
DG + P + R PG S + +V D++R L+ ESVV+R+ G G +
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGNFLVSDTTRHQLVELAG--DGESVVRRI-GSGAR 221
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEG 147
G++DG A F++P+ + G++ VAD NH +R+ + VTT+AG G + +
Sbjct: 222 GFADGSADEAAFNEPQGLTLLDDGSVVVADTVNHALRRLDLATGEVTTLAGTGRQWWQGS 281
Query: 148 RADGPAQNASFSNDFELTF 166
GPA+ S+ +++ +
Sbjct: 282 PTSGPAREVDLSSPWDVAW 300
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 55/114 (48%), Gaps = 3/114 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GDG + EP RF P + GN V+D + H + ++ G + + GS
Sbjct: 164 GDGPYVAPEPEPTVLRF--PGKALLLPSGNFLVSDTTRHQLVELAGDGESVVRRIGSGAR 221
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
G ADG A A+F+ LT + +++++D N +R+++L + + + +G
Sbjct: 222 GFADGSADEAAFNEPQGLTLLDD-GSVVVADTVNHALRRLDLATGEVTTLAGTG 274
>gi|290995436|ref|XP_002680301.1| predicted protein [Naegleria gruberi]
gi|284093921|gb|EFC47557.1| predicted protein [Naegleria gruberi]
Length = 699
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 59/112 (52%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G+ G+S +G +A+ P S V +Y+AD+ NH IRKI N
Sbjct: 95 ENGNIITIAGNGIHGFSGDNGLATNAQLYTPCSVFVSSNNEVYIADQGNHRIRKILENGN 154
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G +G A NA ++ + + FV + I+D+ N IR+I
Sbjct: 155 IVTIAGNGIHGFSGDNGLATNAQLNSSYSV-FVSSNNEVYIADYFNNRIRKI 205
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G +G S +G +A+ ++P S V +Y+AD+ N+ IRKI N
Sbjct: 39 ENGTIVTIAGNGTKGSSGDNGLATNAQLNRPYSVFVSSNNEVYIADQGNNRIRKILENGN 98
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G +G A NA + FV + I+D GN IR+I
Sbjct: 99 IITIAGNGIHGFSGDNGLATNAQLYTPCSV-FVSSNNEVYIADQGNHRIRKI 149
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 48/88 (54%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SA+ ++P + V +Y+AD N+ IRKI G + TIAG G+K +G A NA
Sbjct: 7 SAQLNRPNNVFVSSNNEVYIADCFNNRIRKILENGTIVTIAGNGTKGSSGDNGLATNAQL 66
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+ + + FV + I+D GN IR+I
Sbjct: 67 NRPYSV-FVSSNNEVYIADQGNNRIRKI 93
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 17/109 (15%)
Query: 77 SEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNL 133
+E + +AG+G+ G++ +G +AR + P S V +Y+AD N+ IRKI N
Sbjct: 226 NENGNIITIAGNGIHGFNGDNGLATNARLNHPFSVFVSSNNEVYIADYYNNRIRKILENG 285
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHICALLISDHGN 180
+ TIAG G+ A FS D F++ PHI L++ +GN
Sbjct: 286 NIITIAGNGT------------AGFSGDSPFDIRTYPHIGNKLLTGNGN 322
Score = 43.5 bits (101), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 58/132 (43%), Gaps = 24/132 (18%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI----- 130
E + +AG+G+ G+S +G +A+ + S V +Y+AD N+ IRKI
Sbjct: 151 ENGNIVTIAGNGIHGFSGDNGLATNAQLNSSYSVFVSSNNEVYIADYFNNRIRKILENGN 210
Query: 131 ----------------TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
N + TIAG G +G A NA ++ F + FV +
Sbjct: 211 IITIAGNGTHGFNGDNENGNIITIAGNGIHGFNGDNGLATNARLNHPFSV-FVSSNNEVY 269
Query: 175 ISDHGNQLIRQI 186
I+D+ N IR+I
Sbjct: 270 IADYYNNRIRKI 281
>gi|302036371|ref|YP_003796693.1| hypothetical protein NIDE1006 [Candidatus Nitrospira defluvii]
gi|300604435|emb|CBK40767.1| conserved protein of unknown function, contains NHL repeats
[Candidatus Nitrospira defluvii]
Length = 404
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 5/97 (5%)
Query: 95 DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADG 151
DG P AR + P + AVD GN+Y+AD NH +RK+ +T +AG G + G
Sbjct: 137 DGGPARRARLNFPSAVAVDRAGNLYIADTMNHRVRKVDGATGIITNVAGTGQARYSGDGG 196
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
PA +A+ + L AL I+D N +R+++L
Sbjct: 197 PAVSAAINEPTGLAVSDE--ALYIADQSNNRVRRVDL 231
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 39/72 (54%), Gaps = 7/72 (9%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKEGRADGPAQNAS 157
++PK VD KGN+Y+AD NHV+R++ +TT+AG G+ + PA A
Sbjct: 34 LNEPKGLCVDRKGNLYIADSENHVVRRVDRATGIITTVAGVCPSGTTGPAVPEPPA--AE 91
Query: 158 FSNDFELTFVPH 169
+ND E F H
Sbjct: 92 ETNDDEDPFADH 103
>gi|290978674|ref|XP_002672060.1| predicted protein [Naegleria gruberi]
gi|284085634|gb|EFC39316.1| predicted protein [Naegleria gruberi]
Length = 1219
Score = 55.8 bits (133), Expect = 2e-05, Method: Composition-based stats.
Identities = 46/204 (22%), Positives = 86/204 (42%), Gaps = 18/204 (8%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEIN------PHSVIDRPGSSDLIVLDSSRSAFYTLSF 74
++++G +L T T +G +E PH V + ++ + D+ + +
Sbjct: 388 ILTNGTILTIAGTGTKGYNGDGIEATSAQLNTPHDVALNLATGEIYIADTENNRIRKIL- 446
Query: 75 PLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD-MKGNIYVADKSNHVIRKIT 131
+ +AG G GY+ DG A ++P +D G ++ +D N IR+++
Sbjct: 447 ---TNGTITTIAGTGDYGYNGDGIMAVDAWLNEPSGVEIDSTSGEVFFSDTENFRIRRVS 503
Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP- 190
N G+ T G K + DG A + + + E+ +VP + +D GN IR+ +
Sbjct: 504 NSGIITTIAGTGKSKFNGDGMATDTNLATPTEIQYVPSTTEIYFADSGNNRIRKFTINGM 563
Query: 191 ----EDCSKSSQSGSALGAVSVWV 210
S S +G + A W+
Sbjct: 564 MTTIAGTSTSGYNGDNMPATRAWL 587
Score = 50.8 bits (120), Expect = 5e-04, Method: Composition-based stats.
Identities = 38/116 (32%), Positives = 58/116 (50%), Gaps = 16/116 (13%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG G+ GY+ D SA+ + P +D G I+++D NH IRKI G +TT
Sbjct: 114 ITTIAGTGIAGYNGDDKSADSAQLNTPSGIVIDPNNGEIFISDSKNHRIRKILQNGKITT 173
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQINL 188
IAG G E +G A F+ ++P+ L I+D N IR+++L
Sbjct: 174 IAGTG---EAGYNGDGIEAKFAK----LYLPNGIDLYEKELFIADQNNHRIRKVSL 222
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 46/168 (27%), Positives = 69/168 (41%), Gaps = 8/168 (4%)
Query: 27 LLEDGYTVTTVIDGHQLEIN----PHSVIDRPGSSDLIVLDS---SRSAFYTLSFPLSEE 79
+LE G VT G N S ++ P S L L S S Y + +E
Sbjct: 278 ILESGIIVTIAGTGEAGFDNILNATQSKVNHPNSVTLNELGEVFISDSQNYRVRKITTES 337
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
++ + G+G + Y + P+ + G + AD SNH IRKI TN + TI
Sbjct: 338 GIITTIVGNGFEKYCTDLASNTPLFYPRGIISNENGEFFYADSSNHCIRKILTNGTILTI 397
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G+K A +A + ++ + I+D N IR+I
Sbjct: 398 AGTGTKGYNGDGIEATSAQLNTPHDVALNLATGEIYIADTENNRIRKI 445
Score = 42.7 bits (99), Expect = 0.15, Method: Composition-based stats.
Identities = 44/155 (28%), Positives = 69/155 (44%), Gaps = 12/155 (7%)
Query: 39 DGHQLEIN---PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS- 94
DG + N P + P ++++ DS + + ++ +AG GY+
Sbjct: 522 DGMATDTNLATPTEIQYVPSTTEIYFADSGNNRIRKFTI----NGMMTTIAGTSTSGYNG 577
Query: 95 DGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADG 151
D P + A + P D K N +Y+AD SNH IRKI TN +TTIAG G +
Sbjct: 578 DNMPATRAWLNFPVGVTYDPKTNQVYIADLSNHRIRKILTNGTITTIAGTGKGGYNGNNL 637
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A + + + P+ ++ SD N +IR I
Sbjct: 638 TALSTQVNGPHGMILQPN-GDVIFSDSTNNVIRII 671
Score = 40.0 bits (92), Expect = 0.95, Method: Composition-based stats.
Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 5/112 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
++ E +++ LAG QG DG + P A + + +AD +NH IRK+ TN
Sbjct: 1 MTSEQIIRNLAG---QGICDGHLSTNAPVNPSDCAF-LDEEMLIADATNHRIRKVFTNGT 56
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G A +A +N + + + I+D N IR+I
Sbjct: 57 IVTIAGNGFAGYNGDGLDATSAQLNNPVGIYVDTNSREVYIADSNNHRIRKI 108
>gi|116620348|ref|YP_822504.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223510|gb|ABJ82219.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 915
Score = 55.8 bits (133), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/114 (31%), Positives = 58/114 (50%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + V+ + G+G G S +G SAR P+ VD GN+Y+ D N+ +R++T
Sbjct: 566 VTADGVISTVVGNGTVGASGDEGAAASARLASPRGLTVDDNGNLYIGDTGNNRVRQVTAD 625
Query: 134 GVT-TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G+ TIAG G GPA +A+ L F+ AL +D N +R++
Sbjct: 626 GIMHTIAGTGPAGFAGDGGPAADAALDGPAGL-FLDGSGALYFADSNNNRVRRL 678
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 47/139 (33%), Positives = 73/139 (52%), Gaps = 10/139 (7%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKS 108
+DR G+ L ++D+S L+ + V+ +AG+G GY+ + A+ P +
Sbjct: 491 LDRGGT--LYIVDTSNHRV----LRLAADGTVQTVAGNGSGGYAGDGGSARFAQLRVPGA 544
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
A D KGN+Y+AD NH IRK+T GV +T+ G G+ +G A +A ++ LT V
Sbjct: 545 CAFDGKGNLYIADTGNHSIRKVTADGVISTVVGNGTVGASGDEGAAASARLASPRGLT-V 603
Query: 168 PHICALLISDHGNQLIRQI 186
L I D GN +RQ+
Sbjct: 604 DDNGNLYIGDTGNNRVRQV 622
Score = 50.4 bits (119), Expect = 7e-04, Method: Composition-based stats.
Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
+ +AG G G DG P +A +F A D GN+YVAD +NH +RK++ G VTTIA
Sbjct: 29 IDTVAGSGRIG--DGGPATAAQFSDMSGIAADRLGNLYVADTNNHRVRKVSLGGTVTTIA 86
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G G A NA + + L + + ++D GN+ +R+I
Sbjct: 87 GTGIAGFSGDGGLAVNARLNLPYGLA-LDDFGNIYVADLGNERVRRI 132
Score = 45.8 bits (107), Expect = 0.021, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 41/67 (61%), Gaps = 5/67 (7%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
V +AG G+ G+S G+ G +AR + P A+D GNIYVAD N +R+I + G + T
Sbjct: 82 VTTIAGTGIAGFS-GDGGLAVNARLNLPYGLALDDFGNIYVADLGNERVRRIGSDGAIVT 140
Query: 138 IAGGGSK 144
IAG G +
Sbjct: 141 IAGNGRR 147
Score = 45.1 bits (105), Expect = 0.035, Method: Composition-based stats.
Identities = 34/114 (29%), Positives = 58/114 (50%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + + +AG+G + S DG P P++ A+D KGN+Y+A+ H +R+++
Sbjct: 132 IGSDGAIVTIAGNGRRASSPDGAGPLDTSLLSPRNIAIDGKGNLYIAEFEGHRVRRLSAD 191
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + T+AG G G + A + L F AL I+D GN ++R+I
Sbjct: 192 GRLVTVAGTGIAGLGGDGFASVKAQVNYPAGLAF-DRAGALYIADSGNNVVRKI 244
Score = 42.0 bits (97), Expect = 0.25, Method: Composition-based stats.
Identities = 46/141 (32%), Positives = 65/141 (46%), Gaps = 20/141 (14%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
G+ +++V D AF ++ V++ +AG G G S DG SA PK +D
Sbjct: 437 GAGNVMVAD----AFNHRVVAINALHVLRPVAGTGTGGVSADGTAALSAALRGPKGVCLD 492
Query: 113 MKGNIYVADKSNH-VIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
G +Y+ D SNH V+R + V T+AG GS G A+ A VP C
Sbjct: 493 RGGTLYIVDTSNHRVLRLAADGTVQTVAGNGSGGYAGDGGSARFAQLR-------VPGAC 545
Query: 172 A------LLISDHGNQLIRQI 186
A L I+D GN IR++
Sbjct: 546 AFDGKGNLYIADTGNHSIRKV 566
Score = 41.2 bits (95), Expect = 0.48, Method: Composition-based stats.
Identities = 35/110 (31%), Positives = 53/110 (48%), Gaps = 5/110 (4%)
Query: 82 VKRLAGDG-VQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTI 138
+ +AGDG DG SA +P + A+D KGN+Y+AD +R++ GV TT+
Sbjct: 347 IDTVAGDGYAHSVGDGAAATSAILHQPSAIALDSKGNLYIADTGTQRVREVLPNGVMTTL 406
Query: 139 AGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
AG G D PA A ++ L V ++++D N + IN
Sbjct: 407 AGTGVVGRDTGDAVPAGIAPLNSPMGLA-VDGAGNVMVADAFNHRVVAIN 455
Score = 40.4 bits (93), Expect = 0.71, Method: Composition-based stats.
Identities = 29/76 (38%), Positives = 34/76 (44%), Gaps = 3/76 (3%)
Query: 81 VVKRLAGDGVQGYSDGEP---GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
V+ LAG GV G G+ G A + P AVD GN+ VAD NH + I L V
Sbjct: 402 VMTTLAGTGVVGRDTGDAVPAGIAPLNSPMGLAVDGAGNVMVADAFNHRVVAINALHVLR 461
Query: 138 IAGGGSKKEGRADGPA 153
G ADG A
Sbjct: 462 PVAGTGTGGVSADGTA 477
>gi|428177953|gb|EKX46830.1| hypothetical protein GUITHDRAFT_70199, partial [Guillardia theta
CCMP2712]
Length = 305
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 76/165 (46%), Gaps = 12/165 (7%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPL----SEESVVKRLAGDGVQGYSDGEPGSAR 102
P V P S L+V DS ++ S + V +AG+G G +DG +A+
Sbjct: 41 PSDVALSPDESFLVVADSRNHVIRRINKGAQGWSSNDVTVTTIAGNGQPGLADGSATTAQ 100
Query: 103 FDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADGPAQNASFS 159
F KP + G + VAD +N+ IRK+T V TIAG +G DG + F
Sbjct: 101 FQKPLGVDISPDGTWVAVADSANNRIRKVTLATGNVETIAG---FTQGYQDGVGTLSKFW 157
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
++F P L ++D N IR I+L S +++SG LG
Sbjct: 158 LPASVSFHPQGTGLAVADTNNNRIRWISL--PSYSVTTKSGRCLG 200
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 3/106 (2%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEG 147
QGY DG ++F P S + +G + VAD +N+ IR I+ + VTT +G G
Sbjct: 144 QGYQDGVGTLSKFWLPASVSFHPQGTGLAVADTNNNRIRWISLPSYSVTTKSGRCLGIAG 203
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
A+G + A++++ + P + I+D N +IR+ + D
Sbjct: 204 YANGLGKTAAYNSPQYVRVSPDGVWIYIADGQNNVIRRARVDNGDV 249
>gi|332878080|ref|ZP_08445810.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
gi|357047673|ref|ZP_09109271.1| NHL repeat protein [Paraprevotella clara YIT 11840]
gi|332684042|gb|EGJ56909.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
gi|355529361|gb|EHG98795.1| NHL repeat protein [Paraprevotella clara YIT 11840]
Length = 461
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 57/114 (50%), Gaps = 2/114 (1%)
Query: 77 SEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+ E K AG G G++DG A+F P+ +D + N+Y+AD N+ IRKIT GV
Sbjct: 349 TREGNFKIYAGMQGRPGHADGLASDAQFHSPRQLVLDEEENLYIADSENNCIRKITPEGV 408
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G K G DG A F+ + L + + + D N +RQ++++
Sbjct: 409 VSTVIGIPGKSGYKDGTPDVALFTQPWGLA-IDSEGIIYVGDKDNLCVRQLSIE 461
>gi|29347940|ref|NP_811443.1| hypothetical protein BT_2530 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383121619|ref|ZP_09942326.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
gi|29339842|gb|AAO77637.1| conserved hypothetical protein, with a conserved domain
[Bacteroides thetaiotaomicron VPI-5482]
gi|251837946|gb|EES66035.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
Length = 441
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 1/104 (0%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
KRL G + DG A F+ PK D GN++VAD NH IR I+ + T G
Sbjct: 336 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMISADNIVTTVAGQ 395
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G DG + F N + + V + I+D GN IR++
Sbjct: 396 PGVAGYKDGGPVESLFKNPWGVA-VNEQGDIYIADWGNARIRKL 438
>gi|365861024|ref|ZP_09400809.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
gi|364009547|gb|EHM30502.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
Length = 560
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 7/112 (6%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V D++R L ++ V+R G G +G SDG P ARF +P+ AV G +
Sbjct: 143 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLSDGGPDEARFSEPQGLAVLPDGRVA 199
Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
VAD NH +R + GVT T+AG G + + GPA+ S+ +++ +
Sbjct: 200 VADTVNHALRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSSPWDVAW 251
>gi|421111981|ref|ZP_15572448.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
JET]
gi|410802632|gb|EKS08783.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
JET]
Length = 356
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
GV G+ +G+ ++ F P +D + N+YV + NH IRKI N G V+T++GG
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---IL 217
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + +T+ +LL++D + IR+I+LK + + S+ G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + + I LD Y ++ VV AG G++ DG +A F
Sbjct: 20 NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
P VD GNI+V+D+ ++IRKI G VTT+
Sbjct: 79 PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138
Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
AG S +G +G N+SF F + + L + + GN
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197
Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
IR+INL S S+ SG LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGILG 218
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 42/81 (51%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G+ GY DG+ SA+F P D K + + VAD +H IRKI N V+T
Sbjct: 209 VSTLSG-GILGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G + DG NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286
>gi|290978943|ref|XP_002672194.1| predicted protein [Naegleria gruberi]
gi|284085769|gb|EFC39450.1| predicted protein [Naegleria gruberi]
Length = 1483
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 60/131 (45%), Gaps = 6/131 (4%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S+ IV+ S++ L F L EE + LAG G+ D P P +
Sbjct: 733 STKYIVISDSKNN--RLRFYLKEEKTIYTLAG----GFGDNGPSENGIGTPFGVTISPNN 786
Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
IYV+D N++IR+I N +TT+AG G+ D A AS ++ + P ++
Sbjct: 787 EIYVSDVLNNLIRRIKNGKITTVAGSGAYGYNGDDISALKASLASPSNIVVNPLTNDIIF 846
Query: 176 SDHGNQLIRQI 186
SD N IR I
Sbjct: 847 SDTNNHRIRII 857
>gi|390957686|ref|YP_006421443.1| gluconolactonase [Terriglobus roseus DSM 18391]
gi|390412604|gb|AFL88108.1| gluconolactonase [Terriglobus roseus DSM 18391]
Length = 738
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 68/128 (53%), Gaps = 6/128 (4%)
Query: 76 LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S V+ +AG G QGY G +A + P + AV G++Y AD NH IR+I +
Sbjct: 96 VSSGGVLAVVAGTGHQGYVGDGGAATAAELNAPTAVAVAPDGSVYFADSGNHCIRRIASG 155
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
+TT+AG G+ G GPA A F + L F +L ++D GN+ +R+I P
Sbjct: 156 VITTVAGNGAPGFGGDGGPAMVARFRSPGGLAFAAD-GSLYVADTGNRRVRKI---PPGG 211
Query: 194 SKSSQSGS 201
S S+ +G+
Sbjct: 212 SVSTIAGT 219
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 51/112 (45%), Gaps = 1/112 (0%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
L S V LAG G G + G P+ A D +GN+YVAD NH I ++++ G
Sbjct: 41 LDPGSRVGTLAGSGRDGRTVAGTAQDVALGWPRGLAYDHEGNLYVADSRNHQIDRVSSGG 100
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V + G + DG A A+ N V ++ +D GN IR+I
Sbjct: 101 VLAVVAGTGHQGYVGDGGAATAAELNAPTAVAVAPDGSVYFADSGNHCIRRI 152
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
ARF P A G++YVAD N +RKI G V+TIAG G++ + G A ASF
Sbjct: 178 ARFRSPGGLAFAADGSLYVADTGNRRVRKIPPGGSVSTIAGTGTEDDAGDGGVATAASFR 237
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
+ L +P LLI+D +R +
Sbjct: 238 SPGALQVLPD-GRLLIADREAYRVRAL 263
>gi|302821385|ref|XP_002992355.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
gi|300139771|gb|EFJ06505.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
Length = 78
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 48/77 (62%), Gaps = 4/77 (5%)
Query: 79 ESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
+S ++ AG G GY DG +RF++P+S A+ G ++V D +N IRKI+ G V
Sbjct: 3 DSSLEAFAGSHVGESGYVDGPAAKSRFNRPQSLAICDNGAVFV-DTTNLAIRKISKNGEV 61
Query: 136 TTIAGGGSKKEGRADGP 152
TTIAGG S++ G AD P
Sbjct: 62 TTIAGGSSRRPGIADSP 78
>gi|290971688|ref|XP_002668616.1| predicted protein [Naegleria gruberi]
gi|284082092|gb|EFC35872.1| predicted protein [Naegleria gruberi]
Length = 348
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG+G +G+S +G +A+ + P V +Y+AD SNHVIRKI+ G + TIAG
Sbjct: 143 IAGNGTKGFSGDNGPATNAQLNGPAGVFVS-NNEVYIADYSNHVIRKISQNGTIVTIAGN 201
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +G A NA N TFV + ISD N +IR+I
Sbjct: 202 GKPGFSGDNGLATNAQLYNP-SGTFVSSNNEVYISDCFNHVIRKI 245
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 58/114 (50%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S+ + +AG+G G+S +G +A+ P V +Y++D NHVIRKI+
Sbjct: 189 ISQNGTIVTIAGNGKPGFSGDNGLATNAQLYNPSGTFVSSNNEVYISDCFNHVIRKISQN 248
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + TIAG G +GPA NA + + FV + ISD N IR+I
Sbjct: 249 GTIVTIAGNGKGGFSGDNGPATNAQLYSPLGV-FVSSDNEVYISDCFNHRIRKI 301
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 59/112 (52%), Gaps = 5/112 (4%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
+ + +AG+G G+ +G +A+ + P++ V +Y+AD N IRK+ N
Sbjct: 80 QNGTIVTIAGNGKPGFGGDNGLATNAQLNYPRNVYVSSNNEVYIADFCNQRIRKVLQNGN 139
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+K +GPA NA N FV + + I+D+ N +IR+I
Sbjct: 140 IITIAGNGTKGFSGDNGPATNAQL-NGPAGVFVSN-NEVYIADYSNHVIRKI 189
>gi|345008031|ref|YP_004810385.1| redoxin domain-containing protein [Streptomyces violaceusniger Tu
4113]
gi|344034380|gb|AEM80105.1| Redoxin domain protein [Streptomyces violaceusniger Tu 4113]
Length = 615
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 63/117 (53%), Gaps = 8/117 (6%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
PG + +V D++R L+ ESVV+R+ G G +G +DG ARF +P+ A+
Sbjct: 195 PGGT-FLVSDTTRHQLVELA--ADGESVVRRI-GAGERGLTDGTGERARFSEPQGLALLP 250
Query: 114 KGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
G + VAD NH +R+ VTT+AG G + + +GPA+ S+ +++ +
Sbjct: 251 DGTVAVADTVNHALRRFDPATGEVTTLAGTGRQWWQGSPTEGPAREVDLSSPWDVAW 307
>gi|262195555|ref|YP_003266764.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
gi|262078902|gb|ACY14871.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
Length = 404
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 61/113 (53%), Gaps = 6/113 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
G++DG ARFD+P AV + IYVAD +NH IRKIT G V+T+AG G +D
Sbjct: 241 GFADGNGAEARFDRPLGVAV-VDDEIYVADSANHRIRKITLDGEVSTLAGTGVAAS--SD 297
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQL-IRQINLKPEDCSKSSQSGSA 202
G A+F + +T L +++ G+ +R+I+L E + SG A
Sbjct: 298 GSLAEATFDTPYAMTRDSD-GNLYVTELGDSFRVRKIDLVAETVETIAGSGMA 349
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 33/64 (51%), Gaps = 3/64 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV-IRKITNLG--VTTI 138
V LAG GV SDG A FD P + D GN+YV + + +RKI + V TI
Sbjct: 284 VSTLAGTGVAASSDGSLAEATFDTPYAMTRDSDGNLYVTELGDSFRVRKIDLVAETVETI 343
Query: 139 AGGG 142
AG G
Sbjct: 344 AGSG 347
Score = 38.1 bits (87), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 35/83 (42%), Gaps = 15/83 (18%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
LAG G +DG A P AV GN+Y+A+ N IRKIT G V+T+
Sbjct: 86 LAGGEEAGTTDGTRAVATLRNPAGLAVGPNGNVYIAEFDNDRIRKITPEGTVSTL----- 140
Query: 144 KKEGRADGPAQNASFSNDFELTF 166
Q A F F L F
Sbjct: 141 ---------TQAAGFDRPFGLAF 154
>gi|357412450|ref|YP_004924186.1| alkyl hydroperoxide reductase [Streptomyces flavogriseus ATCC
33331]
gi|320009819|gb|ADW04669.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces flavogriseus ATCC 33331]
Length = 603
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 7/112 (6%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V DS+R L ++ V+ G G +G++DG ARF +P+ AV G I
Sbjct: 188 FLVSDSTRHRLVELE---ADGETVRGHFGTGERGFADGGREEARFSEPQGLAVLPDGRIA 244
Query: 119 VADKSNHVIRKIT-NLGVTTIAGGGSKK--EGRA-DGPAQNASFSNDFELTF 166
VAD NH IR + GVTT G ++ +G A DGPA S+ +++ +
Sbjct: 245 VADTVNHAIRALDLTTGVTTTLAGTGRQWWQGSATDGPATEVDLSSPWDIAW 296
>gi|290976072|ref|XP_002670765.1| predicted protein [Naegleria gruberi]
gi|284084327|gb|EFC38021.1| predicted protein [Naegleria gruberi]
Length = 854
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 5/112 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTT 137
+ +AG G GY +G+ G A P+ AV G +Y++D +N++IRK+ N + T
Sbjct: 73 ISTIAGTGSAGY-NGDGGLAIETMLSSPQGVAVSESGEVYLSDSTNNIIRKVYLNGTIQT 131
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
IAG ++ +GPA N F + L+ P L I+D N ++R+++L
Sbjct: 132 IAGSRTQGYSGDNGPAVNCQFFSPQGLSLSPSNSDLYIADTFNNVVRRLDLN 183
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 3/96 (3%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SA+ PK V G +++AD N+ IRK+ + G + TIAG S G + +A
Sbjct: 319 SAQLSSPKGLVVSSSGEMFLADSGNNRIRKVFSNGIIVTIAGTSSVGFSGDGGLSTDAQL 378
Query: 159 SNDFELTF--VPHICALLISDHGNQLIRQINLKPED 192
+N L + + + ISD GN IR++ E+
Sbjct: 379 NNPVNLAIRQLRSLSEIYISDAGNHAIRKLTAFCEE 414
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 26/107 (24%), Positives = 46/107 (42%), Gaps = 5/107 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA--RFD 104
P + P +SDL + D+ + L ++ GDG +GY+ G+A F+
Sbjct: 155 PQGLSLSPSNSDLYIADTFNNVVRRLDL---NTGYIRPFVGDGGRGYTSPGDGNAFGWFN 211
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
+ + G + D N++IRK+ G+ ++G G DG
Sbjct: 212 VIRGLCLSPNGELIFTDTENNMIRKVYANGLYDVSGRGQTNSYLGDG 258
>gi|298387816|ref|ZP_06997366.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
gi|298259421|gb|EFI02295.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
Length = 440
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 48/104 (46%), Gaps = 1/104 (0%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
KRL G + DG A F+ PK D GN++VAD NH IR ++ + T G
Sbjct: 335 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVSADNIVTTVAGQ 394
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G DG + F N + + V + I+D GN IR++
Sbjct: 395 PGVAGYKDGGPVESLFKNPWGVA-VNEQGDIYIADWGNARIRKL 437
>gi|289664052|ref|ZP_06485633.1| putative NHL repeat protein [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 263
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 44/84 (52%), Gaps = 3/84 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
L+ + V LAGDGV G +G RF+ P AVD +G +YVAD N I I G
Sbjct: 48 LTPQGKVTTLAGDGVAGDRNGAAAQVRFNGPVGVAVDAQGRVYVADTYNDRIGVIETDGQ 107
Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
V T+AGGG G ADG A F
Sbjct: 108 VRTLAGGG--LPGMADGIGTQAWF 129
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 39/59 (66%), Gaps = 2/59 (3%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
LAG G +G++DG +A F+ P +D GN+Y+AD NH IRK+T G VTT+AG G
Sbjct: 4 LAG-GREGFADGIGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLTPQGKVTTLAGDG 61
>gi|440714668|ref|ZP_20895245.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
gi|436440433|gb|ELP33756.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
Length = 283
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VTTIAGGGSKKE 146
GY DG SARF+K + +D + +Y++D +NH +R++ V T AG G
Sbjct: 38 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMVDTYAGQGKAGP 97
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ ++A+F +T LLI+D GNQ++R I+L
Sbjct: 98 ATNNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDL 139
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ T + G ++ +D G+ L+VL+ + + + + LAG G +G
Sbjct: 143 LVTTLAGRTSKLKDPRAVDLDGNRRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 198
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 199 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 233
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 52/88 (59%), Gaps = 8/88 (9%)
Query: 100 SARFDKPKSFAVDMKGN--IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNA 156
+++ P+ AVD+ GN + V +++ + +R++ +N +TT+AG G K+G ADG A+ A
Sbjct: 151 TSKLKDPR--AVDLDGNRRLLVLERNGNRLRRVESNGDITTLAGSG--KKGTADGDAKQA 206
Query: 157 SFSNDFELTFVPHICALLISDHGNQLIR 184
SF+ + P + I+D N L+R
Sbjct: 207 SFNGPKHMDVAPD-GRVYIADDVNHLVR 233
>gi|417302109|ref|ZP_12089222.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
gi|327541551|gb|EGF28082.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
Length = 351
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VTTIAGGGSKKE 146
GY DG SARF+K + +D + +Y++D +NH +R++ V T AG G +
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTSDGEWMVDTYAGQGKEGP 165
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ ++A+F +T LLI+D GNQ++R I+
Sbjct: 166 ATNNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDF 207
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ T + G + ++ +D G+ L+VL+ + + + + LAG G +G
Sbjct: 211 LVTTLAGRKSKLKDPRAVDLDGNGRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 266
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 49/85 (57%), Gaps = 4/85 (4%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
++ P++ +D G + V +++ + +R++ +N +TT+AG G K+G ADG A+ ASF+
Sbjct: 220 SKLKDPRAVDLDGNGRLLVLERNGNRLRRVESNGDITTLAGSG--KKGTADGDAKQASFN 277
Query: 160 NDFELTFVPHICALLISDHGNQLIR 184
+ P + I+D N L+R
Sbjct: 278 GPKHMDVAPD-GRVYIADDVNHLVR 301
>gi|372267905|ref|ZP_09503953.1| hypothetical protein AlS89_08394 [Alteromonas sp. S89]
Length = 2227
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 55/109 (50%), Gaps = 4/109 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ +AG G +G+S DG P + A P +D +GN+Y+ D N IR++ G +TT
Sbjct: 1056 IISTVAGSGARGFSGDGGPATDASLSAPSGITLDPEGNLYIVDSGNRRIRRVGVDGRITT 1115
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG G K G A A F + + + + ISD G +R+I
Sbjct: 1116 IAGNGGYKNTGDGGSALEAGFDDPLGIAYAAD-GGIYISDSGEGGVRRI 1163
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 54/109 (49%), Gaps = 8/109 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
V+ AGDG G A + P G+IY+AD NH IR++ G ++T+A
Sbjct: 1008 VIDTFAGDG------GPATDASLNAPYDVICGPHGSIYIADSRNHRIRRVDVNGIISTVA 1061
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G G++ GPA +AS S +T P L I D GN+ IR++ +
Sbjct: 1062 GSGARGFSGDGGPATDASLSAPSGITLDPE-GNLYIVDSGNRRIRRVGV 1109
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 59/121 (48%), Gaps = 9/121 (7%)
Query: 71 TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
T++ L+ V+ +GDG + A D P G+IYVAD N +R+I
Sbjct: 941 TIAGTLASNVSVRGFSGDGAIATA------ATMDHPYGLDFCDDGSIYVADTGNDRVRRI 994
Query: 131 TNLGV-TTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
GV TTIAG D GPA +AS + +++ PH ++ I+D N IR++++
Sbjct: 995 DRRGVITTIAGSEVIDTFAGDGGPATDASLNAPYDVICGPH-GSIYIADSRNHRIRRVDV 1053
Query: 189 K 189
Sbjct: 1054 N 1054
Score = 37.0 bits (84), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 57/117 (48%), Gaps = 11/117 (9%)
Query: 82 VKRLAGDGVQGYSD-GEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
+ +AG+G GY + G+ GSA FD P A G IY++D +R+I T+ +
Sbjct: 1113 ITTIAGNG--GYKNTGDGGSALEAGFDDPLGIAYAADGGIYISDSGEGGVRRIGTDGTIV 1170
Query: 137 TIAGGGSKKEGRA--DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
T+AG S G GPA A + ++ P +L + D N IR+++ PE
Sbjct: 1171 TVAGNKSPYWGFGGDGGPAIQADMTGVTDVAIGPE-GSLYLVDAYNFRIRKVS-NPE 1225
>gi|290985545|ref|XP_002675486.1| predicted protein [Naegleria gruberi]
gi|284089082|gb|EFC42742.1| predicted protein [Naegleria gruberi]
Length = 819
Score = 55.1 bits (131), Expect = 3e-05, Method: Composition-based stats.
Identities = 41/112 (36%), Positives = 56/112 (50%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++ + +AG G +G+S DG+P SA+ P S V ++ AD N+ IRKI G
Sbjct: 448 KDGTIVTIAGTGEEGFSGDGDPATSAQLSHPCSVFVSSTNEVFFADSGNYRIRKILRNGN 507
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G K PA NA S + FV + SD GN IR+I
Sbjct: 508 IVTIAGTGEKGYSGDGRPAINAQISY-VQNIFVSQNDEIYFSDFGNHRIRKI 558
Score = 54.3 bits (129), Expect = 6e-05, Method: Composition-based stats.
Identities = 40/105 (38%), Positives = 55/105 (52%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG G +G+S DG P SA+ D P V +Y+ D +NH IRKI G + TIAG
Sbjct: 567 IAGTGEKGFSGDGGPATSAQLDSPCGVFVSNNDEVYIVDYNNHRIRKILRNGIINTIAGT 626
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + GPA NA ++ + FV + I + GN IR+I
Sbjct: 627 GEEGFSGDGGPAINAQVNHPCGV-FVSSTNEVYIMNSGNYRIRKI 670
Score = 50.8 bits (120), Expect = 7e-04, Method: Composition-based stats.
Identities = 39/105 (37%), Positives = 54/105 (51%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG G +GYS DG P +A+ ++ V IY +D NH IRKI G + TIAG
Sbjct: 511 IAGTGEKGYSGDGRPAINAQISYVQNIFVSQNDEIYFSDFGNHRIRKILRNGTIVTIAGT 570
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G K GPA +A + + FV + + I D+ N IR+I
Sbjct: 571 GEKGFSGDGGPATSAQLDSPCGV-FVSNNDEVYIVDYNNHRIRKI 614
Score = 48.5 bits (114), Expect = 0.003, Method: Composition-based stats.
Identities = 38/109 (34%), Positives = 55/109 (50%), Gaps = 4/109 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
++ +AG G +G+S DG P +A+ + P V +Y+ + N+ IRKI N +TT
Sbjct: 619 IINTIAGTGEEGFSGDGGPAINAQVNHPCGVFVSSTNEVYIMNSGNYRIRKILRNANITT 678
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG G K G A NA S + FV + I+D N IR+I
Sbjct: 679 IAGTGVKGYSGDGGLAINAQISY-VDNIFVSRNDEVYIADTENHRIRKI 726
Score = 46.2 bits (108), Expect = 0.015, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 11/96 (11%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG GV+GYS G+ G +A+ ++ V +Y+AD +NH IRKI G + T
Sbjct: 293 ITTIAGTGVKGYS-GDGGLAINAQISYVENIFVSQNDEVYIADTNNHRIRKILKDGTIET 351
Query: 138 IAGGGSKKEGRADGP---AQNASFSNDFELTFVPHI 170
IAG G K G D P + + ND+ T +P I
Sbjct: 352 IAGNGEKGFG-GDSPFDFSSHPHIGNDY--TIIPKI 384
Score = 45.4 bits (106), Expect = 0.025, Method: Composition-based stats.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG G +GYS DG P +A+ + V +Y +D NH IRKI G + TIAG
Sbjct: 128 IAGTGEEGYSGDGGPAINAQISAVNNIFVSQNDEVYFSDFRNHRIRKILRNGTIVTIAGT 187
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + GPA NA + + FV + + I D+ + IR++
Sbjct: 188 GEQGFSGDGGPAINAKLNTPCGV-FVSNNDEVYIVDYKSHRIRKM 231
Score = 45.1 bits (105), Expect = 0.031, Method: Composition-based stats.
Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG G QG+S DG P +A+ + P V +Y+ D +H IRK+ G + TIAG
Sbjct: 184 IAGTGEQGFSGDGGPAINAKLNTPCGVFVSNNDEVYIVDYKSHRIRKMLQDGTIITIAGT 243
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + G GPA +A S+ + FV + I+D N IR+I
Sbjct: 244 GEQGFGGDGGPATSAQLSHPCGV-FVSSTNEVYITDSYNYRIRKI 287
Score = 43.9 bits (102), Expect = 0.068, Method: Composition-based stats.
Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SA+F+ P V +Y++D N+ IRKI G + TIAG G + GPA NA
Sbjct: 89 SAQFNHPCDVFVSSTNEVYISDFGNYRIRKILRNGNIVTIAGTGEEGYSGDGGPAINAQI 148
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
S + FV + SD N IR+I
Sbjct: 149 SAVNNI-FVSQNDEVYFSDFRNHRIRKI 175
Score = 43.5 bits (101), Expect = 0.10, Method: Composition-based stats.
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 11/96 (11%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG GV+GYS G+ G +A+ + V +Y+AD NH IRKI G + T
Sbjct: 676 ITTIAGTGVKGYS-GDGGLAINAQISYVDNIFVSRNDEVYIADTENHRIRKILRNGTIKT 734
Query: 138 IAGGGSKKEGRADGP---AQNASFSNDFELTFVPHI 170
IAG G + G D P + + ND+ T +P I
Sbjct: 735 IAGNGEEGFG-GDSPFDFSSHPHIGNDY--TIIPKI 767
Score = 42.7 bits (99), Expect = 0.17, Method: Composition-based stats.
Identities = 33/88 (37%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SA+ P V +Y+ D N+ IRKI G +TTIAG G K G A NA
Sbjct: 257 SAQLSHPCGVFVSSTNEVYITDSYNYRIRKILRNGNITTIAGTGVKGYSGDGGLAINAQI 316
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
S E FV + I+D N IR+I
Sbjct: 317 SY-VENIFVSQNDEVYIADTNNHRIRKI 343
Score = 40.0 bits (92), Expect = 1.0, Method: Composition-based stats.
Identities = 29/87 (33%), Positives = 41/87 (47%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
A + P V +Y+AD NH +RKI G + TIAG G + PA +A S
Sbjct: 417 AMLNCPGGVFVAPNDEVYMADCQNHRVRKILKDGTIVTIAGTGEEGFSGDGDPATSAQLS 476
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
+ + FV + +D GN IR+I
Sbjct: 477 HPCSV-FVSSTNEVFFADSGNYRIRKI 502
>gi|408828763|ref|ZP_11213653.1| hypothetical protein SsomD4_16351 [Streptomyces somaliensis DSM
40738]
Length = 614
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 62/115 (53%), Gaps = 7/115 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +L+V D++R ++ E+VV+R GDG +G DG SARF +P+ G
Sbjct: 197 SGNLLVSDTTRHRLVEVA--PDGETVVRRY-GDGRRGLVDGPAESARFSEPQGMCALPDG 253
Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
I VAD NH +R + V+T+AG G++ + GPA+ + S+ +++ +
Sbjct: 254 RIVVADTVNHALRALDPETGAVSTLAGTGAQWMQGSPTSGPAREVALSSPWDVAW 308
Score = 42.4 bits (98), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 5/135 (3%)
Query: 68 AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP-KSFAVDMKGNIYVADKSNHV 126
A TL L E K G Y EP + P K+ A++ GN+ V+D + H
Sbjct: 151 AIGTLVAELEAEHGAKGTLRRGDGPYVPPEPTATHLRFPGKALALE-SGNLLVSDTTRHR 209
Query: 127 IRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ ++ G T + G + G DGPA++A FS + +P ++++D N +R
Sbjct: 210 LVEVAPDGETVVRRYGDGRRGLVDGPAESARFSEPQGMCALPD-GRIVVADTVNHALRA- 267
Query: 187 NLKPEDCSKSSQSGS 201
L PE + S+ +G+
Sbjct: 268 -LDPETGAVSTLAGT 281
>gi|159471860|ref|XP_001694074.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277241|gb|EDP03010.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1591
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 60/125 (48%), Gaps = 10/125 (8%)
Query: 73 SFPLSEESVVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
S +S++ ++R+A DG + G+ DG +ARF + AVD +GN Y +D S
Sbjct: 1221 SLVVSDQHCLRRVASDGFVTTIAGSSMPGHLDGPAATARFYNLRGVAVDGEGNCYCSDSS 1280
Query: 124 NHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
NH +R + G+ + G + G DG A F N + +L ++D N
Sbjct: 1281 NHCVRLLHAADGMVSTFAGSPGQAGFRDGAGTEARFRNPCGIAINLQDGSLAVADAENNR 1340
Query: 183 IRQIN 187
+R+I+
Sbjct: 1341 LRRID 1345
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 48/84 (57%), Gaps = 8/84 (9%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLG-VT 136
E+VV LAG G G+ DG A+F P AVD + ++ V+D+ H +R++ + G VT
Sbjct: 1185 EAVV--LAGGGGYGHRDGPGRKAKFACPMYLAVDARDSSLVVSDQ--HCLRRVASDGFVT 1240
Query: 137 TIAGGGSKKEGRADGPAQNASFSN 160
TIA GS G DGPA A F N
Sbjct: 1241 TIA--GSSMPGHLDGPAATARFYN 1262
>gi|344923600|ref|ZP_08777061.1| NHL repeat containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 1226
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 64/135 (47%), Gaps = 8/135 (5%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
GS + V DS+ + ++ V LAG GV Y++G +ARF P+ VD+
Sbjct: 35 GSGTVYVADSNNNVIK----KITPAGTVSVLAGSGVSDYAEGTGTAARFSYPEGVGVDVA 90
Query: 115 G-NIYVADKSNHVIRKI-TNLGVTT-IAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
G +YVAD N VIRKI T+ VT+ +AG DG A F+ + V
Sbjct: 91 GTTVYVADSQNGVIRKIDTSTRVTSLVAGSPGALNAHVDGTYTTARFAYPTSVA-VDSSS 149
Query: 172 ALLISDHGNQLIRQI 186
L I D N IR+I
Sbjct: 150 NLYIGDTLNHCIRKI 164
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 32/50 (64%), Gaps = 3/50 (6%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGG 142
DG +ARF P S AVD N+Y+ D NH IRKI N+ VTT+AGGG
Sbjct: 129 DGTYTTARFAYPTSVAVDSSSNLYIGDTLNHCIRKIAPGNV-VTTLAGGG 177
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 50/107 (46%), Gaps = 3/107 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
S F+ AVD G +YVAD +N+VI+KIT G V+ +AG G G A S+
Sbjct: 22 SGGFNFASGVAVDGSGTVYVADSNNNVIKKITPAGTVSVLAGSGVSDYAEGTGTAARFSY 81
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
+ + ++D N +IR+I+ S + S AL A
Sbjct: 82 PEGVGVDVAG--TTVYVADSQNGVIRKIDTSTRVTSLVAGSPGALNA 126
>gi|290995663|ref|XP_002680402.1| predicted protein [Naegleria gruberi]
gi|284094023|gb|EFC47658.1| predicted protein [Naegleria gruberi]
Length = 974
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 99/198 (50%), Gaps = 25/198 (12%)
Query: 30 DGYTVTTVIDGHQLEINP------HSVIDRPGSSDLIVLDS------SRSAFYTLSFPLS 77
+G T V G + + +P +++I PG+ I+ DS S +F+ + L+
Sbjct: 210 EGRIATVVGSGQKGQESPDGVLAVNAIIGAPGN---IIFDSIGNMYLSDRSFHKVRKVLT 266
Query: 78 EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
++V +AG+G+ Y+ DG SA +P A+ G +Y+A+ H IRK+ TN
Sbjct: 267 NGTIVT-IAGNGMSAYNGDGILAVSASLFRPSGLALSSTGELYIAESYGHRIRKVLTNGT 325
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
+ TIAG G G A NA + E F+ LLISD GN+ IR+I+L D +
Sbjct: 326 IITIAGTGVAGYEGDGGLAVNA-LLDAPESIFLSSDNYLLISDFGNKRIRKISLL--DGT 382
Query: 195 KSSQSGSALG---AVSVW 209
S+ +G LG A SV+
Sbjct: 383 ISTIAGDGLGDGKAASVY 400
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 61/134 (45%), Gaps = 30/134 (22%)
Query: 77 SEESVVKRLAGDGVQ-GYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
S +++ AG Q DG+P + +P+ + G +Y+AD S I+K+++ G
Sbjct: 86 STNNIMSTFAGTNNQVNPEDGQPAVNTPLAEPRGLEISSNGEVYIADYSTCKIKKVSSQG 145
Query: 135 V-TTIAGGGSKKEGRADGPAQN------------ASFSNDFELTFVPHIC------ALLI 175
+ TTIAG GP QN +F+N+ +L+F I L I
Sbjct: 146 IMTTIAG---------TGPGQNNDRRLVSVYTGDGAFANETQLSFPQAIAFHPITGELHI 196
Query: 176 SDHGNQLIRQINLK 189
++ + IR+I L+
Sbjct: 197 AETQSYKIRKITLE 210
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
++ LAG V G+S DG P + A F +D +IYV+D NH IRKI+ + T
Sbjct: 663 IIVTLAGSNVAGFSGDGGPAADATFGTLGGIYLDSNDDIYVSDPDNHRIRKISLMCPTDY 722
Query: 139 AGGGSK 144
A GS+
Sbjct: 723 ALNGSE 728
>gi|290982388|ref|XP_002673912.1| predicted protein [Naegleria gruberi]
gi|284087499|gb|EFC41168.1| predicted protein [Naegleria gruberi]
Length = 2313
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 7/106 (6%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGG 142
LAG ++G++DG +A + P S ++ + +Y +D+ NH IR ++ + VTTI+G G
Sbjct: 500 LAGGSLKGFADGVGSNANLNYPDSISIGLNNMLYFSDRDNHAIRSVSTINALVTTISGSG 559
Query: 143 SKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQI 186
+GPA A + E+ I + D GNQ IR+I
Sbjct: 560 IAGYTGDEGPAIYAKLNLPGSIEVALNGDI---IFMDKGNQRIRKI 602
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 85 LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG G G+S G SA + P + D GN+ ++D SN+ IRK+TN ++T+AG
Sbjct: 281 VAGTGTSGFSGDGGVATSALLNGPSALTFDSSGNMLISDSSNNRIRKVTNGIISTLAGTS 340
Query: 143 SKKEGRA 149
++ G
Sbjct: 341 NRNFGNG 347
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
VV +AG G GYS DG A+ + P+ + G I V+D NH IRKI N ++TI
Sbjct: 82 VVSTIAGTGNAGYSGDGANALFAQLNSPQGIGLLSGGAIIVSDTLNHRIRKIENGIISTI 141
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG GS A G A +A + L + + +D N +IR+I
Sbjct: 142 AGTGSPGY-TASGTATSALINTPLGLAVYNN--EVYFADSLNHVIRKI 186
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 56/112 (50%), Gaps = 17/112 (15%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
++V ++G G+ GY+ G+ G A + K P S V + G+I DK N IRKIT G
Sbjct: 549 NALVTTISGSGIAGYT-GDEGPAIYAKLNLPGSIEVALNGDIIFMDKGNQRIRKITKYGN 607
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ T+ G GS + F+N LT + + I+D GN LI+ +
Sbjct: 608 IFTLYGNGST-----------SVFNNANGLT-LGGSGEIYIADSGNNLIKML 647
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 2/78 (2%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
E ++ +AG G GY+ G SA + P AV +Y AD NHVIRKI++ G
Sbjct: 134 ENGIISTIAGTGSPGYTASGTATSALINTPLGLAV-YNNEVYFADSLNHVIRKISSSGSI 192
Query: 137 TIAGGGSKKEGRADGPAQ 154
+ G G P Q
Sbjct: 193 SNEQGVGATSGSQTLPLQ 210
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 7/109 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDK--PKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTT 137
++R+ GV G S D KSF + G IY+AD NH I K I +TT
Sbjct: 440 LRRIESSGVLKLVVGSCNSGNQDYFLSKSFDISSDGIIYIADYYNHRIAKFVIGGTSLTT 499
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+AGG K G ADG NA+ + ++ + L SD N IR +
Sbjct: 500 LAGGSLK--GFADGVGSNANLNYPDSIS-IGLNNMLYFSDRDNHAIRSV 545
>gi|290992324|ref|XP_002678784.1| predicted protein [Naegleria gruberi]
gi|284092398|gb|EFC46040.1| predicted protein [Naegleria gruberi]
Length = 2331
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 58/107 (54%), Gaps = 2/107 (1%)
Query: 82 VKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
+ +AG+G GYSD +P A F+ S ++ G++Y++D +N+VIRK+++ G T
Sbjct: 509 ISSVAGNGYLGYSDNTQPLRASFNLAVSVLINNVGDMYISDYANNVIRKMSSSGAITTIA 568
Query: 141 GGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +K G AD A+ A + F L + L +D N IR++
Sbjct: 569 GQAKISGFADSLNAKTALLNGPFGLYYQESTQTLFFADSLNGRIRKL 615
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 59/109 (54%), Gaps = 5/109 (4%)
Query: 81 VVKRLAGDGVQG-YSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
V+ +AG+G + Y DG G SA + P + G+IY+AD N IRK+ ++ +++
Sbjct: 452 VLSTIAGNGGERFYGDGGLGTSASLNYPYGIHLHTDGSIYIADTYNSKIRKLDSSKKISS 511
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+AG G P + ASF+ + + ++ + ISD+ N +IR++
Sbjct: 512 VAGNGYLGYSDNTQPLR-ASFNLAVSV-LINNVGDMYISDYANNVIRKM 558
>gi|262405960|ref|ZP_06082510.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|262356835|gb|EEZ05925.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
Length = 459
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Query: 85 LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNIY---VADKSNHVIRKITN 132
AG+ G GY D SARFD P + + + + ++Y + D +NH IRKIT
Sbjct: 336 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRLDVYDFYLTDAANHCIRKITP 395
Query: 133 LG-VTTIAGGGSKK-----EGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G VTT AG GS G DG P + A F+ L + + D+GN +R
Sbjct: 396 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 455
Query: 186 INLK 189
I L+
Sbjct: 456 IALQ 459
>gi|380692527|ref|ZP_09857386.1| hypothetical protein BfaeM_00899 [Bacteroides faecis MAJ27]
Length = 441
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 48/104 (46%), Gaps = 1/104 (0%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
KRL G + DG A F+ PK D GN++VAD NH IR ++ + T G
Sbjct: 336 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVSADNIVTTVAGQ 395
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G DG + F N + + V + I+D GN IR++
Sbjct: 396 PGVAGYKDGGPLESLFKNPWGVA-VNEQGDIYIADWGNARIRKL 438
>gi|290985668|ref|XP_002675547.1| predicted protein [Naegleria gruberi]
gi|284089144|gb|EFC42803.1| predicted protein [Naegleria gruberi]
Length = 1037
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 62/111 (55%), Gaps = 4/111 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
V+ + G+G+ G+S DGE +A+ ++P VD G + D SN+ IR I+N G ++TI
Sbjct: 28 VEVIGGNGISGFSGDGELAINAKLNRPTKIVVDSAGRVVFCDNSNNRIRMISNNGTISTI 87
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
AG G+ G A NA + LT + I +LI+D N IR+I+L
Sbjct: 88 AGSGTGFVLGDGGLATNAILNMPTGLT-INSIGEILIADTSNFKIRKIDLN 137
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 71/142 (50%), Gaps = 8/142 (5%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG-SARFDK 105
P+ V+ S D+ + D+S +S S ++ +AG G G+ D A+ +
Sbjct: 445 PYGVV-TSSSGDVFIADTSNCRIRKVS---SSTGIITTVAG-GTCGFGDNVLAVDAQLNT 499
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
P +V+ KG +++AD +NH IRK+++ G ++TIAG G G A NA+ ++
Sbjct: 500 PYGISVNSKGELFIADTNNHRIRKVSSSGFISTIAGNGVGGFSGDGGLATNANLFKPSKV 559
Query: 165 TFVPHICALLISDHGNQLIRQI 186
V I + I+D IR+I
Sbjct: 560 V-VNSIGEIFIADSSTNRIRKI 580
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 57/115 (49%), Gaps = 5/115 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S + +AG G G+ G+ G +A + P ++ G I +AD SN IRKI
Sbjct: 78 ISNNGTISTIAGSGT-GFVLGDGGLATNAILNMPTGLTINSIGEILIADTSNFKIRKIDL 136
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +TTIAG G++ G A +A + +++ P + I+D N +R+I
Sbjct: 137 NGIITTIAGSGNQGFEGDGGLATSAPLNFPADVSVHPTTNEVFIADSNNHCVRKI 191
Score = 43.5 bits (101), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
+A KP V+ G I++AD S + IRKI TN + TIAG G+ + A N+
Sbjct: 550 NANLFKPSKVVVNSIGEIFIADSSTNRIRKILTNGTIITIAGNGNSGFNGDEADATNSQL 609
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+ + + + + ISD GN IR++
Sbjct: 610 GSPYGIA-LSSTGEIYISDQGNNRIRKL 636
Score = 42.4 bits (98), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 54/115 (46%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+S ++ + G+ +Q + PK A+ G +Y AD + IRKIT G
Sbjct: 305 VSLNGIITTVVGEYLQSQDGKLATDVVLNTPKGVAISPSGEVYFADSDKYAIRKITLSGT 364
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ TIAG G G A ++ SN +++ F+ + I+D N IR+I++
Sbjct: 365 IITIAGSG--LYGFAGDNGYSSQLSNSYDI-FINSLGEAFIADTYNHRIRKIDVN 416
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 73/150 (48%), Gaps = 14/150 (9%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFD 104
P V P ++++ + DS+ + L+ ++ G QG S G ++R +
Sbjct: 166 PADVSVHPTTNEVFIADSNNHCVRKI---LTNGTIFTVAGICGAQGQSSDGGLAINSRLN 222
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGS---KKEGRADGPAQNASFS 159
P A+ G+I++A++ N +RK+++ ++T AG G +G+A A NAS
Sbjct: 223 YPIGVAISSTGDIFIAERDNSKVRKVSSSTGIISTFAGNGVFGFMGDGKA---AVNASIY 279
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLK 189
N + + F ISD+G+ IR+++L
Sbjct: 280 NPYNVAF-NSAGEAFISDYGSGKIRKVSLN 308
>gi|111219921|ref|YP_710715.1| ascorbate-dependent monooxygenase [Frankia alni ACN14a]
gi|111147453|emb|CAJ59103.1| Putative ascorbate-dependent monooxygenase [Frankia alni ACN14a]
Length = 865
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 46/68 (67%), Gaps = 3/68 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ V+ +AG GV G+S DG P +A+ D+P A+D +G++Y+AD+ NH IR++
Sbjct: 558 INAAGVITTIAGTGVAGFSGDGGPAVAAQLDQPAGIALDNRGDLYIADRLNHRIRRVDPR 617
Query: 134 G-VTTIAG 140
G +TT+AG
Sbjct: 618 GIITTVAG 625
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 62/135 (45%), Gaps = 8/135 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDM 113
+ DL + D + + ++ AG +G + DG P + A D P A D
Sbjct: 665 AGDLFIADQGENRIRKVD----ARGIISTFAGSSGRGSFGDGGPATDALLDVPFGVAADA 720
Query: 114 KGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
G++Y+AD N IRK+ GV TT+AG + GPA AS D V +
Sbjct: 721 AGDVYIADTDNSRIRKVDTHGVITTVAGNRLRGFAGDGGPAVKASL-QDPRGIAVDAVGN 779
Query: 173 LLISDHGNQLIRQIN 187
L I+D GN IR+++
Sbjct: 780 LYITDRGNSRIRKVD 794
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 7/134 (5%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDM 113
+ D+ + D+ S + V+ +AG+ ++G++ DG P A P+ AVD
Sbjct: 721 AGDVYIADTDNSRIRKVD----THGVITTVAGNRLRGFAGDGGPAVKASLQDPRGIAVDA 776
Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
GN+Y+ D+ N IRK+ G +TT+AG G G A NA V H
Sbjct: 777 VGNLYITDRGNSRIRKVDTHGIITTLAGSGRPGSAGDGGLAGNAELGRPDGAVGVDHEGN 836
Query: 173 LLISDHGNQLIRQI 186
+ SD + IR +
Sbjct: 837 VFFSDRASGRIRVV 850
>gi|307182571|gb|EFN69764.1| NHL repeat-containing protein 2 [Camponotus floridanus]
Length = 687
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 85/197 (43%), Gaps = 20/197 (10%)
Query: 5 FVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
+Y+ + I L + + L L Y + + L +++ + LI+ DS
Sbjct: 182 LLYARVMIAYFKSLNQISNHSLPLRPAYHLLSSFKNGLLFPGKLAILSSEHGTKLIISDS 241
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
+ + + V+ G QG+ DG +ARF+ P+ V + IYVAD +N
Sbjct: 242 GNNRIVIAT----KHGEVEHFIGGCNQGFKDGSFKNARFNSPQGVCV-LNNTIYVADNNN 296
Query: 125 HVIRKI--TNLGVTTIAGGGSKKEGRADG----------PAQNASFSNDFELTFVPHICA 172
H IRKI T V+TIAG GS+ G P A + ++++ + VP
Sbjct: 297 HAIRKINLTENNVSTIAGTGSQGYDHKGGKQGINQVLSSPWDVAIYHHEYKDSIVP---V 353
Query: 173 LLISDHGNQLIRQINLK 189
LLI+ G I + L+
Sbjct: 354 LLIAIAGTHQIWALFLE 370
>gi|358461043|ref|ZP_09171215.1| serine/threonine protein kinase [Frankia sp. CN3]
gi|357074242|gb|EHI83734.1| serine/threonine protein kinase [Frankia sp. CN3]
Length = 866
Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats.
Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++++ VV +AG+ G+S DG P +A + P A D GN+Y+ D N+ +RKI
Sbjct: 574 IAKDGVVTPIAGNAQDGFSGDGGPATAAQLYGPGRVAWDKAGNLYIPDTQNYRVRKIDPS 633
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +TT+ G G+ GPA A N E V L ++D+ NQ IR++
Sbjct: 634 GKITTVVGIGTAGYSGDGGPATQAQI-NGVEGIAVTADGTLYLADYDNQRIRKV 686
Score = 50.8 bits (120), Expect = 5e-04, Method: Composition-based stats.
Identities = 35/115 (30%), Positives = 60/115 (52%), Gaps = 7/115 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ + ++ +AG G +GYS G P +A + D P S ++ G IY A+ + ++KI
Sbjct: 686 VTPDGIITTIAGTGEKGYS-GTPTTATQAKLDGPNSISLADDGTIYFANLGSDTVQKIDK 744
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +TT AG G GPA +A+ S F+ H + I +G++ IR++
Sbjct: 745 AGMLTTFAGNGKTGRTGDGGPATSATLS--IPDVFLGHDGTVYICAYGSETIRKV 797
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 41/67 (61%), Gaps = 3/67 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-N 132
++ + ++ +AG G +GY+ DG P +A+ P S VD G IYVAD N VIR+I N
Sbjct: 797 VTSDGIITTIAGTGAEGYTGDGGPANAAQLSDPTSVVVDAGGAIYVADNGNKVIRRIDPN 856
Query: 133 LGVTTIA 139
+TTIA
Sbjct: 857 GTITTIA 863
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 37/115 (32%), Positives = 62/115 (53%), Gaps = 5/115 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + ++ AG+G G + DG P SA P F + G +Y+ + IRK+T+
Sbjct: 742 IDKAGMLTTFAGNGKTGRTGDGGPATSATLSIPDVF-LGHDGTVYICAYGSETIRKVTSD 800
Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+ TTIAG G++ GPA A S+ + V A+ ++D+GN++IR+I+
Sbjct: 801 GIITTIAGTGAEGYTGDGGPANAAQLSDPTSVV-VDAGGAIYVADNGNKVIRRID 854
Score = 44.7 bits (104), Expect = 0.039, Method: Composition-based stats.
Identities = 34/97 (35%), Positives = 50/97 (51%), Gaps = 13/97 (13%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ + G G GYS DG P + A+ + + AV G +Y+AD N IRK+T G +TTI
Sbjct: 636 ITTVVGIGTAGYSGDGGPATQAQINGVEGIAVTADGTLYLADYDNQRIRKVTPDGIITTI 695
Query: 139 AGGGSK---------KEGRADGPAQNASFSNDFELTF 166
AG G K + + DGP + S ++D + F
Sbjct: 696 AGTGEKGYSGTPTTATQAKLDGP-NSISLADDGTIYF 731
>gi|294647916|ref|ZP_06725468.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
gi|294806312|ref|ZP_06765159.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|345510467|ref|ZP_08790034.1| hypothetical protein BSAG_00676 [Bacteroides sp. D1]
gi|292636824|gb|EFF55290.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
gi|294446568|gb|EFG15188.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|295085612|emb|CBK67135.1| IPT/TIG domain./NHL repeat. [Bacteroides xylanisolvens XB1A]
gi|345454427|gb|EEO48966.2| hypothetical protein BSAG_00676 [Bacteroides sp. D1]
Length = 476
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Query: 85 LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNIY---VADKSNHVIRKITN 132
AG+ G GY D SARFD P + + + + ++Y + D +NH IRKIT
Sbjct: 353 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRLDVYDFYLTDAANHCIRKITP 412
Query: 133 LG-VTTIAGGGSKK-----EGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G VTT AG GS G DG P + A F+ L + + D+GN +R
Sbjct: 413 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 472
Query: 186 INLK 189
I L+
Sbjct: 473 IALQ 476
>gi|290999745|ref|XP_002682440.1| predicted protein [Naegleria gruberi]
gi|284096067|gb|EFC49696.1| predicted protein [Naegleria gruberi]
Length = 731
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 64/118 (54%), Gaps = 5/118 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S ++ +AG GV GYS DG P + + + P +D NI +AD++NH IR I+N
Sbjct: 228 VSNSGIISTVAGTGVSGYSGDGIPANTTKLNTPNGITIDSNDNIIIADRNNHRIRLISNS 287
Query: 134 G--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
++T+AG G+ + A +A S ++T + + L+I+D N +IR + L
Sbjct: 288 SGIISTLAGNGTTGSRDEEVLATSAKLSRPADVT-IGYDGELIITDTDNFVIRIVKLN 344
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 77/149 (51%), Gaps = 12/149 (8%)
Query: 76 LSEESVVKRLAGDGVQGYSDGE---PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-- 130
+S ++ +AG G GY +G+ A+ P +VD KGN+Y+ADK NH IRKI
Sbjct: 508 VSNNGIISTIAGTGSAGY-NGDVIMATEAKLYLPHGVSVDNKGNVYIADKQNHRIRKILA 566
Query: 131 -TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
T + ++TIAG G + A + ++ +++T + I+D N IR+I
Sbjct: 567 STGM-ISTIAGTGQAGFNDDNMSALESRVNSPYDVTVDESGQVIYIADTNNHKIRRI--- 622
Query: 190 PEDCSKSSQSGSALGAVSVWVLVSVLSCL 218
++ + ++ +G+ +G + ++S S L
Sbjct: 623 -QNGNLTTIAGNGIGGYNQDGILSTQSQL 650
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 62/114 (54%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
+S++ VK +AG+G+ GY+ DG A+ + P A D GNIY++D NH +R + TN
Sbjct: 396 ISKDGSVKTIAGNGIGGYNGDGMLAIDAQLNYPHGVASDSIGNIYISDSYNHRVRIVFTN 455
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++TIAG G+ + A ++ + F + + L ISD N +R++
Sbjct: 456 GTISTIAGNGNSGFNKDGIQATSSQLNYPFGIALNGN-DELFISDRSNHRVRKV 508
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 57/115 (49%), Gaps = 9/115 (7%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGN-IYVADKSNHVIRKITN 132
L+ ++ +AG G G++D + +R + P VD G IY+AD +NH IR+I N
Sbjct: 565 LASTGMISTIAGTGQAGFNDDNMSALESRVNSPYDVTVDESGQVIYIADTNNHKIRRIQN 624
Query: 133 LGVTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
+TTIAG GG ++G +Q ++++ P + I D N IR
Sbjct: 625 GNLTTIAGNGIGGYNQDGILSTQSQ---LYYPYDVSIDPVSGKIFIGDASNFRIR 676
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 64/115 (55%), Gaps = 16/115 (13%)
Query: 81 VVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG+G G++ DG + S++ + P A++ ++++D+SNH +RK++N G ++T
Sbjct: 457 TISTIAGNGNSGFNKDGIQATSSQLNYPFGIALNGNDELFISDRSNHRVRKVSNNGIIST 516
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
IAG GS G + + + +L ++PH + + I+D N IR+I
Sbjct: 517 IAGTGSA------GYNGDVIMATEAKL-YLPHGVSVDNKGNVYIADKQNHRIRKI 564
>gi|289667480|ref|ZP_06488555.1| putative NHL repeat protein [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 310
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 44/84 (52%), Gaps = 3/84 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
L+ + V LAGDGV G +G RF+ P AVD +G +YVAD N I I G
Sbjct: 95 LTPQGKVTTLAGDGVAGDRNGAAAQVRFNGPVGVAVDAQGRVYVADTYNDRIGVIETDGQ 154
Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
V T+AGGG G ADG A F
Sbjct: 155 VRTLAGGG--LPGMADGIGTQAWF 176
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 42/68 (61%), Gaps = 2/68 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
L + V LAG G +G++DG +A F+ P +D GN+Y+AD NH IRK+T G
Sbjct: 42 LGADGHVHTLAG-GREGFADGIGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLTPQGK 100
Query: 135 VTTIAGGG 142
VTT+AG G
Sbjct: 101 VTTLAGDG 108
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSKK 145
++G +DG A+FD P A D G +Y+AD + +I LG V T+AGG +
Sbjct: 1 MRGVADGRAADAQFDDPYGLATDAHGTLYIADGGDD--NRIHGLGADGHVHTLAGG---R 55
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
EG ADG A+F N + L I+D GN IR++
Sbjct: 56 EGFADGIGVAAAF-NTPSGIMLDTAGNLYIADTGNHAIRKL 95
>gi|441163235|ref|ZP_20968262.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
rimosus ATCC 10970]
gi|440616374|gb|ELQ79516.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
rimosus ATCC 10970]
Length = 553
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 56/114 (49%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ ++ +AG+G GY SDG P R P A+D G++Y+ D NH IRK+T+
Sbjct: 24 VTPNGLITTVAGNGTAGYVSDGGPALGTRLHYPWGLALDEAGSLYIGDGHNHRIRKVTSD 83
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +TT+AG G+ GPA + + L I+D N +R +
Sbjct: 84 GIITTVAGNGTAGYVDDGGPAAGTRLYYPYGIAL-DRGGNLYIADCNNHRVRGV 136
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 35/58 (60%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + ++ +AG+G GY DG P + R P A+D GN+Y+AD +NH +R +T
Sbjct: 80 VTSDGIITTVAGNGTAGYVDDGGPAAGTRLYYPYGIALDRGGNLYIADCNNHRVRGVT 137
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 2/79 (2%)
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
VD +G++Y+AD+ NH +RK+T G +TT+AG G+ GPA + L
Sbjct: 3 LVVDAEGSLYIADRYNHRVRKVTPNGLITTVAGNGTAGYVSDGGPALGTRLHYPWGLAL- 61
Query: 168 PHICALLISDHGNQLIRQI 186
+L I D N IR++
Sbjct: 62 DEAGSLYIGDGHNHRIRKV 80
>gi|290979906|ref|XP_002672674.1| predicted protein [Naegleria gruberi]
gi|284086252|gb|EFC39930.1| predicted protein [Naegleria gruberi]
Length = 747
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 70/132 (53%), Gaps = 8/132 (6%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
DL + DS A + E ++ +AG+G+ G+ DG A+F+ +V G++
Sbjct: 512 DLYICDSINHAIRKID---HETGIISTIAGNGIAGFKDGNASIAQFNSNLGLSVLPNGDL 568
Query: 118 YVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALL 174
+AD +N+ IRK ++ V TIAGG + G DG A +A ++ ++ + +
Sbjct: 569 LIADYNNNRIRKYLASSKQVLTIAGGLAGYSG--DGQLATSAQLNHPTDVAYNASTGDVF 626
Query: 175 ISDHGNQLIRQI 186
I+D GN++IR+I
Sbjct: 627 IADFGNKVIRKI 638
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 68/123 (55%), Gaps = 11/123 (8%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITN 132
L+ V +AG G+ GYS DG+ SA+ + P A + G++++AD N VIRKI+N
Sbjct: 582 LASSKQVLTIAG-GLAGYSGDGQLATSAQLNHPTDVAYNASTGDVFIADFGNKVIRKISN 640
Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSN--DFELTFVPHICALLISDHGNQLIRQINL 188
++TIAG G + +G A AS SN + L+ P L ISD N LIR+IN+
Sbjct: 641 STGIISTIAGSGLSEY---NGDAMPASISNISPYGLSIHPITGELFISDSVNYLIRKINV 697
Query: 189 KPE 191
+
Sbjct: 698 NNQ 700
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 77/156 (49%), Gaps = 24/156 (15%)
Query: 40 GHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV------VKRLAGDGVQG 92
G++ +N P+SV S D+ + + F L ++ V V LAGD
Sbjct: 439 GNETALNYPYSVTQSEISGDVFI----GTTFKILKVSKKDKRVSTVAGSVSTLAGDNFNS 494
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRA 149
+ ++ ++P A + G++Y+ D NH IRKI T + ++TIAG G G
Sbjct: 495 IA------SQLNEPAGLAFAINGDLYICDSINHAIRKIDHETGI-ISTIAGNG--IAGFK 545
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
DG A A F+++ L+ +P+ LLI+D+ N IR+
Sbjct: 546 DGNASIAQFNSNLGLSVLPN-GDLLIADYNNNRIRK 580
>gi|255530479|ref|YP_003090851.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255343463|gb|ACU02789.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 453
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 48/104 (46%), Gaps = 1/104 (0%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
KRL G + DG A F+ PK D GN++VAD NH IR ++ + T G
Sbjct: 348 KRLNAPGGSAFRDGPLADALFNDPKEIKFDNSGNMFVADYGNHCIRMVSADNIVTTVAGQ 407
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
K G DG + F+ + + V + I+D N IR++
Sbjct: 408 PGKSGYKDGGPVESLFNQPWGVA-VNEQGDIYIADWSNARIRKL 450
>gi|449137215|ref|ZP_21772546.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
gi|448884292|gb|EMB14794.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
Length = 351
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGGGSKKE 146
GY DG SARF+K + +D +Y++D +NH +R++ V T AG G +
Sbjct: 106 GYVDGPARSARFNKLHNLMIDSNDVLYLSDHANHSVRRLMESEDGKWVVDTYAGNGKRGP 165
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ ++A+F +T LLI+D GNQ+IR ++L
Sbjct: 166 AVDNVDRRDATFHEPISVTLDVKDNRLLIADIGNQVIRSLDL 207
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
+ ++I G + ++ +D G L+VL+ + + + + LAG G +G
Sbjct: 211 MVSIIAGQKSKLKDPRAVDLDGERRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 266
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+DG+ A F+ PK V G +Y+AD NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 49/86 (56%), Gaps = 4/86 (4%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
++ P++ +D + + V +++ + +R++ +N +TT+AG G K+G ADG A+ ASF
Sbjct: 219 KSKLKDPRAVDLDGERRLLVLERNGNRLRRVESNGDITTLAGSG--KKGTADGDAKQASF 276
Query: 159 SNDFELTFVPHICALLISDHGNQLIR 184
+ + P + I+D N L+R
Sbjct: 277 NGPKHMDVAPD-GRVYIADDVNHLVR 301
>gi|111219555|ref|YP_710349.1| serine/threonine-protein kinase [Frankia alni ACN14a]
gi|111147087|emb|CAJ58734.1| putative serine/threonine-protein kinase (partial match) [Frankia
alni ACN14a]
Length = 899
Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats.
Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 2/89 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
+A+ D P S A D GNIY+ D N+ IRK+T GV +T+AG G+ GPA A
Sbjct: 632 AAQLDGPGSTARDKAGNIYIGDAKNNRIRKVTPAGVISTVAGTGTAGYSGDGGPATAAQL 691
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
++ ++T P ++ SD+ N IR+I+
Sbjct: 692 NSAEKVTTGPD-GSVYFSDYDNHRIRKID 719
Score = 51.2 bits (121), Expect = 4e-04, Method: Composition-based stats.
Identities = 48/141 (34%), Positives = 77/141 (54%), Gaps = 10/141 (7%)
Query: 55 GSSDLIVLDSSRSAFYTLSF----PLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKS 108
G +D+ + D F L+ ++ + ++ +AG G GY+ DG P SA+ +KP S
Sbjct: 749 GPNDITMTDDGTLYFVDLTSETIQKVTPDGIISTIAGTGEAGYTGDGGPARSAKLNKP-S 807
Query: 109 FAVDMKG-NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
A+ G Y+AD +N+ +RKI G +TTIAG G++ G GPA A F N +
Sbjct: 808 LAIGPDGETFYIADYNNNRVRKIDPNGIITTIAGTGTEGSGGDGGPATAAQFKNPSSVV- 866
Query: 167 VPHICALLISDHGNQLIRQIN 187
V A+ ++D+GN +R+I+
Sbjct: 867 VDGSGAVYVADNGNDRVRRID 887
Score = 43.9 bits (102), Expect = 0.075, Method: Composition-based stats.
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 13/99 (13%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
V+ +AG G GYS DG P +A + + + G++Y +D NH IRKI G +TT
Sbjct: 667 VISTVAGTGTAGYSGDGGPATAAQLNSAEKVTTGPDGSVYFSDYDNHRIRKIDPAGIITT 726
Query: 138 IAGGG---------SKKEGRADGPAQNASFSNDFELTFV 167
G G + + R DGP + + ++D L FV
Sbjct: 727 YVGTGVAGYTGAGGAATQARIDGP-NDITMTDDGTLYFV 764
Score = 42.7 bits (99), Expect = 0.18, Method: Composition-based stats.
Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 2/88 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
AR D P + G +Y D ++ I+K+T G+ +TIAG G GPA++A
Sbjct: 745 ARIDGPNDITMTDDGTLYFVDLTSETIQKVTPDGIISTIAGTGEAGYTGDGGPARSAKL- 803
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
N L P I+D+ N +R+I+
Sbjct: 804 NKPSLAIGPDGETFYIADYNNNRVRKID 831
>gi|290996598|ref|XP_002680869.1| predicted protein [Naegleria gruberi]
gi|284094491|gb|EFC48125.1| predicted protein [Naegleria gruberi]
Length = 1407
Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats.
Identities = 53/201 (26%), Positives = 92/201 (45%), Gaps = 14/201 (6%)
Query: 8 SSLAITLLSLLFSLVSSGLLLEDGY-TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
S L I +S+ +++ + GY ++++ ID I+ +S S DL + D+S
Sbjct: 21 SGLVIGAISVKYNITTIA-----GYNSLSSGIDAKMANIDVNSCFKDLSSGDLFISDNSN 75
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
+ + V +AG GV GY+ + + +A+ + P V IY+AD N
Sbjct: 76 HRIVRV---FAANGTVLTIAGTGVSGYNGDNIQATTAQLNTPTGVFV-FNSEIYIADSQN 131
Query: 125 HVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
+RKI TN + TIAG G+ DG ++ N FV + IS++ N I
Sbjct: 132 SRVRKIQTNGNIVTIAGNGNAGYN-GDGMLATNAYLNSPVDVFVSSNGNVYISEYQNHYI 190
Query: 184 RQINLKPEDCSKSSQSGSALG 204
R +N+ + + +G+ +G
Sbjct: 191 RMVNVSTGVITTVAGNGTQIG 211
Score = 41.2 bits (95), Expect = 0.49, Method: Composition-based stats.
Identities = 39/110 (35%), Positives = 55/110 (50%), Gaps = 10/110 (9%)
Query: 85 LAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
+AG GV GY+ + +A+ + P S + K +Y++D N IRKI TN + TIAG
Sbjct: 441 IAGTGVAGYNSDYMDASTAQLNYPSS-VFEFKNEVYISDSVNRRIRKIFTNGTIVTIAGT 499
Query: 142 GSK--KEGRADGPAQNASFSNDFELT--FVPHICALLISDHGNQLIRQIN 187
GS+ G NA + + T FV + I D N LIR+IN
Sbjct: 500 GSQPPSSGYLGDDGVNALSARLYFPTGIFVTSANEVFIVD--NFLIRKIN 547
Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats.
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 13/77 (16%)
Query: 81 VVKRLAGDGVQ----------GYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIR 128
V+ +AG+G Q GY+ DG P + AR P+ V IY+AD N IR
Sbjct: 199 VITTVAGNGTQIGTSGTGLGFGYNGDGIPATYARLTNPQGIFVTSNNEIYIADAGNFRIR 258
Query: 129 KI-TNLGVTTIAGGGSK 144
K+ TN + T+AG G +
Sbjct: 259 KVLTNGTIITVAGTGEE 275
Score = 39.3 bits (90), Expect = 2.0, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 39/62 (62%), Gaps = 3/62 (4%)
Query: 85 LAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
+AG G +GY+ DG +A+ D P +VD G I++A+ ++ +RK+ TN + TIAG
Sbjct: 269 VAGTGEEGYNGDGMLATAAKLDYPYGVSVDSNGEIWIAELGSNRLRKVLTNGTIVTIAGT 328
Query: 142 GS 143
G+
Sbjct: 329 GT 330
>gi|290972323|ref|XP_002668903.1| predicted protein [Naegleria gruberi]
gi|284082440|gb|EFC36159.1| predicted protein [Naegleria gruberi]
Length = 296
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 4/94 (4%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG--GGSKKEGRADGPAQN 155
+A + P AV G++Y+ADKSNHVIRK++ L +TTIAG G ++ ++ A N
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALNGKITTIAGIAGETELNKYSNSLATN 167
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ ++ L ++ISD N +IR++ L
Sbjct: 168 TTLNSPQYLAVNSSTAEVIISDTNNNVIRKVYLN 201
>gi|385680260|ref|ZP_10054188.1| NHL repeat-containing protein [Amycolatopsis sp. ATCC 39116]
Length = 597
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 53/111 (47%), Gaps = 8/111 (7%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
+P RF P V +G I VAD NH I + + G T I GS GRADG A
Sbjct: 163 QPADLRF--PSKAVVTAEGRILVADTGNHSIAEFASDGETLIRRFGSGARGRADGAFDVA 220
Query: 157 SFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSGS 201
SF+ L +P A LL++D N L+R ++L + + + +G+
Sbjct: 221 SFAEPSGLALLPAEVADQAGYHLLVADTANHLLRGVDLVTGEVTTVAGTGN 271
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------DMKG-NIYVADKSNH 125
F E++++R G G +G +DG A F +P A+ D G ++ VAD +NH
Sbjct: 193 EFASDGETLIRRF-GSGARGRADGAFDVASFAEPSGLALLPAEVADQAGYHLLVADTANH 251
Query: 126 VIRKITNLG--VTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
++R + + VTT+AG G++ ++G DG A + ++ +++ + +++ GN
Sbjct: 252 LLRGVDLVTGEVTTVAGTGNQWRDGETDGDALSIDLTSPWDVIWWEPAGGAVVAMAGNHT 311
Query: 183 I 183
+
Sbjct: 312 L 312
>gi|418677567|ref|ZP_13238841.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. RM52]
gi|418687786|ref|ZP_13248945.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. Moskva]
gi|418742122|ref|ZP_13298495.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
gi|400320757|gb|EJO68617.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. RM52]
gi|410738110|gb|EKQ82849.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. Moskva]
gi|410750480|gb|EKR07460.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
Length = 358
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 65/119 (54%), Gaps = 9/119 (7%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
+ G +G+ ++ FD P +D++ N+YV + SNH IRKI NL V+T +GG S
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSGMVSTFSGGIS--- 219
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + + + +LL +D + IR+INLK + S+ G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKINLKTS--TVSTLLGNGIGA 276
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++++ VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGIISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKI 130
D+ ++IRKI
Sbjct: 96 DQMTNLIRKI 105
>gi|326435919|gb|EGD81489.1| hypothetical protein PTSG_02206 [Salpingoeca sp. ATCC 50818]
Length = 6977
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 63/144 (43%), Gaps = 9/144 (6%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
+P V+ P + D++V D + ++ E V G G +G+ DG R
Sbjct: 912 SPQDVLSLP-NGDIVVCDRDNARVRVIT----PEGKVSTALGVGKRGHQDGVGAGVRLAG 966
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
P+ G+ V D NH IR++++ V TIAG + G DG A A F
Sbjct: 967 PRGMTQLASGSFVVTDAENHCIREVSSALAKVETIAG--CTRAGVRDGDAAAAEFRYPTH 1024
Query: 164 LTFVPHICALLISDHGNQLIRQIN 187
+P +LI+D GN IR I+
Sbjct: 1025 ALEIPRQKLILITDTGNHTIRAIS 1048
>gi|326431736|gb|EGD77306.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
Length = 1384
Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 9/114 (7%)
Query: 81 VVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---- 134
V+ LAG GV G+ G P +AR + P+ A G I +AD N IR + N+G
Sbjct: 1270 VITALAGTGVAGFRGDGGAPMAARLNTPRGIAAMPAGEIAIADTGNSRIRTL-NVGGQGA 1328
Query: 135 --VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ T+AG G++ G A A+ + +TF P ++ D N+ +RQI
Sbjct: 1329 GVIETVAGTGARGFSGDGGVATEANLNFPTGITFSPSTNNIVFVDRRNRRVRQI 1382
>gi|225420274|ref|ZP_03762577.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
DSM 15981]
gi|225041091|gb|EEG51337.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
DSM 15981]
Length = 336
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 55/144 (38%), Gaps = 43/144 (29%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR-------------- 128
+ LAG G +GY D G A F P A D GN+Y+AD N VIR
Sbjct: 150 RTLAGRGREGYEDNRGGRALFSGPTGLAADDAGNLYIADTGNDVIRRLRPDGMVDTYLRG 209
Query: 129 -------------------------KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
K+ N + AGG EG+ADG A FS+
Sbjct: 210 LSGPVGLCWHEGALYVSDTGNHRVLKVENGEIVWTAGG---AEGQADGGFGQARFSSPTY 266
Query: 164 LTFVPHICALLISDHGNQLIRQIN 187
L V L +SD GN +R+I
Sbjct: 267 LA-VSEQGTLYVSDTGNAAVRKIE 289
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
G +G +DG G ARF P AV +G +YV+D N +RKI N V T+
Sbjct: 247 GAEGQADGGFGQARFSSPTYLAVSEQGTLYVSDTGNAAVRKIENGTVATV 296
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 53/109 (48%), Gaps = 6/109 (5%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
K L G+ + GY DG R ++P +G V+D+ NH++R I G T+AG G
Sbjct: 98 KGLYGEPLGGYGDGTRSEMRMEEPWDIVPYGEGYA-VSDRENHMVRYIDAQGSRTLAGRG 156
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
+EG D A FS L L I+D GN +IR+ L+P+
Sbjct: 157 --REGYEDNRGGRALFSGPTGLA-ADDAGNLYIADTGNDVIRR--LRPD 200
>gi|386849777|ref|YP_006267790.1| Teneurin-4 [Actinoplanes sp. SE50/110]
gi|359837281|gb|AEV85722.1| Teneurin-4 [Actinoplanes sp. SE50/110]
Length = 631
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 42/66 (63%), Gaps = 2/66 (3%)
Query: 80 SVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
+ V +AG G G S DG S A+ + P+ VD G+IYVAD NH++RKI+N ++T
Sbjct: 172 AAVSVIAGTGTAGTSPDGTAASLAKLNAPRDVKVDASGDIYVADTGNHLVRKISNGTIST 231
Query: 138 IAGGGS 143
+AG G+
Sbjct: 232 VAGTGA 237
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P AVD G +Y+AD N+V+R++ +TT+AG G ADG + +D E
Sbjct: 36 PGGVAVDPAGTMYIADSGNNVVRQVAAGIITTVAGTGVAGGTPADGGLATSGNLSDPEDV 95
Query: 166 FVPHICALLISDHGNQLIRQI 186
V L I+D G+ IR++
Sbjct: 96 TVDSGGVLYIADTGHHRIRRV 116
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 6/113 (5%)
Query: 78 EESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++ +AG GV G + + G S P+ VD G +Y+AD +H IR++
Sbjct: 61 AAGIITTVAGTGVAGGTPADGGLATSGNLSDPEDVTVDSGGVLYIADTGHHRIRRVAGGV 120
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+TT+AG G G A S+ + + + + ++D GN I IN
Sbjct: 121 ITTVAGTGVAGS-LMTGVAATTLLSSPYGIDTANGM--VYVADTGNNRILMIN 170
Score = 37.0 bits (84), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 50/109 (45%), Gaps = 5/109 (4%)
Query: 81 VVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLGVTTI 138
V+ +AG GV G G + P + +D G +YVAD N+ I I V+ I
Sbjct: 120 VITTVAGTGVAGSLMTGVAATTLLSSP--YGIDTANGMVYVADTGNNRILMINGAAVSVI 177
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
AG G+ DG A + + N V + ++D GN L+R+I+
Sbjct: 178 AGTGTAGT-SPDGTAASLAKLNAPRDVKVDASGDIYVADTGNHLVRKIS 225
>gi|312198347|ref|YP_004018408.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311229683|gb|ADP82538.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 814
Score = 54.3 bits (129), Expect = 6e-05, Method: Composition-based stats.
Identities = 37/118 (31%), Positives = 65/118 (55%), Gaps = 6/118 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S + + +AG+G G+S DG P +A +KP + A+ G +Y+ D N +R+++
Sbjct: 530 ISPDGTITTVAGNGQPGFSGDGGPATAAMLNKPVAVAIGPGGTLYIVDTFNMRVRQVSPD 589
Query: 134 G-VTTIAGGGSKKEGRAD--GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G + TIAG G + AD GPA NA+ + + L I+D+GN ++R++ +
Sbjct: 590 GIIQTIAGSGERPWNPADDGGPATNAALWYPSGIA-IDSAGDLFIADNGNDIVRRVGV 646
Score = 51.2 bits (121), Expect = 4e-04, Method: Composition-based stats.
Identities = 39/112 (34%), Positives = 59/112 (52%), Gaps = 4/112 (3%)
Query: 79 ESVVKRLAGD-GVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
+ ++ +AG G + DG+P + A KP + A+D +G IY+AD NH IR+I GV
Sbjct: 647 DGIITTVAGRFGYGSWGDGKPATQAMISKPFNVALDRQGRIYIADSYNHKIRRIGLDGVI 706
Query: 137 -TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
TIAG G G A A+ + +T V + I+D GN +R+I+
Sbjct: 707 ETIAGTGVAGYSGDGGKATAATLRDPRGVT-VDAAGNVYITDSGNNRVRRID 757
Score = 50.4 bits (119), Expect = 7e-04, Method: Composition-based stats.
Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 12/86 (13%)
Query: 79 ESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
+ V++ +AG GV GYS G+ +A P+ VD GN+Y+ D N+ +R+I G +
Sbjct: 703 DGVIETIAGTGVAGYSGDGGKATAATLRDPRGVTVDAAGNVYITDSGNNRVRRIDTAGII 762
Query: 136 TTIAGG---GSKKEG------RADGP 152
TT+AG GSK R DGP
Sbjct: 763 TTVAGTAPPGSKSTDTSATALRPDGP 788
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 41/118 (34%), Positives = 59/118 (50%), Gaps = 10/118 (8%)
Query: 73 SFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
SFP V +AG G G++ DG P + A+ D P A+D GN+Y AD N+ +R+I
Sbjct: 477 SFP------VVAIAGTGQAGFAGDGGPAAQAQLDHPYGPAMDGFGNLYFADFDNNRVRRI 530
Query: 131 TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ G +TT+AG G GPA A + + P L I D N +RQ++
Sbjct: 531 SPDGTITTVAGNGQPGFSGDGGPATAAMLNKPVAVAIGPG-GTLYIVDTFNMRVRQVS 587
Score = 45.4 bits (106), Expect = 0.023, Method: Composition-based stats.
Identities = 33/118 (27%), Positives = 58/118 (49%), Gaps = 6/118 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS---DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + +++ +AG G + ++ DG P + A P A+D G++++AD N ++R++
Sbjct: 586 VSPDGIIQTIAGSGERPWNPADDGGPATNAALWYPSGIAIDSAGDLFIADNGNDIVRRVG 645
Query: 132 NLGVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G+ T G DG PA A S F + + + I+D N IR+I L
Sbjct: 646 VDGIITTVAGRFGYGSWGDGKPATQAMISKPFNVA-LDRQGRIYIADSYNHKIRRIGL 702
>gi|337745906|ref|YP_004640068.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus KNP414]
gi|336297095|gb|AEI40198.1| copper amine oxidase domain protein [Paenibacillus mucilaginosus
KNP414]
Length = 537
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 72/159 (45%), Gaps = 18/159 (11%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G+S+G AR + P AV +G + +AD NH +R + ++T+AG G +K G DG
Sbjct: 302 GFSNGAALQARLNYPMGIAVTEEGGLLIADSQNHAVRYLFGGQLSTLAGAGEQKMGLLDG 361
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK-PEDCSKSSQSGSALGAVSVWV 210
A + ++ + L+ N+L R + PED +G AL V
Sbjct: 362 MEGKAGLNRPADVAVLGDGSVLVADSFNNRLRRLTGYRLPEDLP----AGDALKVV---- 413
Query: 211 LVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEAL 249
L + V+ F A+P I GRL P++ EAL
Sbjct: 414 -------LDAGVMKFDAQPEITE--GRLMVPVRAVTEAL 443
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 55/189 (29%), Positives = 79/189 (41%), Gaps = 21/189 (11%)
Query: 29 EDGYTVTTVIDGH---QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ GY V ++DG L P + P S L V D+ A + + V +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
AG G G DG+ +A F +P A G +YVAD H IR+I+ G T S++
Sbjct: 164 AGSGRLGAKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSRR 223
Query: 146 -----EGR-------ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
G+ ADGP A F+ + L +SD GNQ IR I+L
Sbjct: 224 VVEATPGQVAAAGDFADGPLAQAKFNEPTGIALDAK-GNLYVSDSGNQRIRYIDLAKGTV 282
Query: 194 SKSSQSGSA 202
+ + G+A
Sbjct: 283 TTVAGGGTA 291
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 52/111 (46%), Gaps = 8/111 (7%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
AG G G ++G A F P AV G V+D N VIRK+T V +AG +K
Sbjct: 48 AGTGRLGSANGTGLEASFRVPAGLAVLPDGTAAVSDSRNGVIRKLTGGRVDVLAGVFYRK 107
Query: 146 E-------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G DG A + F L+ P +L ++D GN IR+I+ K
Sbjct: 108 DDKGYPVGGLLDGAANASLFQEPLGLSAGPD-GSLYVADAGNHAIRRIDAK 157
>gi|379719861|ref|YP_005311992.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus 3016]
gi|378568533|gb|AFC28843.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus 3016]
Length = 535
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 55/189 (29%), Positives = 79/189 (41%), Gaps = 21/189 (11%)
Query: 29 EDGYTVTTVIDGH---QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
+ GY V ++DG L P + P S L V D+ A + + V +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
AG G G DG+ +A F +P A G +YVAD H IR+I+ G T S++
Sbjct: 164 AGSGRLGAKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSRR 223
Query: 146 -----EGR-------ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
G+ ADGP A F+ + L +SD GNQ IR I+L
Sbjct: 224 VVEATPGQVAAAGDFADGPLAQAKFNEPTGIALDAK-GNLYVSDSGNQRIRYIDLAKGTV 282
Query: 194 SKSSQSGSA 202
+ + G+A
Sbjct: 283 TTVAGGGTA 291
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 68/159 (42%), Gaps = 20/159 (12%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G+S+G AR + P AV +G + +AD NH +R + ++T+AG G +K G DG
Sbjct: 302 GFSNGAALQARLNYPMGIAVTEEGGLLIADSQNHDVRYLFGGQLSTLAGAGEQKMGLLDG 361
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK-PEDCSKSSQSGSALGAVSVWV 210
A + ++ + L+ N+L R + PED
Sbjct: 362 MEGKAGLNRPADVAVLGDGSVLVADSFNNRLRRLTGYRLPEDLP---------------- 405
Query: 211 LVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEAL 249
V L + V+ F A+P I GRL P++ EAL
Sbjct: 406 -AGVKVVLDAGVMKFDAQPEITE--GRLMVPVRAITEAL 441
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 52/111 (46%), Gaps = 8/111 (7%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
AG G G ++G A F P AV G V+D N VIRK+T V +AG +K
Sbjct: 48 AGTGRLGSANGTGLEASFRVPAGLAVLPDGTAAVSDSRNGVIRKLTGGRVDVLAGVFYRK 107
Query: 146 E-------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G DG A + F L+ P +L ++D GN IR+I+ K
Sbjct: 108 DDKGYPVGGLLDGAANASLFQEPLGLSAGPD-GSLYVADAGNHAIRRIDAK 157
>gi|455790313|gb|EMF42184.1| putative lipoprotein [Leptospira interrogans serovar Lora str. TE
1992]
Length = 294
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
+ G +G+ ++ FD P +D + N+YV + SNH IRKI N G V+T++GG S G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---G 220
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
DG +A F + + + + +LL +D + IR+I+LK + + S+ G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G + DG NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287
>gi|399157177|ref|ZP_10757244.1| NHL repeat containing protein, partial [SAR324 cluster bacterium
SCGC AAA001-C10]
Length = 352
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 52/108 (48%), Gaps = 6/108 (5%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTI 138
+ V LAG G +D SA+F+ PK D N+YVAD SNH IRKI + GV T
Sbjct: 238 TAVTTLAGQSDNGSTDATGTSAQFNYPKGITTDGT-NLYVADYSNHRIRKIVISTGVVTT 296
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G S +G D +ASF + L ++D N IR+I
Sbjct: 297 LAGSS--QGSTDATGTSASFY--YPSGITTDGTNLYVADRYNHRIRKI 340
>gi|158315873|ref|YP_001508381.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158111278|gb|ABW13475.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 850
Score = 53.9 bits (128), Expect = 7e-05, Method: Composition-based stats.
Identities = 42/109 (38%), Positives = 56/109 (51%), Gaps = 4/109 (3%)
Query: 82 VKRLAGDGVQGYS-DGEP-GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
V +AG G +S DG P GSA + P D GNIYVAD N+ +R+IT G +TTI
Sbjct: 517 VVTVAGTGEAAFSGDGGPAGSAALNGPFGMVADWAGNIYVADFDNNRVRRITADGTITTI 576
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
AG G GPA A + + +LI+D NQ IR+++
Sbjct: 577 AGTGEAGFSGDGGPATQARLRQPAAVA-LDSAGNILIADTFNQRIRRVD 624
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 36/98 (36%), Positives = 56/98 (57%), Gaps = 10/98 (10%)
Query: 47 PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARF 103
P SV +DRPG L + DS + + + V++ +AG G+ GYS DG P + A
Sbjct: 711 PISVAMDRPGR--LYIADSGNNRIRRIGL----DGVIETVAGTGLPGYSGDGGPATRATL 764
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
P+ AVD +G I++ D++N IR++ G +TT+AG
Sbjct: 765 RSPRGVAVDARGAIFITDRTNRRIRRVDPSGIITTVAG 802
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 36/114 (31%), Positives = 56/114 (49%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + +AG G G+S DG P + AR +P + A+D GNI +AD N IR++
Sbjct: 567 ITADGTITTIAGTGEAGFSGDGGPATQARLRQPAAVALDSAGNILIADTFNQRIRRVDPS 626
Query: 134 GVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G T G + DG PA A+ + P + I+D GN IR++
Sbjct: 627 GTITTVAGKDDRGFSEDGVPATEATLWYPGGVVADP-TGNIYIADSGNNRIRRV 679
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 36/104 (34%), Positives = 53/104 (50%), Gaps = 3/104 (2%)
Query: 86 AGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAGGGS 143
GDG + DG P + A P S A+D G +Y+AD N+ IR+I GV T+AG G
Sbjct: 690 GGDGEGAFGDGGPAADALLAFPISVAMDRPGRLYIADSGNNRIRRIGLDGVIETVAGTGL 749
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GPA A+ + + V A+ I+D N+ IR+++
Sbjct: 750 PGYSGDGGPATRATLRSPRGVA-VDARGAIFITDRTNRRIRRVD 792
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 37/110 (33%), Positives = 55/110 (50%), Gaps = 4/110 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG +G+S DG P + A P D GNIY+AD N+ IR++ G + T+
Sbjct: 629 ITTVAGKDDRGFSEDGVPATEATLWYPGGVVADPTGNIYIADSGNNRIRRVGTDGIIQTV 688
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
AGG + GPA +A + + + L I+D GN IR+I L
Sbjct: 689 AGGDGEGAFGDGGPAADALLAFPISVA-MDRPGRLYIADSGNNRIRRIGL 737
>gi|418714767|ref|ZP_13275259.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
gi|410788941|gb|EKR82646.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
Length = 358
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
+ G +G+ ++ FD P +D + N+YV + SNH IRKI N G V+T++GG S G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---G 220
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
DG +A F + + + + +LL +D + IR+I+LK + + S+ G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G + DG NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287
>gi|345851882|ref|ZP_08804843.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
gi|345636654|gb|EGX58200.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
Length = 468
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 63/114 (55%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + +AG G G+ DG P +A+ + P+ AVD G +Y+AD +NH IRK+T
Sbjct: 28 ITTDGRITTVAGTGAAGFRGDGGPAVAAQLNGPREVAVDGAGAVYIADSNNHRIRKVTPD 87
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T+AG G+ GPA A + +T V L ++D+ N IR+I
Sbjct: 88 GKISTVAGTGAGGFRGDGGPATAAQLNLPLGVT-VDGAGVLHVADYYNHRIRRI 140
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 46/85 (54%), Gaps = 2/85 (2%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
++P AVD G +YVAD S H +R+IT G +TT+AG G+ GPA A +
Sbjct: 1 MNRPYGIAVDSTGTVYVADFSGHRVRRITTDGRITTVAGTGAAGFRGDGGPAVAAQLNGP 60
Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
E+ V A+ I+D N IR++
Sbjct: 61 REVA-VDGAGAVYIADSNNHRIRKV 84
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 46/94 (48%), Gaps = 14/94 (14%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
+A+ + P VD G ++VAD NH IR+IT GV +T+AG G+ GPA A
Sbjct: 110 AAQLNLPLGVTVDGAGVLHVADYYNHRIRRITADGVISTVAGTGAAGFRGDGGPAATAQL 169
Query: 159 SNDFELTFVPHICA------LLISDHGNQLIRQI 186
+ PH A L I+D N +R++
Sbjct: 170 NG-------PHGVALNAAGDLCIADLQNHRVRKV 196
>gi|418707100|ref|ZP_13267936.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
str. R499]
gi|410763306|gb|EKR34037.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
str. R499]
Length = 358
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
+ G +G+ ++ FD P +D + N+YV + SNH IRKI N G V+T++GG S G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---G 220
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
DG +A F + + + + +LL +D + IR+I+LK + + S+ G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G + DG NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287
>gi|417772389|ref|ZP_12420278.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Pomona]
gi|418684105|ref|ZP_13245295.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Kennewicki LC82-25]
gi|400324210|gb|EJO76509.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Kennewicki LC82-25]
gi|409945760|gb|EKN95775.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Pomona]
gi|455670390|gb|EMF35375.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Fox 32256]
Length = 358
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 57/100 (57%), Gaps = 5/100 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGRA 149
G +G+ ++ FD P +D + N+YV + SNH IRKI N G V+T++GG S G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---GYL 222
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
DG +A F + + + + +LL +D + IR+I+LK
Sbjct: 223 DGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK 262
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G + DG NASF
Sbjct: 270 LGNGIGTD--VDGNGTNASF 287
>gi|24215844|ref|NP_713325.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
str. 56601]
gi|45656856|ref|YP_000942.1| hypothetical protein LIC10968 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|386074980|ref|YP_005989298.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
str. IPAV]
gi|417765006|ref|ZP_12412972.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
Mallika]
gi|418666993|ref|ZP_13228411.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
2006006960]
gi|418699338|ref|ZP_13260303.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
L1111]
gi|418723579|ref|ZP_13282417.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
gi|418727930|ref|ZP_13286513.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
gi|421083673|ref|ZP_15544545.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
gi|421102269|ref|ZP_15562874.1| putative lipoprotein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|421122244|ref|ZP_15582528.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
gi|24197042|gb|AAN50343.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
str. 56601]
gi|45600092|gb|AAS69579.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|353458770|gb|AER03315.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
str. IPAV]
gi|400352606|gb|EJP04785.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
Mallika]
gi|409962948|gb|EKO26678.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
gi|410344796|gb|EKO95953.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
gi|410367934|gb|EKP23317.1| putative lipoprotein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410433778|gb|EKP78117.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
gi|410757315|gb|EKR18927.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
2006006960]
gi|410761708|gb|EKR27881.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
L1111]
gi|410777322|gb|EKR57287.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
gi|456822102|gb|EMF70597.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
LT1962]
Length = 358
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
+ G +G+ ++ FD P +D + N+YV + SNH IRKI N G V+T++GG S G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---G 220
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
DG +A F + + + + +LL +D + IR+I+LK + + S+ G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G + DG NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287
>gi|290975268|ref|XP_002670365.1| predicted protein [Naegleria gruberi]
gi|284083923|gb|EFC37621.1| predicted protein [Naegleria gruberi]
Length = 1034
Score = 53.9 bits (128), Expect = 7e-05, Method: Composition-based stats.
Identities = 39/105 (37%), Positives = 58/105 (55%), Gaps = 13/105 (12%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
++D++ +D+ R LS + +V LAG G++ DG P SA+ + P ++
Sbjct: 15 ANNDILFIDNDR--LRKLS-----QGIVSTLAGTKEAGFNGDGIPAVSAQLNSPLGVSMS 67
Query: 113 MKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNA 156
G IY+ADK NH IRKI G +TTIAG G E + PA++A
Sbjct: 68 KGGEIYIADKLNHRIRKIDIFGKITTIAGTGQAGESQ---PAESA 109
Score = 43.9 bits (102), Expect = 0.070, Method: Composition-based stats.
Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 5/109 (4%)
Query: 80 SVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
S++ LAG G GY DG + A+ + P++ + G + ++D NH IR I + + T
Sbjct: 143 SLIVTLAGTGEPGYGGDGSLATQAQLNGPEAVILSGTGAVIISDTYNHAIRTILDNNLIT 202
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG G +DG + + LT P+ ALLI+D N I+ +
Sbjct: 203 IAGTLGTN-GDSDGSLETKLYY-PMGLTITPYQ-ALLIADSWNDKIKML 248
>gi|418707795|ref|ZP_13268612.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. UI 08368]
gi|410771897|gb|EKR47094.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. UI 08368]
gi|456968671|gb|EMG09839.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. LT2186]
Length = 358
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 62/115 (53%), Gaps = 5/115 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRAD 150
G +G+ ++ FD P +D + N+YV + SNH IRKI N G +I GG G D
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSILSGG--ISGYLD 223
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G +A F + + + + +LL +D + IR+I+LK + + S+ G+ +GA
Sbjct: 224 GDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 39/73 (53%), Gaps = 5/73 (6%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKK 145
G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T+ G G
Sbjct: 217 GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTLLGNGIGA 276
Query: 146 EGRADGPAQNASF 158
+ DG NASF
Sbjct: 277 D--VDGNGTNASF 287
>gi|417763451|ref|ZP_12411429.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
gi|417776933|ref|ZP_12424763.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
gi|418671130|ref|ZP_13232484.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
gi|409940657|gb|EKN86296.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
gi|410573292|gb|EKQ36344.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
gi|410581799|gb|EKQ49606.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
Length = 358
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
+ G +G+ ++ FD P +D + N+YV + +NH IRKI N G V+T++GG S G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELNNHTIRKINLNSGTVSTLSGGIS---G 220
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
DG +A F + + + + +LL +D + IR+I+LK + + S+ G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G + DG NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287
>gi|417783289|ref|ZP_12431009.1| putative lipoprotein [Leptospira interrogans str. C10069]
gi|409953418|gb|EKO07917.1| putative lipoprotein [Leptospira interrogans str. C10069]
Length = 358
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
+ G +G+ ++ FD P +D + N+YV + SNH IRKI N G V+T++GG S G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGMVSTLSGGIS---G 220
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
DG +A F + + + + +LL +D + IR+I+LK + + S+ G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 44/81 (54%), Gaps = 6/81 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTT 137
+V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T
Sbjct: 210 MVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVST 268
Query: 138 IAGGGSKKEGRADGPAQNASF 158
+ G G + DG NASF
Sbjct: 269 LLGNGIGAD--VDGNGTNASF 287
>gi|430744164|ref|YP_007203293.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
gi|430015884|gb|AGA27598.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 358
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 59/117 (50%), Gaps = 6/117 (5%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
+ ++ +AG G +G+S G+ G A+ D+P +D +GN+Y AD+ N +R++ +
Sbjct: 72 GKSGIITTVAGSGTKGFS-GDGGVALKAKLDEPYGIVLDSRGNLYFADRLNRRVRRVDAD 130
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G +TTIAG GSK GP A + L I+D + +R ++L
Sbjct: 131 SGMITTIAGDGSKTYSGDGGPGARAGLVEPNGVALDSQEARLFIADVADHRVRVVDL 187
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 58/115 (50%), Gaps = 7/115 (6%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI---TN 132
+ ++ AG G +GYS DG P +A F+ PK FA+D G++ + D N IR+I T
Sbjct: 245 QTGLITTRAGTGAKGYSGDGGPATAATFNGPKEFAIDRAGDLLIVDTENQAIRRIDARTG 304
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
L + T+AG G + G A +A + P A+ I D GN IR++
Sbjct: 305 L-IRTLAGNGQRGGEGDGGAATSALLDRPHGVAVGPD-GAVYIGDTGNHRIRKVG 357
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 3/89 (3%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASF 158
A + P A D +GN+Y++D NH IR++ +TT+AG G+K G A A
Sbjct: 41 ATLNMPFDVAFDSRGNLYLSDTMNHCIRRVDGKSGIITTVAGSGTKGFSGDGGVALKAKL 100
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
+ + L +D N+ +R+++
Sbjct: 101 DEPYGIVLDSR-GNLYFADRLNRRVRRVD 128
>gi|75909875|ref|YP_324171.1| NHL repeat-containing protein [Anabaena variabilis ATCC 29413]
gi|75703600|gb|ABA23276.1| NHL repeat protein [Anabaena variabilis ATCC 29413]
Length = 503
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 76/175 (43%), Gaps = 14/175 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ PG L V DS F + L G+G G +DG A+F P
Sbjct: 175 PGKVLATPGG--LFVADSGHHRIVVSDF----NGEILHLIGNGKSGLTDGNFQEAQFSAP 228
Query: 107 KSFAVDMKGNI-YVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRA--DGPAQNASFSND 161
+ A DM+ I YVAD NHV+R+ I V TIAG G + G + ++
Sbjct: 229 QGMAFDMENQILYVADTDNHVVRRADIQQQTVETIAGTGEQSRNIQPHGGAGLETALNSP 288
Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLS 216
++L V + +L I+ G I Q++L P K+ A G ++ SV +
Sbjct: 289 WDLVKVGN--SLYIAMAGTHQIWQMDL-PSGFVKTYAGTGAEGCFDGYLTESVFA 340
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 35/62 (56%), Gaps = 3/62 (4%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
+ + GDG G +G+ + RF +P + M +Y++D +NHVIR++ L VTT+
Sbjct: 431 QTVLGDGSAGLQNGQGKNTRFFEPSGLSA-MDSYLYISDTNNHVIRRVDLRTLEVTTMQF 489
Query: 141 GG 142
G
Sbjct: 490 NG 491
>gi|298245110|ref|ZP_06968916.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
gi|297552591|gb|EFH86456.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
Length = 496
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 76/154 (49%), Gaps = 22/154 (14%)
Query: 27 LLEDGYTVTTVIDGHQLEIN----PHSVIDRPG-------SSDLIVLDSSRSAFYTLSFP 75
+LE+ + ++D QL + P S + PG + LI+ DS+ +
Sbjct: 148 MLEE-FEARGLLDHRQLRFSAATAPSSFLAFPGKLAVDAQADRLIISDSAHHRLVETNL- 205
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
+ V+ + G GVQG +DG A+F+ P+ A+ + +YVAD NH+IR++
Sbjct: 206 ---QGRVRTIIGSGVQGQADGSFAEAQFNHPQGVAL-VNDLLYVADTDNHLIRRVDLRTK 261
Query: 134 GVTTIAGGGSKK---EGRADGPAQNASFSNDFEL 164
V T+AG G + R GPA++ + S+ ++L
Sbjct: 262 QVETLAGTGEQNGMVRTRLQGPARSIALSSPWDL 295
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 3/48 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR 128
V+ LAG G G +G A+F++P A+ GN +Y+AD +NH IR
Sbjct: 435 VRTLAGTGEAGIHNGPAEQAQFNEPGDLAI--TGNTLYIADTNNHAIR 480
>gi|298388124|ref|ZP_06997668.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298259116|gb|EFI01996.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 435
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 52/94 (55%), Gaps = 2/94 (2%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
+++G ARF++P D GN+YV+D+ N+ IRKIT G TI G++ EG +G
Sbjct: 341 FANGIGEEARFNQPCQGVFDEDGNLYVSDRDNNCIRKITPDGNVTIY-AGNRSEGLVNGL 399
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+SF LT + ++DH N +IR+I
Sbjct: 400 PLKSSFRRPEGLTRSKD-GVIYVADHDNHVIRKI 432
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 26/46 (56%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
AG+ +G +G P + F +P+ G IYVAD NHVIRKI
Sbjct: 387 YAGNRSEGLVNGLPLKSSFRRPEGLTRSKDGVIYVADHDNHVIRKI 432
>gi|290996889|ref|XP_002681014.1| predicted protein [Naegleria gruberi]
gi|284094637|gb|EFC48270.1| predicted protein [Naegleria gruberi]
Length = 1023
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 6/117 (5%)
Query: 72 LSFPLSEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S +++ES V +AG GV GY + A P S + G+IY+AD SNH +R++
Sbjct: 11 VSTTVAQESFVSTIAGGGVCDGYLATQ---ASLAYPGSPTIGPDGSIYIADSSNHRVRQV 67
Query: 131 T-NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
N +TTIAG G PA A N + V I + ISD+GN IR++
Sbjct: 68 YPNGTITTIAGTGISGYNGDVIPATRAQLKNPVSVA-VNSIGEVFISDNGNNRIRKV 123
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 56/111 (50%), Gaps = 4/111 (3%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV 135
+ +AG G+ GY+ D P + A+ P S AV+ G ++++D N+ IRK+ TN +
Sbjct: 70 NGTITTIAGTGISGYNGDVIPATRAQLKNPVSVAVNSIGEVFISDNGNNRIRKVLTNGTI 129
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T AG G G A NA + + + I L+ISD + IR++
Sbjct: 130 ITFAGSGQTTFSGDYGLATNAGINYPYGIALT-SIEELIISDVNHNRIRKV 179
Score = 40.8 bits (94), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 28/42 (66%), Gaps = 2/42 (4%)
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSK 144
KPK +D GNIY+AD +N+ +RKI+ L + TIAG GS
Sbjct: 356 KPKGAVIDSLGNIYIADTNNNRVRKISYLDGTINTIAGTGSN 397
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 51/109 (46%), Gaps = 13/109 (11%)
Query: 85 LAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
+AG G G++ D A+ P S +D N+Y+AD NH IRKI N +TTI G G
Sbjct: 427 VAGVGSSGFNGDILATDAKLSNPVSVTIDSNDNVYIADTYNHRIRKILQNGNLTTIVGLG 486
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
S G + SN +L + I + I+D N IR++
Sbjct: 487 SS------GFNGDYLLSNGTKLNYPQSIAFDSNGNMYIADMNNNRIRKM 529
Score = 40.4 bits (93), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRK-ITNLGVTTIAGGG 142
+ + P+S A D GN+Y+AD +N+ IRK +TN + T+AG G
Sbjct: 501 KLNYPQSIAFDSNGNMYIADMNNNRIRKMLTNGTIITVAGTG 542
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 7/48 (14%)
Query: 96 GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
G PG DK G++Y ADK +H IRKI+N +T++AG GS
Sbjct: 299 GNPGQMFIDKNS-------GDVYFADKGSHRIRKISNGFITSVAGSGS 339
>gi|424667903|ref|ZP_18104928.1| hypothetical protein A1OC_01487 [Stenotrophomonas maltophilia
Ab55555]
gi|401068165|gb|EJP76689.1| hypothetical protein A1OC_01487 [Stenotrophomonas maltophilia
Ab55555]
Length = 470
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 66/147 (44%), Gaps = 11/147 (7%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L + S+ R AG G DG A F +P + AV ++ +YVAD IR +
Sbjct: 301 SYHLGQRSLQWR-AGSGAIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358
Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
G V T+ G G + G DGP AS + P L I+D GN +R + L
Sbjct: 359 RGDLVQTLVGQGPWRFGDEDGPRAQASLQFPQAIALSPDAPLLWIADTGNGRLRTLRLGG 418
Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
+P +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
+ R G G + DG A F +P++ ++ + +YVAD NH +R+I L G+
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255
Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
G + +GP AQ S D H L I+D NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289
>gi|283779991|ref|YP_003370746.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
gi|283438444|gb|ADB16886.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
Length = 364
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 37/83 (44%), Positives = 49/83 (59%), Gaps = 9/83 (10%)
Query: 77 SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---T 131
++ ++ +AG G +G++ DG P A F P S A+D NIYV D NH IRKI T
Sbjct: 135 AKTGIISTIAGSGKEGFAGDGGPAKEAVFSIPHSIALDADDNIYVCDLGNHRIRKIDAKT 194
Query: 132 NLGVTTIAGGGSK---KEGRADG 151
L ++TIAG G K KEG+ DG
Sbjct: 195 GL-ISTIAGNGGKTLPKEGKIDG 216
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 61/118 (51%), Gaps = 23/118 (19%)
Query: 85 LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIA 139
+AG G GY+ DG P + A ++P D GN+ + N VIRK+ T + ++TIA
Sbjct: 86 VAGSGKMGYAGDGGPATEALLNEPYEVRFDSHGNMIFVEMKNFVIRKVDAKTGI-ISTIA 144
Query: 140 GGGSKKEGRA--DGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQINLK 189
G G KEG A GPA+ A FS +PH AL + D GN IR+I+ K
Sbjct: 145 GSG--KEGFAGDGGPAKEAVFS-------IPHSIALDADDNIYVCDLGNHRIRKIDAK 193
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 49/96 (51%), Gaps = 9/96 (9%)
Query: 38 IDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
IDG + P +V ID+ + I L S + L+ SE + +AG G +G+S D
Sbjct: 214 IDGTMSLLGPRAVCIDK--NVMWIALREGHS-VWKLNLATSE---LTHIAGTGKKGFSGD 267
Query: 96 GEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G P A FD PK AV G + V D NHVIRKI
Sbjct: 268 GGPAKDATFDGPKGVAVCPDGGVVVVDTENHVIRKI 303
Score = 37.0 bits (84), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 58/134 (43%), Gaps = 6/134 (4%)
Query: 71 TLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
TL+ L E + +AG G G + G P + G +Y+ + NH +R
Sbjct: 16 TLALTL-EAGTIDTVAGTGKPADGPASGLGVETNVGDPFGVEIGPDGALYITEVRNHRVR 74
Query: 129 KI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++ +TT+AG G GPA A + +E+ F H ++ + N +IR++
Sbjct: 75 RLDLKTGAMTTVAGSGKMGYAGDGGPATEALLNEPYEVRFDSH-GNMIFVEMKNFVIRKV 133
Query: 187 NLKPEDCSKSSQSG 200
+ K S + SG
Sbjct: 134 DAKTGIISTIAGSG 147
>gi|190573634|ref|YP_001971479.1| hypothetical protein Smlt1644 [Stenotrophomonas maltophilia K279a]
gi|190011556|emb|CAQ45175.1| conserved hypothetical protein [Stenotrophomonas maltophilia K279a]
Length = 470
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 66/147 (44%), Gaps = 11/147 (7%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L + S+ R AG G DG A F +P + AV ++ +YVAD IR +
Sbjct: 301 SYHLGQRSLQWR-AGSGAIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358
Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
G V T+ G G + G DGP AS + P L I+D GN +R + L
Sbjct: 359 RGDLVQTLVGQGPWRFGDEDGPRAQASLQFPQAIALSPDAPLLWIADTGNGRLRTLRLGG 418
Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
+P +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
+ R G G + DG A F +P++ ++ + +YVAD NH +R+I L G+
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255
Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
G + +GP AQ S D H L I+D NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289
>gi|284038675|ref|YP_003388605.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
gi|283817968|gb|ADB39806.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
Length = 359
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 46/97 (47%), Gaps = 5/97 (5%)
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
F ++ + + AG G GY DG A+F P A D GN+YVAD N IRKI+
Sbjct: 73 FKVTPQGNLSLYAGSGGTGYQDGSLDKAKFLWPYGLAFDRAGNLYVADSGNQAIRKISPD 132
Query: 134 G-VTTIAGG----GSKKEGRADGPAQNASFSNDFELT 165
G VTT AG S DG + A F N LT
Sbjct: 133 GQVTTFAGQPYDVTSITNVSVDGIGKEARFYNPLVLT 169
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 4/96 (4%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPA 153
+G SAR+ S D N+Y+ + ++RKI G V+ + GS+ G DGP
Sbjct: 267 NGPINSARYGLITSLRFDSNDNLYIGETGAGIVRKIATDGQVSDVT--GSRLGGYKDGPL 324
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
Q A F + +L F P +L ++D+ N IR+I +
Sbjct: 325 QAAEFGSVEDLAFSPS-GSLYVADNRNGAIRKITFE 359
Score = 38.1 bits (87), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 9/60 (15%)
Query: 81 VVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+V+++A DG + GY DG +A F + A G++YVAD N IRKIT
Sbjct: 298 IVRKIATDGQVSDVTGSRLGGYKDGPLQAAEFGSVEDLAFSPSGSLYVADNRNGAIRKIT 357
Score = 37.4 bits (85), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 3/82 (3%)
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
+P S A D N+Y AD + + ++ AG G G DG A F + L
Sbjct: 51 RPVSVAFDKANNLYFADGIARIFKVTPQGNLSLYAGSGG--TGYQDGSLDKAKFLWPYGL 108
Query: 165 TFVPHICALLISDHGNQLIRQI 186
F L ++D GNQ IR+I
Sbjct: 109 AF-DRAGNLYVADSGNQAIRKI 129
>gi|290973047|ref|XP_002669261.1| predicted protein [Naegleria gruberi]
gi|284082806|gb|EFC36517.1| predicted protein [Naegleria gruberi]
Length = 618
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 48/87 (55%), Gaps = 3/87 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G +A+ P S V + +Y+AD+ NH IRKI N
Sbjct: 148 ENGNIITIAGNGTDGFSGDNGPATNAQLYYPSSVFVSLTNEVYIADQHNHRIRKILENGN 207
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND 161
+ TIAG G+ +GPA NA +++
Sbjct: 208 IITIAGNGTYGFSGDNGPATNAQLNHN 234
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 42/87 (48%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
A+ P V +Y+AD NH +RKI N + TIAG G+ +GPA NA F
Sbjct: 8 AQLFNPFGVFVSSNNEVYIADFCNHAVRKILENGNIVTIAGNGTAGFSGDNGPATNAQF- 66
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
N FV + I+D N IR+I
Sbjct: 67 NYPSSVFVSSNNEVCIADLHNHRIRKI 93
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 46/95 (48%), Gaps = 9/95 (9%)
Query: 61 VLDSSRSAFYTLSF------PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD 112
V SS + Y F + E + +AG+G G+S +G +A+F+ P S V
Sbjct: 16 VFVSSNNEVYIADFCNHAVRKILENGNIVTIAGNGTAGFSGDNGPATNAQFNYPSSVFVS 75
Query: 113 MKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKE 146
+ +AD NH IRKI G + TIAG G+ +
Sbjct: 76 SNNEVCIADLHNHRIRKILENGSIITIAGNGTHNQ 110
>gi|429737840|ref|ZP_19271682.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
gi|429161715|gb|EKY04093.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
Length = 438
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 54/110 (49%), Gaps = 2/110 (1%)
Query: 80 SVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ KRL+ + G + DG A+F P D GNIYVAD+ NH IR+IT +
Sbjct: 329 TTYKRLSSSNISGGHRDGPLSEAQFRNPSQIFCDNDGNIYVADRGNHCIRRITPENMVET 388
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G +G DG ++A F N+ + ++ ++D N +R++ +
Sbjct: 389 VLGMPGTKGWKDGKKEDALF-NEPTGIGIAQDGSVYVADFKNGRVRKLTI 437
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ E++V+ + G G +G+ DG+ A F++P + G++YVAD N +RK+T
Sbjct: 380 ITPENMVETVLGMPGTKGWKDGKKEDALFNEPTGIGIAQDGSVYVADFKNGRVRKLT 436
>gi|456733989|gb|EMF58811.1| Hypothetical protein EPM1_4079 [Stenotrophomonas maltophilia EPM1]
Length = 470
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 66/147 (44%), Gaps = 11/147 (7%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L + S+ R AG G DG A F +P + AV ++ +YVAD IR +
Sbjct: 301 SYHLGQRSLQWR-AGSGAIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358
Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
G V T+ G G + G DGP AS + P L I+D GN +R + L
Sbjct: 359 RGDLVQTLVGQGPWRFGDEDGPRAQASLQFPQAIALSPDAPLLWIADTGNGRLRTLRLGG 418
Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
+P +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
+ R G G + DG A F +P++ ++ + +YVAD NH +R+I L G+
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255
Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
G + +GP AQ S D H L I+D NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289
>gi|29830929|ref|NP_825563.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
gi|29608042|dbj|BAC72098.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
Length = 341
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 63/114 (55%), Gaps = 7/114 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V D++R L+ ESVV+R+ G G++G++DG +A F +P+ + +
Sbjct: 189 SGNFLVSDTTRHQLVELA--ADGESVVRRI-GSGIRGFADGPADAASFSEPQGMTLLDED 245
Query: 116 NIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELT 165
+ VAD NH +R+ + VTT+AG G + + A GPA+ S+ +++
Sbjct: 246 VVVVADTVNHALRRLDLATGEVTTLAGTGRQWWQGSPASGPAREVDLSSPWDVA 299
Score = 41.6 bits (96), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 55/114 (48%), Gaps = 3/114 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GDG + EP RF P + GN V+D + H + ++ G + + GS
Sbjct: 164 GDGPYVAPEPEPTVLRF--PGKALLLPSGNFLVSDTTRHQLVELAADGESVVRRIGSGIR 221
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
G ADGPA ASFS +T + ++++D N +R+++L + + + +G
Sbjct: 222 GFADGPADAASFSEPQGMTLLDEDV-VVVADTVNHALRRLDLATGEVTTLAGTG 274
>gi|292491957|ref|YP_003527396.1| hypothetical protein Nhal_1896 [Nitrosococcus halophilus Nc4]
gi|291580552|gb|ADE15009.1| YD repeat protein [Nitrosococcus halophilus Nc4]
Length = 2539
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 62/113 (54%), Gaps = 3/113 (2%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+ + VV AG+G+ G+S G+ + +P + A+D GN+Y+A+ N+VIR++ G
Sbjct: 1294 IGTDGVVSIAAGNGIAGFSGTGDLATQAMMRPFAIALDSDGNLYIAESQNNVIRRVGPDG 1353
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+T +AG G++ G A A ++ +L P +L I+D+ N IR +
Sbjct: 1354 LITLVAGNGTRGFSGDGGLATQARLNHPNDLAIGPD-GSLYIADYDNHRIRLV 1405
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 73/141 (51%), Gaps = 17/141 (12%)
Query: 77 SEESVVKRL---------AGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSN 124
S+ +V++R+ AG+G +G+S G+ G AR + P A+ G++Y+AD N
Sbjct: 1341 SQNNVIRRVGPDGLITLVAGNGTRGFS-GDGGLATQARLNHPNDLAIGPDGSLYIADYDN 1399
Query: 125 HVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
H IR ++ G +TTIAG GS+ G GPA A N +T V +L I+D N I
Sbjct: 1400 HRIRLVSPNGIITTIAGNGSRGYGGDGGPATQAMLRNPQGIT-VASDSSLYIADRRNHRI 1458
Query: 184 RQINLKPEDCSKSSQSGSALG 204
R+++ PE + LG
Sbjct: 1459 RKVS--PEGIITTVAGNGILG 1477
Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 4/58 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S E ++ +AG+G+ GY DG+ G A+ + P + A+ N+Y+AD NH IR++
Sbjct: 1461 VSPEGIITTVAGNGILGY-DGDGGISTGAKLNLPIAVALSPNENLYIADYYNHRIRRV 1517
>gi|290991466|ref|XP_002678356.1| predicted protein [Naegleria gruberi]
gi|284091968|gb|EFC45612.1| predicted protein [Naegleria gruberi]
Length = 1017
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 4/94 (4%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG--GGSKKEGRADGPAQN 155
+A + P AV G++Y+ADKSNHVIRK++ L +TTIAG G ++ ++ A N
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALNGKITTIAGIAGETELNKYSNSLATN 167
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ ++ L ++ISD N +IR++ L
Sbjct: 168 TTLNSPQYLAVNSSTAEVIISDTNNNVIRKVYLN 201
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 65/129 (50%), Gaps = 17/129 (13%)
Query: 68 AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSN 124
A Y SF S + + +AG G G G+ G SAR KP S ++ + N+Y+ D N
Sbjct: 447 AMYLDSFLGSLATKITVVAGTGKLGGYAGDGGLATSARIQKPTSVVLNDQ-NLYIVDTLN 505
Query: 125 HVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISD 177
H IRK+ T +TTIAG G+ G + + + +L + H+ + ISD
Sbjct: 506 HRIRKVSLTFGNITTIAGIGTA------GFSGDGGLATKAKLNYPTHMAISASGEIFISD 559
Query: 178 HGNQLIRQI 186
+GNQ IR+I
Sbjct: 560 NGNQRIRKI 568
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--T 131
++ + +AG+G+ G+S +G A F+ AV G++YVAD NH IRKI +
Sbjct: 568 IATNGKISTIAGNGIVGFSGDNGLATKATFNSRNGIAVASNGDVYVADTRNHRIRKISVS 627
Query: 132 NLGVTTIAGGGS----KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
N ++T AG GS G G A +A + + + + I+D GN IR++
Sbjct: 628 NGFISTFAGNGSVAYQATFGGDGGLAVSAKLNLPYSVAINNATNEVYITDSGNHRIRKV 686
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 57/117 (48%), Gaps = 9/117 (7%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
+S + +AG G G S G SA+ + P S A+ + I + D NH IRK+ +N
Sbjct: 255 ISTNGKITTIAGTGTAGLSGDGGLATSAKLNYPNSVALGLNNEILIVDTLNHRIRKLFSN 314
Query: 133 LGVTTIAGGGSKKEGRAD-GPAQNA--SFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ +IAG G+ + D G A NA + ND +T ISD GN IR++
Sbjct: 315 GTIISIAGNGTTQGFSGDGGNALNALLNLPNDVVMTLNGE---YFISDFGNHRIRKV 368
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 8/102 (7%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSAR 102
+N +S P S + I+ DS+ + L +S ++ +AG G G + A+
Sbjct: 820 LNSNSFTTTP-SGEFIIADSNNN----LIRKISTSGIISTIAGTGAATFGGDNANATIAK 874
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGS 143
+ P + AV G I++AD +NH IRKI N +TTIAG G+
Sbjct: 875 LNNPLNVAVSSSGEIFIADTNNHRIRKIFLNGTITTIAGNGT 916
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 4/123 (3%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
+ +AG G G+S G A+ + P A+ G I+++D N IRKI TN ++TI
Sbjct: 518 ITTIAGIGTAGFSGDGGLATKAKLNYPTHMAISASGEIFISDNGNQRIRKIATNGKISTI 577
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
AG G +G A A+F N V + ++D N IR+I++ S +
Sbjct: 578 AGNGIVGFSGDNGLATKATF-NSRNGIAVASNGDVYVADTRNHRIRKISVSNGFISTFAG 636
Query: 199 SGS 201
+GS
Sbjct: 637 NGS 639
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 7/70 (10%)
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
+Y AD N +RKI+N +TTIAG G DG A +++ N T P +I+
Sbjct: 784 VYFADSLNSRVRKISNGKITTIAG------GIGDGLAATSAYLNSNSFTTTPS-GEFIIA 836
Query: 177 DHGNQLIRQI 186
D N LIR+I
Sbjct: 837 DSNNNLIRKI 846
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 56/125 (44%), Gaps = 26/125 (20%)
Query: 79 ESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
+ +AG+G GYS DG + + + P + AV G I++ D NH IRKI G +
Sbjct: 905 NGTITTIAGNGTAGYSGDGLDSTKCQLNYPSAVAVSSGGEIFIVDTHNHRIRKIAIDGII 964
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFEL-----------TFVPHICALLISDHGNQLIR 184
+TIAG G A F+ D +L + IS+ GN+ IR
Sbjct: 965 STIAGNGI------------AGFNGDGKLPINTQLNYPTGIVIASSGEAYISEEGNRRIR 1012
Query: 185 QINLK 189
+I L+
Sbjct: 1013 KIYLQ 1017
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 2/99 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
G DG ++ + SF G +AD +N++IRKI+ G ++TIAG G+ G +
Sbjct: 808 GIGDGLAATSAYLNSNSFTTTPSGEFIIADSNNNLIRKISTSGIISTIAGTGAATFGGDN 867
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
A A +N + V + I+D N IR+I L
Sbjct: 868 ANATIAKLNNPLNVA-VSSSGEIFIADTNNHRIRKIFLN 905
>gi|456988165|gb|EMG23306.1| hypothetical protein LEP1GSC150_2908 [Leptospira interrogans
serovar Copenhageni str. LT2050]
Length = 231
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
>gi|428162807|gb|EKX31917.1| hypothetical protein GUITHDRAFT_57527, partial [Guillardia theta
CCMP2712]
Length = 203
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 5/113 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIR--KITNLGVTTI 138
V LAG G G++DGE +A F+ P + A +I +AD NH +R ++N+ ++T+
Sbjct: 54 VGTLAGSGYVGFADGEARAAMFNHPIALATSPDNSIVLIADGFNHRVRSFNVSNMSISTL 113
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
AG G G DG +A F+ L+F + ++D N IR I L +
Sbjct: 114 AGDGGA--GFQDGIGTDAKFNFPSALSFFGDGTKVAVTDMYNNKIRIITLDSK 164
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 56/115 (48%), Gaps = 7/115 (6%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGN---IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEG 147
Y DG +A F+ P A+ G+ + V++ NHV+R I +L V T+AG G G
Sbjct: 7 YIDGNLSNACFNGPSGVAILQLGSDTLVLVSEVGNHVVRYINVISLQVGTLAGSG--YVG 64
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
ADG A+ A F++ L P +LI+D N +R N+ S + G A
Sbjct: 65 FADGEARAAMFNHPIALATSPDNSIVLIADGFNHRVRSFNVSNMSISTLAGDGGA 119
>gi|255070387|ref|XP_002507275.1| predicted protein [Micromonas sp. RCC299]
gi|226522550|gb|ACO68533.1| predicted protein [Micromonas sp. RCC299]
Length = 131
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 57/108 (52%), Gaps = 4/108 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
V ++G G+ G+ DG+ + F P AV G++YVAD N+ IR++++ G VT IAG
Sbjct: 6 VTSISGSGIPGFRDGQGEKSHFSSPAGVAVSSDGSVYVADSGNNRIRRVSSKGHVTWIAG 65
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G G +D A F+ + +P + ++D N IR ++L
Sbjct: 66 SG--LAGFSDDKLLRAEFNRPQGIVTIP-TGLIFVADTLNHRIRLVSL 110
Score = 38.1 bits (87), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+S + V +AG G+ G+SD + A F++P+ G I+VAD NH IR ++
Sbjct: 54 VSSKGHVTWIAGSGLAGFSDDKLLRAEFNRPQGIVTIPTGLIFVADTLNHRIRLVS 109
>gi|421117009|ref|ZP_15577380.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
Fiocruz LV133]
gi|410011441|gb|EKO69561.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
Fiocruz LV133]
Length = 358
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTISTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 57/100 (57%), Gaps = 5/100 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGRA 149
G +G+ ++ FD P +D + N+YV + SNH IRKI N G V+T++GG S G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---GYL 222
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
DG +A F + + + + +LL +D + IR+I+LK
Sbjct: 223 DGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK 262
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G + DG NASF
Sbjct: 270 LGNGIGTD--VDGNGTNASF 287
>gi|418688943|ref|ZP_13250072.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
gi|421128059|ref|ZP_15588277.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. 2006006986]
gi|421134520|ref|ZP_15594655.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. Andaman]
gi|400361935|gb|EJP17894.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
gi|410021314|gb|EKO88104.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. Andaman]
gi|410434526|gb|EKP83664.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. 2006006986]
Length = 358
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 66/116 (56%), Gaps = 7/116 (6%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGRA 149
G +G+ ++ FD P +D + N+YV + SNH IRKI N G V+T++GG S G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---GYL 222
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
DG +A F + + + + +LL +D + IR+I+LK + + S+ G+ +GA
Sbjct: 223 DGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNIYV+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTISTASFKTPFGLEVDTSGNIYVS 95
Query: 121 DKSNHVIRKI 130
D+ N++IRKI
Sbjct: 96 DQINNLIRKI 105
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G G+ GY DG+ SARF P A + K N + AD +H IRKI N V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G + DG NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287
>gi|392945711|ref|ZP_10311353.1| serine/threonine protein kinase [Frankia sp. QA3]
gi|392289005|gb|EIV95029.1| serine/threonine protein kinase [Frankia sp. QA3]
Length = 859
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 34/89 (38%), Positives = 50/89 (56%), Gaps = 2/89 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
+A+ + P S D GNIY+ D N+ IRKI+ GV +TIAG G+ GPA A
Sbjct: 592 AAQLNGPGSTTRDKAGNIYIGDAQNNRIRKISPAGVISTIAGTGTAGYSGDGGPATAAQL 651
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
+N ++T P ++ SD+ N IR+I+
Sbjct: 652 NNAEQVTTGPD-GSVYFSDYENHRIRKID 679
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 4/110 (3%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TT 137
++ G GV GY+ G AR D P + G +Y AD + I+K+T G+ ++
Sbjct: 683 IITTYVGTGVAGYTGAGGPATQARIDGPHEITMTDDGTLYFADLRSETIQKVTPDGIISS 742
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+AG G GPA++A N L+ P L ++D+ N IR+I+
Sbjct: 743 VAGTGEAGYAGDGGPARSAKL-NGPSLSIGPDGRTLYLADYHNNRIRKID 791
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 33/93 (35%), Positives = 45/93 (48%), Gaps = 3/93 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S V+ +AG G GYS DG P +A + + + G++Y +D NH IRKI
Sbjct: 622 ISPAGVISTIAGTGTAGYSGDGGPATAAQLNNAEQVTTGPDGSVYFSDYENHRIRKIDPA 681
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
G +TT G G A GPA A E+T
Sbjct: 682 GIITTYVGTGVAGYTGAGGPATQARIDGPHEIT 714
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 42/116 (36%), Positives = 65/116 (56%), Gaps = 6/116 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKG-NIYVADKSNHVIRKITN 132
++ + ++ +AG G GY+ DG P SA+ + P S ++ G +Y+AD N+ IRKI
Sbjct: 734 VTPDGIISSVAGTGEAGYAGDGGPARSAKLNGP-SLSIGPDGRTLYLADYHNNRIRKIDP 792
Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+ TT+AG G+ G GPA A N + V AL I+D+GN +R+I+
Sbjct: 793 SGIITTVAGIGTAGSGGDGGPATAAQLKNPTSVV-VDGAGALYIADNGNARVRRID 847
>gi|116672456|ref|YP_833389.1| NHL repeat-containing protein [Arthrobacter sp. FB24]
gi|116612565|gb|ABK05289.1| NHL repeat containing protein [Arthrobacter sp. FB24]
Length = 672
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 77/166 (46%), Gaps = 10/166 (6%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL-SF-PLSEESVVKRLA 86
ED T + H L++ S D S+ L + + + + + SF P+S V LA
Sbjct: 323 EDAAGFTGRLSDHPLDVALSSPWDVVWSAKLNAVVVAMAGVHQIFSFDPIS--GAVSILA 380
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI---AGG 141
G+G++G DG A F + A D GNI+VAD +RK I + G T+ G
Sbjct: 381 GNGLEGLLDGAAHEAWFAQSSGLAEDADGNIWVADSETSALRKLVIDDAGTVTVESAVGK 440
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G DGPA A + +T +P ++ I+D N +R+ +
Sbjct: 441 GLFDFGFRDGPAAEARLQHPLGVTVLPD-GSVAIADTYNGAVRRYD 485
>gi|408824784|ref|ZP_11209674.1| NHL repeat containing protein [Pseudomonas geniculata N1]
Length = 470
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 11/147 (7%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L + S+ R AG G DG A F +P + AV ++ +YVAD IR +
Sbjct: 301 SYHLGQRSLQWR-AGSGSIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358
Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
G V T+ G G + G DGP +AS + P L I+D GN +R + L
Sbjct: 359 RGDLVQTLVGQGPWRFGSEDGPRADASLQFPQAIALSPDAPLLWIADTGNGRLRTLRLGG 418
Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
+P +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
+ R G G + DG A F +P++ ++ + +YVAD NH +R+I L G+
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255
Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
G + +GP AQ S D H L I+D NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289
>gi|290977429|ref|XP_002671440.1| predicted protein [Naegleria gruberi]
gi|284085009|gb|EFC38696.1| predicted protein [Naegleria gruberi]
Length = 1494
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 63/122 (51%), Gaps = 6/122 (4%)
Query: 81 VVKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
+V +AG + DG A+ + P A+ G +Y++D SN VIRKI TN ++T+
Sbjct: 473 LVSTIAGTISKYTGDGSSAIQAQLNGPTGVAISPNGEVYISDSSNDVIRKIDTNGAISTV 532
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
G S G D A+ A +N + F+P+ L+ISD N IR+ ++ + + +
Sbjct: 533 VGSSS---GFLDASARRAQLANPMGIAFLPN-GDLIISDAKNNRIRKFSISSGNVTTIAG 588
Query: 199 SG 200
+G
Sbjct: 589 TG 590
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 55/130 (42%), Gaps = 1/130 (0%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGN 116
D+ ++DSS++ + + + + G + G P SA F P+ AV
Sbjct: 127 DVFIVDSSQNRVKKVFYNSTINAFNMITVAGGGSSLASGIPALSATFTAPRCIAVSPDDE 186
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
+YV D N++I+KI N + + G G K A N S+ + F P L S
Sbjct: 187 VYVCDAFNYLIKKIVNDTIYNVIGTGIKGNSVDGTLANNTKISSPSSIAFHPITKELFYS 246
Query: 177 DHGNQLIRQI 186
D +IR I
Sbjct: 247 DLLANVIRYI 256
>gi|389875076|ref|YP_006374432.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
gi|388532256|gb|AFK57450.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
Length = 487
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 60/109 (55%), Gaps = 6/109 (5%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGG 141
R G G G DG SARF +P+ + G I+VAD NH +R+I + VTTIAG
Sbjct: 210 RRVGSGEPGLVDGPAESARFQRPQGL-IAADGAIFVADTWNHAVRRIDVASGEVTTIAGT 268
Query: 142 GSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G + +A PA +++ ++ ++L + + L +++ G + +I+L+
Sbjct: 269 GRRGPILKAPAPAIDSALASPWDLEYREGV--LYVANAGTHQLARIDLE 315
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 55/112 (49%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--V 135
E V++ LAG G + DG A +P + A+ + ++ D +R++ G V
Sbjct: 315 ETGVLEPLAGTGAEALVDGPADQAALAQPSALAL-KEDRLWFIDAETSSLRRLDLAGGTV 373
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+T+ G G + G DGPA+ A F + L F + +L++D N IR ++
Sbjct: 374 STLVGDGLFEFGMVDGPARTARFQHPLGLCFDMN-GNILVADAYNDAIRVVD 424
>gi|423216521|ref|ZP_17203044.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
CL03T12C04]
gi|392690731|gb|EIY83987.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
CL03T12C04]
Length = 589
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 69/144 (47%), Gaps = 18/144 (12%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE---PGSARFDKPKSFAVD-MKG 115
+++ + R Y ++ V ++A DG G +DGE P +A+ +P+ VD +
Sbjct: 453 LIVGAQRDGLYAVA----PNGTVTKIACDG-NGNTDGEANKPMTAKLIQPEGIVVDKTRN 507
Query: 116 NIYVADKSNHVIRKI-------TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
+IY D N+ IR+I T+ V+TIAG G+K G DG A + +T
Sbjct: 508 DIYFTDGYNNKIRRIRPGKNGYTDATVSTIAGSGTK--GNQDGDGATAQLAMPHGITMTA 565
Query: 169 HICALLISDHGNQLIRQINLKPED 192
+ SD N +IR+I + +D
Sbjct: 566 DGNTIYFSDLDNYIIRKITITEKD 589
>gi|320107313|ref|YP_004182903.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319925834|gb|ADV82909.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 1821
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 52/95 (54%), Gaps = 3/95 (3%)
Query: 95 DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGP 152
DG P SA RF+ P A+D G++Y+AD++NH IR IT G ++T+AG G + G
Sbjct: 341 DGGPASAGRFNNPLGLAIDSGGSLYIADQANHRIRMITAGGSLSTVAGTGVPRYNGDGGL 400
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A A + V L I+D N L+R+++
Sbjct: 401 AVAAQLQQPAAVA-VDVARNLYIADSNNHLVRKVS 434
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 67/140 (47%), Gaps = 14/140 (10%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARF---DKPKSFA 110
G+ DL + D+ +A L+ ++ +AG GV G S G+ GSA + P A
Sbjct: 244 GAGDLYIADTRNNAIRKLTL---ATGILSTIAGRLGVPG-SSGDGGSATLATLNAPGGLA 299
Query: 111 VDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRAD-GPAQNASFSNDFELTF 166
+D G +Y+AD N IR+I T + +T AG + + D GPA F+N L
Sbjct: 300 MDAAGFLYIADTGNDTIRRIDPSTGM-ITLFAGIPTVADFAGDGGPASAGRFNNPLGLA- 357
Query: 167 VPHICALLISDHGNQLIRQI 186
+ +L I+D N IR I
Sbjct: 358 IDSGGSLYIADQANHRIRMI 377
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 62/113 (54%), Gaps = 10/113 (8%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
+ +AG GV Y +G+ G +A+ +P + AVD+ N+Y+AD +NH++RK+++ ++
Sbjct: 383 LSTVAGTGVPRY-NGDGGLAVAAQLQQPAAVAVDVARNLYIADSNNHLVRKVSSTTGLIS 441
Query: 137 TIAG--GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+I G G + G +GPA A + + L L +D N IR+++
Sbjct: 442 SIVGIPGAAAYSGD-NGPANVAKINGPYALAL-DSAGDLYFADLLNNRIRKVS 492
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 13/79 (16%)
Query: 89 GVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----LGVTTIAGGG 142
G YS +G A+ + P + A+D G++Y AD N+ IRK++N L T I
Sbjct: 448 GAAAYSGDNGPANVAKINGPYALALDSAGDLYFADLLNNRIRKVSNSYATLQYTPI---- 503
Query: 143 SKKEGRADGPAQNASFSND 161
+ GR P Q+ +F ND
Sbjct: 504 --RVGRTSAP-QSQTFEND 519
>gi|456865078|gb|EMF83443.1| putative lipoprotein [Leptospira weilii serovar Topaz str. LT2116]
Length = 357
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 66/119 (55%), Gaps = 9/119 (7%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
+ G+ +G+ ++ F P +D + N+YV + NH IRKI NL V+T++GG S
Sbjct: 163 IDGFQNGDRLNSLFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSETVSTLSGGVS--- 218
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + +T+ +LL++D + IR+I+LK + S+ G+ +GA
Sbjct: 219 GYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKTS--TVSTFLGNGIGA 275
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 47/94 (50%), Gaps = 2/94 (2%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + I LD + ++ V+ AG G+ DG +A F+
Sbjct: 22 NPSIFMSRAWLENTI-LDCILKECHLCKLKVTNNPVISLFAGTGINQSVDGTTQTASFNT 80
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
P +D GNI+V+D++ ++IRKI + G VTT+
Sbjct: 81 PFGLELDTFGNIFVSDQAANLIRKIDHFGNVTTL 114
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 70/133 (52%), Gaps = 8/133 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAG 140
V L+G GV GY DG+ SA+F P D K + + VAD +H IRKI +L +T++
Sbjct: 210 VSTLSG-GVSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKI-DLKTSTVST 267
Query: 141 G-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE-DCSKSSQ 198
G+ DG NASF N + + + +SD ++ IR ++ P+ + S +
Sbjct: 268 FLGNGIGASIDGKGLNASF-NGPAFISLDNSGYMFVSDTNSKKIRIVD--PDLNVSTITH 324
Query: 199 SGSALGAVSVWVL 211
+ +A+GAV V L
Sbjct: 325 TFAAIGAVKVDCL 337
>gi|359728310|ref|ZP_09267006.1| hypothetical protein Lwei2_15904 [Leptospira weilii str.
2006001855]
gi|417778551|ref|ZP_12426356.1| putative lipoprotein [Leptospira weilii str. 2006001853]
gi|410781344|gb|EKR65918.1| putative lipoprotein [Leptospira weilii str. 2006001853]
Length = 357
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 66/119 (55%), Gaps = 9/119 (7%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
+ G+ +G+ ++ F P +D + N+YV + NH IRKI NL V+T++GG S
Sbjct: 163 IDGFQNGDRLNSLFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSETVSTLSGGVS--- 218
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + +T+ +LL++D + IR+I+LK + S+ G+ +GA
Sbjct: 219 GYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKTS--TVSTFLGNGIGA 275
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 2/94 (2%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + R + I LD + ++ + V+ AG G+ DG +A F+
Sbjct: 22 NPSIFMSRAWLENAI-LDCILKECHLCKLKVTNKPVISLFAGTGINQSVDGTTQTASFNT 80
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
P +D GNI+V+D++ ++IRKI + G VTT+
Sbjct: 81 PFGLELDTFGNIFVSDQAANLIRKIDHFGNVTTL 114
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 70/133 (52%), Gaps = 8/133 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAG 140
V L+G GV GY DG+ SA+F P D K + + VAD +H IRKI +L +T++
Sbjct: 210 VSTLSG-GVSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKI-DLKTSTVST 267
Query: 141 G-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE-DCSKSSQ 198
G+ DG NASF N + + + +SD ++ IR ++ P+ + S +
Sbjct: 268 FLGNGIGASIDGKGLNASF-NGPAFISLDNSGYMFVSDTNSKKIRIVD--PDLNVSTITH 324
Query: 199 SGSALGAVSVWVL 211
+ +A+GAV + L
Sbjct: 325 TFAAIGAVKIDCL 337
>gi|425734502|ref|ZP_18852820.1| NHL repeat-containing protein [Brevibacterium casei S18]
gi|425481116|gb|EKU48277.1| NHL repeat-containing protein [Brevibacterium casei S18]
Length = 666
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 74/155 (47%), Gaps = 26/155 (16%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---D 112
S +L+V DS + + + ++V++R+ G G +G +DG+ SA+F +P V D
Sbjct: 208 SGNLLVADSGHHSL--VEYTPDGQTVLRRI-GTGERGLTDGDFASAQFSEPGGITVLPED 264
Query: 113 MKG----NIYVADKSNHVIRKITNLG--VTTIAGGGSKKE--------------GRADGP 152
+ + VAD NHV+R I G V TIAG G + GR GP
Sbjct: 265 IAARAGYQLVVADTVNHVLRGIDLDGETVRTIAGTGEQHMVGAIDNVRGTHGALGRYSGP 324
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A + S+ +++ FVP ++++ GN I +
Sbjct: 325 ALDVKLSSPWDVLFVPATGEVVVAMAGNHTIWSFD 359
Score = 41.2 bits (95), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
+P V+ P + +++V + ++ P++ + ++ L+G +G +G+ SA F +
Sbjct: 332 SPWDVLFVPATGEVVVAMAGNHTIWSFD-PVTGD--IRLLSGTMNEGLVNGDAESAWFAQ 388
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
+ G + VAD IR++ TT+ G G G DGPA A +
Sbjct: 389 SSGLDLHPDGGVIVADSETSAIRRLDPASGEATTLVGTGLFDFGFRDGPAAEARLQHPLG 448
Query: 164 LTFVPHICALLISDHGNQLIRQINLKPEDCS 194
+ +P +L I+D N IR+ + + S
Sbjct: 449 VRTLPD-GSLAIADTYNGAIRRYDFTTNEVS 478
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 47/101 (46%), Gaps = 6/101 (5%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P V GN+ VAD +H + + T G T + G+ + G DG +A FS +T
Sbjct: 200 PGKVTVLPSGNLLVADSGHHSLVEYTPDGQTVLRRIGTGERGLTDGDFASAQFSEPGGIT 259
Query: 166 FVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSG 200
+P A L+++D N ++R I+L E + +G
Sbjct: 260 VLPEDIAARAGYQLVVADTVNHVLRGIDLDGETVRTIAGTG 300
>gi|156399714|ref|XP_001638646.1| predicted protein [Nematostella vectensis]
gi|156225768|gb|EDO46583.1| predicted protein [Nematostella vectensis]
Length = 286
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTI 138
V AG+G +G DGE A+F P+S ++D +G+ ++V D NH IR I+ + V T+
Sbjct: 131 VTTFAGNGSEGLVDGERTRAQFKGPQSLSLDDEGDRLFVGDTDNHAIRVISLKDGSVQTL 190
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
GG G DG + F + + + L +SDH N +IR + +
Sbjct: 191 VGG---SLGFKDGVGLKSKFYHPTGIAYDRENDILYVSDHYNHVIRAVKV 237
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 48/106 (45%), Gaps = 7/106 (6%)
Query: 102 RFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASF 158
R + A+D + + Y+AD N IRK+ VTT AG GS EG DG A F
Sbjct: 95 RINPNTGMAIDTRKRVMYLADVGNSRIRKVDMATAEVTTFAGNGS--EGLVDGERTRAQF 152
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
L+ L + D N IR I+LK D S + G +LG
Sbjct: 153 KGPQSLSLDDEGDRLFVGDTDNHAIRVISLK--DGSVQTLVGGSLG 196
>gi|359777541|ref|ZP_09280822.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
12137]
gi|359305319|dbj|GAB14651.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
12137]
Length = 666
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 77/166 (46%), Gaps = 10/166 (6%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
ED T + H L++ S D SS L V+ + S+ PL+ + V LA
Sbjct: 317 EDAAGFTGRLGDHPLDVALSSPWDVVWSSKLNAVVIAMAGVHQIFSYEPLTGD--VSILA 374
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI---AGG 141
G+G++G DG A F +P A D GNI+VAD +RK I++ G T+ G
Sbjct: 375 GNGLEGLLDGPAHEAWFAQPSGLAEDADGNIWVADSETSALRKLVISDDGAVTVESAVGK 434
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G DG A A + +T +P ++ I+D N +R+ +
Sbjct: 435 GLFDFGFRDGEASEARLQHPLGVTVLPD-GSVAIADTYNGAVRRYD 479
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 9/68 (13%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNHVIRKIT--NLGVTT 137
G G +G++DG G ARF++P+ A ++ +AD NH +R ++ + VTT
Sbjct: 239 GSGEKGHADGAAGIARFNEPQGLVLLPEDVAAKTGYDVVIADSVNHRLRGLSLADGSVTT 298
Query: 138 IAGGGSKK 145
+AG G ++
Sbjct: 299 LAGSGVQR 306
>gi|351729451|ref|ZP_08947142.1| NHL repeat-containing protein [Acidovorax radicis N35]
Length = 663
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 35/66 (53%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
Y D +ARF+ P++ AVD GN YVAD NH++RKI G T G G ADG
Sbjct: 48 YLDAAGTAARFNAPQAVAVDTAGNTYVADTGNHLVRKIDASGNVTTLAGQEGVCGNADGT 107
Query: 153 AQNASF 158
A+
Sbjct: 108 GTAATL 113
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 47/95 (49%), Gaps = 9/95 (9%)
Query: 74 FPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH------- 125
F ++ V AG+ +G++DG GSA F S A+D GN+YV D H
Sbjct: 480 FKINPTGAVSLFAGNPNARGHADGPAGSALFAGLGSMALDPAGNLYVVDGVYHFITGVGP 539
Query: 126 VIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
IRKIT G V+T+AG + G DGP A +
Sbjct: 540 TIRKITPAGMVSTLAGRANVPPGLVDGPVSQAQLT 574
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 50/99 (50%), Gaps = 7/99 (7%)
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG-SKKEGRADGP-----AQNASFSND 161
A+ G++ +AD NH IR+++ G +TT AGGG ++ + + GP A A+ N
Sbjct: 1 MALTPTGDVLIADPGNHTIRRLSPTGQLTTFAGGGPTRSDASSPGPRYLDAAGTAARFNA 60
Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
+ V ++D GN L+R+I+ + + Q G
Sbjct: 61 PQAVAVDTAGNTYVADTGNHLVRKIDASGNVTTLAGQEG 99
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G++DG +ARFD P + A+ G +YV D++N IR I G+ G + G
Sbjct: 341 GWADGSGSTARFDGPGALAMGKSGTLYVRDRNNAAIRTIQPDGMVGTLGRVKNPCSQITG 400
Query: 152 PAQNASFSNDFELT 165
P +F LT
Sbjct: 401 PINEVAFCGPEALT 414
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 35/68 (51%), Gaps = 12/68 (17%)
Query: 82 VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV-----------ADKSNHVIRK 129
V LAG +GV G +DG +A P S AVD GN+YV A+ + + IRK
Sbjct: 91 VTTLAGQEGVCGNADGTGTAATLCSPTSIAVDKDGNVYVSEWAPLTQTLPAEPTGNPIRK 150
Query: 130 ITNLGVTT 137
IT GV +
Sbjct: 151 ITPAGVVS 158
>gi|254524393|ref|ZP_05136448.1| NHL repeat protein [Stenotrophomonas sp. SKA14]
gi|219721984|gb|EED40509.1| NHL repeat protein [Stenotrophomonas sp. SKA14]
Length = 470
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 65/147 (44%), Gaps = 11/147 (7%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L + S+ R AG G DG A F +P + AV ++ +YVAD IR +
Sbjct: 301 SYHLGQRSLQWR-AGSGAIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358
Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
G V T+ G G + G DGP A + P L I+D GN +R + L
Sbjct: 359 RGDLVQTLVGQGPWRFGNEDGPRAQARLQFPQAIALSPDAPLLWIADSGNGRLRTLRLGG 418
Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
+P +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLQGPAGLAVGAGAVWI 445
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
+ R G G + DG A F +P++ ++ + +YVAD NH +R+I L G+
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255
Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
G + +GP AQ S D H L I+D NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPIGLAIAD--NQL 289
>gi|421094122|ref|ZP_15554843.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
gi|410363263|gb|EKP14295.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
gi|456886734|gb|EMF97860.1| putative lipoprotein [Leptospira borgpetersenii str. 200701203]
Length = 358
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 59/109 (54%), Gaps = 7/109 (6%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKK 145
G+ G+ +G+ ++ F P +D + N+YV + NH IRKI NL V+T++GG S
Sbjct: 163 GIGGFQNGDRLNSLFKGPLFMDLDRERNLYVGELGNHTIRKI-NLNSETVSTLSGGVS-- 219
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G DG +A F + + + +LL++D N IR+I+LK S
Sbjct: 220 -GYLDGDLTSAQFKSPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVS 267
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 1/81 (1%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
++LD +T ++ VV AG G++ DG +A F P S +D GNI+V
Sbjct: 35 MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94
Query: 120 ADKSNHVIRKITNLG-VTTIA 139
+D +++IRKI G VTT++
Sbjct: 95 SDSDSNLIRKIDRFGNVTTLS 115
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 42/80 (52%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G GV GY DG+ SA+F P D K + + VAD NH IRKI V+T+
Sbjct: 211 VSTLSG-GVSGYLDGDLTSAQFKSPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G E DG NASF
Sbjct: 270 LGNG--IEASIDGKGLNASF 287
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 1/62 (1%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVT 136
+ S V L G+G++ DG+ +A FD P ++D G ++V+D S++ IR + +L V
Sbjct: 262 KTSTVSTLLGNGIEASIDGKGLNASFDGPAFISLDNSGYMFVSDTSSNKIRIVDPDLNVF 321
Query: 137 TI 138
TI
Sbjct: 322 TI 323
>gi|421091013|ref|ZP_15551797.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
gi|410000210|gb|EKO50873.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
Length = 358
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 9/119 (7%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
+ G +G+ ++ FD P +D++ N+YV + SNH IRKI NL V+T +GG S
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSGMVSTFSGGIS--- 219
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + + + +LL +D + IR+I+LK + S+ G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++++ VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKI 130
D+ ++IRKI
Sbjct: 96 DQMTNLIRKI 105
>gi|429202583|ref|ZP_19193964.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
gi|428661888|gb|EKX61363.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
Length = 619
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 83/175 (47%), Gaps = 20/175 (11%)
Query: 39 DGHQLEINPHSVIDR-PGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
DG + P + R PG + D +V D++R L+ E+VV+R G G +
Sbjct: 170 DGPYVPPEPDPTVLRFPGKALLLPDGDFLVSDTTRHRLVRLA--PDGETVVRRY-GTGER 226
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG-VTTIAGGGSK--KEG 147
G+ DG ARF +P+ A+ G++ VAD NH +R++ + G V T+AG G + +
Sbjct: 227 GFVDGPADHARFSEPQGLALLDSGDVVVADTVNHALRRLDPDTGHVATLAGTGRQWWQGS 286
Query: 148 RADGPAQNASFSNDFELT-FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
GPA S+ +++ F + + H QL P DC+ + +G+
Sbjct: 287 PTSGPAWQIDLSSPWDVAVFGGKVWIAMAGVH--QL---WTYDPADCTVAVAAGT 336
>gi|348175195|ref|ZP_08882089.1| NHL repeat-containing protein [Saccharopolyspora spinosa NRRL
18395]
Length = 630
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 77/147 (52%), Gaps = 16/147 (10%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P VI PG +L+V DS+ + + F E+VV+R+ G G +G +DG +A F +P
Sbjct: 194 PGKVIALPGG-NLLVSDSANHSL--VEFAADGETVVRRI-GTGQRGRADGV--AAEFAEP 247
Query: 107 KSFAV-------DMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADGPAQNA 156
A+ + ++ VAD NH++R ++ + VTT+AG G + ++G GPA
Sbjct: 248 AGLALLPQRVAEQVGYDVVVADTVNHLLRGVRLVDGQVTTLAGTGEQWRDGSDSGPALET 307
Query: 157 SFSNDFELTFVPHICALLISDHGNQLI 183
++ ++L + ++I+ GN +
Sbjct: 308 PLTSPWDLAWWEPAGGVVIAMAGNHTL 334
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 8/80 (10%)
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-- 172
GN+ V+D +NH + + G T + G+ + GRADG A A F+ L +P A
Sbjct: 203 GNLLVSDSANHSLVEFAADGETVVRRIGTGQRGRADGVA--AEFAEPAGLALLPQRVAEQ 260
Query: 173 ----LLISDHGNQLIRQINL 188
++++D N L+R + L
Sbjct: 261 VGYDVVVADTVNHLLRGVRL 280
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 46/108 (42%), Gaps = 6/108 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTT 137
V+R AG V+G DG A F + A D +++AD +R + V T
Sbjct: 344 VRRFAGTTVEGLRDGAADEAFFAQTSGLA-DGGDRLWLADSETSALRWVEAADRGFAVRT 402
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G G G ADG A A F + + +P ++ + D N +R+
Sbjct: 403 AVGTGLFDFGHADGTADKALFQHPLGVAVLPD-GSVAVCDTYNGAVRR 449
>gi|322788140|gb|EFZ13922.1| hypothetical protein SINV_02735 [Solenopsis invicta]
Length = 717
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 72/153 (47%), Gaps = 20/153 (13%)
Query: 49 SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
+V+ + L++ DS + + E V+ + G QG+ DG+ +ARF+ P+
Sbjct: 255 TVLQSEQGTKLVISDSGNNRIVITN----EHGRVEHVIGGCSQGFKDGDFKNARFNSPQG 310
Query: 109 FAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADG----------PAQNA 156
+ IY+AD +NH IRKI + V+TIAG G + R G P A
Sbjct: 311 VCA-LNNTIYIADNNNHAIRKINLSEKTVSTIAGTGLQGCDRNGGGHGTDQALSSPWDVA 369
Query: 157 SFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ ++++ T VP LLI+ G I + L+
Sbjct: 370 IYHHEYKGTSVP---VLLIAIAGTHQIWALFLE 399
>gi|116625316|ref|YP_827472.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116228478|gb|ABJ87187.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 981
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 42/68 (61%), Gaps = 3/68 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + ++ +AG+G +G+S DG P +A F P AVD GN+YV D SN +RKI
Sbjct: 226 VTPDGIITTIAGNGTRGFSGDGGPATAATFRGPIGLAVDAFGNVYVTDNSNGRVRKIDAA 285
Query: 134 G-VTTIAG 140
G +TT AG
Sbjct: 286 GTITTYAG 293
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S ++ AG+G G++ G SA+ + P A+D GN+Y+ D SN+ +RK+T
Sbjct: 116 ISAAGIISTYAGNGTAGFAGDSGAATSAQLNGPTDVAIDGNGNLYICDSSNNRVRKVTPG 175
Query: 134 G-VTTIAGGG 142
G +TT AG G
Sbjct: 176 GTITTFAGNG 185
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 88 DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKE 146
DGVQ + P P AVD +GN+Y+ S +RK+T G+ TTIAG G++
Sbjct: 192 DGVQAATTAVP------SPAGIAVDAQGNVYI--SSAVRVRKVTPDGIITTIAGNGTRGF 243
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GPA A+F L V + ++D+ N +R+I+
Sbjct: 244 SGDGGPATAATFRGPIGLA-VDAFGNVYVTDNSNGRVRKID 283
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 8/116 (6%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPG-----SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +AG+G GY+ DG P + AVD GN Y AD SN+VIRKI+
Sbjct: 58 SSGTLSTIAGNGNFGYAGDGGPAINATLNLGAGGLAGLAVDGLGNAYFADSSNNVIRKIS 117
Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G ++T AG G+ G A +A + ++ + L I D N +R++
Sbjct: 118 AAGIISTYAGNGTAGFAGDSGAATSAQLNGPTDVA-IDGNGNLYICDSSNNRVRKV 172
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 3/81 (3%)
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
A+D +G +Y+ D + +RKI++ ++TIAG G+ GPA NA+ +
Sbjct: 36 LAIDGQGTVYLTDSATARVRKISSGTLSTIAGNGNFGYAGDGGPAINATLNLGAGGLAGL 95
Query: 169 HICAL---LISDHGNQLIRQI 186
+ L +D N +IR+I
Sbjct: 96 AVDGLGNAYFADSSNNVIRKI 116
>gi|153806007|ref|ZP_01958675.1| hypothetical protein BACCAC_00252 [Bacteroides caccae ATCC 43185]
gi|149130684|gb|EDM21890.1| IPT/TIG domain protein [Bacteroides caccae ATCC 43185]
Length = 440
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 31/50 (62%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
VV+ +AG GY D GSA F P+ AVD G IYV DK N+VIRKI
Sbjct: 388 VVRLVAGGTQAGYVDANAGSAAFSGPQDLAVDKNGVIYVYDKKNNVIRKI 437
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 53/120 (44%), Gaps = 15/120 (12%)
Query: 55 GSSDLIVLDSSRSAFYTLS-FPLSEESVVKRLAG--DGVQGYSDGEPGSARF-------- 103
G +L + ++R T+ F S +VV LAG +G GY DG A F
Sbjct: 98 GDGELTITIANREPVRTIEKFTYSFSAVVTTLAGSANGEPGYQDGVGSEALFFFDAAKAE 157
Query: 104 ----DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
K S VD GN+YV D N+ +RKIT G T G + +G DG A F+
Sbjct: 158 PAEDWKKGSVCVDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLAGNKGCIDGTGVQARFN 217
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 13/114 (11%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA--------DKSNHVIR--KITN 132
+ L G G GY +G A F P A+D G+IYVA + + IR +TN
Sbjct: 327 ENLIGKGTSGYLNGSFEDALFTFPSDLAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMTN 386
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V +AGG + G D A +A+FS +L V + + D N +IR+I
Sbjct: 387 RVVRLVAGG--TQAGYVDANAGSAAFSGPQDLA-VDKNGVIYVYDKKNNVIRKI 437
>gi|398339828|ref|ZP_10524531.1| hypothetical protein LkirsB1_10395 [Leptospira kirschneri serovar
Bim str. 1051]
Length = 358
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 9/119 (7%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
+ G +G+ ++ FD P +D++ N+YV + SNH IRKI NL V+T +GG S
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSGMVSTFSGGIS--- 219
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + + + +LL +D + IR+I+LK + S+ G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++++ VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKI 130
D+ ++IRKI
Sbjct: 96 DQMTNLIRKI 105
>gi|423298428|ref|ZP_17276486.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
CL03T12C18]
gi|392663340|gb|EIY56891.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
CL03T12C18]
Length = 507
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 2/67 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY DG ARF + + D GNIY+ D+ NH IR + N GV+T+ GG G +G
Sbjct: 413 GYWDGIFAEARFLRGEQMCSDDAGNIYIVDQDNHAIRMVNNTGVSTVIGGNGS--GAVNG 470
Query: 152 PAQNASF 158
++A
Sbjct: 471 VGKDAKL 477
>gi|158520859|ref|YP_001528729.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
gi|158509685|gb|ABW66652.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
Length = 2831
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 54/116 (46%), Gaps = 7/116 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S + +++ AG GV G+ G AR P AVD GN+++AD N IRK+
Sbjct: 1629 ISPDGIIEAFAGMGVDAGYSGDGGLAVDARLQSPTGLAVDKTGNLFIADSGNFSIRKVDP 1688
Query: 133 LGVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GV T GG+ DG PA +A + E+T L G IR+IN
Sbjct: 1689 KGVITTIAGGNGPGYSGDGWPAVDAQLQSISEITLDSSGNLYLT---GYDHIRKIN 1741
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 52/110 (47%), Gaps = 9/110 (8%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
+K AGDG G AR P+ +D GNIY+AD NH +R+I+ G + AG
Sbjct: 1587 MKVFAGDG------GPAIEARLYHPQGMEIDASGNIYIADTDNHCVRRISPDGIIEAFAG 1640
Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G DG A +A + L V L I+D GN IR+++ K
Sbjct: 1641 MGVDAGYSGDGGLAVDARLQSPTGLA-VDKTGNLFIADSGNFSIRKVDPK 1689
>gi|423219560|ref|ZP_17206056.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
CL03T12C61]
gi|392624765|gb|EIY18843.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
CL03T12C61]
Length = 436
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 31/50 (62%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
VV+ +AG GY D GSA F P+ AVD G IYV DK N+VIRKI
Sbjct: 384 VVRLVAGGTQAGYVDANAGSAAFSGPQDLAVDKNGVIYVYDKKNNVIRKI 433
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 53/120 (44%), Gaps = 15/120 (12%)
Query: 55 GSSDLIVLDSSRSAFYTL-SFPLSEESVVKRLAG--DGVQGYSDGEPGSARF-------- 103
G +L + ++R T+ F S +VV LAG +G GY DG A F
Sbjct: 94 GDGELTITIANREPVRTIDKFTYSFSAVVTTLAGSANGEPGYQDGVGSEALFFFDAAKAE 153
Query: 104 ----DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
K S VD GN+YV D N+ +RKIT G T G + +G DG A F+
Sbjct: 154 PAEDWKKGSVCVDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLAGNKGCIDGTGVQARFN 213
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 13/114 (11%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA--------DKSNHVIR--KITN 132
+ L G G GY +G A F P A+D G+IYVA + + IR +TN
Sbjct: 323 ENLIGKGTSGYLNGSFEDALFTFPSDLAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMTN 382
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V +AGG + G D A +A+FS +L V + + D N +IR+I
Sbjct: 383 RVVRLVAGG--TQAGYVDANAGSAAFSGPQDLA-VDKNGVIYVYDKKNNVIRKI 433
>gi|430741385|ref|YP_007200514.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
gi|430013105|gb|AGA24819.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 361
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 62/118 (52%), Gaps = 4/118 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
+ ++ V+ +AG G G S DG P + A F+ S AV G+I V+D N+ +RKI
Sbjct: 73 IDKDGRVQTVAGTGKNGDSGDGGPATKAEFNGMHSLAVMKNGDILVSDTWNNRVRKIDAR 132
Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
GV TTIAG G K GPA A F + + AL ++D N+ IR ++LK
Sbjct: 133 TGVITTIAGTGKKGFSGDGGPATAAEFGGIYCIALDEPGQALYLADLDNRRIRAVDLK 190
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 71/147 (48%), Gaps = 12/147 (8%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SAR 102
I+P +V G ++ +LD +A + ++ L G G +G S DG PG A+
Sbjct: 218 IDPRAVA-TDGKGNVYILDRGGNALRVVD----RSGKIRTLVGTGEKGASGDGGPGRQAK 272
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADGPAQNASFS 159
+ PK V+ +G++ +AD NH IR + + + ++AG G K G PA+ A F+
Sbjct: 273 LNGPKHLCVNPQGDVIIADTENHRIRLYRPSEGTIESLAGTGRKGTAGVGKAPAE-AEFN 331
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
+T P L ISD N I ++
Sbjct: 332 QPHGVTLAPD-GVLFISDSSNHRILKL 357
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 73/157 (46%), Gaps = 12/157 (7%)
Query: 50 VIDRPGSS-DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDG-EPGSARFDKP 106
+D PG + L LD+ R L + +V LAG+G +G DG + ++ P
Sbjct: 166 ALDEPGQALYLADLDNRRIRAVDL-----KTGLVSTLAGNGHKGIPQDGADARTSPMIDP 220
Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
++ A D KGN+Y+ D+ + +R + G + T+ G G K GP + A + L
Sbjct: 221 RAVATDGKGNVYILDRGGNALRVVDRSGKIRTLVGTGEKGASGDGGPGRQAKLNGPKHLC 280
Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
P ++I+D N IR +P + + S +G+
Sbjct: 281 VNPQ-GDVIIADTENHRIRL--YRPSEGTIESLAGTG 314
>gi|421129432|ref|ZP_15589632.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
gi|410358807|gb|EKP05916.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
Length = 358
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 9/119 (7%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
+ G +G+ ++ FD P +D++ N+YV + SNH IRKI NL V+T +GG S
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSGMVSTFSGGIS--- 219
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + + + +LL +D + IR+I+LK + S+ G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 37/70 (52%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++++ VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKI 130
D+ ++IRKI
Sbjct: 96 DQMTNLIRKI 105
>gi|290975027|ref|XP_002670245.1| predicted protein [Naegleria gruberi]
gi|284083802|gb|EFC37501.1| predicted protein [Naegleria gruberi]
Length = 1363
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 18/165 (10%)
Query: 28 LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
L Y +TTV + +P I + + D+IV D+ + +S+ V+ R+AG
Sbjct: 393 LPSKYYITTVATDFK---DPQK-IAKLSNGDIIVSDTGDHSIKKISY---STGVISRIAG 445
Query: 88 DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG---G 141
GV G+S DG S A+ +KP A+ IY+AD NH IR + N ++T+AG G
Sbjct: 446 TGVAGFSGDGGLASQAQLNKPYGIAITANDEIYIADNLNHRIRFVDVNGNISTVAGTSIG 505
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
S G A NA D L+ L I+D N IR++
Sbjct: 506 FSGDSGLATAAKLNAPM--DVSLSASGD---LYIADRDNYRIRRV 545
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 22/88 (25%), Positives = 48/88 (54%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
+A+ + P ++ G++Y+AD+ N+ IR++++ ++T AG G G A +A+ S
Sbjct: 515 AAKLNAPMDVSLSASGDLYIADRDNYRIRRVSSGVISTFAGNGQSGYSGDGGQATSAALS 574
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
+ + + + ISD N +R+++
Sbjct: 575 QAYGVKVING--EVYISDSNNYKVRKVD 600
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 57/109 (52%), Gaps = 5/109 (4%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
V+ AG+G GYS G+ SA + V + G +Y++D +N+ +RK+ G +TT
Sbjct: 549 VISTFAGNGQSGYSGDGGQATSAALSQAYGVKV-INGEVYISDSNNYKVRKVDGSGIITT 607
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG G+ A +++ ++ ++ +P ++I+D N IR +
Sbjct: 608 IAGSGAAPFNGDGLMATSSNMNHPTDVALLPS-GEMIIADTDNYRIRMV 655
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 6/121 (4%)
Query: 12 ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYT 71
+ ++++L +V+ L ++T G+ +P + R + D++V D+ A
Sbjct: 13 VVMMAMLVYMVNGAPLNTYSVKISTFAGGNGEFKDPQK-LARLSNGDILVSDALGHAIKK 71
Query: 72 LSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
++ V+ +AG+G G+ +G +A +KP AV IY AD N+ IRK
Sbjct: 72 INV---TSGVITTIAGNGTAGFGGDNGPAVNALVNKPFGIAVSQTDEIYFADSGNNRIRK 128
Query: 130 I 130
I
Sbjct: 129 I 129
>gi|290984233|ref|XP_002674832.1| predicted protein [Naegleria gruberi]
gi|284088424|gb|EFC42088.1| predicted protein [Naegleria gruberi]
Length = 2158
Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats.
Identities = 40/121 (33%), Positives = 57/121 (47%), Gaps = 5/121 (4%)
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
FY E V +AG G Y+ DG + +A+ ++P V IY+ DK NH
Sbjct: 169 FYNHKVKKVESGTVTTIAGTGTANYNGDGIQASTAQLNEPHGVFV-RNDKIYIVDKKNHR 227
Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
IR I G ++T+AG G+ G D A +S D +V + ISDH N IR+
Sbjct: 228 IRMIQPDGKISTVAGNGTLGFG-PDNVAAKSSMLQDPVFVYVTQDEVIYISDHENNRIRR 286
Query: 186 I 186
+
Sbjct: 287 V 287
Score = 43.5 bits (101), Expect = 0.098, Method: Composition-based stats.
Identities = 34/100 (34%), Positives = 47/100 (47%), Gaps = 4/100 (4%)
Query: 89 GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GV G D G G + D P + V G Y+AD NH ++K+ + VTTIAG G+
Sbjct: 134 GVPGCGDAIGATGVTQLDHPPAIFVPYTGLYYIADFYNHKVKKVESGTVTTIAGTGTANY 193
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
DG + + N+ FV + + I D N IR I
Sbjct: 194 -NGDGIQASTAQLNEPHGVFVRND-KIYIVDKKNHRIRMI 231
>gi|374988208|ref|YP_004963703.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
gi|297158860|gb|ADI08572.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
Length = 609
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 61/117 (52%), Gaps = 8/117 (6%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
PG + +V D++R L+ ESV++R+ G G +G DG ARF +P+ A+
Sbjct: 183 PGGT-FLVSDTTRHQLVELA--ADGESVLRRI-GSGERGLVDGPADRARFSEPQGLALLP 238
Query: 114 KGNIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
G + VAD NH +R + V T+AG G + + A GPA+ S+ +++ +
Sbjct: 239 DGTVIVADTVNHALRSLDLATGEVRTVAGTGRQWWQGSPAAGPAREVDLSSPWDVAW 295
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 50/121 (41%), Gaps = 1/121 (0%)
Query: 68 AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
A TL L E K G Y EP + P + G V+D + H +
Sbjct: 138 ALETLVAELEAEHGAKGTLRRGDGPYVPPEPVATELRFPGKALLLPGGTFLVSDTTRHQL 197
Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
++ G + + GS + G DGPA A FS L +P ++++D N +R ++
Sbjct: 198 VELAADGESVLRRIGSGERGLVDGPADRARFSEPQGLALLPD-GTVIVADTVNHALRSLD 256
Query: 188 L 188
L
Sbjct: 257 L 257
>gi|347756512|ref|YP_004864075.1| Thiol-disulfide isomerase / thioredoxin [Candidatus
Chloracidobacterium thermophilum B]
gi|347589029|gb|AEP13558.1| Thiol-disulfide isomerase / thioredoxin [Candidatus
Chloracidobacterium thermophilum B]
Length = 676
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 9/106 (8%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG---VTTIAGGG 142
G G G +G A F P+ A+D G+ +YVAD NH++R++ NL V T+AG G
Sbjct: 255 GSGAPGTRNGPYNFAEFQYPQGMALD--GDFLYVADTGNHLLRRV-NLKTRMVETVAGTG 311
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ R GP + S+ ++L H L I+ G I Q NL
Sbjct: 312 KNERSRQTGPGTSVGLSSPWDLAI--HQRTLFIAMAGAHQIWQYNL 355
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 56/125 (44%), Gaps = 5/125 (4%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGV 135
+ VV AG G +G DG +A F +P + D K +YVAD +R I V
Sbjct: 356 DTGVVGPYAGTGAEGRQDGTLETAVFAQPSGLSTDGK-RLYVADSEISAVRAIDLATGQV 414
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
TT+AGG G +G +NA F + + L ++D N +R I+L+ S
Sbjct: 415 TTLAGGDLFDFGDTNGKGENARFQHPLGVAAAER--KLYVADTYNHKLRTIDLRTRFVSN 472
Query: 196 SSQSG 200
SG
Sbjct: 473 LIGSG 477
>gi|290995458|ref|XP_002680312.1| predicted protein [Naegleria gruberi]
gi|284093932|gb|EFC47568.1| predicted protein [Naegleria gruberi]
Length = 744
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 46/88 (52%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
+A F P S V G +Y+AD NH IRK+ TN + TIAG G +GPA NA
Sbjct: 26 NANFKGPSSVFVSSNGEVYIADIGNHKIRKVLTNGKMVTIAGNGQNGFSGDNGPATNAML 85
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+ + FV + I+D+ N IR+I
Sbjct: 86 NYPIGV-FVSPYNEVYIADYFNNRIRKI 112
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 77/155 (49%), Gaps = 16/155 (10%)
Query: 41 HQLEINPHSVIDRPG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS 94
++L IN ++++RP + ++ + D +R+ + S ++V +AG+G +GY
Sbjct: 133 NRLAIN--AMLNRPSGVFVTLNDEVYIADQNRNKIRKIE---SNGNIVT-IAGNGKEGYG 186
Query: 95 DGE-PG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADG 151
D P +A ++P S V +Y+AD NH IRKI N + TIAG G +
Sbjct: 187 DDNVPAVNAMLNRPSSVFVTSNNEVYIADTGNHRIRKILRNGNIVTIAGNGLVGFHEDNI 246
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A N S N + FV + I+D GN IR+I
Sbjct: 247 LATNTSL-NFPKSVFVSSNGEVYIADTGNHRIRKI 280
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 5/105 (4%)
Query: 85 LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
+AG+G+ G+ + + + PKS V G +Y+AD NH IRKI N + TIAG
Sbjct: 233 IAGNGLVGFHEDNILATNTSLNFPKSVFVSSNGEVYIADTGNHRIRKILANGNIATIAGN 292
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + G + PA + N FV + I+D NQ IR+I
Sbjct: 293 GKEGYGDDNVPAIDTML-NRPSSVFVSS-NEVYIADQSNQRIRKI 335
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 14/92 (15%)
Query: 82 VKRLAGDGVQGYSDGE-PG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
+ +AG+G +GY D P ++P S V +Y+AD+SN IRKI N + TI
Sbjct: 286 IATIAGNGKEGYGDDNVPAIDTMLNRPSSVFVS-SNEVYIADQSNQRIRKILQNGNIVTI 344
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
AG G +EG + SF F+++ PHI
Sbjct: 345 AGNG--REGNSGD-----SF---FDISRYPHI 366
Score = 40.8 bits (94), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
+AG+G G+S +G +A + P V +Y+AD N+ IRKI TN + TIAG
Sbjct: 65 IAGNGQNGFSGDNGPATNAMLNYPIGVFVSPYNEVYIADYFNNRIRKILTNGNIITIAGN 124
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+G + A NA N FV + I+D IR+I
Sbjct: 125 NVNGKGVYNRLAINAML-NRPSGVFVTLNDEVYIADQNRNKIRKI 168
>gi|116754580|ref|YP_843698.1| NHL repeat-containing protein [Methanosaeta thermophila PT]
gi|116666031|gb|ABK15058.1| NHL repeat containing protein [Methanosaeta thermophila PT]
Length = 504
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 54/102 (52%), Gaps = 7/102 (6%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGP 152
DG+ G AR P +G+IYVAD NH I+K +++ + T AG GS G ADGP
Sbjct: 400 DGDFGRARLQHPLGLFYK-EGSIYVADTYNHRIKKADLSSGSIHTTAGTGSP--GFADGP 456
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
A+F+ LTF+ +L I+D N +R + + D S
Sbjct: 457 GAQAAFNEPSGLTFLGD--SLFIADTNNHAVRIYDQRSGDVS 496
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 54/105 (51%), Gaps = 5/105 (4%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK 144
G G +GYSDG+ AR +P+ A+ + +Y+AD NH++R + + +AG G
Sbjct: 230 GSGERGYSDGDFSEARLFRPQGIAI-VGDVVYIADTGNHMVRAADLRRRTLVRMAGTGKS 288
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G S S+ ++L FV L I+ G+ I +++L+
Sbjct: 289 RHPGLGGRGAEVSLSSPWDLVFVQD--HLYIAMAGSHQIWRMDLE 331
>gi|290971764|ref|XP_002668649.1| predicted protein [Naegleria gruberi]
gi|284082135|gb|EFC35905.1| predicted protein [Naegleria gruberi]
Length = 556
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 76/166 (45%), Gaps = 29/166 (17%)
Query: 48 HSVIDRPGSSDLIVLDS-----SRSAFYT-LSFPLSEESV---------VKRLAGD---- 88
+S++DR + +I LD+ + ++ YT +S SE S +K+L
Sbjct: 338 NSLVDRVAGTGIIELDTDGKLGNLTSLYTPISVTQSESSSDIYIGTTIGIKKLTDSNKLI 397
Query: 89 ----GVQGYS--DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAG 140
G Q YS DG P +A R P+ A+ G ++++DK +H IRKI + GV + +AG
Sbjct: 398 SSFAGSQAYSSGDGYPATAARLSSPEGIAISTSGEVFISDKGSHTIRKIDSKGVISNVAG 457
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
GS G DGPA D H L Q+IR++
Sbjct: 458 TGS--AGYVDGPALKPKQPIDTAALLRKHARNLYPDSSDRQIIRKL 501
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 37/87 (42%), Gaps = 2/87 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
SAR P AV K I++ D +N IRKI N + G G PA AS
Sbjct: 94 SARLSYPVCIAVSPKDEIFICDNNN--IRKIVNDTFYNVVGNGQLGNSPDGTPAVTASIK 151
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
+ + F P + SD N +IR +
Sbjct: 152 TAYGIAFHPVTGEMYFSDINNNVIRYV 178
>gi|398335921|ref|ZP_10520626.1| hypothetical protein LkmesMB_09964 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 947
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 60/118 (50%), Gaps = 7/118 (5%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKE 146
G G +DG +ARF+ PK A D +YVAD NH IR KI++ VTTIAG G+
Sbjct: 722 GTSGNADGVGTAARFNAPKGIATDGT-YLYVADTGNHKIRKIKISDNTVTTIAGSGT--A 778
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
G DG A F+ L + + L ++D + I+ ++L + ++ +G G
Sbjct: 779 GTLDGLGTAAKFNQPSHLVYDSN--KLYVTDTNSNNIKLVDLTTSPVTVTTIAGDPAG 834
>gi|375146499|ref|YP_005008940.1| cell surface receptor IPT/TIG domain-containing protein [Niastella
koreensis GR20-10]
gi|361060545|gb|AEV99536.1| cell surface receptor IPT/TIG domain protein [Niastella koreensis
GR20-10]
Length = 500
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 4/76 (5%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
G ++ V D S + Y + P + S++ AG GY DG+P ARF P+ +D
Sbjct: 426 GVGNIYVTDDSTNGIYRVD-PNGKLSLI---AGGVRSGYIDGKPQDARFSGPRGIVIDAS 481
Query: 115 GNIYVADKSNHVIRKI 130
GN++VAD N+ IRKI
Sbjct: 482 GNLFVADIGNNCIRKI 497
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 53/110 (48%), Gaps = 14/110 (12%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
VV +AG G GY DG +A+F P+ +D GNIYVA+ N IRKI G VTT
Sbjct: 253 VVSTIAGSGTYGYLDGPGLTAQFHFPQGLTLDNAGNIYVAEYQNQSIRKIDPAGNVTTF- 311
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+GP ++ ND V + ++D N I +I+L+
Sbjct: 312 --------FINGPQGYVAYPND----VVVDASNVYVTDQSNNRICKISLQ 349
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 67/134 (50%), Gaps = 5/134 (3%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEE-SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
S + V+ ++ A SF + + V AG G D +A+F P A+D
Sbjct: 173 SGKVTVVSCNKMAVSDTSFIYQTKVNSVITFAGQQNPGAVDASGINAQFWSPDDIALDAA 232
Query: 115 GNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
GN+ V++ N IR+IT LG V+TIAG G+ G DGP A F LT + +
Sbjct: 233 GNLIVSEIGNSKIRRITPLGVVSTIAGSGT--YGYLDGPGLTAQFHFPQGLTL-DNAGNI 289
Query: 174 LISDHGNQLIRQIN 187
++++ NQ IR+I+
Sbjct: 290 YVAEYQNQSIRKID 303
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 6/101 (5%)
Query: 31 GYTVTTVIDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG 89
G T I+G Q + P+ V+ +S++ V D S + +S + ++ L+G+G
Sbjct: 306 GNVTTFFINGPQGYVAYPNDVVVD--ASNVYVTDQSNNRICKISL---QTGLLSVLSGNG 360
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G DG+P A+F +P A+D N+ +ADK N +RK+
Sbjct: 361 NWGMVDGDPQQAQFYQPAKMALDNNNNLIIADKINGRVRKV 401
>gi|290982010|ref|XP_002673724.1| NHL repeat-containing protein [Naegleria gruberi]
gi|284087309|gb|EFC40980.1| NHL repeat-containing protein [Naegleria gruberi]
Length = 456
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 62/126 (49%), Gaps = 7/126 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNL--GVT 136
+K + G+G+ GY G+ G+A P S AVD GNIY+AD NH IRK T L ++
Sbjct: 115 IKTIVGNGMPGYG-GDSGAATSTSLLFPTSIAVDSSGNIYIADTQNHRIRKFTALTGKLS 173
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKS 196
TIAG + + A A S+ + + I + I+D N IR+I + S
Sbjct: 174 TIAGTEEEGYNGDNIIATTAKLSSPTGVA-LDSIGNVYIADSQNHRIRKITISNGKISTI 232
Query: 197 SQSGSA 202
+ G A
Sbjct: 233 AGDGDA 238
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 6/108 (5%)
Query: 85 LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAG 140
+AG +GY+ +A+ P A+D GN+Y+AD NH IRKIT N ++TIAG
Sbjct: 175 IAGTEEEGYNGDNIIATTAKLSSPTGVALDSIGNVYIADSQNHRIRKITISNGKISTIAG 234
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICA--LLISDHGNQLIRQI 186
G DG + AS N+ V + + +D N IR+I
Sbjct: 235 DGDAGSPLYDGLSAIASSLNNPTSVAVDQSSSNTVYFADTNNSKIRKI 282
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 49/96 (51%), Gaps = 6/96 (6%)
Query: 96 GEPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRAD 150
G P + F D P A+ G++YVADKSN I+KI T ++TIAGG +
Sbjct: 361 GFPSTTEFACLDSPFGIAIGPNGDVYVADKSNARIKKINPTTKTISTIAGGSGAGFDGDN 420
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
GPA +S ND V A+ I+D N IR++
Sbjct: 421 GPAILSSL-NDPCGIAVQSNGAVYIADTKNNRIRKV 455
>gi|434403812|ref|YP_007146697.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
stagnale PCC 7417]
gi|428258067|gb|AFZ24017.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
stagnale PCC 7417]
Length = 506
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 67/148 (45%), Gaps = 13/148 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ P L + DS SF V + G G G++DG A+F P
Sbjct: 177 PGKVLATPAG--LFIADSGHHRLVMSSF----NGEVFHIIGTGKSGFTDGNFSEAQFSAP 230
Query: 107 KSFAVDMKGNI-YVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRA--DGPAQNASFSND 161
+ A D + I YVAD NH +RK I V TIAG G + G A + ++
Sbjct: 231 QGMAYDAENQILYVADTENHALRKVDIKRQVVGTIAGTGEQSHNTRPHSGAALETALNSP 290
Query: 162 FELTFVPHICALLISDHGNQLIRQINLK 189
++L + + +L I+ GN I Q++L+
Sbjct: 291 WDLEKIGN--SLFIAMAGNHQIWQLDLE 316
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 55/129 (42%), Gaps = 7/129 (5%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLG 134
E +VK AG G +G DG + F +P D K +++AD IR + N
Sbjct: 316 ENHIVKTYAGTGGEGCVDGNFTESAFAQPSGITTDGK-ELFIADSEISSIRGVELGENGK 374
Query: 135 VTTIAGGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
V T+ GGG + G DG + + + F + L ++D N I+ +N DC
Sbjct: 375 VRTVCGGGFLFDFGDIDGQYFDVRLQHCLGVDFFQN--HLWVTDTYNHKIKLVNPITGDC 432
Query: 194 SKSSQSGSA 202
GSA
Sbjct: 433 QTVLGDGSA 441
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 3/62 (4%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
+ + GDG G +G+ + RF +P + + +Y++D +NHVIR++ L VTT+
Sbjct: 433 QTVLGDGSAGLQNGQGKNTRFFEPSGLSA-IDSYLYISDTNNHVIRRVDLETLEVTTLQF 491
Query: 141 GG 142
G
Sbjct: 492 SG 493
>gi|356571070|ref|XP_003553704.1| PREDICTED: NHL repeat-containing protein 2-like [Glycine max]
Length = 1098
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 56/113 (49%), Gaps = 4/113 (3%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVT 136
+ + ++ G +G DG A F++P+ A + K NI YVAD NH +R+I N V
Sbjct: 642 NFIVQIGSSGEEGLQDGSFDDATFNRPQGLAYNAKKNILYVADTENHALREIDFVNEKVR 701
Query: 137 TIAGGGSKKEGRADGPAQNASFSND-FELTFVPHICALLISDHGNQLIRQINL 188
T+AG G+K G ++ N +++ F P + I+ G I + NL
Sbjct: 702 TLAGNGTKGSDYVGGGKGDSQLLNSPWDVCFHPFDEKIYIAMAGQHQIWEHNL 754
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 41/76 (53%), Gaps = 5/76 (6%)
Query: 115 GNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
G IY+AD NH I+K+ T+ V+TIAG G K G DG A A S + +
Sbjct: 857 GEIYIADSYNHKIKKLDPTSKRVSTIAGTG--KAGFKDGTAVKAQLSEPSGIV-EGNKGR 913
Query: 173 LLISDHGNQLIRQINL 188
L I+D N LIR ++L
Sbjct: 914 LFIADTNNSLIRYLDL 929
>gi|283781091|ref|YP_003371846.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
gi|283439544|gb|ADB17986.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
Length = 782
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 39/67 (58%), Gaps = 3/67 (4%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
G GV G +DG A FD P+ A+D + +YVAD NH++RKI T VTTIAG G +
Sbjct: 332 GSGVIGKADGSFAEASFDHPQGCALDGE-TLYVADTENHLLRKIDLTKKTVTTIAGTGKQ 390
Query: 145 KEGRADG 151
E G
Sbjct: 391 AESAWPG 397
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 4/49 (8%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
VK +AG G G SD A FD+P A+ + +YVAD +NH+IRK+
Sbjct: 578 VKTIAGSGSPGTSDD---PATFDEPAGLAL-VGETLYVADTNNHLIRKV 622
>gi|290989149|ref|XP_002677205.1| predicted protein [Naegleria gruberi]
gi|284090811|gb|EFC44461.1| predicted protein [Naegleria gruberi]
Length = 691
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 43/68 (63%), Gaps = 5/68 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
++ +AG G +GY DG+ G AR P AV+ +G++Y +D N+VIRKI+ G +T
Sbjct: 226 LIHTVAGFGEKGY-DGDGGFAFYARIRCPTGIAVNQRGDVYFSDSGNNVIRKISTCGIIT 284
Query: 137 TIAGGGSK 144
TIAG G K
Sbjct: 285 TIAGTGEK 292
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 63/115 (54%), Gaps = 8/115 (6%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVT 136
++ +AG G +G++ DG + ++ P A G++Y+AD+ NH IRK++ + ++
Sbjct: 111 IITTIAGCGERGFNRDGCVATVSKLSYPTRIARCENGDLYIADRHNHRIRKVSAVDGSIS 170
Query: 137 TIAGGGSKKEG-RADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
TIAG G +EG DG PA +A + L ISD+ N IR+++L+
Sbjct: 171 TIAGTG--REGYNGDGIPAIHADLYFPTGIAIDEREQELFISDYHNNRIRRVSLR 223
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 54/113 (47%), Gaps = 3/113 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S ++ +AG G +G++ + + + P A+ +G I D N+ I KI N
Sbjct: 277 ISTCGIITTIAGTGEKGFNGDQMDALDVMLNGPSGIAISQEGEILFTDIYNNRICKI-NR 335
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T + G+ +EG DGP + A F+N ++ + + D GN IR+I
Sbjct: 336 DRTLVTIAGTTEEGYQDGPVRMAKFNNPSDIAIDNETGDVYVVDGGNNYIRKI 388
>gi|441185716|ref|ZP_20970504.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
subsp. rimosus ATCC 10970]
gi|440613960|gb|ELQ77297.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
subsp. rimosus ATCC 10970]
Length = 108
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 3/78 (3%)
Query: 79 ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGV 135
+ + +AG+GV G+ SDG G+ R P+ VD GN+++AD+ NH IRK+T N +
Sbjct: 18 DGSIATVAGNGVAGFISDGGHGALTRVYYPQGVTVDKNGNLFIADRYNHRIRKVTPNGTI 77
Query: 136 TTIAGGGSKKEGRADGPA 153
TT+AG G+ GPA
Sbjct: 78 TTVAGDGNAGYISDGGPA 95
>gi|428169673|gb|EKX38605.1| hypothetical protein GUITHDRAFT_56349, partial [Guillardia theta
CCMP2712]
Length = 210
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 73/161 (45%), Gaps = 10/161 (6%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + S +++ D S L PL + V +AG GY DG A+FD
Sbjct: 26 NPVGISVSADSQYVVIADYLNSLIRLLFVPLLQ---VTTVAGTSGSGYEDGTCAMAKFDG 82
Query: 106 PKS--FAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
P + +D K + +AD N+ IR +++ V+TIAG G+ G+ DG A+F+
Sbjct: 83 PIDVVWGID-KSLVLIADFWNNRIRSLNLSSCSVSTIAGSGAF--GQLDGSTAAATFAGP 139
Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
L LL+++ +R I+L + + SG A
Sbjct: 140 CSLALTSQGSKLLVAEFSGHSVRMIDLSKNTVTTLAGSGIA 180
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V +AG G G DG +A F P S A+ +G+ + VA+ S H +R I + VTT+
Sbjct: 115 VSTIAGSGAFGQLDGSTAAATFAGPCSLALTSQGSKLLVAEFSGHSVRMIDLSKNTVTTL 174
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELT 165
AG G G DG A F+N ++LT
Sbjct: 175 AGSG--IAGLQDGVGLIAQFTNPWDLT 199
>gi|418737091|ref|ZP_13293489.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
str. 200801910]
gi|410747250|gb|EKR00156.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
str. 200801910]
Length = 358
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 58/109 (53%), Gaps = 7/109 (6%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKK 145
G+ G+ +G+ ++ F P +D + N+YV + NH IRKI NL V+T++GG S
Sbjct: 163 GIGGFQNGDRLNSLFKGPLFMDLDRERNLYVGELGNHTIRKI-NLNSETVSTLSGGVS-- 219
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G DG +A F + + +LL++D N IR+I+LK S
Sbjct: 220 -GYLDGDLTSAQFKFPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVS 267
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 1/81 (1%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
++LD +T ++ VV AG G++ DG +A F P S +D GNI+V
Sbjct: 35 MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94
Query: 120 ADKSNHVIRKITNLG-VTTIA 139
+D +++IRKI G VTT++
Sbjct: 95 SDSDSNLIRKIDRFGNVTTLS 115
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 42/80 (52%), Gaps = 6/80 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V L+G GV GY DG+ SA+F P D K + + VAD NH IRKI V+T+
Sbjct: 211 VSTLSG-GVSGYLDGDLTSAQFKFPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVSTL 269
Query: 139 AGGGSKKEGRADGPAQNASF 158
G G E DG NASF
Sbjct: 270 LGNG--IEASIDGKGLNASF 287
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 1/62 (1%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVT 136
+ S V L G+G++ DG+ +A FD P ++D G ++V+D S++ IR + +L V
Sbjct: 262 KTSTVSTLLGNGIEASIDGKGLNASFDGPAFISLDNSGYMFVSDTSSNKIRIVDPDLNVF 321
Query: 137 TI 138
TI
Sbjct: 322 TI 323
>gi|387219837|gb|AFJ69627.1| nhl repeat containing 2 [Nannochloropsis gaditana CCMP526]
Length = 295
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKI 130
V RLAG G +GY DG A FD+P+ VD K G +V D +NH IR +
Sbjct: 229 VSRLAGSGKRGYRDGAGAEAEFDEPRGLCVDGKGGQAFVCDSNNHCIRTV 278
>gi|297582981|ref|YP_003698761.1| NHL repeat containing protein [Bacillus selenitireducens MLS10]
gi|297141438|gb|ADH98195.1| NHL repeat containing protein [Bacillus selenitireducens MLS10]
Length = 414
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 48/97 (49%), Gaps = 4/97 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G DG A F P+ AV G I VAD NH +R IT V +AG G G +G
Sbjct: 320 GSDDGPVSEAGFHFPRGVAVLSSGAILVADTYNHRLRLITEDEVLPVAGHGVH--GMVNG 377
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
P ++A F + + +L++DH N +IR++ L
Sbjct: 378 PVEDALFDGPYHVAVFGE--RILVTDHWNHMIREVEL 412
Score = 40.0 bits (92), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 70/195 (35%), Gaps = 63/195 (32%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---------SARFD 104
P ++ DS +S E+ R DG Y GEP A FD
Sbjct: 111 PFDDGFLITDSGNHTLRYVSMSGQSETFAGRY--DGYDEY--GEPEGRFDHGTGEEAGFD 166
Query: 105 KPKSFAVDMK-GNIYVADKSNHVIRKITNLG-VTTIA----------------------- 139
P D G +YVAD N IR++T G V+T+A
Sbjct: 167 TPLGLVYDEDSGLLYVADAGNGAIRRVTEDGEVSTVAEDLDYPTDLILLDGSLIVSEARA 226
Query: 140 -----------------GGGSKKE-----GR-ADGPAQNASFSNDFELTFVPHICALLIS 176
GGG ++ GR ADG + A F+ F L + ++++
Sbjct: 227 HVLTLVDPESGETEHLAGGGYEEADGELVGRFADGSGEGAGFNEPFGLAVLEE--TIVVA 284
Query: 177 DHGNQLIRQINLKPE 191
D GNQ IRQ+ L E
Sbjct: 285 DSGNQRIRQVTLDGE 299
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 43/108 (39%), Gaps = 9/108 (8%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG-------SK 144
GY +G P +F+ P V + D NH +R ++ G + G +
Sbjct: 92 GYQNGNPDDTKFNHPAGI-VPFDDGFLITDSGNHTLRYVSMSGQSETFAGRYDGYDEYGE 150
Query: 145 KEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
EGR D G + A F L + L ++D GN IR++ E
Sbjct: 151 PEGRFDHGTGEEAGFDTPLGLVYDEDSGLLYVADAGNGAIRRVTEDGE 198
>gi|290976760|ref|XP_002671107.1| predicted protein [Naegleria gruberi]
gi|284084673|gb|EFC38363.1| predicted protein [Naegleria gruberi]
Length = 1349
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 67/154 (43%), Gaps = 2/154 (1%)
Query: 34 VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQG 92
V T G+ + P S G L +S + F + + ++R+ D V
Sbjct: 454 VGTGTQGNSGDYGPASQCQLNGPQGLATPPTSMKGQFLFDFADTNNNRIRRIQNDLTVIP 513
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
P + + P G++Y+A + NH I K +N ++T+AG G GP
Sbjct: 514 LLSNNPNNDGINGPNGIMYTPNGDLYIASQGNHKIMKYSNSQLSTVAGNGYAGYSGDGGP 573
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A+NA + ++T V + I+D GN+ +R++
Sbjct: 574 ARNAKLNYPADVT-VSSTGEVYIADWGNRAVRKV 606
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-- 134
S + +AG+G GYS DG P +A+ + P V G +Y+AD N +RK+ G
Sbjct: 553 NSQLSTVAGNGYAGYSGDGGPARNAKLNYPADVTVSSTGEVYIADWGNRAVRKVFRNGTI 612
Query: 135 VTTIAGGG 142
+T I GG
Sbjct: 613 ITLIVSGG 620
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 36/165 (21%)
Query: 4 HFVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVID--RPGSSDLIV 61
+F+ S L T + L+ + SG + NPH+ + P + +
Sbjct: 325 NFIDSDLPSTSTTPLYGMYDSGFQFD----------------NPHNALSVYMPSPQQMRL 368
Query: 62 LDSSRSAFYTLSFPLSEESVVK---------RLAGDGVQ-GYS-DGEPGS-ARFDKPKSF 109
S++ L FP+ +VVK R+AG Q GY+ D P + A +P +
Sbjct: 369 TSSNQ-----LYFPIFNSAVVKVDLNSGLVTRIAGSITQFGYNQDSIPATIALMTRPNTV 423
Query: 110 AVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPA 153
G++ ++D N+ IRKI + G+ TTI G G++ GPA
Sbjct: 424 LQLDSGDLIISDTGNNRIRKIDSNGIITTIVGTGTQGNSGDYGPA 468
>gi|229822379|ref|YP_002883905.1| alkyl hydroperoxide reductase [Beutenbergia cavernae DSM 12333]
gi|229568292|gb|ACQ82143.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Beutenbergia cavernae DSM 12333]
Length = 641
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 59/132 (44%), Gaps = 3/132 (2%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S DL + + +TL E+ + RLAG +G DG A F +P AV G
Sbjct: 324 SPDLAAFVVAMAGNHTLWAFDDEQGSLTRLAGTEHEGLRDGPAAEAWFAQPSGLAVAQDG 383
Query: 116 NIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
I+VAD +R + G V T G G + G DG A A + + +P ++
Sbjct: 384 RIWVADSETSALRWLDPAGGDVHTAVGQGLFEFGHRDGAADQALLQHPLGVAALPD-ASV 442
Query: 174 LISDHGNQLIRQ 185
L++D N +R+
Sbjct: 443 LVADTYNGALRR 454
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 38/162 (23%), Positives = 72/162 (44%), Gaps = 36/162 (22%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA---VD 112
+ +L+V D+ + L E++V+R+ G G +G DG P ARF +P V+
Sbjct: 189 NGNLLVADAGHHSLAELG--PDGETLVRRV-GSGERGLVDGGPNVARFSEPNGLCLVPVE 245
Query: 113 MKG----NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNAS--------- 157
++ ++ VAD NH +R ++ + V+T+AG G + G NAS
Sbjct: 246 LRARLGYDVLVADTVNHALRGVRLADGHVSTVAGTGQQY---VVGAPDNASDPHGGTHPV 302
Query: 158 ------------FSNDFELTFVPHICALLISDHGNQLIRQIN 187
S+ +++ + P + A +++ GN + +
Sbjct: 303 GFGDQFVPTSVKLSSPWDVAWSPDLAAFVVAMAGNHTLWAFD 344
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 52/109 (47%), Gaps = 8/109 (7%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNAS 157
PG+ RF K+ A++ GN+ VAD +H + ++ G T + GS + G DG A
Sbjct: 175 PGTLRFPA-KAIALE-NGNLLVADAGHHSLAELGPDGETLVRRVGSGERGLVDGGPNVAR 232
Query: 158 FSNDFELTFVP-HICA-----LLISDHGNQLIRQINLKPEDCSKSSQSG 200
FS L VP + A +L++D N +R + L S + +G
Sbjct: 233 FSEPNGLCLVPVELRARLGYDVLVADTVNHALRGVRLADGHVSTVAGTG 281
>gi|290985160|ref|XP_002675294.1| predicted protein [Naegleria gruberi]
gi|284088889|gb|EFC42550.1| predicted protein [Naegleria gruberi]
Length = 1229
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 54/105 (51%), Gaps = 5/105 (4%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLGVTTIA 139
+V +G + GY DG +ARF P +D G++YV+D N VIRKI+N V+T+A
Sbjct: 608 IVSTFSGGTISGYQDGPIQTARFYFPNGLDIDTTNGDLYVSDTFNCVIRKISNGVVSTVA 667
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
G + +G AS + L++ ++ + + N+LI
Sbjct: 668 -GQYQNTNYQNGNLSTASIGYPYYLSYYN---GMIFTVYDNELIE 708
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 65/134 (48%), Gaps = 9/134 (6%)
Query: 76 LSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAV-DMKGNIYVADKSNHVIR--K 129
L V+ +AGDG GY DG A F S + D ++++D +N +R
Sbjct: 230 LLSNGVMSVIAGDGNNYNNGYRDGNATQALFYYIPSLSFKDSTQELFISDLNNQRVRVLN 289
Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNAS--FSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ + V+T+ G R P QN + +++ ++P + LLI+D N++IR+++
Sbjct: 290 MNSNQVSTLLGNDINSYERTYLP-QNGKDVYVRPYQIQYLPQLDQLLIADLDNRVIRKVD 348
Query: 188 LKPEDCSKSSQSGS 201
L+ + + +GS
Sbjct: 349 LQNQQAWTTIIAGS 362
Score = 37.0 bits (84), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 66/136 (48%), Gaps = 13/136 (9%)
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICA 172
G+IY++ N++++ ++N ++ IAG G+ G DG A A F L+F
Sbjct: 216 NGDIYLS-AYNNIVKLLSNGVMSVIAGDGNNYNNGYRDGNATQALFYYIPSLSFKDSTQE 274
Query: 173 LLISDHGNQLIRQINLKPEDCSKSSQSGSALGA-VSVWVLVSVLSCLVSLVIGFVARPYI 231
L ISD NQ +R +N+ S+Q + LG ++ + + + + RPY
Sbjct: 275 LFISDLNNQRVRVLNM------NSNQVSTLLGNDINSYERTYLPQNGKDVYV----RPYQ 324
Query: 232 IRHTGRLDPPLQHDME 247
I++ +LD L D++
Sbjct: 325 IQYLPQLDQLLIADLD 340
>gi|74317309|ref|YP_315049.1| hypothetical protein Tbd_1291 [Thiobacillus denitrificans ATCC
25259]
gi|74056804|gb|AAZ97244.1| conserved hypothetical protein containing twin-arginine
translocation pathway signal [Thiobacillus denitrificans
ATCC 25259]
Length = 510
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 48/109 (44%), Gaps = 8/109 (7%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-------VTT 137
+ G + DG ARF+ D GN+YVAD+ NHVIRKIT V+T
Sbjct: 332 MVGQKQGAFVDGPANEARFNGCSQIDYDNAGNLYVADRFNHVIRKITPSADPMVGKMVST 391
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A G + GR GPA A F + L+ + I D N + I
Sbjct: 392 YA-GVPMQSGRISGPASKAKFYEPWGLSVDRKNNLIFIGDTANSRVAVI 439
Score = 43.5 bits (101), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 63/141 (44%), Gaps = 14/141 (9%)
Query: 59 LIVLDSSRSAFYTLSFPLS-------------EESVVKRLAGDGVQGYSDGEPGSARFDK 105
L + +++A TLS P S EE AG+ G +G SA F
Sbjct: 41 LTAVKGAQAALPTLSTPFSVATNEGFPLPKSPEEIWSLDYAGNLFPGMRNGPALSAFFYP 100
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
++ G+I VAD N +IR I+NLG TI G ++ DG A + +F++ ++
Sbjct: 101 ATGMYIEDNGSIKVADARNSLIRNISNLGDVTIFSGSMQQYPFKDGSAVDTNFNSPNDID 160
Query: 166 FVPHICALLISDHGNQLIRQI 186
+ ++ D N IR++
Sbjct: 161 KLSD-GTYVVGDRENNAIRRV 180
>gi|296124331|ref|YP_003632109.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
gi|296016671|gb|ADG69910.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
Length = 581
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 42/71 (59%), Gaps = 5/71 (7%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAG 140
++ G G G DG SA+FD P+ A+D GN +YVAD NH++R + T V+T+AG
Sbjct: 283 KVIGSGKIGAKDGPAESAQFDHPQGMALD--GNTLYVADTENHLLRTVNLTTWEVSTLAG 340
Query: 141 GGSKKEGRADG 151
G + GR G
Sbjct: 341 TGEQARGRDRG 351
>gi|290995346|ref|XP_002680256.1| predicted protein [Naegleria gruberi]
gi|284093876|gb|EFC47512.1| predicted protein [Naegleria gruberi]
Length = 1226
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 62/131 (47%), Gaps = 6/131 (4%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
DL DS+ A + + V+ L G+G G+ DG+ SA F+ P S A+ G+
Sbjct: 398 DLYFCDSNNHAIR----KIDTQGVITTLTGNGTAGFQDGDAASALFNSPASIALLPNGDF 453
Query: 118 YVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
VAD +N IRK + V TIAG + +G A A + + + +LI
Sbjct: 454 LVADFTNSRIRKYVASTKQVVTIAGNSTLGFYGDNGLASKALLNKPTGVFYNSTSGDILI 513
Query: 176 SDHGNQLIRQI 186
+D+ N IR+I
Sbjct: 514 ADYFNFRIRKI 524
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 46/102 (45%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
+AG G S+ +A+F +P+ V +Y+ D N++IRKI N V I G G +
Sbjct: 40 VAGGGSSLSSNISALNAKFSQPRCVVVSPNNEVYICDTGNYMIRKIVNGVVYNIVGNGQQ 99
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A NA S + F P L SD+ N I+ +
Sbjct: 100 GSSTDGTLAVNAKVSLVLSVAFHPVSGELFFSDYSNNAIKYV 141
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFE 163
+P A G++Y D +NH IRKI GV TT+ G G+ G DG A +A F++
Sbjct: 387 QPAGMAFASNGDLYFCDSNNHAIRKIDTQGVITTLTGNGT--AGFQDGDAASALFNSPAS 444
Query: 164 LTFVPHICALLISDHGNQLIRQ 185
+ +P+ L++D N IR+
Sbjct: 445 IALLPN-GDFLVADFTNSRIRK 465
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 69/146 (47%), Gaps = 10/146 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFD 104
P S+ P + D +V D + S + S + VV +AG+ G+ +G A +
Sbjct: 442 PASIALLP-NGDFLVADFTNSRI--RKYVASTKQVVT-IAGNSTLGFYGDNGLASKALLN 497
Query: 105 KPKS-FAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSND 161
KP F G+I +AD N IRKI+N +TTIAG GS DG A ++
Sbjct: 498 KPTGVFYNSTSGDILIADYFNFRIRKISNNTGIITTIAGSGSTSY-NGDGLAATSTSMAP 556
Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
+ L P L +D+ N L+R+I+
Sbjct: 557 YGLAVNPISGELFFTDNANFLVRRID 582
>gi|357400623|ref|YP_004912548.1| hypothetical protein SCAT_3039 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|386356676|ref|YP_006054922.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|337767032|emb|CCB75743.1| conserved protein of unknown function [Streptomyces cattleya NRRL
8057 = DSM 46488]
gi|365807185|gb|AEW95401.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
Length = 613
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 59/133 (44%)
Query: 68 AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
A TL L E K G Y EP P + GN+ V+D + H +
Sbjct: 146 AIATLVEELEAEHAAKGTLRRGDGPYVPPEPVPTDLRFPGKALLLPTGNLLVSDSTRHAL 205
Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
++ G T + GS + G ADGPA A FS L +P +++++D N +R+++
Sbjct: 206 VELAPDGETVVRRIGSGERGLADGPADRARFSEPQGLALLPDGESVIVADTVNHALRRVD 265
Query: 188 LKPEDCSKSSQSG 200
L + + + +G
Sbjct: 266 LASGEVTTVAGTG 278
>gi|302551068|ref|ZP_07303410.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
40736]
gi|302468686|gb|EFL31779.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
40736]
Length = 498
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 59/109 (54%), Gaps = 5/109 (4%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
+ +AG G+ G++ D +P +A+ + P + AVD GN+++AD SNH +R++ +TT
Sbjct: 244 ITTVAGTGIAGFNGDNQPAVNAQLNAPHTVAVDSGGNVFIADTSNHRVRRVDAATRQITT 303
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+AG G+ GPA ++ + V L I+D N +R++
Sbjct: 304 VAGIGTAGFNGDGGPAIGVQLNSPVGVA-VDSGGGLFIADASNYRVRKV 351
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 66/136 (48%), Gaps = 15/136 (11%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--T 131
++ E + +AG+G G+S G A+ + P+ AVD GN++ +D SNH +R++
Sbjct: 10 VTREPAIYIVAGNGTAGFSSDYGPAVCAQLNYPRKVAVDSSGNLFFSDYSNHRVRRVDAA 69
Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
+TT+AG G+ +G Q A +S L + + L I+D N +R++
Sbjct: 70 TQQITTVAGNGTAGF---NGDNQPAVYS---RLHYPRGVAVDAGGNLFIADCNNHRVRRV 123
Query: 187 NLKPEDCSKSSQSGSA 202
+ + + +G A
Sbjct: 124 DAATRQITTVAGTGIA 139
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 31/125 (24%), Positives = 60/125 (48%), Gaps = 5/125 (4%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
+ +AG G+ G++ D +P +A+ + P AVD GN+++ D +N +R++ +TT
Sbjct: 130 ITTVAGTGIAGFNGDNQPAVNAQLNSPIGVAVDAGGNLFITDFNNQRVRRVDAATRQITT 189
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
+AG G+ + A NA + L I D+ N +R+++ + +
Sbjct: 190 VAGTGTAGFNGDNQSAVNAQLYGPVGVA-ADAGGNLFIGDYANHRVRRVDAATRQITTVA 248
Query: 198 QSGSA 202
+G A
Sbjct: 249 GTGIA 253
Score = 37.0 bits (84), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT--- 136
+ +AG G G++ DG P + + P AVD G +++AD SN+ +RK++++ T
Sbjct: 301 ITTVAGIGTAGFNGDGGPAIGVQLNSPVGVAVDSGGGLFIADASNYRVRKVSDVASTARF 360
Query: 137 TIAGGG 142
++A GG
Sbjct: 361 SVAPGG 366
>gi|384568064|ref|ZP_10015168.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
glauca K62]
gi|384523918|gb|EIF01114.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
glauca K62]
Length = 634
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 61/138 (44%), Gaps = 15/138 (10%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
DL+ L + +F SE + R A D G+ DG RF P V +G +
Sbjct: 156 DLVALHEEKGTLRRGAFDPSEAA---RAAADNRGGWLDG----LRF--PSKAVVTAEGRV 206
Query: 118 YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA----- 172
VAD +NH I + + T + G + G DG A F+ LT +P A
Sbjct: 207 LVADTANHSIAEFASDVETLLRRFGDGRRGSVDGAFDVARFAEPSGLTLLPTHVADRVGY 266
Query: 173 -LLISDHGNQLIRQINLK 189
LLI+D N L+R ++L+
Sbjct: 267 HLLIADTANHLLRGVDLR 284
>gi|336399828|ref|ZP_08580628.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
17128]
gi|336069564|gb|EGN58198.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
17128]
Length = 494
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 53/106 (50%), Gaps = 10/106 (9%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-----TNLGVTTIAG--GGS 143
+G+ DG +A+F+ P+ G +Y+AD NH IR I N VTT G G S
Sbjct: 392 KGWEDGSLKNAKFNYPRQICFTKDGRLYIADSQNHCIRSIDTTQGANARVTTPIGVPGSS 451
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G ADG + A F+ + + I+D NQ+IR++++K
Sbjct: 452 ---GYADGGVELAKFNIPMGVAVSSDGEKVYIADMKNQVIRELSIK 494
>gi|324997987|ref|ZP_08119099.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Pseudonocardia sp. P1]
Length = 614
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 47/89 (52%), Gaps = 3/89 (3%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRAD 150
Y+DG GSARF +P+ V + VAD NH +R+++ + V+T+AG GS+ R D
Sbjct: 231 YTDGPAGSARFSEPQGLLVLDPSTVLVADTVNHAVRRVSLDDGTVSTVAGTGSQLRERVD 290
Query: 151 GPAQNASFSNDFELT-FVPHICALLISDH 178
A S+ ++L + H+ + H
Sbjct: 291 PGGTAAELSSPWDLAPWDGHVVVAMAGSH 319
Score = 40.8 bits (94), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 42/175 (24%), Positives = 72/175 (41%), Gaps = 19/175 (10%)
Query: 28 LEDGYTVTTVIDGHQLE--INPHSVIDR--------PGSSDLIVLDSSRSAFYTLSFPLS 77
L+DG T G QL ++P P ++V + +T+ P+S
Sbjct: 270 LDDGTVSTVAGTGSQLRERVDPGGTAAELSSPWDLAPWDGHVVVAMAGSHQLWTVD-PVS 328
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-----TN 132
++ V LAG +G DG A +P A G ++VAD +R++
Sbjct: 329 GQARV--LAGTTNEGLRDGSFAEAFLAQPSGLATGPDGTLWVADSEISALRRVDVDPGAG 386
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
V+T G G + G DGPA A + + +P ++ ++D N +R+ +
Sbjct: 387 PAVSTAVGQGLFEFGHRDGPAAEALLQHPLGVAVLPD-GSVAVADTYNGAVRRFD 440
>gi|108707787|gb|ABF95582.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
protein, expressed [Oryza sativa Japonica Group]
Length = 772
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 4/105 (3%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAGGGS 143
G +G DG +A F++P+ A + K NI YVAD NH +R+I + V T+AG G+
Sbjct: 610 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 669
Query: 144 K-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
K + R G N ++ +++ + P L I+ G I + N
Sbjct: 670 KGSDYRGGGQGTNQVLNSPWDVCYDPSKETLYIAMAGQHQIWKHN 714
Score = 41.6 bits (96), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 42/96 (43%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P A+D+ N SNH +TNL I GS +EG DG ASF+ L
Sbjct: 574 PGKLAIDVLNNRLFISDSNHNRIVVTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLA 633
Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
+ L ++D N +R+IN E + +G+
Sbjct: 634 YNSKKNILYVADTENHALREINFVSETVKTLAGNGT 669
>gi|290997788|ref|XP_002681463.1| predicted protein [Naegleria gruberi]
gi|284095087|gb|EFC48719.1| predicted protein [Naegleria gruberi]
Length = 2297
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 3/113 (2%)
Query: 78 EESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
E + GDG +G S DG P A + V G++Y+ADK+ H IRK++N +
Sbjct: 42 ERFTITSYVGDGNEGNSGDGVSPSHASIGYVDAIEVGSNGDLYIADKTYHRIRKVSNGII 101
Query: 136 TTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
TIAG GS D G A +A + L F L + D N IR+I+
Sbjct: 102 KTIAGTGSSGGYFGDNGVATSAKLNKPQALAFYSGDGTLYLGDSLNYRIRKIS 154
Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats.
Identities = 34/145 (23%), Positives = 67/145 (46%), Gaps = 8/145 (5%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARF 103
+P+ V P + +L + D+ + + ++ + + G GV GY G+ A+
Sbjct: 239 SPNDVAISPTTGELFIADTGNNVIRKVGL----DNKIITVVGTGVSGYLGDGGQAKQAQL 294
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAGGGSKKEGRADGP-AQNASFSND 161
P S A G + ++D N+VIRK+ + G+ TIAG DG + +A +
Sbjct: 295 SSPTSIAFTSAGEMLISDSDNYVIRKVYSNGIIRTIAGSARNSGSVGDGTDSLSAQIDSV 354
Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
+ +++ + I+D N +R++
Sbjct: 355 YSISYSNISNEVFIADTSNFRVRKL 379
Score = 46.2 bits (108), Expect = 0.014, Method: Composition-based stats.
Identities = 32/115 (27%), Positives = 56/115 (48%), Gaps = 2/115 (1%)
Query: 76 LSEESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S ++ + G G +G ++G S+ + AV G +Y +D + H I KI+
Sbjct: 153 ISTNQIISTVIGKGTKGALTEGTATSSTIGQVLGLAVSSSGVLYFSDYTYHCIGKISGTV 212
Query: 135 VTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
V+ AG S G +G A +A ++ ++ P L I+D GN +IR++ L
Sbjct: 213 VSVAAGTCSLLGYGGDNGVATSAKLNSPNDVAISPTTGELFIADTGNNVIRKVGL 267
>gi|290985802|ref|XP_002675614.1| predicted protein [Naegleria gruberi]
gi|284089211|gb|EFC42870.1| predicted protein [Naegleria gruberi]
Length = 848
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 61/113 (53%), Gaps = 9/113 (7%)
Query: 32 YTVTTVIDG-HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
+T+ T+I +Q++ P +I P S DL + + + + S S E V +AG+G
Sbjct: 521 FTIQTIIQAPNQVQFLPQQLIYGP-SCDLYIACTLENQIFKYS---SSEGFV-LIAGNGK 575
Query: 91 QGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
GY+ G AR + PK+ AV+ G +Y+AD N V+RKI +N + T+ G
Sbjct: 576 VGYTGDYGPAIYARLNNPKAIAVNKFGEVYIADSFNKVVRKIFSNETIVTLIG 628
>gi|158319005|ref|YP_001511513.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114410|gb|ABW16607.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 807
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 48/96 (50%), Gaps = 4/96 (4%)
Query: 95 DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DG 151
DG P +A P A+ G +YVAD S H +R+IT G T+ G ++ G DG
Sbjct: 642 DGGPAVNATLTSPSGLALADDGTLYVADSSAHTVRRITKDGKITLVAGKPEQSGYEGDDG 701
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
PA +A ++ +L P L I+D N +IR +
Sbjct: 702 PAVDALLASPQDLALGPA-GELYIADTYNNVIRMVT 736
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 6/118 (5%)
Query: 76 LSEESVVKRLAGDGVQ-GYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++++ + +AG Q GY DG A P+ A+ G +Y+AD N+VIR +T
Sbjct: 678 ITKDGKITLVAGKPEQSGYEGDDGPAVDALLASPQDLALGPAGELYIADTYNNVIRMVTP 737
Query: 133 LG-VTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G +TT AG ADG A S+ L V A+ +S+ ++R+I L
Sbjct: 738 DGKITTFAGSDESTADDADGALATETRISSPTGLA-VDTSGAVYVSEGSYGVVRRIAL 794
>gi|290971766|ref|XP_002668650.1| predicted protein [Naegleria gruberi]
gi|284082136|gb|EFC35906.1| predicted protein [Naegleria gruberi]
Length = 728
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 56/113 (49%), Gaps = 6/113 (5%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKI-TNL 133
E + +AG G G+S G+ G A +K P+ V +Y+AD+ NH IRKI N
Sbjct: 207 ENGNIVTIAGKGTAGFS-GDNGLATKEKLNFPRCVFVSSNNEVYIADQINHRIRKILENG 265
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G +G A NA N FV + I+++GN IR+I
Sbjct: 266 NIVTIAGNGPYGFCGDNGLATNAQL-NSPAGVFVSSNNEIYIAEYGNHRIRKI 317
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 6/106 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
+AG G G+S G+ G A +K P+ V +Y+AD+ NH IRKI N + TIAG
Sbjct: 102 IAGKGTAGFS-GDNGLATKEKLNFPRCVFVSSNNEVYIADQINHRIRKILENGNIVTIAG 160
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G +G A NA N FV + I+D+ N IR+I
Sbjct: 161 NGPYGFCGDNGLATNAQL-NSPAGVFVSSNNEIYIADYDNHRIRKI 205
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 51/112 (45%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+ +G +A+ + P V IY+AD NH IRKI N
Sbjct: 151 ENGNIVTIAGNGPYGFCGDNGLATNAQLNSPAGVFVSSNNEIYIADYDNHRIRKILENGN 210
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G+ +G A N FV + I+D N IR+I
Sbjct: 211 IVTIAGKGTAGFSGDNGLATKEKL-NFPRCVFVSSNNEVYIADQINHRIRKI 261
Score = 43.9 bits (102), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 2/88 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
SA+ P+S V +Y+AD NH IRKI N + TIAG G+ +G A A
Sbjct: 7 SAQLYSPESVFVSSNNEVYIADYGNHRIRKILENGNIVTIAGNGTAGFSGDNGIATKAQL 66
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+ + FV + I+D+ N IR+I
Sbjct: 67 NGPVGV-FVSSNNEVYIADYDNHRIRKI 93
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 50/112 (44%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+S +G A+ + P V +Y+AD NH IRKI N
Sbjct: 39 ENGNIVTIAGNGTAGFSGDNGIATKAQLNGPVGVFVSSNNEVYIADYDNHRIRKILENGN 98
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ IAG G+ +G A N FV + I+D N IR+I
Sbjct: 99 IVIIAGKGTAGFSGDNGLATKEKL-NFPRCVFVSSNNEVYIADQINHRIRKI 149
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 45/98 (45%), Gaps = 17/98 (17%)
Query: 78 EESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
E + +AG+G G+ +G +A+ + P V IY+A+ NH IRKI N
Sbjct: 263 ENGNIVTIAGNGPYGFCGDNGLATNAQLNSPAGVFVSSNNEIYIAEYGNHRIRKILENGN 322
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
+ T+AG G+ A FS D F++ PH
Sbjct: 323 IVTVAGNGT------------AGFSGDSPFDIRMYPHF 348
>gi|428182271|gb|EKX51132.1| hypothetical protein GUITHDRAFT_103053 [Guillardia theta CCMP2712]
Length = 1472
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 50/97 (51%), Gaps = 4/97 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIA 139
V+ LAG G G+ DG SA F P+ +D +Y+AD +H IR I+ V+T+A
Sbjct: 1328 VQTLAGSGANGFVDGVGTSAEFHSPRDIVLD-GDKLYIADSGSHRIRTLSISTRDVSTLA 1386
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
G + K+G DG A F + F L + +L +S
Sbjct: 1387 GY-AWKQGTQDGKGTFARFLSPFSLALSSAVNSLALS 1422
>gi|418693423|ref|ZP_13254475.1| putative lipoprotein [Leptospira kirschneri str. H1]
gi|409958780|gb|EKO17669.1| putative lipoprotein [Leptospira kirschneri str. H1]
Length = 358
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 63/119 (52%), Gaps = 9/119 (7%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
+ G +G+ + FD P +D + N+YV + SNH IRKI NL V+T +GG S
Sbjct: 164 LSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLDSGTVSTFSGGIS--- 219
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + + + +LL +D + IR+I+LK + S+ G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 36/70 (51%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKI 130
D+ ++IRKI
Sbjct: 96 DQITNLIRKI 105
>gi|424835068|ref|ZP_18259738.1| NHL repeat-containing protein [Clostridium sporogenes PA 3679]
gi|365978195|gb|EHN14287.1| NHL repeat-containing protein [Clostridium sporogenes PA 3679]
Length = 467
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 75/172 (43%), Gaps = 22/172 (12%)
Query: 23 SSGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
++LE+ I ++ IN + +I G + +++LD + T+
Sbjct: 162 EENIILENKCEKEEYIYPSKISINENGLIAFSEKGKNRIVILDQNLEKIKTI-------- 213
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI 138
G G DG+ A+F P+ +K IYVAD NH IR+ + V TI
Sbjct: 214 ------GSSEYGLKDGDFREAQFKAPEGLRF-IKNKIYVADTENHFIRECDLEKEMVKTI 266
Query: 139 AGGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
AG G K+ A G A N S ++ ++L + + I+ GN I + N+K
Sbjct: 267 AGNGQKEYSPMAKGDALNVSLNSPWDLEILED--CIYIAMAGNHQIWKYNMK 316
>gi|290975266|ref|XP_002670364.1| predicted protein [Naegleria gruberi]
gi|284083922|gb|EFC37620.1| predicted protein [Naegleria gruberi]
Length = 1584
Score = 50.4 bits (119), Expect = 7e-04, Method: Composition-based stats.
Identities = 38/112 (33%), Positives = 51/112 (45%), Gaps = 8/112 (7%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV- 135
+E S +GDG G A+ + P ++ G IY+AD NH IRKI + GV
Sbjct: 158 NEHSDCDGFSGDG------GLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINSYGVI 211
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+TIAG G G DG S N + + I+D N IR+IN
Sbjct: 212 STIAGTGRASFG-GDGGFAVLSQLNSPNGVHISQNGEIYIADTLNHRIRKIN 262
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 47/101 (46%), Gaps = 3/101 (2%)
Query: 88 DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKE 146
DG G G A+ + P ++ G IY+AD NH IRKI + GV +TIAG G
Sbjct: 559 DGFSG-DGGLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINSYGVISTIAGTGRASF 617
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G DG S N + + I+D N IR+IN
Sbjct: 618 G-GDGGFAVLSQLNSPNGVHISQNGEIYIADTLNHRIRKIN 657
Score = 48.5 bits (114), Expect = 0.003, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 55/110 (50%), Gaps = 6/110 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-T 136
V+ +AG G + G+ G A + + P + G IY+AD NH IRKI + GV +
Sbjct: 210 VISTIAGTGRASFG-GDGGFAVLSQLNSPNGVHISQNGEIYIADTLNHRIRKINSYGVIS 268
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T+AG G G G A N+ ++ + + V + I+D N IR+I
Sbjct: 269 TVAGTGRASFGGDGGLAINSQLNSPYGV-HVSQNGEIYIADTLNHRIRKI 317
Score = 48.5 bits (114), Expect = 0.003, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 55/110 (50%), Gaps = 6/110 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-T 136
V+ +AG G + G+ G A + + P + G IY+AD NH IRKI + GV +
Sbjct: 605 VISTIAGTGRASFG-GDGGFAVLSQLNSPNGVHISQNGEIYIADTLNHRIRKINSYGVIS 663
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T+AG G G G A N+ ++ + + V + I+D N IR+I
Sbjct: 664 TVAGTGRASFGGDGGLAINSQLNSPYGV-HVSQNGEIYIADTLNHRIRKI 712
Score = 43.1 bits (100), Expect = 0.11, Method: Composition-based stats.
Identities = 41/111 (36%), Positives = 56/111 (50%), Gaps = 6/111 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
V+ +AG G + G+ G +++ + P V G IY+AD NH IRKI N +T
Sbjct: 266 VISTVAGTGRASFG-GDGGLAINSQLNSPYGVHVSQNGEIYIADTLNHRIRKIFVNGTIT 324
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICALLISDHGNQLIRQI 186
TIAG S DG AS N + FV P+ +LI+D N IR+I
Sbjct: 325 TIAGSSSDGSFGGDGGLSIASRLNSPKGVFVSPNNNEILIADTSNNRIRKI 375
Score = 43.1 bits (100), Expect = 0.11, Method: Composition-based stats.
Identities = 41/111 (36%), Positives = 56/111 (50%), Gaps = 6/111 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
V+ +AG G + G+ G +++ + P V G IY+AD NH IRKI N +T
Sbjct: 661 VISTVAGTGRASFG-GDGGLAINSQLNSPYGVHVSQNGEIYIADTLNHRIRKIFVNGTIT 719
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICALLISDHGNQLIRQI 186
TIAG S DG AS N + FV P+ +LI+D N IR+I
Sbjct: 720 TIAGSSSDGSFGGDGGLSIASRLNSPKGVFVSPNNNEILIADTSNNRIRKI 770
>gi|421106984|ref|ZP_15567544.1| putative lipoprotein [Leptospira kirschneri str. H2]
gi|410007898|gb|EKO61577.1| putative lipoprotein [Leptospira kirschneri str. H2]
Length = 358
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 63/119 (52%), Gaps = 9/119 (7%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
+ G +G+ + FD P +D + N+YV + SNH IRKI NL V+T +GG S
Sbjct: 164 LSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLDSGTVSTFSGGIS--- 219
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
G DG +A F + + + +LL +D + IR+I+LK + S+ G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 36/70 (51%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + VV AG GV DG +A F P VD GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITNKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95
Query: 121 DKSNHVIRKI 130
D+ ++IRKI
Sbjct: 96 DQITNLIRKI 105
>gi|320107007|ref|YP_004182597.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319925528|gb|ADV82603.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 710
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 65/129 (50%), Gaps = 17/129 (13%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
LS +AGDG QG++ DG P SA D P AV +G +++AD NH +R ++
Sbjct: 73 LSTTGTWTTVAGDGRQGFAGDGGPAISAELDTPMGIAVSAEGTLWIADAHNHRVRLVSPN 132
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
G ++TIAG G Q + + D + AL I+D + IR+++ E+
Sbjct: 133 GIISTIAG----SSGSGTVLRQPVALAFDVD-------GALYIADAADHRIRRVD---EN 178
Query: 193 CSKSSQSGS 201
S S+ +G+
Sbjct: 179 GSISTVAGT 187
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 43/81 (53%), Gaps = 4/81 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ E + +AG G QGYS E + AR D P S A D G + AD+ NH IR++
Sbjct: 175 VDENGSISTVAGTGDQGYSGDEGPAILARIDSPSSLAFDAAGRLLFADRRNHRIRRVDVD 234
Query: 134 G-VTTIAGGGSKKEGRADGPA 153
G V T+AG GS G DG A
Sbjct: 235 GTVHTVAGDGS-ASGEGDGMA 254
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 4/102 (3%)
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFE 163
KP A D GN+YVAD HVIR+++ G TT+AG G + GPA +A
Sbjct: 48 KPAGMAYDSAGNLYVADARGHVIRRLSTTGTWTTVAGDGRQGFAGDGGPAISAELDTPMG 107
Query: 164 LTFVPHICALLISDHGNQLIRQI--NLKPEDCSKSSQSGSAL 203
+ V L I+D N +R + N + SS SG+ L
Sbjct: 108 IA-VSAEGTLWIADAHNHRVRLVSPNGIISTIAGSSGSGTVL 148
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 2/85 (2%)
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
+P + A D+ G +Y+AD ++H IR++ G ++T+AG G + +GPA A +
Sbjct: 150 QPVALAFDVDGALYIADAADHRIRRVDENGSISTVAGTGDQGYSGDEGPAILARIDSPSS 209
Query: 164 LTFVPHICALLISDHGNQLIRQINL 188
L F LL +D N IR++++
Sbjct: 210 LAF-DAAGRLLFADRRNHRIRRVDV 233
>gi|325110177|ref|YP_004271245.1| hypothetical protein Plabr_3626 [Planctomyces brasiliensis DSM
5305]
gi|324970445|gb|ADY61223.1| NHL repeat containing protein [Planctomyces brasiliensis DSM 5305]
Length = 370
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 9/128 (7%)
Query: 69 FYTLSFP--LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
+ L P + E V + GDG GY+ +G A+ ++P + G +YV + +N
Sbjct: 17 LFVLQLPATVQAEGKVITILGDGKAGYTGDNGSADEAQCEQPFGLVIGPDGALYVCETAN 76
Query: 125 HVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
HVIR+I TN VTT+AG G K G A A +E+ F + + N
Sbjct: 77 HVIRRIDLKTNK-VTTVAGTGEKGYSGDGGSALEAKLDEPYEIRFDKQ-GNMFFVEMLNN 134
Query: 182 LIRQINLK 189
++R+++ K
Sbjct: 135 IVRRVDAK 142
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 52/114 (45%), Gaps = 11/114 (9%)
Query: 78 EESVVKRLAG-DGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
E ++ +AG G Y DG P AR PK +VD +GNI++AD +H IR+I
Sbjct: 255 ERGLLHHVAGLGGKNAYEGDGRPARLARLAGPKGISVDKQGNIFLADTESHTIRRIDGQT 314
Query: 135 --VTTIAGGGSKKEGRADGPAQN--ASFSNDFELTFVPHICALLISDHGNQLIR 184
+TT+ G G K DGP N N FV + I D N +R
Sbjct: 315 GIITTLVGNGKK----GDGPDGNPLECKLNRPHGVFVAADGKVYIGDSSNHRVR 364
Score = 44.3 bits (103), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 57/113 (50%), Gaps = 7/113 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV-T 136
V +AG G +GYS G+ GSA + D+P D +GN++ + N+++R++ GV +
Sbjct: 89 VTTVAGTGEKGYS-GDGGSALEAKLDEPYEIRFDKQGNMFFVEMLNNIVRRVDAKTGVIS 147
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
T+AG G K A A+F+ + F L I D GN +R + +
Sbjct: 148 TVAGTGEKGFSGDGKAATEATFNRPHSICF-DSAGHLYICDIGNHRVRVVEAR 199
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 77 SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
++ V+ +AG G +G+S DG+ + A F++P S D G++Y+ D NH +R +
Sbjct: 141 AKTGVISTVAGTGEKGFSGDGKAATEATFNRPHSICFDSAGHLYICDIGNHRVRVV 196
>gi|290973007|ref|XP_002669241.1| predicted protein [Naegleria gruberi]
gi|284082786|gb|EFC36497.1| predicted protein [Naegleria gruberi]
Length = 747
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 63/126 (50%), Gaps = 7/126 (5%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TI 138
+ +AG GV G S DG+ + A+ + P+ A+ G IY AD SN IR+I G+ TI
Sbjct: 244 INTIAGTGVLGLSGDGDLATKAQLNTPRGIAISSTGEIYFADTSNQRIRRIALNGIIDTI 303
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
AG G + G DG + N + + I+D NQ IR+I L + + S+
Sbjct: 304 AGTGDPRFG-GDGDLATKAQLNSPRGIAISSTGEIYIADTYNQRIRRIAL---NTNISTF 359
Query: 199 SGSALG 204
+GS G
Sbjct: 360 AGSGFG 365
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 4/115 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
E ++ +AG G + DG+ + A+ + P+ A+ G IY+AD NH IR+I N
Sbjct: 128 ENGIITTIAGTGDSRFGGDGDLATKAQLNSPRGIAISSTGEIYIADTYNHRIRRIALNGT 187
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ T+AG G + G A A + + + + I+D N+ IR+I L
Sbjct: 188 INTVAGTGDSRFGGDGDLATKAQLNYPMGIA-ISSTGEIYIADTFNERIRRIALN 241
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 50/104 (48%), Gaps = 4/104 (3%)
Query: 86 AGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGG 142
+G G GY G A + P S A G IY+AD NH IRKI+ N +TTIAG G
Sbjct: 362 SGFGYSGYVGDGGLSTDALLNTPLSVACSSNGEIYIADTYNHRIRKISLNNTITTIAGTG 421
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G A NA S+ ++ V + SD+ N IR+I
Sbjct: 422 DSGFSGDGGLAINARLSSPADI-VVNSNGVIYFSDYDNNRIRKI 464
>gi|405960880|gb|EKC26754.1| NHL repeat-containing protein 2 [Crassostrea gigas]
Length = 699
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TN 132
LS++ VV+ G +G+ +G RFD P+ V MK N IY+AD NH IRKI
Sbjct: 240 LSKDGVVQHCIGGKERGFKNGTFTQCRFDSPQ--GVVMKENVIYLADTENHAIRKIDLQK 297
Query: 133 LGVTTIAGGGSKKEGRADG 151
VTTIAG G + + G
Sbjct: 298 QEVTTIAGNGKQGNDKFGG 316
>gi|430749689|ref|YP_007212597.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
gi|430733654|gb|AGA57599.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
Length = 187
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 45/104 (43%), Gaps = 17/104 (16%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
+AG G+ G DG F P AV G IYVAD N+VIRKI N VTT+
Sbjct: 4 IAGTGILGLKDGSASQENFYYPMDVAVSPSGVIYVADTLNYVIRKIENGRVTTLN----- 58
Query: 145 KEGRADGPAQNASFSNDFELTFVP----HICALLISDHGNQLIR 184
AQ+ + F P A LI+D GN IR
Sbjct: 59 --------AQSGRVAEVFPGVVEPAGDYRDGAFLIADGGNNRIR 94
>gi|168183170|ref|ZP_02617834.1| NHL repeat protein [Clostridium botulinum Bf]
gi|237796201|ref|YP_002863753.1| NHL repeat-containing protein [Clostridium botulinum Ba4 str. 657]
gi|182673679|gb|EDT85640.1| NHL repeat protein [Clostridium botulinum Bf]
gi|229261965|gb|ACQ52998.1| NHL repeat protein [Clostridium botulinum Ba4 str. 657]
Length = 467
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 76/171 (44%), Gaps = 22/171 (12%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
++LE+ I ++ IN + +I G + +++LD +
Sbjct: 163 KNIILENKCEKEEYIYPSKISINENGLIAFSEKGKNRIVILDQNLEKI------------ 210
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
++ G G G DG+ A+F P+ ++ IYVAD NH IR+ + V TIA
Sbjct: 211 --KIIGSGKYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIA 267
Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G G K+ A G A N S ++ ++L + + I+ GN I + N+K
Sbjct: 268 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 316
>gi|303248818|ref|ZP_07335068.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
gi|302489759|gb|EFL49690.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
Length = 2115
Score = 50.1 bits (118), Expect = 9e-04, Method: Composition-based stats.
Identities = 30/76 (39%), Positives = 45/76 (59%), Gaps = 3/76 (3%)
Query: 71 TLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
T F L + ++ +AG+G Y+ DG P +A P AVD +GN+Y+AD+ N+ IR
Sbjct: 846 TKVFRLDTDGTIRVVAGNGSTHYNGDGIPATAASLYHPNGLAVDAQGNLYIADQYNNRIR 905
Query: 129 KITNLG-VTTIAGGGS 143
K+ G +TT AG G+
Sbjct: 906 KVDQNGIITTFAGTGA 921
>gi|290983036|ref|XP_002674235.1| predicted protein [Naegleria gruberi]
gi|284087824|gb|EFC41491.1| predicted protein [Naegleria gruberi]
Length = 737
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 61/113 (53%), Gaps = 7/113 (6%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG G G+S G+ G +A+ P++ V +G I +AD SNHVIRKIT G + T
Sbjct: 90 IHTIAGTGTSGHS-GDGGLAVNAKLFHPRTVTVTSEGEILIADTSNHVIRKITKDGYIFT 148
Query: 138 IAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
IAG S DG A++A S+ + P + I ++G IR+IN K
Sbjct: 149 IAGKPSTSGYGGDGGLAKDALLSSPQCVRISPSG-DIFIVEYGGARIRKINSK 200
Score = 40.4 bits (93), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 50/106 (47%), Gaps = 6/106 (5%)
Query: 85 LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
+AG GYS G+ G +A + IY+AD N VIRKI + G + TIAG
Sbjct: 37 IAGSLTSGYS-GDGGLAINALLATSMGVTISPNREIYIADTGNSVIRKINSTGYIHTIAG 95
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G+ G A NA + +T +LI+D N +IR+I
Sbjct: 96 TGTSGHSGDGGLAVNAKLFHPRTVTVTSEG-EILIADTSNHVIRKI 140
>gi|320103824|ref|YP_004179415.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
gi|319751106|gb|ADV62866.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
Length = 417
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 4/113 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
V+ +AG G +G+S DG P +A D+P A G+IY D+ N +RKI +T
Sbjct: 127 TVEVIAGTGRKGFSGDGGPARAAELDEPYGVAPAPDGSIYFVDRLNRRVRKIDGATGIIT 186
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
T+AG G + GPA A L P LLI+D IR ++L
Sbjct: 187 TVAGTGRPETSGDGGPANRAGLVEPNGLALDPKAEDLLIADVQACRIRVVDLN 239
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 9/119 (7%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNL 133
+ ++ +AG+G +GYS DG P A F+ PK V +G + + D N IR
Sbjct: 296 RDGIITTVAGNGRKGYSGDGGPALEATFNGPKELDVSPEGLVAIVDTENQAIRLYNPKTR 355
Query: 134 GVTTIAGGGSK-KEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
VTT+AG G + +EG DG PA A + F P + ISD N IR + P
Sbjct: 356 VVTTLAGNGQRGREG--DGVPATAARLDRPHGIAFGPG-GVIAISDTNNHRIRLVKSVP 411
Score = 40.4 bits (93), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 62/149 (41%), Gaps = 8/149 (5%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPGS-AR 102
+ P+ + P + DL++ D + V+ AG G + DG P A
Sbjct: 209 VEPNGLALDPKAEDLLIADVQACRIRVVDL---NTGVISTFAGTGRKARDGDGGPAQQAS 265
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSN 160
++ V G +Y+ ++ + +R + +TT+AG G K GPA A+F+
Sbjct: 266 ILGARAVRVAPDGTVYILEREGNSLRAVDPRDGIITTVAGNGRKGYSGDGGPALEATFNG 325
Query: 161 DFELTFVPHICALLISDHGNQLIRQINLK 189
EL P ++ D NQ IR N K
Sbjct: 326 PKELDVSPEGLVAIV-DTENQAIRLYNPK 353
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 3/90 (3%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNAS 157
SAR ++P D GN+ +D +NH +++ N V IAG G K GPA+ A
Sbjct: 91 SARLNQPFDCRYDAVGNLVFSDTANHQLKRWNQANDTVEVIAGTGRKGFSGDGGPARAAE 150
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ + P ++ D N+ +R+I+
Sbjct: 151 LDEPYGVAPAPD-GSIYFVDRLNRRVRKID 179
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 48/97 (49%), Gaps = 13/97 (13%)
Query: 41 HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG 99
+L+++P ++ ++D+ A + + VV LAG+G +G DG P
Sbjct: 327 KELDVSPEGLV--------AIVDTENQAIRLYN---PKTRVVTTLAGNGQRGREGDGVPA 375
Query: 100 SA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+A R D+P A G I ++D +NH IR + ++ V
Sbjct: 376 TAARLDRPHGIAFGPGGVIAISDTNNHRIRLVKSVPV 412
>gi|348508723|ref|XP_003441903.1| PREDICTED: NHL repeat-containing protein 2-like [Oreochromis
niloticus]
Length = 719
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 80/177 (45%), Gaps = 34/177 (19%)
Query: 16 SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPG----- 55
+LLFSLV G +L D Y ++ H + I P S++ PG
Sbjct: 168 NLLFSLVGEGHRDRLMLFTDCALRYYGGRGLLKAHTVGIKLYKDALPPSILSFPGKVAID 227
Query: 56 --SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
+ L++ D+ +S + + G G DG+ A F+ P+ A+
Sbjct: 228 NSTKRLVIADTGHHRILVVS----STGDLLHVIGGPESGRKDGDVSEASFNSPQGVAI-- 281
Query: 114 KGN-IYVADKSNHVIRKITNL--GVTTIAGGGSK---KEGRADGPAQNASFSNDFEL 164
KG+ +YVAD NH+IRKI L V+T+AG G++ KEG A GP Q S D L
Sbjct: 282 KGDTVYVADTENHLIRKIDLLEGKVSTLAGVGTQGTDKEGGAMGPQQPISSPWDVML 338
>gi|72094692|ref|XP_795849.1| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 706
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 60/140 (42%), Gaps = 12/140 (8%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGEPGSARFDKPKSFAV 111
+ V DS S+ +SF ++ VK + G + G SDG+ A+ P A
Sbjct: 407 MFVADSESSSIRRVSF---KDGAVKNVVGGEMDPMNLFAYGDSDGKGLEAKLQHPLGVAW 463
Query: 112 DMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
D ++VAD NH I+ I T AG G +G D A F+ L P
Sbjct: 464 DHSKLLFVADSYNHKIKMIDPEERYCATYAGTGEPGKGGDDEHILKAQFNEPGGLAISPC 523
Query: 170 ICALLISDHGNQLIRQINLK 189
C + ++D N IR I++K
Sbjct: 524 GCKIYVADTNNHTIRCIDIK 543
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 11/102 (10%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V P + L V D+ + ++ + VV+ G G++DG ARF P
Sbjct: 212 PGKVTTNPEGTLLAVSDTGHNRVIIVAL----DGVVQHCIGGPETGFNDGLYREARFHSP 267
Query: 107 K--SFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
+ +A D+ IYVAD NH IRKI VTTIAG G +
Sbjct: 268 QGLCWAQDV---IYVADTENHAIRKIDLKEQCVTTIAGTGEQ 306
>gi|348689157|gb|EGZ28971.1| hypothetical protein PHYSODRAFT_474561 [Phytophthora sojae]
Length = 98
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 39/69 (56%), Gaps = 2/69 (2%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
V GY+DG + F P+S AV G++YVAD +N VI K T+ GV T+ GS G
Sbjct: 11 VSGYADGYGNAVLFKNPRSLAVGSDGHLYVADDNNRVI-KFTSGGVMTVL-AGSTSSGLV 68
Query: 150 DGPAQNASF 158
DG +A F
Sbjct: 69 DGTGTSARF 77
>gi|381205317|ref|ZP_09912388.1| NHL repeat containing protein [SAR324 cluster bacterium JCVI-SC
AAA005]
Length = 458
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 62/122 (50%), Gaps = 6/122 (4%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
+AG G GY+DG SA+F+ P +F + K +++VAD N+ IR+I +L +
Sbjct: 271 IAGGGSIGYTDGSGISAQFNAPNAFT-NTKDSLFVADTCNNTIRRI-DLDTNIVTSLLPP 328
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
+ G G + + ++ +LT + L I+D GN I++I+L DC + +
Sbjct: 329 EPGCGSGSSDSDKLNSPTDLTTNGY--QLYIADKGNSAIKRIDLN--DCKEQEWTNCTFD 384
Query: 205 AV 206
+
Sbjct: 385 KI 386
>gi|290995833|ref|XP_002680487.1| serine/threonine kinase [Naegleria gruberi]
gi|284094108|gb|EFC47743.1| serine/threonine kinase [Naegleria gruberi]
Length = 1078
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 8/120 (6%)
Query: 39 DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
D + +IN H + R SSD ++ + + TL+ P+S + + +G DG
Sbjct: 133 DVYICDINNHRI--RKVSSDGVISTIAGTGRTTLN-PISPNTYRQEGGYNG-----DGLA 184
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
+ + PK VD KG++Y AD N +IRK++N +TTIAG G A N +F
Sbjct: 185 TATDLNYPKYAQVDSKGDVYFADTYNGLIRKVSNGYITTIAGSKQNSTVNTFGMALNTTF 244
>gi|290973162|ref|XP_002669318.1| predicted protein [Naegleria gruberi]
gi|284082864|gb|EFC36574.1| predicted protein [Naegleria gruberi]
Length = 832
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 61/117 (52%), Gaps = 6/117 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITN 132
+ + ++ + G G +GYS +G A D P S A + G++ D +N+ IRKI+N
Sbjct: 255 IDKNGIISTIIG-GSKGYSGDEGNAADAMIDGPYSLAFHPVSGDLTFVDINNYRIRKISN 313
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G ++TIAG G K G A NA ++F P+ L I++ N IR+I+L
Sbjct: 314 KGIISTIAGNGEKGSIGDGGSALNAQIYYSVSISFSPN-GELYIANEWNNRIRKISL 369
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 56/116 (48%), Gaps = 2/116 (1%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S V +AG G G+S DG P A+ P S A + G ++++D N+ IRKI
Sbjct: 199 ISTNGNVYTVAGTGEFGFSGDGGPAFDAQLFFPSSIAFNNGGELFISDLGNNRIRKIDKN 258
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G+ + GGSK +G A +A + L F P L D N IR+I+ K
Sbjct: 259 GIISTIIGGSKGYSGDEGNAADAMIDGPYSLAFHPVSGDLTFVDINNYRIRKISNK 314
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 66/146 (45%), Gaps = 21/146 (14%)
Query: 55 GSSDLIVLDSSRSAFYTLSFP----LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSF 109
G SDL+ L SS + P + + + LAG+G + DG P RF S
Sbjct: 119 GPSDLL-LTSSGELLFMDGGPRLKKIDKNGFLTCLAGNGSEILPDGFPAKDTRFTSIASI 177
Query: 110 AVD-MKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
A + G IY+++ NH IR+I TN V T+AG G GPA +A F
Sbjct: 178 AQHPVTGEIYLSETGNHTIRRISTNGNVYTVAGTGEFGFSGDGGPAFDAQL-------FF 230
Query: 168 PHICA------LLISDHGNQLIRQIN 187
P A L ISD GN IR+I+
Sbjct: 231 PSSIAFNNGGELFISDLGNNRIRKID 256
Score = 41.6 bits (96), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 84/173 (48%), Gaps = 29/173 (16%)
Query: 23 SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
S G ++G +IDG P+S+ P S DL +D + Y + +S + ++
Sbjct: 268 SKGYSGDEGNAADAMIDG------PYSLAFHPVSGDLTFVDINN---YRIR-KISNKGII 317
Query: 83 KRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
+AG+G +G S G+ GSA + S + G +Y+A++ N+ IRKI+ G+ +
Sbjct: 318 STIAGNGEKG-SIGDGGSALNAQIYYSVSISFSPNGELYIANEWNNRIRKISLSGIISTY 376
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQI 186
GG+ +G ++ + + F+P ++ LI+D + LIR++
Sbjct: 377 SGGTFGDGYD---------ASSWGVLFLPQGVSITPNGDVLIADSKHALIRKL 420
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 47/98 (47%), Gaps = 3/98 (3%)
Query: 89 GVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
G DG P A + +P S + G++++AD ++ IRK++N V+TIAG +
Sbjct: 46 GTNSVGDGLPPDGATYVRPVSVKIAQNGDVFIADHDHYTIRKVSNGKVSTIAGKNLVQGD 105
Query: 148 RADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIR 184
DG A NA S +L LL D G +L +
Sbjct: 106 SGDGTLAANALLSGPSDLLLTSS-GELLFMDGGPRLKK 142
>gi|406947628|gb|EKD78528.1| NHL repeat containing protein [uncultured bacterium]
Length = 810
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 52/113 (46%), Gaps = 15/113 (13%)
Query: 91 QGYSDGEP-----GSAR-----FDKPKSFAVDMKGNI-YVADKSNHVIRKIT----NLGV 135
GY +GE G +R F +P AVD G YVAD N VIR++T N G
Sbjct: 488 NGYQEGEACPNQFGQSRKNCAYFSRPGGIAVDSNGKYAYVADTGNEVIRRVTLYGKNKGK 547
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
T + G ++ G D ++A F+ +T L ++D N IRQ+ +
Sbjct: 548 TKLIAGSPQESGYQDATRKDAQFNVPIAVTIDNADNYLYVADRDNHTIRQVRI 600
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
IV D + S + +F ++ VV + G G G + G S +F P + +D
Sbjct: 51 IVSDGAGSFYLADTFNGVIRKINSNGVVSTVVGAGGYGDTTGSGSSTKFALPAAVGLDSS 110
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
GN+Y+AD N I+K V+TIA + EG
Sbjct: 111 GNVYIADTGNGKIKKFNGSTVSTIASDFDRPEG 143
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 50/96 (52%), Gaps = 6/96 (6%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR--KITNLGVTTIAGGGSKKEGR 148
GY D A+F+ P + +D N +YVAD+ NH IR +I++ VTT+ G SKK G
Sbjct: 559 GYQDATRKDAQFNVPIAVTIDNADNYLYVADRDNHTIRQVRISDGQVTTLVGNPSKK-GY 617
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
DG A + E+ + + + S+ G ++R
Sbjct: 618 LDGKFAEAYLNLPVEVFY--NRGNVYFSEAGTHMVR 651
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 34/53 (64%), Gaps = 3/53 (5%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
+ VK +AGDG +GY++G+ + +FD P A KG N+ +AD N +IRK+
Sbjct: 657 DEAVKLVAGDGNRGYNNGDRDNTQFDNPVGLA--RKGDNLLIADSLNDLIRKV 707
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 16/94 (17%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
A FD P+ D G+ Y+AD N VIRKI + G V+T+ G G G G + F+
Sbjct: 43 AYFDFPEDIVSDGAGSFYLADTFNGVIRKINSNGVVSTVVGAGGY--GDTTGSGSSTKFA 100
Query: 160 NDFELTFVPHICAL------LISDHGNQLIRQIN 187
+P L I+D GN I++ N
Sbjct: 101 -------LPAAVGLDSSGNVYIADTGNGKIKKFN 127
Score = 37.4 bits (85), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 58/120 (48%), Gaps = 9/120 (7%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKEG 147
+GY DG+ A + P + +GN+Y ++ H++R + + G V +AG G++ G
Sbjct: 615 KGYLDGKFAEAYLNLPVEVFYN-RGNVYFSEAGTHMVR-VADRGDEAVKLVAGDGNR--G 670
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
+G N F N L LLI+D N LIR+++L + ++ A+ VS
Sbjct: 671 YNNGDRDNTQFDNPVGLARKGD--NLLIADSLNDLIRKVDLGDGENIPYTEDAPAVSGVS 728
>gi|290970060|ref|XP_002668023.1| predicted protein [Naegleria gruberi]
gi|284081057|gb|EFC35279.1| predicted protein [Naegleria gruberi]
Length = 318
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 56/112 (50%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
+ +AG G+ GY+ DG + +A+ P +D K +Y +D SNH IRKI +N
Sbjct: 81 NNGTIVTIAGTGIAGYNGDGIDATTAQLFSPYGIGIDSKDLVYFSDSSNHRIRKILSNGT 140
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG GS G A A F LT P+ ++ +D N IR+I
Sbjct: 141 IVTIAGIGSNGFSGDGGLATKAKIYLPFGLTISPN-DDVIFADVNNNRIRKI 191
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 5/66 (7%)
Query: 79 ESVVKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
+S++K +AG QG DG SA + P S A++ G +Y+ADKS+H IRK+ N G +
Sbjct: 30 KSIIKTIAG---QGVCDGISATSASLNGPSSTAINSLGEVYIADKSHHRIRKVYNNGTIV 86
Query: 137 TIAGGG 142
TIAG G
Sbjct: 87 TIAGTG 92
>gi|290986699|ref|XP_002676061.1| predicted protein [Naegleria gruberi]
gi|284089661|gb|EFC43317.1| predicted protein [Naegleria gruberi]
Length = 819
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 41/64 (64%), Gaps = 4/64 (6%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
+AG+G QGY+ +G G + +P S +D GNIY ADK N +IRKI +N V T+AG
Sbjct: 207 IAGNGKQGYNGEGLSGLDSALKEPYSVDMDSDGNIYFADKGNSIIRKILASNGAVITVAG 266
Query: 141 GGSK 144
G+K
Sbjct: 267 NGTK 270
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 57/112 (50%), Gaps = 3/112 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK-ITNLG 134
LS +V+ + G +DG + P D+ G++ +AD+ NH IRK ++N
Sbjct: 477 LSNGTVIT-VVGSTTGSTADGPALQTKLTIPNGIGFDLDGSLLIADRYNHRIRKLLSNGT 535
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G +G D A N+S + + +P+ +LI+D N IR++
Sbjct: 536 LITIAGTGVSGKGGDDILATNSSLNQPSGVCTMPN-GDILIADRANHQIRRL 586
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 56/116 (48%), Gaps = 7/116 (6%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+K +AG G GY+ DG P S+ + P++ + IY+ D SN IRKI N G + T
Sbjct: 591 TIKTIAGTGSSGYNGDGLPATSSTLNYPQTVTM-FHNEIYMVDTSNSRIRKILNNGTMIT 649
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI--NLKPE 191
I G G + + + + + +TF + ISD+ N I+ + N PE
Sbjct: 650 IVGNGIRGDSLDGLDPLESKLKSPYHVTFDSR-GRMYISDYSNHKIKVVDFNSSPE 704
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 50/106 (47%), Gaps = 6/106 (5%)
Query: 84 RLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
R AG G QGY+ DG+ + P S + G +Y +D SN+++RK+ G G
Sbjct: 96 RFAGTGFQGYNGDGKLALETDLNYPGSLMM-YDGELYFSDLSNYLVRKVDRNGYVRTVAG 154
Query: 142 GSKKEGRADGPAQNASFSNDFEL-TFVPHICALLISDHGNQLIRQI 186
K G AD N +N + +V + +LI+D IR++
Sbjct: 155 SIKSYGYADNVLANE--TNIYPTGIYVTKLGEVLIADSSIHRIRKV 198
>gi|289774196|ref|ZP_06533574.1| NHL repeat containing protein [Streptomyces lividans TK24]
gi|289704395|gb|EFD71824.1| NHL repeat containing protein [Streptomyces lividans TK24]
Length = 605
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 62/115 (53%), Gaps = 7/115 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +V D++R L+ ES V+R+ G G +G++DG A F +P+ A+ G
Sbjct: 189 SGTFLVSDTTRHQLVELAE--DGESPVRRI-GSGRRGFTDGPADRAEFSEPQGLALLGDG 245
Query: 116 NIYVADKSNHVIRKITNLG--VTTIAGGGSK-KEGR-ADGPAQNASFSNDFELTF 166
++ VAD NH +R+ + VTT+AG G + + G + G A+ S S+ +++
Sbjct: 246 SVVVADTVNHALRRYEPVTGEVTTLAGTGRQHRPGEPSRGAARAVSLSSPWDVAL 300
>gi|21218701|ref|NP_624480.1| hypothetical protein SCO0142 [Streptomyces coelicolor A3(2)]
gi|6066645|emb|CAB58305.1| conserved hypothetical protein SCJ33.06c [Streptomyces coelicolor
A3(2)]
Length = 601
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 62/115 (53%), Gaps = 7/115 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +V D++R L+ ES V+R+ G G +G++DG A F +P+ A+ G
Sbjct: 185 SGTFLVSDTTRHQLVELAE--DGESPVRRI-GSGRRGFTDGPADRAEFSEPQGLALLGDG 241
Query: 116 NIYVADKSNHVIRKITNLG--VTTIAGGGSK-KEGR-ADGPAQNASFSNDFELTF 166
++ VAD NH +R+ + VTT+AG G + + G + G A+ S S+ +++
Sbjct: 242 SVVVADTVNHALRRYEPVTGEVTTLAGTGRQHRPGEPSRGAARAVSLSSPWDVAL 296
>gi|418721636|ref|ZP_13280812.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
str. UI 09149]
gi|410741989|gb|EKQ90740.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
str. UI 09149]
Length = 246
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 1/81 (1%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
++LD +T ++ VV AG G++ DG +A F P S +D GNI+V
Sbjct: 35 MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94
Query: 120 ADKSNHVIRKITNLG-VTTIA 139
+D +++IRKI G VTT++
Sbjct: 95 SDSDSNLIRKIDRFGNVTTLS 115
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 41/73 (56%), Gaps = 7/73 (9%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKK 145
G+ G+ +G+ ++ F P +D + N+YV + NH IRKI NL V+T++GG S
Sbjct: 163 GIGGFQNGDRLNSLFKGPLFMDLDRERNLYVGELGNHTIRKI-NLNSETVSTLSGGVS-- 219
Query: 146 EGRADGPAQNASF 158
G DG +A F
Sbjct: 220 -GYLDGDLTSAQF 231
>gi|348681648|gb|EGZ21464.1| hypothetical protein PHYSODRAFT_329405 [Phytophthora sojae]
Length = 145
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 52/108 (48%), Gaps = 8/108 (7%)
Query: 60 IVLDSSRSAFY-----TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
I +DSS S + L ++ + VK +AG G +G+ +G SARF P +
Sbjct: 3 IAIDSSDSIYVLDMHRILKVTVTLDGDVKVVAGSGAKGFVNGFGESARFSTPWALTFGSD 62
Query: 115 GNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
G +YV D N IRK IT V T A G + G +G NA+F N
Sbjct: 63 GYLYVPDLDNDCIRKVDITTTEVMTYA-GICQTSGAVNGLTTNATFDN 109
>gi|392403177|ref|YP_006439789.1| NHL repeat containing protein [Turneriella parva DSM 21527]
gi|390611131|gb|AFM12283.1| NHL repeat containing protein [Turneriella parva DSM 21527]
Length = 392
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 58/114 (50%), Gaps = 11/114 (9%)
Query: 81 VVKRLAGDG----VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLG 134
VV LAG G + G DG +ARF ++ D N+YVAD SN+ IRK I +
Sbjct: 107 VVSTLAGPGPGTSLSGDVDGIGEAARFSSMRAITTDGT-NLYVADNSNNKIRKIVIASRA 165
Query: 135 VTTIAG--GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
VT +AG G G DG A +A F+N + V L ++D N+ +R+I
Sbjct: 166 VTVLAGPAAGDTTSGDTDGTANDARFNNPQGI--VTDGTNLFVADSLNRKVRKI 217
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 48/105 (45%), Gaps = 7/105 (6%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG--G 141
AGD G +DG ARF+ P+ D N++VAD N +RKI V+T+AG
Sbjct: 174 AGDTTSGDTDGTANDARFNNPQGIVTDGT-NLFVADSLNRKVRKIVIASGIVSTLAGPAQ 232
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G DG A A F +T L + D N IR+I
Sbjct: 233 GVTGSGDTDGSANTARFGLPGAMT--TDGVNLYLCDSSNHKIRKI 275
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 46/101 (45%), Gaps = 25/101 (24%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNAS 157
SARF+ P D ++YVAD SN IRK I VTT+A GPAQ A+
Sbjct: 304 SARFNNPLGITTDGT-SLYVADTSNQKIRKIVIATGAVTTVA-----------GPAQGAT 351
Query: 158 FSNDFELT------FVPHICA-----LLISDHGNQLIRQIN 187
S D E F PH L + ++ N IR+IN
Sbjct: 352 TSGDTEGVGTSARFFNPHGITTDGYRLFVGEYWNHKIRRIN 392
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 62/127 (48%), Gaps = 7/127 (5%)
Query: 64 SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
+ RS+ L+ L+ +++ G G +DG +ARF+ D + NIY D
Sbjct: 36 AGRSSNCALTLSLTVDTLAGPSQGTAASGDTDGTGSAARFNAVTGVTTDGQ-NIYSTDYF 94
Query: 124 NHVIRK--ITNLGVTTIA--GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHG 179
H IRK I++ V+T+A G G+ G DG + A FS+ +T L ++D+
Sbjct: 95 GHKIRKTVISSGVVSTLAGPGPGTSLSGDVDGIGEAARFSSMRAIT--TDGTNLYVADNS 152
Query: 180 NQLIRQI 186
N IR+I
Sbjct: 153 NNKIRKI 159
>gi|290983716|ref|XP_002674574.1| predicted protein [Naegleria gruberi]
gi|284088165|gb|EFC41830.1| predicted protein [Naegleria gruberi]
Length = 2271
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 6/107 (5%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
AG G GYS DG G SA + P + A D GN+ ++D N+ IRK+ N ++T+AG
Sbjct: 291 FAGSGTSGYSGDGGLGTSALLNGPSALAFDSSGNLLISDSFNNRIRKVANGTISTLAGNS 350
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPH---ICALLISDHGNQLIRQI 186
++ G ASFS+ + + + +LISD N ++R++
Sbjct: 351 NRNFGNG-VLGTLASFSSPNSVYYTGNDDSAGGILISDTNNHVLRRL 396
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 6/108 (5%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
++ +AG GV YS G +A+ + P+ + G I +D NH IRKI N ++T+
Sbjct: 84 IMSTIAGTGVANYSGDGGAAVNAQLNSPQGIGILSTGAIVFSDTLNHCIRKIENGIISTL 143
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG GS G G A +A + L + + ++ GN LIR+I
Sbjct: 144 AGNGSP--GLTVGSAISAQLNTPTALIVASN--DIYFAESGNHLIRRI 187
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 54/123 (43%), Gaps = 19/123 (15%)
Query: 77 SEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+ ++V LAGD G+ DG P S+R D P + + G I D+ N IRKIT G
Sbjct: 557 TSSALVTTLAGDRFNGFLGDGGPAKSSRLDSPGPIQLTLGGEIIFMDRGNQRIRKITKYG 616
Query: 135 -VTTIAGGGSKKEGR-ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
+ TI G GS + A+G S + I+D GN LI+ L
Sbjct: 617 NIFTIYGNGSTSVFKDANGLTLGGS-------------GEIYIADSGNNLIKM--LSAAK 661
Query: 193 CSK 195
CS
Sbjct: 662 CSN 664
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 2/107 (1%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
E ++ LAG+G G + G SA+ + P + V +IY A+ NH+IR+I++ G+ T
Sbjct: 136 ENGIISTLAGNGSPGLTVGSAISAQLNTPTALIV-ASNDIYFAESGNHLIRRISSSGMLT 194
Query: 138 IA-GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
I G S G +D ASF + + V H L SD N I
Sbjct: 195 IYIGESSGTSGSSDTGTAGASFKMNTPMGIVFHAGNLYFSDSKNYRI 241
>gi|348690525|gb|EGZ30339.1| hypothetical protein PHYSODRAFT_295167 [Phytophthora sojae]
Length = 383
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 39/71 (54%), Gaps = 3/71 (4%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEG 147
V GY+DG + F KP+S AV ++YVAD N+ IRK+ T VTT A G G
Sbjct: 11 VSGYADGYGDAVFFKKPRSLAVGSDADLYVADVGNYYIRKVRLTTKEVTTYA-GTCTSSG 69
Query: 148 RADGPAQNASF 158
DG A +A F
Sbjct: 70 ATDGVATSALF 80
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 60 IVLDSSRSAFYTLS---FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
IV DSS + + T + + + LAG G DG SARF P + A+ G+
Sbjct: 261 IVTDSSDNIYVTDNNRVMKFTSGGAMTVLAGSTSSGLVDGTGTSARFYHPDALAIGSDGD 320
Query: 117 IYVADKSNHVIRK---ITNLGVTTIAGG 141
+YVADK+N+ IR+ IT T++A G
Sbjct: 321 LYVADKANYCIRRLNLITKEVTTSVASG 348
>gi|386717879|ref|YP_006184205.1| hypothetical protein SMD_1470 [Stenotrophomonas maltophilia D457]
gi|384077441|emb|CCH12027.1| Alr1013 protein [Stenotrophomonas maltophilia D457]
Length = 470
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 65/147 (44%), Gaps = 11/147 (7%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L + S+ R AG G DG A F +P + AV ++ +YVAD IR +
Sbjct: 301 SYHLGQRSLQWR-AGSGSIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358
Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
G V T+ G G + G DGP AS + L I+D GN +R + L
Sbjct: 359 RGDLVQTLVGQGPWRFGSEDGPRAQASLQYPQAIALSADAPLLWIADTGNGRLRTLRLGG 418
Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
+P +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 48/103 (46%), Gaps = 12/103 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
+ R G G + DG A F +P++ ++ + ++YVAD NH +R+I L G+
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDSLYVADTGNHAVRRINLLTGIVDTLC 255
Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
G + +GP AQ S D H L I+D NQL
Sbjct: 256 GNGRPGAPMEGPVAQARQVSLD-------HPVGLAIAD--NQL 289
>gi|443288042|ref|ZP_21027136.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora lupini str. Lupac 08]
gi|385881808|emb|CCH22229.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora lupini str. Lupac 08]
Length = 612
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 49/92 (53%), Gaps = 6/92 (6%)
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP-HICA- 172
GN+ V+D + H + ++ G T + GS + GRADGPA A+FS L +P H+
Sbjct: 185 GNLLVSDSARHSLVEVAPDGETPVRRIGSGERGRADGPAAAATFSEPQGLCLLPTHVAEV 244
Query: 173 ----LLISDHGNQLIRQINLKPEDCSKSSQSG 200
L+++D N L+R + L+ + + SG
Sbjct: 245 AGYDLVVADTVNHLLRGVRLESGEVVTVAGSG 276
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 42/174 (24%), Positives = 72/174 (41%), Gaps = 12/174 (6%)
Query: 25 GLLLEDGYTVTTVIDGHQLE--INPHS----VIDRPGSSDLIVLDS----SRSAFYTLSF 74
G+ LE G VT G Q ++ H+ +D DL D + + + L +
Sbjct: 261 GVRLESGEVVTVAGSGRQWRASVDDHAHDALAVDLSSPWDLAWYDDKLIIAMAGIHQLWW 320
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNL 133
+ AG V+ DG A +P +V G +++AD +R + N
Sbjct: 321 FDPIKRTAGMYAGTTVEALRDGPLAEAWMAQPSGLSVSADGVRLWIADSETSAVRYVENG 380
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ T G G G DGPA++A + + +P ++LI+D N +R+ +
Sbjct: 381 VLGTAVGQGLFDFGHVDGPAESALLQHPLGVCALPD-GSVLIADTYNGAVRRFD 433
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 70/138 (50%), Gaps = 21/138 (15%)
Query: 47 PHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
P + + PG + +L+V DS+R + ++ E+ V+R+ G G +G +DG +
Sbjct: 169 PETALRFPGKAVLLPDGNLLVSDSARHSLVEVAP--DGETPVRRI-GSGERGRADGPAAA 225
Query: 101 ARFDKPKSFAV------DMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADG 151
A F +P+ + ++ G ++ VAD NH++R ++ + V T+AG G + D
Sbjct: 226 ATFSEPQGLCLLPTHVAEVAGYDLVVADTVNHLLRGVRLESGEVVTVAGSGRQWRASVDD 285
Query: 152 PAQNA---SFSNDFELTF 166
A +A S+ ++L +
Sbjct: 286 HAHDALAVDLSSPWDLAW 303
>gi|390342982|ref|XP_001201679.2| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 338
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 60/140 (42%), Gaps = 12/140 (8%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGEPGSARFDKPKSFAV 111
+ V DS S+ ++F ++ VK + G + G SDG+ AR P A
Sbjct: 39 MFVADSESSSIRRVAF---KDGAVKNVVGGEMDPMNLFAYGDSDGKGLEARLQHPLGVAW 95
Query: 112 DMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
D ++VAD NH I+ I T AG G +G D A F+ L P
Sbjct: 96 DHSKLLFVADSYNHKIKMIDPEERYCATYAGTGEPGKGGDDEHILKAQFNEPGGLAISPC 155
Query: 170 ICALLISDHGNQLIRQINLK 189
C + ++D N IR I++K
Sbjct: 156 GCKIYVADTNNHTIRCIDIK 175
>gi|297848174|ref|XP_002891968.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
lyrata]
gi|297337810|gb|EFH68227.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
lyrata]
Length = 1058
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 4/115 (3%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLG 134
E + + ++ G +G+ DG A F++P+ A + K N +YVAD NH +R+I N
Sbjct: 600 EGNFIVQIGSTGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREIDFVNER 659
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND-FELTFVPHICALLISDHGNQLIRQINL 188
V T+AG G+K G N +++ F P + ++ G I + N+
Sbjct: 660 VRTLAGNGTKGSDYQGGRKGTKQLLNSPWDVCFEPVNEKVYVAMAGQHQIWEYNV 714
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 43/103 (41%), Gaps = 2/103 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G +DG P G IY+ D NH I+K+ + + G+ K G DG
Sbjct: 794 GDNDGVGAEVLLQHPLGVLCAKDGQIYLTDSYNHKIKKLDPITKRVVTVAGTGKAGFKDG 853
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC 193
+ A S L + L ++D N LIR I+L K ED
Sbjct: 854 KVKGAQLSEPAGLAITEN-GRLFVADTNNSLIRYIDLNKGEDA 895
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
P+++ V +AG G G+ DG+ A+ +P A+ G ++VAD +N +IR I
Sbjct: 834 PITKRVVT--VAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI 887
>gi|441163237|ref|ZP_20968264.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
ATCC 10970]
gi|440616376|gb|ELQ79518.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
ATCC 10970]
Length = 551
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 45/79 (56%), Gaps = 2/79 (2%)
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
AVD GN+Y+AD +N +RK+T G +TT+AG G+ GPA ++ + +T
Sbjct: 1 MAVDAAGNLYIADSANQRVRKVTPQGTITTVAGTGTAGYTSDGGPATSSQLNTPAYVTL- 59
Query: 168 PHICALLISDHGNQLIRQI 186
L I++ G+Q IR++
Sbjct: 60 DDAGNLYIAESGSQRIRKV 78
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 46/81 (56%), Gaps = 3/81 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ + + +AG G GY SDG P S++ + P +D GN+Y+A+ + IRK+T
Sbjct: 22 VTPQGTITTVAGTGTAGYTSDGGPATSSQLNTPAYVTLDDAGNLYIAESGSQRIRKVTTD 81
Query: 134 G-VTTIAGGGSKKEGRADGPA 153
G +TT+AG G+ GPA
Sbjct: 82 GIITTVAGNGTAGYVDDGGPA 102
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 2/58 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
++ + ++ +AG+G GY DG P +A R P+ A+D GN+Y+AD N+ +R +T
Sbjct: 78 VTTDGIITTVAGNGTAGYVDDGGPATATRLYGPRGVALDRAGNLYIADGDNNRVRGVT 135
>gi|344206806|ref|YP_004791947.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
gi|343778168|gb|AEM50721.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
Length = 470
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 65/147 (44%), Gaps = 11/147 (7%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L + S+ R AG G DG A F +P + AV ++ +YVAD IR +
Sbjct: 301 SYHLGQRSLQWR-AGSGSIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358
Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
G V T+ G G + G DGP AS + L I+D GN +R + L
Sbjct: 359 RGDLVQTLVGQGPWRFGSEDGPRAQASLQYPQAIALSADAPLLWIADTGNGRLRTLRLGG 418
Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
+P +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 48/103 (46%), Gaps = 12/103 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
+ R G G + DG A F +P++ ++ + ++YVAD NH +R+I L G+
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDSLYVADTGNHAVRRINLLTGIVDTLC 255
Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
G + +GP AQ S D H L I+D NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289
>gi|392404857|ref|YP_006441469.1| NHL repeat containing protein [Turneriella parva DSM 21527]
gi|390612811|gb|AFM13963.1| NHL repeat containing protein [Turneriella parva DSM 21527]
Length = 697
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 7/100 (7%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAG--GGSKKE 146
+G +DG +ARF+ P+ D N+YVAD +N IRK I+ VTTIAG GS
Sbjct: 478 RGDTDGTGTAARFNVPEGITTDGT-NLYVADSTNSKIRKIVISTRVVTTIAGPAQGSAAT 536
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G D A F+ +T+ L I+D N IR++
Sbjct: 537 GDTDATGNAARFNKPTGITY--DGTNLFIADGNNNKIRKL 574
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 44/97 (45%), Gaps = 14/97 (14%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPG-----------SARFDKPKSFAVDMKGNIYVADKS 123
P++ + V AG G Y PG SARF + K D N+YV D
Sbjct: 333 PITLSANVSTFAGPGPGCYFPYNPGCPEGDLDATGTSARFRQIKGVTTDGT-NLYVVDTG 391
Query: 124 NHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASF 158
N+ IRKI + VT +AGG S + G ADG A F
Sbjct: 392 NNNIRKIVISTGAVTKLAGGTSTEFGDADGTGSTARF 428
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 65/141 (46%), Gaps = 17/141 (12%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG----DGVQGYSDGEPGSARFDKPKSFAVD 112
++L V DS+ S + + VV +AG G +D +ARF+KP D
Sbjct: 501 TNLYVADSTNSKIRKI---VISTRVVTTIAGPAQGSAATGDTDATGNAARFNKPTGITYD 557
Query: 113 MKGNIYVADKSNHVIRK--ITNLGVTTIAG--GGSKKEGRADGPAQNASFSNDFELTFVP 168
N+++AD +N+ IRK I+ VTTIAG G+ G D A F + +T
Sbjct: 558 GT-NLFIADGNNNKIRKLVISTGVVTTIAGPSQGTITSGDTDAVGNAARFYSPVGIT--T 614
Query: 169 HICALLISD---HGNQLIRQI 186
L ++D + N IR+I
Sbjct: 615 DRTNLFVADGTGNRNNKIRKI 635
>gi|336313754|ref|ZP_08568676.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
gi|335881693|gb|EGM79570.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
Length = 498
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 62/127 (48%), Gaps = 10/127 (7%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
+VKRL G G+ DG S+ F P+ + K +YVAD NH IR+I + V T
Sbjct: 222 KLVKRL-GSGIAELKDGHSDSSAFSSPQGLVLTDKA-LYVADTGNHAIRRIDLSTFQVNT 279
Query: 138 IAGGGSKKEGR---ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
IAG G +GR PAQ S + ++L + L I+ G+ I ++LK + +
Sbjct: 280 IAGNGELAQGRLISGSTPAQ-VSLRSPWDLALDNN--TLYIAMAGSHQIWTLDLKSSELN 336
Query: 195 KSSQSGS 201
+ +G
Sbjct: 337 LFAGTGQ 343
Score = 37.4 bits (85), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 62/128 (48%), Gaps = 9/128 (7%)
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVI 127
+TL SE ++ AG G + DG+ A F++P A ++GN ++VAD +
Sbjct: 325 IWTLDLKSSELNL---FAGTGQEALLDGKRRDAAFNQPSGLA--LRGNKLWVADAEASAV 379
Query: 128 RKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
R+I ++ V T+ G G + G DG + A ++ ++ + L ++D N IR
Sbjct: 380 RQIDLSSGKVDTLVGQGLFEFGLKDGGFKRALLQHNKDVVALDK-NTLAVADTYNHKIRL 438
Query: 186 INLKPEDC 193
++L +
Sbjct: 439 LDLDKQQV 446
>gi|359493626|ref|XP_003634638.1| PREDICTED: LOW QUALITY PROTEIN: protein argonaute 1B-like [Vitis
vinifera]
Length = 867
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 30/45 (66%)
Query: 123 SNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
SN IRKI + GVTTI GG S G +GP ++A FS DF++ +V
Sbjct: 10 SNLAIRKIGDSGVTTIVGGKSNVAGYGNGPCEDAKFSRDFDVVYV 54
>gi|290972869|ref|XP_002669173.1| predicted protein [Naegleria gruberi]
gi|284082716|gb|EFC36429.1| predicted protein [Naegleria gruberi]
Length = 1010
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 58/132 (43%), Gaps = 23/132 (17%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDM 113
S DL + DSSR F + + AG G GYS DG + F+ P A+
Sbjct: 75 SDDLFISDSSRHVI----FRMFSNGTISLFAGIGFAGYSKDGYSALDSLFNSPNGIAMSP 130
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
GN+Y+AD N IR ++N V++I G SK G A V +
Sbjct: 131 NGNLYIADSQNDKIRIVSNGIVSSIDGTFSKPLGVA-----------------VSANNLV 173
Query: 174 LISDHGNQLIRQ 185
I+D GN LI++
Sbjct: 174 YIADTGNNLIKK 185
>gi|119961614|ref|YP_949427.1| NHL repeat-containing protein [Arthrobacter aurescens TC1]
gi|403528903|ref|YP_006663790.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
gi|119948473|gb|ABM07384.1| putative NHL repeat protein [Arthrobacter aurescens TC1]
gi|403231330|gb|AFR30752.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
Length = 657
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 58/117 (49%), Gaps = 7/117 (5%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---- 130
PL+ E V LAG G++G DG+ A F + A+D + NI+VAD +R++
Sbjct: 359 PLANE--VSILAGSGLEGLLDGKAEEAWFAQSSGLAIDGEDNIWVADSETSSLRRLVISD 416
Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ + V T G G G DG A A + +T +P ++ I+D N +R+ +
Sbjct: 417 SGVSVETAIGKGLFDFGFRDGEASEARLQHPLGVTVLPD-NSVAIADTYNGAVRRYD 472
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 54/108 (50%), Gaps = 13/108 (12%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK-------S 108
+ + +V+DS L L V+R+ G G +GY DG+ A+F++P+
Sbjct: 188 NGNYLVVDSGHHRLVELRPDLE---TVERVIGSGTKGYLDGQSEIAQFNEPQGVTLLPSE 244
Query: 109 FAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADGPAQ 154
A + + VAD NH +R +T + V T+AG G ++ A GPA+
Sbjct: 245 LAWKLGYDAVVADTVNHRLRGVTLSSGYVQTLAGNGVQRLLDA-GPAR 291
>gi|428166914|gb|EKX35881.1| hypothetical protein GUITHDRAFT_79273, partial [Guillardia theta
CCMP2712]
Length = 318
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 78/174 (44%), Gaps = 14/174 (8%)
Query: 40 GHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
G + IN P+ V P + +IV D+ + + E LAG + G ++G
Sbjct: 107 GTKARINTPYGVKVSPCGNYVIVSDTGNNMIRKVDI---ESGYTNTLAGQSLAGTANGVG 163
Query: 99 GSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI---------TNLGVTTIAGGGSKKEGR 148
A+F+ P VD + YV+D+ N+ IRKI T+ + + GS G
Sbjct: 164 TLAQFNMPVDVTVDWNETVAYVSDQGNNCIRKIDLLTAALDWTSATPSLVVVAGSGVAGL 223
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
D +A F N + + +LL++D + IR+I+L + + + +G+A
Sbjct: 224 TDAVGLSAQFYNPTGVAVDWYGASLLVADSMDSTIRRIDLMTSEVTTLAGNGNA 277
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 5/119 (4%)
Query: 71 TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK 129
T +FP++ + + GDG G G AR + P V GN + V+D N++IRK
Sbjct: 84 TATFPITAVAWI----GDGTAGNVQGYGTKARINTPYGVKVSPCGNYVIVSDTGNNMIRK 139
Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ T G G A+G A F+ ++T + +SD GN IR+I+L
Sbjct: 140 VDIESGYTNTLAGQSLAGTANGVGTLAQFNMPVDVTVDWNETVAYVSDQGNNCIRKIDL 198
Score = 43.5 bits (101), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 65/143 (45%), Gaps = 15/143 (10%)
Query: 60 IVLDSSRSAFYTLSFP---LSE----ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAV 111
IV SR+ Y+ ++ + E +V +AG G Y D +ARF V
Sbjct: 3 IVPSKSRNVLYSTTWDNYCVKEIDLTTKIVTTIAGLCGTNQYVDSTGTNARF--SNMIGV 60
Query: 112 DMKGN---IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
D+ N +YVAD N+ +R+I +T +A G G G A + + +
Sbjct: 61 DLTKNEQYLYVADVGNNKLRRIDTATFPITAVAWIGDGTAGNVQGYGTKARINTPYGVKV 120
Query: 167 VPHICALLISDHGNQLIRQINLK 189
P +++SD GN +IR+++++
Sbjct: 121 SPCGNYVIVSDTGNNMIRKVDIE 143
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 55/112 (49%), Gaps = 18/112 (16%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLG--VTTIAGG 141
+AG GV G +D SA+F P AVD G ++ VAD + IR+I + VTT+AG
Sbjct: 215 VAGSGVAGLTDAVGLSAQFYNPTGVAVDWYGASLLVADSMDSTIRRIDLMTSEVTTLAGN 274
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICAL-------LISDHGNQLIRQI 186
G+ G N ++ND E T VP AL +SD IR++
Sbjct: 275 GNA------GFIDNL-YANDAEFT-VPFGVALSRDGKYVFVSDQNRNNIRKM 318
>gi|428174200|gb|EKX43097.1| hypothetical protein GUITHDRAFT_153351 [Guillardia theta CCMP2712]
Length = 392
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 63/144 (43%), Gaps = 7/144 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P ++ P + LIV D T + + S + G+G +GY DG +A F+ P
Sbjct: 150 PSALALAPNQATLIVSDLELHQLLTFNMADGKTS---KFVGNGQRGYQDGA--TASFNGP 204
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
+ G I VAD NHV+RKI +T+ G + G DG +A F + + +
Sbjct: 205 RGLTFSPDGTYILVADTGNHVVRKIDVGSAMTSTLAGAAGMAGYQDGMGVDARFQSPYGI 264
Query: 165 TFVPHICALLISDHGNQLIRQINL 188
T +S+ IR I+L
Sbjct: 265 TIDSVGQFAYVSETDGNRIRMIDL 288
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 7/94 (7%)
Query: 103 FDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
F P + GN I V D N+V+R + + V+TIAG G ++G ADG A +A F++
Sbjct: 46 FSGPSGISTWANGNSILVVDTGNNVVRLLASAQVSTIAGNG--RQGVADGQATSAEFNSP 103
Query: 162 FELT----FVPHICALLISDHGNQLIRQINLKPE 191
++ F + L+SD N+ IR + + E
Sbjct: 104 SDVATKTDFNTGMVTALVSDTNNKKIRFLYVGQE 137
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 72/167 (43%), Gaps = 13/167 (7%)
Query: 39 DGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE 97
DG N P + P + ++V D+ + + S + AG + GY DG
Sbjct: 195 DGATASFNGPRGLTFSPDGTYILVADTGNHVVRKIDVGSAMTSTLAGAAG--MAGYQDGM 252
Query: 98 PGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPA 153
ARF P +D G YV++ + IR I TN+ V++I+G G DG
Sbjct: 253 GVDARFQSPYGITIDSVGQFAYVSETDGNRIRMIDLATNV-VSSISGSSLGLAGSQDGEG 311
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQI----NLKPEDCSKS 196
A FS +T + + LL+SD N IR + N+K + C S
Sbjct: 312 YTARFSGPKGVTLM-NDQVLLVSDSNNNRIRAVATKDNIKIDLCVSS 357
>gi|423301816|ref|ZP_17279839.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
CL09T03C10]
gi|408470907|gb|EKJ89439.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
CL09T03C10]
Length = 469
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 49/96 (51%), Gaps = 2/96 (2%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPA 153
DG+ G + + P +D + NIYVAD+ N IRKI+ + G T G ++ EG DG
Sbjct: 375 DGDLGGSTLNHPCQICMDSQNNIYVADRYNCCIRKISLDAGTITTFAGKAQTEGYQDGLI 434
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
A F + + V + ++D N IR+I L+
Sbjct: 435 SEALFDDPMGIA-VDKNGVIYVADSDNYAIRRIALE 469
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 24/41 (58%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+GY DG A FD P AVD G IYVAD N+ IR+I
Sbjct: 426 TEGYQDGLISEALFDDPMGIAVDKNGVIYVADSDNYAIRRI 466
>gi|290987128|ref|XP_002676275.1| predicted protein [Naegleria gruberi]
gi|284089876|gb|EFC43531.1| predicted protein [Naegleria gruberi]
Length = 1182
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 65/136 (47%), Gaps = 9/136 (6%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAV 111
P + +DSS T+ + +S +V AG G GY +D P SA + PK A
Sbjct: 441 PNVGEYYFVDSS-----TVKYLISSSGIVITKAGSGTTGYNADNIPATSAHLNLPKGIAR 495
Query: 112 DMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
G++Y+ D NH IRK+ N + T+AG G+ A +A+ +N + + +
Sbjct: 496 STNGDLYITDSGNHRIRKVFFNQTIITVAGTGTSTYNGDGLLATSANINNPYGIVLTSN- 554
Query: 171 CALLISDHGNQLIRQI 186
++ S++ +R+I
Sbjct: 555 NEVIFSEYEGHRVRKI 570
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 6/110 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTT 137
+K +AG G+ G + ARF+ P + A +Y++D ++HVIRK+ +NL ++T
Sbjct: 100 IKTIAGVGIPGSVTTTIATLARFNYPSALAFQNSDVMYLSDTNSHVIRKVILSSNL-IST 158
Query: 138 IAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G DG A NA + + P L I++ N IR +
Sbjct: 159 YAGQVGVPGSIGDDGVATNALLNTPGGIVISPVNNNLYIAETSNNKIRVV 208
>gi|86738767|ref|YP_479167.1| serine/threonine protein kinase [Frankia sp. CcI3]
gi|86565629|gb|ABD09438.1| serine/threonine protein kinase [Frankia sp. CcI3]
Length = 863
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 59/115 (51%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S + ++ G GV GY+ DG P +A + + P + G +Y AD ++ I+K+T
Sbjct: 682 ISPQGIITTYVGTGVAGYTGDGGPATAAKINGPNDLQMTDDGTLYFADLASDTIQKVTPD 741
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G +TT+AG G GPA A N LT P L ++D+ N IR+++
Sbjct: 742 GIITTVAGTGEGGFSGDGGPATRARL-NVPSLTVGPDGRTLYLADYRNHRIRRVD 795
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 52/89 (58%), Gaps = 2/89 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+A+ D P S A D GNIY+ D N+ IRKI+ G +TTIAG G G GPA A
Sbjct: 596 AAQLDGPGSTARDKAGNIYIGDAKNNRIRKISPAGIITTIAGTGDAGYGGDGGPATAAKI 655
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
++ ++T P ++ +SD+ N IR+I+
Sbjct: 656 NSAEKVTTGPD-GSVYLSDYENHRIRKIS 683
Score = 40.4 bits (93), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 49/89 (55%), Gaps = 4/89 (4%)
Query: 101 ARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
AR + P S V G +Y+AD NH IR++ GV TTIAG G + G GPA A F
Sbjct: 765 ARLNVP-SLTVGPDGRTLYLADYRNHRIRRVDPNGVITTIAGTGGEGSGGDGGPATAAQF 823
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
N + V AL I+D+GN +R+I+
Sbjct: 824 KNPSSVA-VDGSGALYIADNGNDRVRRID 851
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIA 139
+A+F P S AVD G +Y+AD N +R+I N +TT+A
Sbjct: 820 AAQFKNPSSVAVDGSGALYIADNGNDRVRRIDPNGTITTVA 860
>gi|428214156|ref|YP_007087300.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
PCC 6304]
gi|428002537|gb|AFY83380.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
PCC 6304]
Length = 506
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 51/109 (46%), Gaps = 9/109 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ SS L + DS + V+ + G+G+ G +DG A+F P
Sbjct: 178 PGKVLADAASSRLFIADSGHHRIIVTTL----NGTVQSIIGNGIPGLTDGSFSEAQFFGP 233
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGP 152
+ + D +YVAD NH IR+I T V T+AG G ++ R GP
Sbjct: 234 QGMSWDANSQRLYVADTENHAIRQIDFTTEQVQTLAGTG--EQNRTQGP 280
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
K + GDG G DG+ ++RF +P ++ G +Y+AD +NH IR++
Sbjct: 437 KTVLGDGTPGLLDGKNTTSRFFEPGGLSI-ASGILYIADTNNHRIRRV 483
>gi|12323029|gb|AAG51506.1|AC058785_9 hypothetical protein [Arabidopsis thaliana]
Length = 1041
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 65/140 (46%), Gaps = 16/140 (11%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLG 134
E + + ++ G +G+ DG A F++P+ A + K N +YVAD NH +R+I N
Sbjct: 597 EGNFIVQIGSSGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREIDFVNER 656
Query: 135 VTTIAG---------GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
V T+AG GG K +A QN +E + + I + GN R
Sbjct: 657 VQTLAGNGTKGSDYQGGRKGTKQASDLIQNIGQHQIWEYSVLDGITRVF---SGNGYERN 713
Query: 186 IN-LKPEDCSKSSQSGSALG 204
+N P+ S + SG +LG
Sbjct: 714 LNGSTPQTTSFAQPSGISLG 733
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 38/79 (48%), Gaps = 2/79 (2%)
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
G IY+ D NH I+K+ + + G+ K G DG + A S L + L
Sbjct: 800 GQIYLTDSYNHKIKKLDPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITEN-GRLF 858
Query: 175 ISDHGNQLIRQINL-KPED 192
++D N LIR I+L K ED
Sbjct: 859 VADTNNSLIRYIDLNKGED 877
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 79 ESVVKR---LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V KR LAG G G+ DG+ A+ +P A+ G ++VAD +N +IR I
Sbjct: 816 DPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI 870
>gi|428172619|gb|EKX41527.1| hypothetical protein GUITHDRAFT_153825 [Guillardia theta CCMP2712]
Length = 285
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 65/144 (45%), Gaps = 14/144 (9%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMK 114
++ + D+ A ++ V +AG G G+ DGE SARF+ P AV +
Sbjct: 133 ENIFIADTYNHAIRHITRDEDRNWTVTLIAGSRSGQSGFEDGEGESARFNCPTGLAVVNE 192
Query: 115 GNIYVADKSNHVIRKITNLG----VTTIAGGGSK-KEGR-------ADGPAQNASFSNDF 162
I ++D SN IR + + VTT+ G SK EGR DGP A +
Sbjct: 193 QEILISDFSNSAIRLLRRVRGDWLVTTVVGKTSKDSEGRMLPESGYVDGPVDQARLNRPH 252
Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
+ + ++LI+D N IR++
Sbjct: 253 GVAWDEASRSVLIADCMNHRIRRV 276
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 63/133 (47%), Gaps = 18/133 (13%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAV-DMKG--NIYVADKSNHVIRK--ITN-LG--VT 136
+AG G +G+ DG G +F P DM G ++ VAD N+ IRK ITN +G V
Sbjct: 39 IAGHGAKGHVDGPVGECKFCYPHDVLFQDMDGAEHLLVADFDNNCIRKVFITNGVGEHVV 98
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED---- 192
T AG K G DG + A F++ ++ + I+D N IR I + ED
Sbjct: 99 TWAGSNLSKPGLRDGSIKEALFNHPGGISADQSGENIFIADTYNHAIRHIT-RDEDRNWT 157
Query: 193 -----CSKSSQSG 200
S+S QSG
Sbjct: 158 VTLIAGSRSGQSG 170
>gi|198421727|ref|XP_002130106.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 454
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 62/146 (42%), Gaps = 3/146 (2%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP + + L + D+ A ++ + V G ++G+ DG A +
Sbjct: 176 NPQQLEYDDTNDRLFIADTDNHAIRIINVDSATPKVTTLTGGPRMEGFKDGSFAEAMWRH 235
Query: 106 PKSFAVDMKGNI-YVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
P A D++ ++ YV+D NH +R + VTTIAGGG + + +
Sbjct: 236 PTGMAYDVRKDVLYVSDHYNHAVRMLDLKKKTVTTIAGGGEPGDVNQRIFGHGLAMNYPE 295
Query: 163 ELTFVPHICALLISDHGNQLIRQINL 188
+ P L +++ GN IR I+L
Sbjct: 296 GIAIDPEHEVLYVAEFGNNCIRMISL 321
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 62/147 (42%), Gaps = 6/147 (4%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNI 117
+ + DS +A + +E + GD + + DG +A + P+ D +
Sbjct: 132 MYIADSGNNAIRLMDLGTNEVTTFAGTNGD--KAFKDGSLLTAGLNNPQQLEYDDTNDRL 189
Query: 118 YVADKSNHVIRKITNLGVT---TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
++AD NH IR I T T GG + EG DG A + + + + L
Sbjct: 190 FIADTDNHAIRIINVDSATPKVTTLTGGPRMEGFKDGSFAEAMWRHPTGMAYDVRKDVLY 249
Query: 175 ISDHGNQLIRQINLKPEDCSKSSQSGS 201
+SDH N +R ++LK + + + G
Sbjct: 250 VSDHYNHAVRMLDLKKKTVTTIAGGGE 276
>gi|317419409|emb|CBN81446.1| NHL repeat-containing protein 2 [Dicentrarchus labrax]
Length = 719
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 78/176 (44%), Gaps = 32/176 (18%)
Query: 16 SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPGSSDLI 60
+LLFS+V G +L D Y ++ H ++I P S + PG +
Sbjct: 168 NLLFSIVGEGHRDRLMLFTDCALRYYGEQGLLKTHAVKIKLYRDSLPTSFLSFPGK---V 224
Query: 61 VLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
+DSS LS + + G G DG+ A F+ P+ A+ K
Sbjct: 225 AIDSSEKRLAIADTGHHRILLLSSTGQLLHVIGGPESGRRDGDLSEASFNSPQGVAI--K 282
Query: 115 GN-IYVADKSNHVIRKITNLG--VTTIAG---GGSKKEGRADGPAQNASFSNDFEL 164
G+ +YVAD NH+IRKI L V+T+AG G+ K+G A GP Q S D L
Sbjct: 283 GDTVYVADTENHLIRKIDLLEGRVSTLAGVGIQGTDKDGGAMGPQQPISSPWDVTL 338
>gi|348681654|gb|EGZ21470.1| hypothetical protein PHYSODRAFT_329412 [Phytophthora sojae]
Length = 251
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 79/162 (48%), Gaps = 19/162 (11%)
Query: 12 ITLLSLLFSLVSSGLLLE--DGYTVTTVIDGHQLEINPHSVIDRPGSSDL------IVLD 63
IT+ SL SLV G+ ++ + V+ + + ++I P + +S + I +D
Sbjct: 61 ITIASLADSLVLQGITVDSSENVYVSRLTEFDVVKIAPDGSVSIFANSSMFTLAMGIAID 120
Query: 64 SSRSAFYT-----LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
SS + + T L F L+ E V +AG VQG+ + ARF P + + GN+Y
Sbjct: 121 SSDNIYITDQHRILKFTLAGEMGV--VAGR-VQGFLNALGELARFSTPWALTIGSDGNLY 177
Query: 119 VADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASF 158
V D N+ IRK+ T VTT A G G DG A +A+F
Sbjct: 178 VVDSDNNCIRKVDLTTREVTTYA-GICLTSGTTDGLATDATF 218
>gi|168698189|ref|ZP_02730466.1| NHL repeat containing protein [Gemmata obscuriglobus UQM 2246]
Length = 358
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 2/103 (1%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASF 158
A F+ S AV G +Y+AD N+ +R V AG G+K GPA++A F
Sbjct: 99 ATFNGMHSLAVGPGGTVYLADTWNNRVRVFDPEAGTVKAFAGTGTKGFSGDGGPAKDAKF 158
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
F + F P L ++D N+ IR+I++K + + ++ +G
Sbjct: 159 GGVFCVAFDPAQKNLYVTDLDNRRIRKIDMKTQTVTTAAGNGE 201
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 9/148 (6%)
Query: 42 QLEINPHS-VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
Q ++P + +D+ G +L VL+ A + + ++ +AG G G G+
Sbjct: 214 QPLVDPRAHAVDKDG--NLWVLERGGHALRVVD----PKGTIRTVAGTGKAGRGTGKALE 267
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNH-VIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
A + PK VD G + +AD NH V+R G +T +AG G K G A F
Sbjct: 268 AAMNGPKHLCVDKDGTVLIADTENHRVVRFDPKAGTLTAVAGTGKKGNSLGGGDPLKAEF 327
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
+ +T P + ISD N + +I
Sbjct: 328 NQPHGVTVHPKTGDIYISDANNGRVLKI 355
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 71/159 (44%), Gaps = 25/159 (15%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPGSARFD---KPKSF 109
P +L V D + + V AG+G +G DGE +AR P++
Sbjct: 168 PAQKNLYVTDLDNRRIRKIDM---KTQTVTTAAGNGEKGVPKDGE--NARTQPLVDPRAH 222
Query: 110 AVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
AVD GN++V ++ H +R + G + T+AG G K GR G A A+ +
Sbjct: 223 AVDKDGNLWVLERGGHALRVVDPKGTIRTVAGTG--KAGRGTGKALEAAMNGP------K 274
Query: 169 HIC-----ALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
H+C +LI+D N R + P+ + ++ +G+
Sbjct: 275 HLCVDKDGTVLIADTENH--RVVRFDPKAGTLTAVAGTG 311
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVD-MKGNIYVADKSNHVIRKI--TN 132
E VK AG G +G+S DG P A+F A D + N+YV D N IRKI
Sbjct: 131 EAGTVKAFAGTGTKGFSGDGGPAKDAKFGGVFCVAFDPAQKNLYVTDLDNRRIRKIDMKT 190
Query: 133 LGVTTIAGGGSK 144
VTT AG G K
Sbjct: 191 QTVTTAAGNGEK 202
>gi|170066897|ref|XP_001868267.1| NHL repeat containing 2 [Culex quinquefasciatus]
gi|167863075|gb|EDS26458.1| NHL repeat containing 2 [Culex quinquefasciatus]
Length = 734
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLG 134
SE +V+ R+ G G+ DG+ ARF+ P+ A I+VAD NH IR+I +
Sbjct: 260 SEGTVLHRIGGKK-SGFVDGDFRKARFNAPQGLAFQNDDVIFVADNENHAIRRIDLKSKQ 318
Query: 135 VTTIAGGGSKKEGRADG 151
VTT+AG G + R G
Sbjct: 319 VTTVAGSGQQGCDRIGG 335
>gi|430743901|ref|YP_007203030.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
acidiphila DSM 18658]
gi|430015621|gb|AGA27335.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
acidiphila DSM 18658]
Length = 696
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 9/149 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ S L + D++ + + VV G G G DG+ A F++P
Sbjct: 229 PGKVLADEKSKQLFIADTAHNRIVLTDLDGRKSVVV----GSGGIGMVDGDYAKAEFNRP 284
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRAD-GPAQNASFSNDFE 163
+ + + +YVAD NH IR I V+T+AG G + R+ G + S S+ ++
Sbjct: 285 QGLCL-VDDTLYVADTENHAIRAIHLKTKQVSTVAGTGQQGHRRSGAGAGKATSLSSPWD 343
Query: 164 LTFVPHICALLISDHGNQLIRQINLKPED 192
L +P L I+ G I + ++ P D
Sbjct: 344 LVLIPGTKTLAIAMAGTHQIWRYDI-PSD 371
>gi|153864226|ref|ZP_01997193.1| NHL repeat protein [Beggiatoa sp. SS]
gi|152146280|gb|EDN72808.1| NHL repeat protein [Beggiatoa sp. SS]
Length = 313
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 53/107 (49%), Gaps = 5/107 (4%)
Query: 84 RLAGD-GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
R+AG GV G+S G A+ + PK V G +Y+ DK NH IRK+ G +TTI
Sbjct: 45 RIAGTLGVAGFSGDGGRAVEAKLNNPKRIVVADDGVVYITDKGNHRIRKVDTSGIITTIV 104
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G G G A A ++ ++ + +L I D N IR++
Sbjct: 105 GNGVASFGGDGGEAPGAQLNSPSDIA-IDDSGSLYIIDTNNHRIRKV 150
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 40 GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGE 97
G QL ID GS L ++D++ + +++ LAG GV G+S DG+
Sbjct: 120 GAQLNSPSDIAIDDSGS--LYIIDTNNHRIRKVE---KATNIITTLAG-GVSGFSGDDGQ 173
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
SA+F+ P A+ +G++ +AD NH +R IT +
Sbjct: 174 ALSAQFNTPDGIAISDEGHLLIADSENHRVRMITGV 209
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 22/94 (23%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT-----------NLGVTTIAG-GGSKKEGRADGPA 153
P A+D GN+Y+++ NHVIRK+ LGV +G GG E + + P
Sbjct: 12 PNGVAIDKAGNLYISEAGNHVIRKVDANTQSIFRIAGTLGVAGFSGDGGRAVEAKLNNPK 71
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ V + I+D GN IR+++
Sbjct: 72 R----------IVVADDGVVYITDKGNHRIRKVD 95
>gi|395226792|ref|ZP_10405227.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
gi|394444937|gb|EJF05961.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
Length = 1326
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 41/68 (60%), Gaps = 2/68 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
++++ + ++ G G G++DG+ A+ P F D NI+VAD +N +RKIT N
Sbjct: 205 INKDGEISKIVG-GSYGFADGDSSIAQLKSPSGFDFDKNWNIFVADTNNGRVRKITPNGD 263
Query: 135 VTTIAGGG 142
VTTIAG G
Sbjct: 264 VTTIAGDG 271
>gi|428171420|gb|EKX40337.1| hypothetical protein GUITHDRAFT_75685, partial [Guillardia theta
CCMP2712]
Length = 284
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 42/77 (54%), Gaps = 5/77 (6%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGG 141
LAGDG G +G +A F P+ A K + IYV+D N +IRKI VTT AG
Sbjct: 181 LAGDGNVGSKNGRGTAATFYTPQGIAYSSKQSAIYVSDYQNSLIRKIEVNTREVTTAAG- 239
Query: 142 GSKKEGRADGPAQNASF 158
S G ADGPA +A F
Sbjct: 240 -SLSVGCADGPATSAQF 255
>gi|288923318|ref|ZP_06417452.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288345334|gb|EFC79729.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 849
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 37/115 (32%), Positives = 60/115 (52%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + ++ LAG DG + DG P S A P + A+D G +Y+AD SN+ IR+I
Sbjct: 678 VGSDGIITTLAGQDGEGSFGDGGPASKALLAFPLAVALDRFGRLYIADTSNNRIRRIGLD 737
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G + T+AG G GPA A+ + +T V + I+D N+ +R+++
Sbjct: 738 GRIETVAGNGRPGLSGDGGPATKATLRSPRGVT-VDAAGTIYITDRTNRRVRRVD 791
Score = 47.0 bits (110), Expect = 0.008, Method: Composition-based stats.
Identities = 42/141 (29%), Positives = 69/141 (48%), Gaps = 10/141 (7%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKS 108
+DR G DL++ D+ F + ++ +AG+G +S DG P +A P
Sbjct: 603 LDRDG--DLLIADT----FNQRIRRVDAAGIITTVAGNGEHAFSGDGGPATAAALWYPGG 656
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
AVD G +++AD +N+ IR++ + G +TT+AG + GPA A + + +
Sbjct: 657 VAVDGNGTVFIADTANNRIRRVGSDGIITTLAGQDGEGSFGDGGPASKALLAFPLAVA-L 715
Query: 168 PHICALLISDHGNQLIRQINL 188
L I+D N IR+I L
Sbjct: 716 DRFGRLYIADTSNNRIRRIGL 736
Score = 41.6 bits (96), Expect = 0.36, Method: Composition-based stats.
Identities = 32/90 (35%), Positives = 48/90 (53%), Gaps = 2/90 (2%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
+A + P D GN+YVAD N+ +R+I+ G + T+AG G K G GPA +A
Sbjct: 535 ATASVNAPFGLVTDRAGNVYVADSGNNRVRRISPDGTIVTVAGTGVKGFGGDGGPAVDAQ 594
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
+T + LLI+D NQ IR+++
Sbjct: 595 LYQPSAVT-LDRDGDLLIADTFNQRIRRVD 623
>gi|290971380|ref|XP_002668485.1| predicted protein [Naegleria gruberi]
gi|284081915|gb|EFC35741.1| predicted protein [Naegleria gruberi]
Length = 277
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 8/120 (6%)
Query: 39 DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
D + +IN H + R SSD ++ + + TL+ P+S + + +G DG
Sbjct: 133 DVYICDINNHRI--RKVSSDGVISTIAGTGRTTLN-PISPNTYRQEGGYNG-----DGLA 184
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
+ + PK VD KG++Y AD N +IRK++N +TTIAG G A N +F
Sbjct: 185 TATDLNYPKYAQVDSKGDVYFADTYNGLIRKVSNGYITTIAGSKQNSTVNTFGMALNTTF 244
>gi|345482264|ref|XP_001607897.2| PREDICTED: NHL repeat-containing protein 2-like [Nasonia
vitripennis]
Length = 768
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 42/73 (57%), Gaps = 1/73 (1%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+ ++ V+ + G G+ DG+ +ARF+ P+ V +K +++VAD NH IRKI
Sbjct: 329 MGKDGKVEHVVGGYSPGFKDGDFETARFNAPQGVCV-LKDDVFVADNENHAIRKINLKTK 387
Query: 136 TTIAGGGSKKEGR 148
T I+ G+ +GR
Sbjct: 388 TVISIAGTGSQGR 400
>gi|170759724|ref|YP_001788082.1| NHL repeat-containing protein [Clostridium botulinum A3 str. Loch
Maree]
gi|169406713|gb|ACA55124.1| NHL repeat protein [Clostridium botulinum A3 str. Loch Maree]
Length = 464
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 75/171 (43%), Gaps = 22/171 (12%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
++LE+ Y + ++ IN + +I G + +++LD + T+
Sbjct: 160 ENIILENKYEKEEYMYPSKISINENGLIAFSEKGKNRIVILDQNLEKIKTI--------- 210
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
G G DG+ +F P+ ++ IYVAD NH IR+ + V TIA
Sbjct: 211 -----GSSEYGLKDGDFKETQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIA 264
Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G G K+ A G A N S ++ ++L + + I+ GN I + N+K
Sbjct: 265 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 313
>gi|30696124|ref|NP_564718.2| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|17529226|gb|AAL38840.1| unknown protein [Arabidopsis thaliana]
gi|332195279|gb|AEE33400.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 1055
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLG 134
E + + ++ G +G+ DG A F++P+ A + K N +YVAD NH +R+I N
Sbjct: 597 EGNFIVQIGSSGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREIDFVNER 656
Query: 135 VTTIAGGGSK 144
V T+AG G+K
Sbjct: 657 VQTLAGNGTK 666
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 38/79 (48%), Gaps = 2/79 (2%)
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
G IY+ D NH I+K+ + + G+ K G DG + A S L + L
Sbjct: 814 GQIYLTDSYNHKIKKLDPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITEN-GRLF 872
Query: 175 ISDHGNQLIRQINL-KPED 192
++D N LIR I+L K ED
Sbjct: 873 VADTNNSLIRYIDLNKGED 891
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 79 ESVVKR---LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V KR LAG G G+ DG+ A+ +P A+ G ++VAD +N +IR I
Sbjct: 830 DPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI 884
>gi|290975356|ref|XP_002670409.1| predicted protein [Naegleria gruberi]
gi|284083967|gb|EFC37665.1| predicted protein [Naegleria gruberi]
Length = 2200
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 37/111 (33%), Positives = 50/111 (45%), Gaps = 8/111 (7%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV- 135
+E S +GDG G A+ + P ++ G IY+AD NH IRKI + GV
Sbjct: 158 NEHSDCDGFSGDG------GLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINSYGVI 211
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+TIAG G G DG S N + + I+D N IR+I
Sbjct: 212 STIAGTGRASFG-GDGGFAVLSPLNSPNGVHISQNGEIYIADTLNHRIRKI 261
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 53/110 (48%), Gaps = 6/110 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
V+ +AG G + G+ G A + P + G IY+AD NH IRKI N +T
Sbjct: 210 VISTIAGTGRASFG-GDGGFAVLSPLNSPNGVHISQNGEIYIADTLNHRIRKIFVNGLIT 268
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T+AG G G G A N+ ++ + + V + I+D N IR+I
Sbjct: 269 TVAGTGRASFGGDGGLAINSQLNSPYGV-HVSQSGEIYIADTLNHRIRKI 317
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 6/111 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
++ +AG G + G+ G +++ + P V G IY+AD NH IRKI N +T
Sbjct: 266 LITTVAGTGRASFG-GDGGLAINSQLNSPYGVHVSQSGEIYIADTLNHRIRKIFVNGTIT 324
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICALLISDHGNQLIRQI 186
TIAG S DG AS N + FV P+ +LI+D N IR+I
Sbjct: 325 TIAGSSSDGSFGGDGGLSIASRLNSPKGVFVSPNNNEILIADTSNNRIRKI 375
>gi|290978083|ref|XP_002671766.1| predicted protein [Naegleria gruberi]
gi|284085337|gb|EFC39022.1| predicted protein [Naegleria gruberi]
Length = 258
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 57/108 (52%), Gaps = 5/108 (4%)
Query: 82 VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ +AG+G G+S G +A+ + P V +K +Y+AD NH IRK++N G + TI
Sbjct: 38 IMTVAGNGELGFSSDGGLAINAKLNYPNGVFV-VKEEVYIADYHNHRIRKVSNNGIIETI 96
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G +GPA NA N FV + ISD+ N +R++
Sbjct: 97 AGNGEGGFEGDNGPAINAKL-NYPTSVFVSENGEVYISDYLNNRVRKV 143
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 57/115 (49%), Gaps = 7/115 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-T 131
+S +++ +AG+G G+ +G+ G +A+ + P S V G +Y++D N+ +RK+
Sbjct: 87 VSNNGIIETIAGNGEGGF-EGDNGPAINAKLNYPTSVFVSENGEVYISDYLNNRVRKVLQ 145
Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
N + TIAG G G A NA + + + I+D N IR++
Sbjct: 146 NGNIVTIAGNGKLGCSGDGGLAINAELNCPMNVFVFNE--EVYITDSANHRIRKV 198
Score = 40.0 bits (92), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S+ +++ +AG+G +G+S DG A+ + P S V+ G IY+ D +N IRK+
Sbjct: 198 VSKSGIIETIAGNGNEGFSGDDGLATQAQLNCPMSTFVNSNGEIYITDSNNFRIRKV 254
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 57/111 (51%), Gaps = 8/111 (7%)
Query: 85 LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
+AG+G G S G+ G +A + P + V +Y+ D +NH IRK++ G + TIAG
Sbjct: 152 IAGNGKLGCS-GDGGLAINAELNCPMNVFV-FNEEVYITDSANHRIRKVSKSGIIETIAG 209
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
G++ DG A A + TFV + I+D N IR++ LK E
Sbjct: 210 NGNEGFSGDDGLATQAQLNCPMS-TFVNSNGEIYITDSNNFRIRKV-LKVE 258
>gi|406901359|gb|EKD44040.1| NHL repeat containing protein, partial [uncultured bacterium]
Length = 667
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 68/135 (50%), Gaps = 7/135 (5%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGN-IY 118
V D+ ++ S++ K +AG + G+ DG +A F+ P + +D N +Y
Sbjct: 531 VADTGNEVIRRVTLTGSKQGQTKLIAGSPTESGFKDGTKTAAEFNVPIALTIDSADNYLY 590
Query: 119 VADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
VAD+ NH IRK I++ VTT+ G S G DG ++A + E+ + + + S
Sbjct: 591 VADRDNHAIRKVRISDGKVTTVTGNPS-TPGYLDGRLEDAYLNYPVEVYY--NRGNIYFS 647
Query: 177 DHGNQLIRQINLKPE 191
+ G Q +R +++ +
Sbjct: 648 ESGTQRVRVVDMADD 662
Score = 44.3 bits (103), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 42/98 (42%), Gaps = 5/98 (5%)
Query: 96 GEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT----NLGVTTIAGGGSKKEGRAD 150
G A F +P VD G YVAD N VIR++T G T + G + G D
Sbjct: 507 GRKNCAYFSRPGGIVVDGSGKYAYVADTGNEVIRRVTLTGSKQGQTKLIAGSPTESGFKD 566
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G A F+ LT L ++D N IR++ +
Sbjct: 567 GTKTAAEFNVPIALTIDSADNYLYVADRDNHAIRKVRI 604
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 5/93 (5%)
Query: 60 IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
I+ D + + + T +F + +V + G G G +G +++F P + AVD
Sbjct: 51 IIADGAGNFYVTDTFNGVIRKIDANGIVSTVVGQGGYGDVNGSATTSKFAHPSAVAVDDS 110
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
GN+Y+AD N I+K + VTT+ + EG
Sbjct: 111 GNVYIADAGNGKIKKFSGGRVTTLKSDLDRPEG 143
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 39/84 (46%), Gaps = 12/84 (14%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
A FD P+ D GN YV D N VIRKI G V+T+ G G G +G A + F+
Sbjct: 43 AYFDFPEDIIADGAGNFYVTDTFNGVIRKIDANGIVSTVVGQGG--YGDVNGSATTSKFA 100
Query: 160 NDFELTFVPHICALLISDHGNQLI 183
H A+ + D GN I
Sbjct: 101 ---------HPSAVAVDDSGNVYI 115
>gi|290976488|ref|XP_002670972.1| predicted protein [Naegleria gruberi]
gi|284084536|gb|EFC38228.1| predicted protein [Naegleria gruberi]
Length = 704
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 60/113 (53%), Gaps = 5/113 (4%)
Query: 81 VVKRLAGD-GVQGYSDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
++ +AG G G++D SA+ + P++ A+ G+IY+AD N+ IRKIT ++
Sbjct: 211 IITTIAGTAGSSGFTDNVLSTSAKLNGPQAVAIMSNGDIYIADTQNNRIRKITAATGIIS 270
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
TI G G+ DG A ++ N F+ L I+D N +R+I+L+
Sbjct: 271 TICGTGNGGIA-GDGSAATSAMINSPRDLFLGLQNDLYIADSWNHRLRRIDLR 322
>gi|340370782|ref|XP_003383925.1| PREDICTED: NHL repeat-containing protein 2-like [Amphimedon
queenslandica]
Length = 730
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 47/90 (52%), Gaps = 6/90 (6%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
+VK++ G G G+ DG A F+ P+ + + +YVAD NH+IRKI ++ V T+
Sbjct: 259 LVKQVYGSGSPGFKDGRGKEAEFNCPQGLVI-CEECVYVADTENHLIRKISLSDDFVLTV 317
Query: 139 AGGGSK---KEGRADGPAQNASFSNDFELT 165
AG G + KEG G Q S D +
Sbjct: 318 AGTGYQGNDKEGGKVGKEQEISSPWDLAIN 347
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 5/107 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT--NLGVTTIAGGGSKKEGR 148
G DG+ + P A D + Y+AD NH I+ + +T+AG GS G
Sbjct: 459 GDVDGKGRDVKLQHPMGVAWDDTNQLLYIADSFNHKIKVVNPKTKVCSTLAGTGSP--GL 516
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
DG + A FS L L ++D N IR ++LK + S+
Sbjct: 517 VDGSFEVAQFSEPAGLCMSQEGDTLYVADTNNHAIRILDLKEKKVSQ 563
>gi|194365180|ref|YP_002027790.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
gi|194347984|gb|ACF51107.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
Length = 470
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 4/118 (3%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L + S+ R AG G DG A F +P + AV ++ +YVAD IR +
Sbjct: 301 SYHLGQRSLQWR-AGSGSIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358
Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G V T+ G G + G DGP AS + P L I+D GN +R + L
Sbjct: 359 RGDLVQTLVGQGPWRFGNEDGPRTQASLQFPQAMALSPDAPLLWIADTGNGRLRTLRL 416
Score = 40.4 bits (93), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 48/103 (46%), Gaps = 12/103 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
+ R G G + DG A F +P++ ++ + ++YVAD NH +R+I L G+
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDSLYVADTGNHAVRRINLLTGIVDTLC 255
Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
G + +GP AQ S D H L I+D NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289
>gi|116620267|ref|YP_822423.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223429|gb|ABJ82138.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 380
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 56/106 (52%), Gaps = 6/106 (5%)
Query: 85 LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
+AG G GYS DG P A+ PK A G++Y+AD +H IR++ + +TT+AG
Sbjct: 276 VAGTGESGYSGDGGPAKLAKLSGPKGIAWAPDGSLYLADTESHTIRRVDLKSGVITTVAG 335
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + +G DG A+ S + FV A+ I+D + +R +
Sbjct: 336 TGKRGDG-PDGDARMCQLSRPHGI-FVSAAGAVFIADSESHRVRAL 379
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 61/116 (52%), Gaps = 5/116 (4%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
+ +V+ AG G +GYS DG P A ++P D GN+++A+ NHV+R++
Sbjct: 98 KTNVISTAAGSGEKGYSGDGGPALGAALNEPYEVRFDRVGNMFIAEMQNHVVRRVDAKTR 157
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
++T+AG G+ G GPA A + F LLI D GN IR+++LK
Sbjct: 158 VISTVAGTGTAGFGGDGGPATAALLRQPHSIAFDGE-GRLLICDIGNHRIRRVDLK 212
>gi|407780651|ref|ZP_11127872.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
gi|407208878|gb|EKE78785.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
Length = 498
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 33/60 (55%), Gaps = 3/60 (5%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIAGGGSK 144
G G G++ G ARF P+ A D G IYVAD NH IR+I VTT+AG G +
Sbjct: 217 GSGEAGFASGPADRARFRDPQGLAADT-GAIYVADTGNHAIRRIDRQTGQVTTLAGNGRR 275
>gi|222053203|ref|YP_002535565.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
gi|221562492|gb|ACM18464.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
Length = 155
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 55/108 (50%), Gaps = 10/108 (9%)
Query: 82 VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI 138
V LAG GV G++DG +A F + D N+YVAD SN+ IRK I + VTT+
Sbjct: 54 VTTLAGTAGVYGWADGIGTAALFGRISDITTDGA-NLYVAD-SNNTIRKIVIASGAVTTL 111
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AGG G DG +ASF +T +L I+D N IR I
Sbjct: 112 AGG---STGSTDGVGVSASFGYPSGIT--TDGASLFITDTFNDTIRAI 154
>gi|17228508|ref|NP_485056.1| hypothetical protein alr1013 [Nostoc sp. PCC 7120]
gi|17130359|dbj|BAB72970.1| alr1013 [Nostoc sp. PCC 7120]
Length = 503
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 46/101 (45%), Gaps = 9/101 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ P S L V DS F + L G+G G +DG A+F P
Sbjct: 175 PGKVLATP--SGLFVADSGHHRIVVSDF----NGEILHLIGNGKSGLTDGNFQEAQFSAP 228
Query: 107 KSFAVDMKGNI-YVADKSNHVIRK--ITNLGVTTIAGGGSK 144
+ A DM+ I Y+AD NH +R+ I V TIAG G +
Sbjct: 229 QGMAFDMENQILYLADTENHALRRVDINQQTVETIAGTGEQ 269
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
+ + GDG+ G +G+ + RF +P + + +Y++D +NHVIR++ L VTT+
Sbjct: 431 QTVLGDGLAGLQNGQGKNTRFFEPSGLSA-IDSYLYISDTNNHVIRRVDLRTLEVTTMQF 489
Query: 141 GG 142
G
Sbjct: 490 NG 491
>gi|428225769|ref|YP_007109866.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
gi|427985670|gb|AFY66814.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
Length = 498
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGV 135
E V+ G G GY DG +A+F P+ D G+ +YVAD NH+IR + + V
Sbjct: 197 EGKVRATVGSGTPGYQDGPWETAQFRAPQGLTYDPAGDRLYVADTGNHLIRCVHGRSRTV 256
Query: 136 TTIAGGGSKKE 146
TT+AG G +
Sbjct: 257 TTLAGTGQQNR 267
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 53/130 (40%), Gaps = 15/130 (11%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT---NLGV 135
E V L G G + DG A F +P A+D +YVAD + +R I+ V
Sbjct: 311 EDAVGTLLGTGAEACIDGTASEATFAQPSGLAIDGD-TLYVADSESSSVRAISLADPPSV 369
Query: 136 TTIAGGG-----SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
T+ G G +EGR D A + L P L I+D N I+++NL
Sbjct: 370 QTLCGSGGLFDFGDREGRGD----RARLQHCLGLAHGP--GNLWIADTYNHKIKRLNLAE 423
Query: 191 EDCSKSSQSG 200
C SG
Sbjct: 424 GHCVNIVGSG 433
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 1/78 (1%)
Query: 110 AVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
A+ + +++VAD +H + I L A GS G DGP + A F LT+ P
Sbjct: 174 AIAQENHLFVADTGHHRL-VIATLEGKVRATVGSGTPGYQDGPWETAQFRAPQGLTYDPA 232
Query: 170 ICALLISDHGNQLIRQIN 187
L ++D GN LIR ++
Sbjct: 233 GDRLYVADTGNHLIRCVH 250
>gi|296122042|ref|YP_003629820.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
gi|296014382|gb|ADG67621.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
Length = 425
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 63/143 (44%), Gaps = 10/143 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFD 104
PH + P + D+++ D + + ++ LAG G G+S DG P A+ +
Sbjct: 170 PHELRFTP-AGDIVIADMRTHTIRKID---GKTGMISTLAGTGTAGFSGDGGPAEKAQLN 225
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
P S +D G++ + D NH +RK+ T L ++T G G +K + P +
Sbjct: 226 MPHSIQIDPAGDLLICDTGNHRVRKVDMKTGL-ISTAYGTGERKPAKDGDPQVGTPLNGP 284
Query: 162 FELTFVPHICALLISDHGNQLIR 184
+ F P +L GN + R
Sbjct: 285 RSIDFTPEGDMILALREGNAVYR 307
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 60/118 (50%), Gaps = 8/118 (6%)
Query: 78 EESVVKRLAGD-GVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---T 131
E+ +AG G +G++ DG P + A ++P G+I +AD H IRKI T
Sbjct: 139 EKQTATTIAGTPGKKGFAGDGGPATKALMNEPHELRFTPAGDIVIADMRTHTIRKIDGKT 198
Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ ++T+AG G+ GPA+ A + + P LLI D GN +R++++K
Sbjct: 199 GM-ISTLAGTGTAGFSGDGGPAEKAQLNMPHSIQIDP-AGDLLICDTGNHRVRKVDMK 254
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 64/156 (41%), Gaps = 22/156 (14%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK--------RLAGDGVQGYSDG 96
+N ID D+I+ +A Y FP E ++ L GDG+
Sbjct: 281 LNGPRSIDFTPEGDMILALREGNAVY--RFPKGEAKLIHIAGVGGKPSLVGDGIDARK-- 336
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPA 153
A PK AVD G+IY+AD H IR I T L + T+ G G +G DG A
Sbjct: 337 ----AILGAPKGAAVDANGDIYLADTETHTIRVIRAKTGL-IETVIGDGKAGDG-PDGEA 390
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ N F+ LL+ D N +R + L+
Sbjct: 391 KTCRL-NRPHGVFITKEGLLLVGDSSNNKVRVLPLR 425
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 78/170 (45%), Gaps = 18/170 (10%)
Query: 48 HSVIDRPGS-----SDLIVLDS-----SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DG 96
SVI RPG S+L+ L + S AF L+ + E VK + G G G++ D
Sbjct: 44 RSVIMRPGQTSFGPSNLMRLAALCVLLSLFAFAELNVAHAGE--VKTIFGSGKDGFNGDQ 101
Query: 97 EPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRAD-GP 152
+P +P + G +Y + + H+IR+ + TTIAG KK D GP
Sbjct: 102 QPFLETHSGQPFGLVIGPDGALYFCEYTGHIIRRLDLEKQTATTIAGTPGKKGFAGDGGP 161
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
A A + EL F P ++I+D IR+I+ K S + +G+A
Sbjct: 162 ATKALMNEPHELRFTP-AGDIVIADMRTHTIRKIDGKTGMISTLAGTGTA 210
>gi|148380724|ref|YP_001255265.1| NHL repeat protein [Clostridium botulinum A str. ATCC 3502]
gi|153933286|ref|YP_001385008.1| NHL repeat-containing protein [Clostridium botulinum A str. ATCC
19397]
gi|153935024|ref|YP_001388478.1| NHL repeat-containing protein [Clostridium botulinum A str. Hall]
gi|148290208|emb|CAL84327.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
3502]
gi|152929330|gb|ABS34830.1| NHL repeat protein [Clostridium botulinum A str. ATCC 19397]
gi|152930938|gb|ABS36437.1| NHL repeat protein [Clostridium botulinum A str. Hall]
Length = 467
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGG 141
++ G G G DG+ A+F P+ ++ IYVAD NH IR+ + V TIAG
Sbjct: 211 KIIGSGEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIAGN 269
Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G K+ A G A N S ++ ++L + + I+ GN I + N++
Sbjct: 270 GQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMR 316
>gi|387819032|ref|YP_005679379.1| hypothetical protein H04402_02846 [Clostridium botulinum H04402
065]
gi|322807076|emb|CBZ04650.1| gll0387 protein [Clostridium botulinum H04402 065]
Length = 467
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGG 141
++ G G G DG+ A+F P+ ++ IYVAD NH IR+ + V TIAG
Sbjct: 211 KIIGSGEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIAGN 269
Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G K+ A G A N S ++ ++L + + I+ GN I + N++
Sbjct: 270 GQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMR 316
>gi|453054414|gb|EMF01867.1| redoxin domain-containing protein [Streptomyces mobaraensis NBRC
13819 = DSM 40847]
Length = 608
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 11/124 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG +V D++R L+ E+VV+R+ G G +G+ D ARF +P
Sbjct: 183 PGKALALPGGG-YLVSDTTRHQLVELA--ADGETVVRRI-GTGERGFGDS---PARFSEP 235
Query: 107 KSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDF 162
+ A+ G + VAD NH +R V T+AG G + +G A +GPA+ + S+ +
Sbjct: 236 QGLALLPDGRVAVADTVNHALRVYDPETHEVGTLAGTGRQWWQGAATEGPAREVALSSPW 295
Query: 163 ELTF 166
+L +
Sbjct: 296 DLAW 299
>gi|392945712|ref|ZP_10311354.1| protein kinase family protein,NHL repeat protein [Frankia sp. QA3]
gi|392289006|gb|EIV95030.1| protein kinase family protein,NHL repeat protein [Frankia sp. QA3]
Length = 770
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 2/96 (2%)
Query: 94 SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG-RADG 151
DG P +A +P A+ G++YVAD N +R+I G T+ G G R DG
Sbjct: 604 GDGGPAINATLYQPSGLALGDDGSLYVADTGNETVRRIDPSGRITLVAGRPGTYGHRGDG 663
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
A A+ +D + +L I+D GN +IR+++
Sbjct: 664 KAATAALLDDPRGLALGPDGSLYIADSGNDVIRRVD 699
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 11/114 (9%)
Query: 81 VVKRLAGDGVQGYSDG------EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
V+ +AG G YSDG + A D P + AVD G +YVA+ + +R+I
Sbjct: 533 VITTVAGAGPD-YSDGFTADVGDATKAHLDDPSAVAVDDDGVLYVAE--GYRVRRIAQGR 589
Query: 135 VTTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+TT+AG ++ D GPA NA+ L +L ++D GN+ +R+I+
Sbjct: 590 ITTVAGKSTESGSAGDGGPAINATLYQPSGLALG-DDGSLYVADTGNETVRRID 642
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 3/90 (3%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGP-AQNAS 157
+A D P+ A+ G++Y+AD N VIR++ G +TT+AG GS DG A
Sbjct: 668 AALLDDPRGLALGPDGSLYIADSGNDVIRRVDRHGIITTVAGTGSYSADDRDGALATQTR 727
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
SN + P A+ I+ G+ +R++
Sbjct: 728 LSNPAGVLVDP-TGAIYIACSGDGTVRRVG 756
>gi|397781307|ref|YP_006545780.1| Tripartite motif-containing protein 71 [Methanoculleus bourgensis
MS2]
gi|396939809|emb|CCJ37064.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
[Methanoculleus bourgensis MS2]
Length = 1996
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 34/56 (60%), Gaps = 3/56 (5%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DGP 152
G+ +FDKP AVD G+IYVAD +NH I+K + G T + GSK G +GP
Sbjct: 1394 GNGQFDKPWGIAVDAAGDIYVADYNNHRIQKFDSAG-TFVTAWGSKGSGSGQFNGP 1448
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 34/56 (60%), Gaps = 3/56 (5%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DGP 152
G+ +FDKP AVD G+IYVAD +NH I+K + G T I GS+ G +GP
Sbjct: 1676 GNGQFDKPWGIAVDTAGDIYVADYNNHRIQKFDSAG-TFITAWGSEGSGSGQFNGP 1730
Score = 41.6 bits (96), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 45/106 (42%), Gaps = 11/106 (10%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
+ +FD+P A D+ GN+YV+D NH I+K + G T G G+ D P A
Sbjct: 1348 NGQFDRPTGIATDISGNVYVSDYYNHRIQKFDSTGAFLTTWGAYGTGNGQFDKPWGIA-- 1405
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
V + ++D+ N I++ + + GS G
Sbjct: 1406 --------VDAAGDIYVADYNNHRIQKFDSAGTFVTAWGSKGSGSG 1443
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 11/107 (10%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNAS 157
G+ +FD+ A D+ GN+YV+D NH I+K + G T G G+ D P A
Sbjct: 1629 GNGQFDRLTGIATDISGNVYVSDYYNHRIQKFDSTGAFLTTWGAYGTGNGQFDKPWGIA- 1687
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
V + ++D+ N I++ + + GS G
Sbjct: 1688 ---------VDTAGDIYVADYNNHRIQKFDSAGTFITAWGSEGSGSG 1725
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 18/32 (56%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
F P A D GN+YVAD SNH IR + G
Sbjct: 1304 FSYPTGIATDATGNVYVADYSNHCIRVFNSTG 1335
>gi|302786076|ref|XP_002974809.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
gi|300157704|gb|EFJ24329.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
Length = 1049
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 47/103 (45%), Gaps = 5/103 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRA 149
G DG A F P G +YVAD NH I+ + + V T+AG G G
Sbjct: 791 GDKDGPASQALFQHPLGILSSANGAVYVADTYNHKIKLMDLASKTVRTVAGTGVA--GYE 848
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
DG A FS L P+ +L ++D N +IR + +P+D
Sbjct: 849 DGKGVKAQFSEPAGLALGPN-GSLFVADTNNNVIRLLKPRPDD 890
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 32/48 (66%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
V+ +AG GV GY DG+ A+F +P A+ G+++VAD +N+VIR
Sbjct: 835 TVRTVAGTGVAGYEDGKGVKAQFSEPAGLALGPNGSLFVADTNNNVIR 882
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGS 143
G G +G +DG A F++P+ A D + NI YVAD NH +R++ V T+AG G
Sbjct: 607 GSGAEGLTDGSFSLASFNRPQGVAYDTEKNILYVADTENHALREVDFVKETVRTLAGTGE 666
Query: 144 KK-EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ + R Q+ ++ ++L P L I+ G I + N
Sbjct: 667 RGFDYRGGKAGQSQPLNSPWDLCLHPD-GMLYIAMAGEHQIWEFN 710
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 58/135 (42%), Gaps = 7/135 (5%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR-FDKPKSFAVDMKG 115
L V D+ A + F + V+ LAG G +G Y G+ G ++ + P + G
Sbjct: 638 LYVADTENHALREVDF---VKETVRTLAGTGERGFDYRGGKAGQSQPLNSPWDLCLHPDG 694
Query: 116 NIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
+Y+A H I + GV + G + N+S++ L+ P + L
Sbjct: 695 MLYIAMAGEHQIWEFNRSNGVAKVFSGDGYERNLNGSKGSNSSYAQPSGLSLSPDMDHLY 754
Query: 175 ISDHGNQLIRQINLK 189
I+D + +R +NLK
Sbjct: 755 IADSESSSVRSVNLK 769
>gi|312200910|ref|YP_004020971.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311232246|gb|ADP85101.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 804
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 45/99 (45%), Gaps = 4/99 (4%)
Query: 94 SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--D 150
DG P + A P AV G+IYVAD IR+IT G G S K+G
Sbjct: 637 GDGGPATKASLPSPSGLAVTADGSIYVADDYLDTIREITPDGTIRTVAGTSGKDGETGDG 696
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
GPA A S L P +L I+D N IR+I+ K
Sbjct: 697 GPAARALLSGPTGLALGPD-GSLYITDTHNAKIRRIDPK 734
>gi|290991612|ref|XP_002678429.1| predicted protein [Naegleria gruberi]
gi|284092041|gb|EFC45685.1| predicted protein [Naegleria gruberi]
Length = 992
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 69/142 (48%), Gaps = 8/142 (5%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKG 115
DL++ DSS + + L+ +V+ R+AG G + Y+ +A +P + G
Sbjct: 547 DLLIADSSN---HVIRKVLTNGTVI-RVAGTGTRAYNGDNILAVNAHLSEPSGIHILSNG 602
Query: 116 NIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
I +DK N+ +RKI TN + TIAG G+ + PA + F L P ++
Sbjct: 603 EILFSDKYNYRVRKILTNGTIITIAGIGTYGYNGENLPALSTKFFGVTGLALSPVDGSIY 662
Query: 175 ISDHGNQLIRQINLKPEDCSKS 196
++D N IR+I P D S+S
Sbjct: 663 LADTSNHRIRKIT-DPCDVSRS 683
Score = 43.5 bits (101), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 86 AGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
AG G+ GY G+ + + P V G++ +AD SNHVIRK+ TN V +AG G
Sbjct: 515 AGTGIYGYDPGDVLAVNTKLFFPNGLDVYPNGDLLIADSSNHVIRKVLTNGTVIRVAGTG 574
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICA---LLISDHGNQLIRQI 186
++ + A NA S + HI + +L SD N +R+I
Sbjct: 575 TRAYNGDNILAVNAHLSEPSGI----HILSNGEILFSDKYNYRVRKI 617
>gi|289707042|ref|ZP_06503373.1| antioxidant, AhpC/TSA family [Micrococcus luteus SK58]
gi|289556228|gb|EFD49588.1| antioxidant, AhpC/TSA family [Micrococcus luteus SK58]
Length = 703
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 51/114 (44%), Gaps = 6/114 (5%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-----LGVTTIAG 140
AG ++G DG+ G A F +P +V G ++VAD +R + V T G
Sbjct: 399 AGTALEGLLDGDAGRAWFAQPSGLSVGADGALWVADSETSAVRWVRTGEDGRREVGTAVG 458
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G G DG A A + +T +P ++LI+D N IR+ ED +
Sbjct: 459 AGLFDFGHVDGEADRARLQHPLGVTALPD-GSVLIADTYNGAIRRYAPAGEDAA 511
>gi|290993799|ref|XP_002679520.1| predicted protein [Naegleria gruberi]
gi|284093137|gb|EFC46776.1| predicted protein [Naegleria gruberi]
Length = 1414
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 65/139 (46%), Gaps = 21/139 (15%)
Query: 64 SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP------------GSARFDKPKSFAV 111
+SR Y + L V++++A DG+ G P A+ + P+ AV
Sbjct: 62 NSRGDMYVVDAQLQ---VIRKIAKDGITTTIAGFPQDAGYNGDNILANKAKLNYPRGVAV 118
Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPH 169
+ I+++D N+ IRKI+N G ++T+AG G + G D A N F N L +
Sbjct: 119 NENDEIFISDSGNYRIRKISNSGIISTVAGTG--ENGFMDHVLAINGKFGNPSHLLYTNS 176
Query: 170 ICALLISDHGNQLIRQINL 188
L I+D N IR+++
Sbjct: 177 --TLYINDQSNNKIRKLDF 193
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 43/70 (61%), Gaps = 2/70 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S+ ++ +AG G GY D + ++ + PK V G +++AD +N+ +RKI+ G
Sbjct: 850 VSKSGIISTIAGLGAAGYIDNVLATESQLNAPKGVVVAPSGEVFIADSNNNKVRKISTSG 909
Query: 135 -VTTIAGGGS 143
++TIAGGGS
Sbjct: 910 IISTIAGGGS 919
Score = 44.3 bits (103), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 47/107 (43%), Gaps = 3/107 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
+ L G G Q Y DG P P + A + +G++YV D VIRKI G+TT
Sbjct: 32 ISLLVGGGNQ-YGDGLPADKINLYNPTAIAFNSRGDMYVVDAQLQVIRKIAKDGITTTIA 90
Query: 141 GGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + G D N + N V + ISD GN IR+I
Sbjct: 91 GFPQDAGYNGDNILANKAKLNYPRGVAVNENDEIFISDSGNYRIRKI 137
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 3/88 (3%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
+A P +++ +G +Y+AD SNH IRK++ G ++TIAG G+ G D S
Sbjct: 820 TAHLWNPIGISLNSEGEVYIADLSNHRIRKVSKSGIISTIAGLGA--AGYIDNVLATESQ 877
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
N + V + I+D N +R+I
Sbjct: 878 LNAPKGVVVAPSGEVFIADSNNNKVRKI 905
>gi|227540165|ref|ZP_03970214.1| conserved protein of hypothetical function [Sphingobacterium
spiritivorum ATCC 33300]
gi|227240026|gb|EEI90041.1| conserved protein of hypothetical function [Sphingobacterium
spiritivorum ATCC 33300]
Length = 658
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 62/137 (45%), Gaps = 15/137 (10%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS---NHVIRKIT-----NLGVT 136
L G G GY DG A+ + VD KG++Y D + + IRKIT N V+
Sbjct: 381 LIGSGGAGYVDGSLADAKTNGAWGVVVDNKGDLYFTDNNATGGNRIRKITFDASGNGTVS 440
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC--- 193
TIAG S G ADG ASF+ L + L +S+ IR+I+L
Sbjct: 441 TIAGQAS--TGNADGIGAAASFNRPSGLAIDKNNNILYVSEWIGNRIRRIDLSTNRVELL 498
Query: 194 --SKSSQSGSALGAVSV 208
S ++QSG G SV
Sbjct: 499 AGSATAQSGDVEGIGSV 515
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 50 VIDRPGSSDLIVLDSSRSA---FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
V+D G DL D++ + ++F S V +AG G +DG +A F++P
Sbjct: 406 VVDNKG--DLYFTDNNATGGNRIRKITFDASGNGTVSTIAGQASTGNADGIGAAASFNRP 463
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFS--- 159
A+D NI YV++ + IR+I TN V +AG + + G +G A F+
Sbjct: 464 SGLAIDKNNNILYVSEWIGNRIRRIDLSTNR-VELLAGSATAQSGDVEGIGSVARFNQLG 522
Query: 160 NDFELTFVPHICALLISDHGNQL 182
N EL+ P L +S + ++
Sbjct: 523 NALELS--PDGKKLFVSGNSSRF 543
>gi|326436774|gb|EGD82344.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
Length = 1484
Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats.
Identities = 38/107 (35%), Positives = 51/107 (47%), Gaps = 18/107 (16%)
Query: 92 GYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEG 147
GYS DG AR ++P+ A+ KG +Y+AD NH IR + +TT+AG GS
Sbjct: 1326 GYSGDDGLSIGARLNRPRGIALS-KGVLYIADSGNHRIRAVDMRTQVITTVAGTGSAGFS 1384
Query: 148 RADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
G NA+ VPH A L I+D GN +R N+
Sbjct: 1385 GDGGLPTNAALR-------VPHGVAVHSSGTLAIADSGNHRVRVFNI 1424
Score = 43.5 bits (101), Expect = 0.090, Method: Composition-based stats.
Identities = 34/114 (29%), Positives = 51/114 (44%), Gaps = 10/114 (8%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---- 134
V+ +AG G G+S G P +A P AV G + +AD NH +R + N+G
Sbjct: 1371 VITTVAGTGSAGFSGDGGLPTNAALRVPHGVAVHSSGTLAIADSGNHRVR-VFNIGSSTA 1429
Query: 135 --VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TI G G D A +S + +TF ++ +D N IRQ+
Sbjct: 1430 GVIETITGDGEHGYN-GDNAATASSLNFPTSVTFSARTGNVVFADTRNNRIRQV 1482
Score = 42.7 bits (99), Expect = 0.17, Method: Composition-based stats.
Identities = 29/107 (27%), Positives = 50/107 (46%), Gaps = 5/107 (4%)
Query: 85 LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG G G S G SA+ ++P+ D++GN+Y+ ++ NH + + I G G
Sbjct: 1204 VAGTGEMGASGVGGAATSAQLNRPRCAIADVEGNVYITEEGNHRVSIVDVASKKLIVGAG 1263
Query: 143 SKKEGR--ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ G G NA+ + L + +LL SD N ++ +N
Sbjct: 1264 TGNPGHRGMGGTPGNAAIHSPVGLAWADD-GSLLFSDEENHVVYMVN 1309
Score = 37.7 bits (86), Expect = 5.1, Method: Composition-based stats.
Identities = 30/122 (24%), Positives = 56/122 (45%), Gaps = 7/122 (5%)
Query: 86 AGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
AG G G+ G PG+A P A G++ +D+ NHV+ + + ++ G
Sbjct: 1262 AGTGNPGHRGMGGTPGNAAIHSPVGLAWADDGSLLFSDEENHVVYMVNRRSLIISVVAGT 1321
Query: 143 SKKEGRA--DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
+ G + DG + A + + + L I+D GN IR ++++ + + + +G
Sbjct: 1322 PRFAGYSGDDGLSIGARLNRPRGIALSKGV--LYIADSGNHRIRAVDMRTQVITTVAGTG 1379
Query: 201 SA 202
SA
Sbjct: 1380 SA 1381
>gi|242017281|ref|XP_002429120.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212513984|gb|EEB16382.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 791
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 35/68 (51%), Gaps = 2/68 (2%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVT 136
E VVK G QG+ DG A FD P A + +Y+AD +NH IR + + VT
Sbjct: 237 EGVVKYTIGGKEQGFKDGNFQEALFDTPHGLAFFNESILYIADTNNHAIRLVDLNKMIVT 296
Query: 137 TIAGGGSK 144
T+ G G +
Sbjct: 297 TVVGNGER 304
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 61/124 (49%), Gaps = 11/124 (8%)
Query: 76 LSEESVVKRLAGDGVQGYSDGE-PGSARFDKPK--SFAVDMKGNIYVADKSNHVIRKI-- 130
+ +E V +AG+G + + + P A F +P +++ D+K +++AD + IR++
Sbjct: 487 MYKEKVCLAIAGNGKEENKNNDLPNKAGFAQPCGITYSEDLKV-LFIADTESSSIRRLSL 545
Query: 131 TNLGVTTIAGGGSKKE-----GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
T VT +AGG E G DG N + LT+ +L + D N I+Q
Sbjct: 546 TKGSVTHLAGGSRFPEDLFSFGDVDGKGYNVRLQHPLGLTWNKKRKSLFVCDSYNHKIKQ 605
Query: 186 INLK 189
IN++
Sbjct: 606 INVE 609
>gi|296128041|ref|YP_003635291.1| alkyl hydroperoxide reductase [Cellulomonas flavigena DSM 20109]
gi|296019856|gb|ADG73092.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Cellulomonas flavigena DSM 20109]
Length = 639
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 3/106 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
V LAG +G DG P A F +P +VD G +++AD +R + + V ++
Sbjct: 353 VTHLAGTMNEGLVDGTPQEAWFAQPSGLSVDAAGRLWLADAETSALRWVDPADASVHSVV 412
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G G G DG A A F + + +P ++L++D N +R+
Sbjct: 413 GTGLFDFGHRDGAADQALFQHPLGVAALPD-GSVLVADTYNGALRR 457
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 56/107 (52%), Gaps = 13/107 (12%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P ++ PG + L+ + + +TL+ ++ + R G G +G DG P ARF +P
Sbjct: 181 PAKAVELPGGTFLV----ADAGHHTLTEVAADGETLVRRIGSGERGLVDGGPDDARFSEP 236
Query: 107 KSFAV---DMKG----NIYVADKSNHVIR--KITNLGVTTIAGGGSK 144
+ +++G ++ VAD NH +R ++++ VTT+AG G +
Sbjct: 237 NGLGLVPDELRGRVGYDVLVADTVNHALRGVRLSDGHVTTLAGTGEQ 283
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 49/109 (44%), Gaps = 10/109 (9%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY-VADKSNHVIRKITNLGVTTIAGGGSKK 145
GDG +P + RF + AV++ G + VAD +H + ++ G T + GS +
Sbjct: 164 GDGPYVAPTPQPTTLRF---PAKAVELPGGTFLVADAGHHTLTEVAADGETLVRRIGSGE 220
Query: 146 EGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
G DG +A FS L VP +L++D N +R + L
Sbjct: 221 RGLVDGGPDDARFSEPNGLGLVPDELRGRVGYDVLVADTVNHALRGVRL 269
>gi|162448821|ref|YP_001611188.1| NHL repeat-containing protein [Sorangium cellulosum So ce56]
gi|161159403|emb|CAN90708.1| hypothetical protein with NHL repeat [Sorangium cellulosum So ce56]
Length = 1889
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 4/62 (6%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRA 149
G DG +ARF +P D +GN+Y+AD NH +R+ I VTT+AG G+ G A
Sbjct: 949 GNVDGVGTTARFAEPSGIVADGRGNLYIADAFNHSLRRFEIATSMVTTVAGNGTA--GSA 1006
Query: 150 DG 151
DG
Sbjct: 1007 DG 1008
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 9/103 (8%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKK 145
G + ++G ARF +P A D +YVAD++ +R+I T+ VTTIA GS
Sbjct: 795 GARALTNGSYAQARFIQPGDMAYDASTRMLYVADRT--AVRRIDLTSGAVTTIA--GSTV 850
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G DG A F+ +T H AL I+D G++ IR++ L
Sbjct: 851 SGTTDGVGTAARFNTISGITL--HDGALYIADRGSRTIRKLTL 891
Score = 41.2 bits (95), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 38/65 (58%), Gaps = 3/65 (4%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITNLGVT- 136
S+V +AG+G G +DG +AR P++ A+D G++++ + K V R LG T
Sbjct: 992 SMVTTVAGNGTAGSADGVGTAARLTSPQALAIDGSGSVFIGEGNKPGRVRRFDPALGATV 1051
Query: 137 TIAGG 141
T+AGG
Sbjct: 1052 TVAGG 1056
>gi|256393018|ref|YP_003114582.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
gi|256359244|gb|ACU72741.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
Length = 1046
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 2/89 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
SA+ + P+ AVD G+++VAD ++ IR++T G ++T+ + G + G A AS
Sbjct: 940 SAQLNDPQGVAVDGAGDLFVADTFDNAIREVTPDGTISTVVNSSAAPGGESSGAAPTASH 999
Query: 159 SND-FELTFVPHICALLISDHGNQLIRQI 186
N + +T P L I+D N +I Q+
Sbjct: 1000 LNTPYAVTVDPSTDLLYIADTRNSVIAQV 1028
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 38/57 (66%), Gaps = 2/57 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
L+ ++ RL G G GYS DG P A+ ++P A+D +G++Y+AD +N+VIR++
Sbjct: 849 LTPRGLLLRLIGTGRAGYSGDGRPSPLAQLNQPIGLALDAQGDLYIADSANNVIRRV 905
>gi|111219554|ref|YP_710348.1| protein serine/threonine kinase [Frankia alni ACN14a]
gi|111147086|emb|CAJ58733.1| putative Protein serine/threonine kinase [Frankia alni ACN14a]
Length = 765
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 57/114 (50%), Gaps = 4/114 (3%)
Query: 78 EESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
E+ + +AG + S G+ G +A +P A+ G++YVAD+ +R+I G
Sbjct: 583 EKGRITTVAGKATEYGSAGDGGPAVNATLYQPSGLALGDDGSLYVADRGEDTVRRIDRSG 642
Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+T +AG R DG A A+ +D + +L I+D GN +IR+++
Sbjct: 643 RITLVAGRPGTYGNRGDGRAATAALLDDPTGLALGPDGSLYIADAGNDVIRRVD 696
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 56/114 (49%), Gaps = 11/114 (9%)
Query: 81 VVKRLAGDGVQGYSDG------EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
V+ +AG G Y+DG + A D P + AVD G +YVA+ + +R++
Sbjct: 530 VITTVAGAGPD-YADGFTADVGDATKAHLDDPSAVAVDDDGVLYVAE--GYRVRRVEKGR 586
Query: 135 VTTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+TT+AG ++ D GPA NA+ L +L ++D G +R+I+
Sbjct: 587 ITTVAGKATEYGSAGDGGPAVNATLYQPSGLALG-DDGSLYVADRGEDTVRRID 639
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG 151
+A D P A+ G++Y+AD N VIR++ G +TT+AG S G DG
Sbjct: 665 AALLDDPTGLALGPDGSLYIADAGNDVIRRVDGHGIITTVAGTASYSAGDHDG 717
>gi|288801999|ref|ZP_06407440.1| conserved hypothetical protein [Prevotella melaninogenica D18]
gi|288335434|gb|EFC73868.1| conserved hypothetical protein [Prevotella melaninogenica D18]
Length = 486
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 4/103 (3%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT---TIAGGGSKKE 146
+G+ DG +A+F P+ G +Y+AD NH IR I T LG T G +
Sbjct: 384 KGWEDGALKNAKFCYPRQMTFTKDGKLYIADSGNHCIRMIDTTLGKNARVTTPIGVPQSA 443
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G DG + A F+ + + ++D NQ+IR++++K
Sbjct: 444 GFQDGGVELAKFNWPTGVAVSADGSTVYVADSKNQVIRELSIK 486
>gi|281415417|ref|ZP_06247159.1| thiol-disulfide isomerase-like thioredoxin [Micrococcus luteus NCTC
2665]
Length = 699
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 51/114 (44%), Gaps = 6/114 (5%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-----LGVTTIAG 140
AG ++G DG+ G A F +P +V G ++VAD +R + V T G
Sbjct: 395 AGTALEGLLDGDAGRAWFAQPSGLSVGADGALWVADSETSAVRWVRTGEDGRREVGTAVG 454
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G G DG A A + +T +P ++LI+D N IR+ ED +
Sbjct: 455 AGLFDFGHVDGEADRARLQHALGVTALPD-GSVLIADTYNGAIRRYAPAGEDAA 507
>gi|239918387|ref|YP_002957945.1| thiol-disulfide isomerase-like thioredoxin [Micrococcus luteus NCTC
2665]
gi|239839594|gb|ACS31391.1| thiol-disulfide isomerase-like thioredoxin [Micrococcus luteus NCTC
2665]
Length = 699
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 51/114 (44%), Gaps = 6/114 (5%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-----LGVTTIAG 140
AG ++G DG+ G A F +P +V G ++VAD +R + V T G
Sbjct: 395 AGTALEGLLDGDAGRAWFAQPSGLSVGADGALWVADSETSAVRWVRTGEDGRREVGTAVG 454
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G G DG A A + +T +P ++LI+D N IR+ ED +
Sbjct: 455 AGLFDFGHVDGEADRARLQHALGVTALPD-GSVLIADTYNGAIRRYAPAGEDAA 507
>gi|410896258|ref|XP_003961616.1| PREDICTED: NHL repeat-containing protein 2-like [Takifugu rubripes]
Length = 716
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 8/79 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSK---K 145
G DG+ A F+ P+ A+ KG+ +YVAD NH+IRKI + V+T+AG G + K
Sbjct: 262 GRKDGDLSEASFNSPQGVAI--KGDTVYVADTENHLIRKIDLSAGKVSTLAGTGEQGTDK 319
Query: 146 EGRADGPAQNASFSNDFEL 164
+G A GP Q S D L
Sbjct: 320 DGGAPGPEQPISSPWDVTL 338
>gi|290982235|ref|XP_002673836.1| protein kinase [Naegleria gruberi]
gi|284087422|gb|EFC41092.1| protein kinase [Naegleria gruberi]
Length = 1543
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 68/160 (42%), Gaps = 26/160 (16%)
Query: 45 INPH---SVIDRP-----GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
IN H S +RP G + V DS ++S VK AG G+ G++ D
Sbjct: 588 INDHALNSTFNRPTGLSFGDGHVYVADSGNHMIRSISITTKS---VKTFAGTGIPGFNGD 644
Query: 96 GEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK-ITNLGVTTIAGGGSKKEGRADGPA 153
G+ + + P + I VAD NH IR I N V TIAG G +GPA
Sbjct: 645 GKLLLETQLNSPMDVVFTVTMGILVADTFNHRIRLLIPNFIVKTIAGSSVDGSGGDEGPA 704
Query: 154 QNASFSNDFELTFVPH-----ICALLISDHGNQLIRQINL 188
N+ F PH + A+ I+D N IR I++
Sbjct: 705 INSEL-------FYPHSISFGVTAIYIADTFNNKIRAISI 737
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 60/139 (43%), Gaps = 7/139 (5%)
Query: 48 HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
HS +D G +++ D+ R+ + +P +V+ LAG G D P
Sbjct: 773 HSYVDEGGIERVLISDN-RNDRLRVYYP--GNNVIHSLAG----GIGDFGPSENAIGNCL 825
Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
+V +IY+ D N++IR I N ++TIAG G + A AS + + +
Sbjct: 826 GISVAPNNDIYLTDVKNNLIRLIRNGIISTIAGNGGYGFNLDNIYATQASLAYPSGIVYS 885
Query: 168 PHICALLISDHGNQLIRQI 186
L+ SD N IR I
Sbjct: 886 ESTGNLIFSDTNNHRIRVI 904
>gi|223939922|ref|ZP_03631790.1| NHL repeat containing protein [bacterium Ellin514]
gi|223891413|gb|EEF57906.1| NHL repeat containing protein [bacterium Ellin514]
Length = 359
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 66/136 (48%), Gaps = 12/136 (8%)
Query: 60 IVLDSSRSAFY------TLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAV 111
+ LD + + F+ T+ ++ +V +AG+G G+S DG P + A+ ++P S
Sbjct: 113 VRLDQAENLFWVERLSHTVRKCDAKTGIVTTIAGNGTAGFSGDGGPATKAQMNEPHSIGF 172
Query: 112 DMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPA-QNASFSNDFELTFVP 168
D GN+Y+ D NH IRK+ + ++T G G +K DG A NA + L F P
Sbjct: 173 DKAGNLYICDVRNHRIRKVDMKSGMISTFCGTGERKP-TPDGTAIGNAPLNGPRALDFGP 231
Query: 169 HICALLISDHGNQLIR 184
+ GN + +
Sbjct: 232 DGNLWVALREGNAVYK 247
Score = 45.1 bits (105), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 70/151 (46%), Gaps = 11/151 (7%)
Query: 44 EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA- 101
E+N + I R L + D+ ++ + + +AG G G+S DG P +A
Sbjct: 50 ELNGPTGIVRGSDGSLYICDTENHRIRKVT----ADGNISTVAGTGEHGWSGDGGPATAA 105
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASF 158
+ ++P +D N++ ++ +H +RK T + VTTIAG G+ GPA A
Sbjct: 106 KLNEPYEVRLDQAENLFWVERLSHTVRKCDAKTGI-VTTIAGNGTAGFSGDGGPATKAQM 164
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ + F L I D N IR++++K
Sbjct: 165 NEPHSIGF-DKAGNLYICDVRNHRIRKVDMK 194
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 43/86 (50%), Gaps = 3/86 (3%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
+ AG+G +G+S DG A + P G++Y+ D NH IRK+T G ++T+
Sbjct: 28 ISNFAGNGKKGFSGDGAAAKGAELNGPTGIVRGSDGSLYICDTENHRIRKVTADGNISTV 87
Query: 139 AGGGSKKEGRADGPAQNASFSNDFEL 164
AG G GPA A + +E+
Sbjct: 88 AGTGEHGWSGDGGPATAAKLNEPYEV 113
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SAR 102
+N +D +L V +A Y + E+ + +AG G G++ +G P +A
Sbjct: 221 LNGPRALDFGPDGNLWVALREGNAVYKIDM---EKGRIFHVAGTGKNGFTGNGGPAKAAT 277
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
F PK +V GN++VAD NH IR I
Sbjct: 278 FKGPKGLSVASNGNVFVADTENHAIRMI 305
>gi|168180827|ref|ZP_02615491.1| NHL repeat protein [Clostridium botulinum NCTC 2916]
gi|421838034|ref|ZP_16272034.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001627]
gi|182668214|gb|EDT80193.1| NHL repeat protein [Clostridium botulinum NCTC 2916]
gi|409739668|gb|EKN40278.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001627]
Length = 467
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGG 141
++ G G G DG+ A+F P+ ++ IYVAD NH IR+ + V TIAG
Sbjct: 211 KIIGSGEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIAGN 269
Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G K+ A G A N S ++ ++L + + I+ GN I + N++
Sbjct: 270 GQKEYSPMAKGDALNVSLNSPWDLEILKG--CIYIAMAGNHQIWKYNMR 316
>gi|436838118|ref|YP_007323334.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
gi|384069531|emb|CCH02741.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
Length = 375
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 54/127 (42%), Gaps = 18/127 (14%)
Query: 74 FPLSEESVVKRLAG----DGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
F ++ V RL+G DG GY DG A+F P S +D G +Y+ D NH I
Sbjct: 141 FRINSAGQVTRLSGGPDPDGGYSFGYRDGPLARAKFSNPWSIIMDKAGMLYIGDGLNHCI 200
Query: 128 RKI---TNLGVTTIAGGGSK-----KEGRADGPAQNASFSNDFELTFVPHICALLISDHG 179
RK+ TN V+T AG + K GPA D L L I D G
Sbjct: 201 RKVDLSTNT-VSTFAGKPLENTAVCKYADGKGPAAQVCTVEDLTLD---QDGNLFIPDWG 256
Query: 180 NQLIRQI 186
N +R +
Sbjct: 257 NNSVRMV 263
>gi|290977893|ref|XP_002671671.1| predicted protein [Naegleria gruberi]
gi|284085242|gb|EFC38927.1| predicted protein [Naegleria gruberi]
Length = 3330
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 56/110 (50%), Gaps = 4/110 (3%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
++ LAG+G+ +S DG + A + P V G IY++D NH IR I G ++T
Sbjct: 2358 LISTLAGNGIANFSGDGLAATEASLNSPSGVFVSEFGKIYISDSGNHRIRAILPNGIIST 2417
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ G G +G A NAS +N + + + L+ISD N IR I+
Sbjct: 2418 VGGNGIPGFSGDNGLATNASLNNPYGIV-ETYSGDLIISDSDNNRIRLID 2466
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 64/144 (44%), Gaps = 10/144 (6%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
NP+ +++ S DLI+ DS + + ++ LAG G +G+ DG A F+
Sbjct: 2440 NPYGIVE-TYSGDLIISDSDNNRIRLID----RYGIITTLAGTGEKGFQDGFFLDALFNN 2494
Query: 106 PKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
P +Y++D IR+ + + V TI G G K + P +N
Sbjct: 2495 PSQLFY-FHSRLYISDTYGQRIREANLESKTVKTIIGTGEKGYNGDNFPT-TTQLNNPLS 2552
Query: 164 LTFVPHICALLISDHGNQLIRQIN 187
+ F+ + + I+D N+ IR +N
Sbjct: 2553 M-FITYNNIIYIADTDNKKIRMLN 2575
Score = 37.4 bits (85), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 74 FPLSEES-VVKRLAGD-GVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+ SEES V+R+AG GY+ DG+P + +P+S V G +Y+ D +NH+IR+
Sbjct: 2025 YSASEESGTVQRIAGSLSSSGYNGDGKPAIDSLLSQPQSVFVK-DGEVYICDTNNHLIRR 2083
Query: 130 ITNLGV 135
I G+
Sbjct: 2084 IDRNGI 2089
>gi|433458786|ref|ZP_20416675.1| NHL repeat containing protein [Arthrobacter crystallopoietes
BAB-32]
gi|432192788|gb|ELK49609.1| NHL repeat containing protein [Arthrobacter crystallopoietes
BAB-32]
Length = 680
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 11/121 (9%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--- 131
PL+ E V LAG G++G DGE +A F + A D G+I++AD +R +T
Sbjct: 378 PLTGELAV--LAGTGLEGLQDGEANAAWFAQSSGLAQDANGSIWIADSETSALRVLTFSD 435
Query: 132 -----NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ V T G G G DG A A + +T +P ++ ++D N +R+
Sbjct: 436 VNGSERVQVQTAIGTGLFDFGFRDGGAAEARLQHPLGVTVLPD-GSVAVADTYNGAVRRY 494
Query: 187 N 187
+
Sbjct: 495 D 495
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 9/75 (12%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---DMKG----NIYVADKSNHVIRKIT- 131
S V G G +G+ DG+ +A+F++P+ A+ D++ ++ +AD NH +R +T
Sbjct: 219 STVLNTYGSGSKGWQDGDAEAAQFNEPQGLALLPEDVRAKAGYDVVIADSVNHRLRGLTL 278
Query: 132 -NLGVTTIAGGGSKK 145
V TIAG G ++
Sbjct: 279 ATGAVVTIAGNGVQR 293
>gi|375098226|ref|ZP_09744491.1| Peroxiredoxin [Saccharomonospora marina XMU15]
gi|374658959|gb|EHR53792.1| Peroxiredoxin [Saccharomonospora marina XMU15]
Length = 673
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 50/100 (50%), Gaps = 9/100 (9%)
Query: 97 EPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQN 155
+PGS RF P V +G + VAD + H + + + T + GS G ADG
Sbjct: 228 QPGSGLRF--PAKAVVTAEGRVLVADTAQHSVAEFASDAETLVRRFGSGVRGHADGAFDV 285
Query: 156 ASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
A+F+ LT +P A L+++D GN L+R ++L+
Sbjct: 286 AAFAEPSGLTLLPTEIAERVGYHLVVADTGNHLLRGVDLR 325
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 61/121 (50%), Gaps = 11/121 (9%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNH 125
F E++V+R G GV+G++DG A F +P A + ++ VAD NH
Sbjct: 259 EFASDAETLVRRF-GSGVRGHADGAFDVAAFAEPSGLTLLPTEIAERVGYHLVVADTGNH 317
Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
++R + VTT+AG G + ++G G A + + ++ +++ + ++++ GN
Sbjct: 318 LLRGVDLRTGAVTTVAGTGEQWRDGADAGTALDVALTSPWDVRWWEAAGGVVVAMAGNHT 377
Query: 183 I 183
+
Sbjct: 378 L 378
>gi|345000781|ref|YP_004803635.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces sp. SirexAA-E]
gi|344316407|gb|AEN11095.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces sp. SirexAA-E]
Length = 608
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V D++R L ++ V+ G G +G +DG PG ARF +P+ A+ G I
Sbjct: 193 FLVSDTTRHRLVELD---ADGETVRGHFGTGERGLTDGGPGEARFSEPQGLALLPDGRIA 249
Query: 119 VADKSNHVIRKI 130
VAD NH IR +
Sbjct: 250 VADTVNHAIRAL 261
>gi|194877446|ref|XP_001973882.1| GG16549 [Drosophila erecta]
gi|190657069|gb|EDV54282.1| GG16549 [Drosophila erecta]
Length = 717
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 55/122 (45%), Gaps = 19/122 (15%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
VV+ G G+ DG +ARF+ P+ A + + VAD NH IR+I TN V T+
Sbjct: 257 VVQHKIGGYQPGFVDGNSTAARFNNPQGIAFLNENTLIVADTKNHAIRQISLTNGMVETL 316
Query: 139 AGGGSKKEGRADG----------PAQNASF-SNDFELTF------VPHICALLISDHGNQ 181
AG G + R G P A F + D +++F +P +LIS G
Sbjct: 317 AGTGHQGNERIGGRLGPLQPLSSPWDVAIFRTRDMDMSFHLDERNIPEKTIVLISMAGTH 376
Query: 182 LI 183
I
Sbjct: 377 QI 378
>gi|290986946|ref|XP_002676184.1| NHL repeat-containing protein [Naegleria gruberi]
gi|284089785|gb|EFC43440.1| NHL repeat-containing protein [Naegleria gruberi]
Length = 1290
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 65/136 (47%), Gaps = 9/136 (6%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAV 111
P + +DSS T+ + +S +V AG G GY +D P SA + PK A
Sbjct: 438 PNVGEYYFVDSS-----TVKYLISSSGIVITKAGSGTTGYNADNIPATSAHLNFPKGIAR 492
Query: 112 DMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
G++++ D NH IRK+ N + T+AG G+ A +A+ +N + + +
Sbjct: 493 STNGDLFITDSGNHRIRKVFLNQTIITVAGTGTSAYNGDGLLATSANINNPYGIVLTSNN 552
Query: 171 CALLISDHGNQLIRQI 186
++ S++ +R+I
Sbjct: 553 -EVIFSEYEGHRVRKI 567
>gi|195456364|ref|XP_002075106.1| GK23381 [Drosophila willistoni]
gi|194171191|gb|EDW86092.1| GK23381 [Drosophila willistoni]
Length = 721
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 56/115 (48%), Gaps = 5/115 (4%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
+ VV+ G G+ DG+ ++RF+ P+ A + + VAD NH +R+I+ G V
Sbjct: 259 DGVVEHKIGGFESGFIDGDLTTSRFNNPQGIAFLDEDTLIVADTKNHALRQISLQGGIVE 318
Query: 137 TIAGGG---SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
T+AG G S++ G GP Q+ S D + P + D N L + I L
Sbjct: 319 TLAGTGHQCSERTGGRTGPVQSMSSPWDVAIFRTPDMDMSFHLDERNVLEKTIIL 373
>gi|406944252|gb|EKD76066.1| Serine/threonine protein kinase, partial [uncultured bacterium]
Length = 212
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV--DMKGNIYVADKSNHVIRKITNL 133
+ + ++ +AG G GY DG A F +P+ A+ D +++AD +N+VIRKI +
Sbjct: 100 IGTDGILTNVAGAGDYGYRDGSSDYALFAQPQDIAIYGDTASELFIADTNNNVIRKIKDG 159
Query: 134 GVTTIAGGGSKKEGRA 149
V+T+ G S +G A
Sbjct: 160 EVSTLLSGLSSPQGVA 175
>gi|290994118|ref|XP_002679679.1| predicted protein [Naegleria gruberi]
gi|284093297|gb|EFC46935.1| predicted protein [Naegleria gruberi]
Length = 1272
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 43/114 (37%), Positives = 58/114 (50%), Gaps = 9/114 (7%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMK-GNIYVADKSNHVIRKI---TN 132
+++ +AG G DG+ SA F + + K GNIYV DKSN IRKI TN
Sbjct: 193 NLITTIAGIRAAGTFDGDGKSALQTSFFRMTDVTISKKNGNIYVTDKSNQRIRKISSSTN 252
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
L V+TIAG GS + A NAS N L + ++ S+ GN IR++
Sbjct: 253 L-VSTIAGNGSIGFVDSMSNAINASLFNPMYLVLTDND-EIIFSEFGNNRIRKL 304
>gi|116007528|ref|NP_001036460.1| CG12547, isoform A [Drosophila melanogaster]
gi|442622271|ref|NP_001260704.1| CG12547, isoform B [Drosophila melanogaster]
gi|17945781|gb|AAL48938.1| RE33981p [Drosophila melanogaster]
gi|30923523|gb|EAA46001.1| CG12547, isoform A [Drosophila melanogaster]
gi|220948326|gb|ACL86706.1| CG12547-PA [synthetic construct]
gi|440214080|gb|AGB93239.1| CG12547, isoform B [Drosophila melanogaster]
Length = 717
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 57/127 (44%), Gaps = 19/127 (14%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
L+ VV+ G G+ DG+ ARF+ P+ A + + VAD NH IR+I TN
Sbjct: 252 LTAGGVVQHRIGGHQPGFVDGDLTVARFNNPQGIAFLNENVLIVADTKNHAIRQISLTNA 311
Query: 134 GVTTIAGGGSKKEGRADG----------PAQNASF-SNDFELTF------VPHICALLIS 176
V T+AG G + R G P A F + D +++F +P +LIS
Sbjct: 312 MVETLAGTGIQGNDRIGGRLGPLQPLSSPWDVAIFRTRDMDMSFHLDERNIPEKTIILIS 371
Query: 177 DHGNQLI 183
G I
Sbjct: 372 MAGTHQI 378
>gi|284097436|ref|ZP_06385541.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
gi|283831050|gb|EFC35055.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
Length = 387
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 9/108 (8%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAG 140
+R AGDG G A + P + AV G +Y+AD NH IR++ ++TIAG
Sbjct: 113 QRFAGDG------GPAVEAALNFPSAVAVAEDGTVYIADTWNHRIRRVDPGTGAISTIAG 166
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G K +GPA A+ + L L I+D N +R+++L
Sbjct: 167 TGQAKCYGDNGPAVKAALNEPVALAL-DGPDRLYIADQSNNRVRKLDL 213
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 67/155 (43%), Gaps = 14/155 (9%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
G L + D S + L V+ + G G GY+ DG PG P AVD
Sbjct: 194 GPDRLYIADQSNNRVRKLDL---TSGVMTTVVGTGESGYNGDGAPGPETALAGPSGLAVD 250
Query: 113 MKGNIYVADKSNHVIRKITNL--GVTTIAGGGSKKEGRADGPAQNAS---FSNDFELTFV 167
+GN+Y+AD + IRK V T+AGG + P +N S S + +
Sbjct: 251 QEGNLYIADTFSGRIRKWDRHTGTVETVAGGTGAFQFT---PGENESSPNLSRPYAIALH 307
Query: 168 PHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
P L I+D N LIR +L+ + S + +G A
Sbjct: 308 PD-GRLFITDSDNHLIRVWDLQKREMSLLAGNGKA 341
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 36/58 (62%), Gaps = 11/58 (18%)
Query: 96 GEPG---------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV-TTIAGGG 142
GEPG +A ++PK+ A+D GN+Y+AD NH++RK+ G+ TTIAG G
Sbjct: 1 GEPGWGGDAGPATAAMLNEPKTVALDAAGNLYIADSENHLVRKVDARTGIMTTIAGTG 58
>gi|116621305|ref|YP_823461.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116224467|gb|ABJ83176.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 1042
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 63/163 (38%), Gaps = 48/163 (29%)
Query: 71 TLSFPLSEESVVKRLAGD-------GVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADK 122
TLS L + +LAG GV DG P SA + P++ D G IY+AD
Sbjct: 8 TLSAALCLTTAHAQLAGSYVSRTIAGVFPLGDGGPATSAILETPQAVVADSSGTIYIADA 67
Query: 123 SNHVIRKI-------------------------------------TNLG-VTTIAGGGSK 144
N VIRK+ T+ G VTT+AG G+
Sbjct: 68 GNGVIRKVSRGIISTVAGYTGYISDLKLDSSGNLYLAGGNNVFKLTSAGKVTTVAGNGTS 127
Query: 145 KEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
D GPA NA FS + + L I D GN IR++
Sbjct: 128 GTYTGDGGPAINAGFSGAYAIAL-DSTGNLYICDSGNNAIRKV 169
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 3/65 (4%)
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
F L+ V +AG+G G G+ G +A F + A+D GN+Y+ D N+ IRK+
Sbjct: 110 FKLTSAGKVTTVAGNGTSGTYTGDGGPAINAGFSGAYAIALDSTGNLYICDSGNNAIRKV 169
Query: 131 TNLGV 135
T G+
Sbjct: 170 TPDGI 174
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 56/114 (49%), Gaps = 6/114 (5%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ +++ AGDG +G++ DG + A F +D N+YV D +N +R I ++
Sbjct: 280 IAPNGIIQAFAGDGTEGFAGDGSSATLAHFSGVTGLGMDPNNNLYVVDGNNERVRMINSV 339
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + T+AG + DG A A+ + + T ++ +D N +R+I
Sbjct: 340 GSIATVAG---RAHFSGDGGAATAATLHRPQGTVTGTDGSIYFTDTANHRVRKI 390
Score = 40.0 bits (92), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 50/107 (46%), Gaps = 9/107 (8%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKR--LAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
+++L V+D + ++ S +V R +GDG G +A +P+
Sbjct: 320 NNNLYVVDGNNERVRMINSVGSIATVAGRAHFSGDG------GAATAATLHRPQGTVTGT 373
Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
G+IY D +NH +RKI G +TTIAG G GPA A+ S
Sbjct: 374 DGSIYFTDTANHRVRKIGTDGKITTIAGTGDLGFSGDGGPATQATMS 420
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 40/71 (56%), Gaps = 3/71 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + + +AG G G+S DG P + A P + A+D N+YV D+ +RKIT
Sbjct: 390 IGTDGKITTIAGTGDLGFSGDGGPATQATMSFPDALAIDSTNNLYVIDQKQLRVRKITPT 449
Query: 134 G-VTTIAGGGS 143
G ++T+AG G+
Sbjct: 450 GTISTVAGNGT 460
>gi|423300918|ref|ZP_17278942.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
CL09T03C10]
gi|408472253|gb|EKJ90781.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
CL09T03C10]
Length = 459
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 52/98 (53%), Gaps = 2/98 (2%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G G+ +G+ AR +P +D +GNIYVA + H I KIT GV T G+ G
Sbjct: 361 GKTGFVEGQGNDARLQEPCQMDLDEEGNIYVAVRKKHRIAKITPDGVVTSY-TGTGTSGT 419
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
DGP A F++ + F P AL +SD+ N IR+I
Sbjct: 420 TDGPLDKAQFNHPEGVQFGPD-GALYVSDYWNHKIRKI 456
>gi|390960114|ref|YP_006423871.1| gluconolactonase [Terriglobus roseus DSM 18391]
gi|390415032|gb|AFL90536.1| gluconolactonase [Terriglobus roseus DSM 18391]
Length = 1836
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 5/98 (5%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRA-D 150
Y+ G+ ++ P A+D GN+YVAD N+VIR+I GV TT+AG + A D
Sbjct: 228 YAGGDVATS-LSSPAGLAIDGAGNLYVADSGNNVIRRIDTNGVMTTVAGQVANSSPYAGD 286
Query: 151 G-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G A A+ + + + P L I+D GN +R+++
Sbjct: 287 GLLATAATLNAPWSVALDPD-GNLFIADTGNNAVRRVD 323
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 67/147 (45%), Gaps = 18/147 (12%)
Query: 46 NPHSVIDRPGSSDLIVL----DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
+P V+ S+D VL D S S + +S P GDG G +A
Sbjct: 345 DPSGVLYVADSADNTVLRVNADGSFSTYVGISTPRGGS-----FDGDG------GAATAA 393
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFS 159
+ P + A D GN+Y+AD +N+ +R++ L +TT+AG G + ASFS
Sbjct: 394 HLNGPAALAFDPAGNLYIADSANNRVRRVDALSKLITTVAGNGGTVMDDTVMNSDIASFS 453
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
+ L F+ + L + D + +RQI
Sbjct: 454 GPYAL-FLDNAANLYVGDLFHNRVRQI 479
>gi|290997412|ref|XP_002681275.1| predicted protein [Naegleria gruberi]
gi|284094899|gb|EFC48531.1| predicted protein [Naegleria gruberi]
Length = 887
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI---AGGGSKKEGRADG 151
+G+P A S ++ G++++ D +NHVIR+ LG +TI AG G+ K DG
Sbjct: 645 NGKPLDATLGYINSLLINDNGDLFMVDTTNHVIRQFNLLGTSTIATTAGVGTSKGYSGDG 704
Query: 152 PAQNASFSNDFELTFVPHI-CALLISDHGNQLIRQI 186
+ +S + + + H AL+ D GN LIR++
Sbjct: 705 KSAKSSLLSLNLNSTLLHYNGALMFFDGGNNLIRKV 740
>gi|187778661|ref|ZP_02995134.1| hypothetical protein CLOSPO_02256 [Clostridium sporogenes ATCC
15579]
gi|187772286|gb|EDU36088.1| NHL repeat protein [Clostridium sporogenes ATCC 15579]
Length = 467
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 69/153 (45%), Gaps = 22/153 (14%)
Query: 42 QLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
++ IN + +I G + +++LD + T+ G G DG+
Sbjct: 181 KISINENGLIAFSEKGRNRIVILDQNLEKIKTI--------------GSSEYGLKDGDFK 226
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEG-RADGPAQNA 156
A+F P+ ++ IYVAD NH IR+ + V TIAG G K+ A G A N
Sbjct: 227 EAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEMVKTIAGNGQKEYSPMAKGDALNV 285
Query: 157 SFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
S ++ ++L + + I+ GN I + N+K
Sbjct: 286 SLNSPWDLEILED--CIYIAMAGNHQIWKYNMK 316
>gi|410938688|ref|ZP_11370531.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
gi|410786234|gb|EKR75182.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
Length = 358
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 36/70 (51%)
Query: 61 VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
+LD Y S ++ + +V AG G DG +A F P +D GNI+V+
Sbjct: 36 ILDCILKECYLCSLKITNKPIVSLFAGTGAAVSVDGTTSTASFKTPFGLEIDTSGNIFVS 95
Query: 121 DKSNHVIRKI 130
D++ ++IRKI
Sbjct: 96 DQTANLIRKI 105
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 55/102 (53%), Gaps = 5/102 (4%)
Query: 95 DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGRADGP 152
+G+ ++ F+ P +D + N+YV + NH IRKI N G V+T++GG S G DG
Sbjct: 169 NGDRLNSLFNGPFFMDIDPERNLYVGELGNHTIRKIDLNSGNVSTLSGGIS---GYLDGD 225
Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
+A F + + + +LL +D + IR+INLK S
Sbjct: 226 LASARFKSPLGIAYNHKTDSLLAADIQDHRIRKINLKTSTVS 267
>gi|290992574|ref|XP_002678909.1| predicted protein [Naegleria gruberi]
gi|284092523|gb|EFC46165.1| predicted protein [Naegleria gruberi]
Length = 1626
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
A + P+ VD +Y AD NH IRKI G V TIAG + + P A +
Sbjct: 122 ASANSPRGITVDSNDEVYFADSKNHRIRKILKDGTVITIAGTKTGGYNGDNIPQTTAQLN 181
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
N +++ + A+ I+D+GN IR+I
Sbjct: 182 NPYDVK-IGSDGAIYIADYGNNRIRKI 207
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 53/111 (47%), Gaps = 3/111 (2%)
Query: 78 EESVVKRLAGDGVQGYS-DGEP-GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
++ V +AG GY+ D P +A+ + P + G IY+AD N+ IRKI N +
Sbjct: 153 KDGTVITIAGTKTGGYNGDNIPQTTAQLNNPYDVKIGSDGAIYIADYGNNRIRKIANNTI 212
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TT+AG G+ + +A S + + + ISD N IR++
Sbjct: 213 TTVAGNGTIGSDSDNAEGTSAKLSGPSGIA-ISSTGDVYISDTNNNKIRKL 262
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 53/111 (47%), Gaps = 4/111 (3%)
Query: 79 ESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
+ + +AG+G G SD G SA+ P A+ G++Y++D +N+ IRK++ G +
Sbjct: 209 NNTITTVAGNGTIGSDSDNAEGTSAKLSGPSGIAISSTGDVYISDTNNNKIRKLSTNGII 268
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TT AG G A+ A S L L I+D N IRQ+
Sbjct: 269 TTFAGTGVSGFFGDKADAKRARLSGPRGLAITAS-GVLYIADTNNNRIRQV 318
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 39/62 (62%), Gaps = 3/62 (4%)
Query: 86 AGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
AG G++ Y+ DG + A D P A++ G +++AD++N+ +RK+ TN + TIAG G
Sbjct: 497 AGTGLKPYNGDGIQATCANLDTPYGIALNSDGELFIADQNNYRVRKVFTNGTIVTIAGNG 556
Query: 143 SK 144
+
Sbjct: 557 ER 558
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 57/115 (49%), Gaps = 4/115 (3%)
Query: 76 LSEESVVKRLAGDGVQGY-SD-GEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITN 132
+S+++++ +G+ + Y SD G +R P+ ++D G+IY AD +N + KIT+
Sbjct: 318 VSQQNIISTFSGNDDKVYCSDAGLALGSRTPSPRGISLDPTTGDIYFADSTNQRLSKITS 377
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + ++G G + A A +N +T P L +D N IR+I
Sbjct: 378 DGEIELVSGTGEANFFGENVVATTAKLNNPSGVTVDPKTGDLYFADTKNNRIRKI 432
>gi|357391051|ref|YP_004905892.1| hypothetical protein KSE_41500 [Kitasatospora setae KM-6054]
gi|311897528|dbj|BAJ29936.1| hypothetical protein KSE_41500 [Kitasatospora setae KM-6054]
Length = 608
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 61/119 (51%), Gaps = 18/119 (15%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKG--- 115
+V DS A L E+V++R+ GDGV+G DG EP RF +P+ A+ G
Sbjct: 189 LVADSGHHALVELD--ADGETVLRRI-GDGVRGLVDGPEP---RFSEPQGLALVPAGLAP 242
Query: 116 ----NIYVADKSNHVIR--KITNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
++ VAD NH +R ++ + VTT+AG G + + GPA S+ +++ F
Sbjct: 243 ELGYDVVVADTVNHALRGVRLADGSVTTLAGTGKQWWQGSPTAGPALGVDLSSPWDVAF 301
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 4/109 (3%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTI 138
V+ AG +G DG PG A F +P AV G ++VAD +R ++ V T
Sbjct: 326 VRVAAGTTNEGLVDGPPGEAWFAQPSGLAVSADGERLWVADSETSALRWVSRGTHEVRTA 385
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G G DG A A + +T +P ++ +SD NQ +R+ +
Sbjct: 386 VGTGLFDFGHRDGAAGQALLQHPLGVTVLPD-GSVAVSDTYNQALRRYD 433
>gi|269926393|ref|YP_003323016.1| alkyl hydroperoxide reductase [Thermobaculum terrenum ATCC BAA-798]
gi|269790053|gb|ACZ42194.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Thermobaculum terrenum ATCC BAA-798]
Length = 493
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI-TNLG-VTTI 138
++R+ G+G +G DG A+F+ P+ A+ +GN +YVAD NH +R + N G V TI
Sbjct: 210 IRRIWGNGEEGLVDGSASEAKFNHPQGMAI--RGNELYVADTENHALRLLHLNEGKVETI 267
Query: 139 AGGGSK 144
AG G +
Sbjct: 268 AGTGEQ 273
>gi|307727531|ref|YP_003910744.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
gi|307588056|gb|ADN61453.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
Length = 396
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 65/124 (52%), Gaps = 10/124 (8%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
+ V +AG G GYS DG P + A P A+D GN+++A+ HV+RK+ + G
Sbjct: 80 DNGTVTIIAGTGQAGYSGDGGPATQAMLHYPTGVALDRAGNVFIAEPFAHVVRKVDRHTG 139
Query: 135 V-TTIAGGGSKK-EGRAD-GPAQNAS-FSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
V TT+AG G+ R D GPA A+ ++ND V L I+D IR+++ +
Sbjct: 140 VITTVAGPGTPDVPNRGDGGPATQATMYANDVA---VNREGDLYIADSAAGRIRRVDHQT 196
Query: 191 EDCS 194
+ S
Sbjct: 197 GNIS 200
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 33/65 (50%), Gaps = 6/65 (9%)
Query: 98 PGSARFDKP----KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADG 151
PGS D P ++ AVD GNIY A+ + V+R+I N VT IAG G G
Sbjct: 41 PGSGGGDAPAIQAEAIAVDRWGNIYTAEPTYSVVRRIDHDNGTVTIIAGTGQAGYSGDGG 100
Query: 152 PAQNA 156
PA A
Sbjct: 101 PATQA 105
>gi|344923601|ref|ZP_08777062.1| hypothetical protein COdytL_03000 [Candidatus Odyssella
thessalonicensis L13]
Length = 1165
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 28/79 (35%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAG 140
V +AG G G+++G GSA+F+ P+ V G + VAD +N IR + G T
Sbjct: 100 VATVAGTGTPGHTNGPGGSAQFNMPQGITVMNDGRLLVADTNNDRIRLVDPGNGFQTSDY 159
Query: 141 GGSKKEGRADGPAQNASFS 159
G+ G DG A A+F+
Sbjct: 160 AGTGSSGYTDGAAAGATFT 178
Score = 44.7 bits (104), Expect = 0.041, Method: Composition-based stats.
Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 20/151 (13%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-------NHVIR 128
L+ ++V AG G G DG +A+ P A D G IY+ N +R
Sbjct: 32 LTPWNIVTTFAGSGTAGLVDGTGAAAQLANPWGLAYDGGGYIYITQGQGFMNGFINSAVR 91
Query: 129 K--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ + V T+AG G+ G +GP +A F+ +T V + LL++D N IR +
Sbjct: 92 RLDLNTRYVATVAGTGTP--GHTNGPGGSAQFNMPQGIT-VMNDGRLLVADTNNDRIRLV 148
Query: 187 N----LKPEDCSKSSQS----GSALGAVSVW 209
+ + D + + S G+A GA W
Sbjct: 149 DPGNGFQTSDYAGTGSSGYTDGAAAGATFTW 179
Score = 41.6 bits (96), Expect = 0.36, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 42/102 (41%), Gaps = 5/102 (4%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGS 143
AG G GY+DG A F P A G IYVAD + IR I + VTT AG S
Sbjct: 160 AGTGSSGYTDGAAAGATFTWPMGMATGPDGTIYVADMAR--IRSINPSTNQVTTFAGTVS 217
Query: 144 KKEGRA-DGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
A DGP F + + + + D G+ R
Sbjct: 218 TTMSDATDGPQNGPRFQYNMSVALNAAGDKVYVGDRGHCYFR 259
Score = 40.8 bits (94), Expect = 0.65, Method: Composition-based stats.
Identities = 18/39 (46%), Positives = 23/39 (58%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +G +A FD P D G++Y+AD NHVIRKI
Sbjct: 332 GLVNGSGATALFDGPSGMVQDADGSLYIADSYNHVIRKI 370
>gi|189462806|ref|ZP_03011591.1| hypothetical protein BACCOP_03504 [Bacteroides coprocola DSM 17136]
gi|189430422|gb|EDU99406.1| NHL repeat protein [Bacteroides coprocola DSM 17136]
Length = 500
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%), Gaps = 9/64 (14%)
Query: 76 LSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGNI-YVADKSNHV 126
L+ E VV AG G +GY DG G ARF+ P+ A D + N+ YV D NH+
Sbjct: 412 LTPEGVVTTYAGRGSANLNNEARGYVDGTLRGEARFNNPRGLAYDEQNNVFYVGDMDNHI 471
Query: 127 IRKI 130
IRKI
Sbjct: 472 IRKI 475
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 56/131 (42%), Gaps = 20/131 (15%)
Query: 86 AGDGVQGYSDGEPGSARFDKP------------KSFAVDMKGNIYVADKSNHVIRKITNL 133
G G + DG +AR + P K+ D+ + Y D NH +R +T
Sbjct: 357 GGVGQSAWVDGVGANARLNSPYQGVFVKNPEYVKANKSDVY-DFYFTDSGNHCVRVLTPE 415
Query: 134 GV-TTIAGGGS-----KKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
GV TT AG GS + G DG + A F+N L + + D N +IR+I
Sbjct: 416 GVVTTYAGRGSANLNNEARGYVDGTLRGEARFNNPRGLAYDEQNNVFYVGDMDNHIIRKI 475
Query: 187 NLKPEDCSKSS 197
++ ED + S
Sbjct: 476 AMEDEDSAHES 486
>gi|449684055|ref|XP_002166330.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
magnipapillata]
Length = 193
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 62/135 (45%), Gaps = 17/135 (12%)
Query: 28 LEDGYTVTTVIDGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFP------L 76
L+ GY T G EI ++ +I P +V D + Y F +
Sbjct: 16 LKTGYVTTLSSQGFGAEIGLNNQPVNLIIHYPEG---LVYDPDYNLLYVCEFENHNIRVV 72
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGV 135
S + +VK LAG+ +G DG +ARF P A D K +YV+D+ NH++R +T +G
Sbjct: 73 SMQGLVKTLAGE-FKGKKDGVGRTARFYYPSGLAFDRKNRMLYVSDQYNHLVRTVTAIG- 130
Query: 136 TTIAGGGSKKEGRAD 150
+ + K+E D
Sbjct: 131 SNVEYSNIKQEAVLD 145
Score = 43.5 bits (101), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 76/169 (44%), Gaps = 18/169 (10%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG-VTTIAGGGSK 144
G G + + +P + P+ D N+ YV + NH IR ++ G V T+AG +
Sbjct: 28 GFGAEIGLNNQPVNLIIHYPEGLVYDPDYNLLYVCEFENHNIRVVSMQGLVKTLAG---E 84
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI----------NLKPEDCS 194
+G+ DG + A F L F L +SD N L+R + N+K E
Sbjct: 85 FKGKKDGVGRTARFYYPSGLAFDRKNRMLYVSDQYNHLVRTVTAIGSNVEYSNIKQEAVL 144
Query: 195 KSSQSGSALGAVS--VWVLVSVLSCLVSLVIG-FVARPYIIRHTGRLDP 240
S+ + V+ + ++++ C SL++G F+ARP + + L P
Sbjct: 145 DSTDILKKVKTVADHSSLTLNIMLCSASLLVGIFIARPCLSKLIKLLRP 193
>gi|226950178|ref|YP_002805269.1| NHL repeat-containing protein [Clostridium botulinum A2 str. Kyoto]
gi|226843692|gb|ACO86358.1| NHL repeat protein [Clostridium botulinum A2 str. Kyoto]
Length = 467
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 74/171 (43%), Gaps = 22/171 (12%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
++LE+ + ++ IN +I G + +++LD + T+
Sbjct: 163 KNIILENKCEKEEYMYPSKISINEDGLIAFSEKGKNRIVILDQNLEKIKTI--------- 213
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
G G DG+ A+F P+ ++ IYVAD NH IR+ + V TIA
Sbjct: 214 -----GSSEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHFIRECDLEKEIVKTIA 267
Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G G K+ A G A N S ++ ++L + + I+ GN I + N+K
Sbjct: 268 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 316
>gi|225450563|ref|XP_002277564.1| PREDICTED: NHL repeat-containing protein 2-like [Vitis vinifera]
Length = 1078
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 4/108 (3%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAG 140
++ G +G DG A F++P+ A + K N+ YVAD NH +R+I N V T+AG
Sbjct: 627 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 686
Query: 141 GGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+K + + G ++ +++ F P + I+ G I + N
Sbjct: 687 NGTKGSDYQGGGKGATQLLNSPWDVCFEPINEIVYIAMAGQHQIWEHN 734
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 5/104 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
G DG P + G IYVAD NH I+K+ V+T+AG G K G
Sbjct: 815 GDHDGVGSEVLLQHPLGVSCGKDGQIYVADSYNHKIKKLDPATGRVSTLAGTG--KAGFK 872
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
DG A A S + V + L I+D N +IR ++LK ++
Sbjct: 873 DGRALAAQLSEPSGIVEVEN-GVLFIADTNNSVIRYLDLKKKEA 915
>gi|300771890|ref|ZP_07081761.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300761276|gb|EFK58101.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 658
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 15/133 (11%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS---NHVIRKIT-----NLGVT 136
L G G GY DG A+ + VD KG++Y D + + IRKIT N V+
Sbjct: 381 LIGSGGAGYVDGSLADAKTNGAWGVVVDNKGDLYFTDNNATGGNRIRKITFDASGNGTVS 440
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC--- 193
TIAG S G ADG ASF+ L + L +S+ IR+I+L
Sbjct: 441 TIAGQAS--TGNADGIGAAASFNRPSGLAIDKNNNILYVSEWIGNRIRRIDLATNRVELL 498
Query: 194 --SKSSQSGSALG 204
S ++QSG G
Sbjct: 499 AGSATAQSGDVEG 511
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 50 VIDRPGSSDLIVLDSSRSA---FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
V+D G DL D++ + ++F S V +AG G +DG +A F++P
Sbjct: 406 VVDNKG--DLYFTDNNATGGNRIRKITFDASGNGTVSTIAGQASTGNADGIGAAASFNRP 463
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFS--- 159
A+D NI YV++ + IR+I TN V +AG + + G +G A F+
Sbjct: 464 SGLAIDKNNNILYVSEWIGNRIRRIDLATNR-VELLAGSATAQSGDVEGIGGVARFNQLG 522
Query: 160 NDFELTFVPHICALLISDHGNQL 182
N EL+ P L +S + ++
Sbjct: 523 NALELS--PDGKKLFVSGNSSRF 543
>gi|170755613|ref|YP_001782372.1| NHL repeat-containing protein [Clostridium botulinum B1 str. Okra]
gi|429244666|ref|ZP_19208103.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001628]
gi|169120825|gb|ACA44661.1| NHL repeat protein [Clostridium botulinum B1 str. Okra]
gi|428758305|gb|EKX80740.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001628]
Length = 467
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 74/171 (43%), Gaps = 22/171 (12%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
++LE+ + ++ IN +I G + +++LD + T+
Sbjct: 163 ENIILENKCEKEEYMYPSKISINEDGLIAFSEKGKNRIVILDQNLEKIKTI--------- 213
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
G G DG+ A+F P+ ++ IYVAD NH IR+ + V TIA
Sbjct: 214 -----GSSKYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHFIRECDLEKEIVKTIA 267
Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G G K+ A G A N S ++ ++L + + I+ GN I + N+K
Sbjct: 268 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 316
>gi|153938821|ref|YP_001392044.1| NHL repeat-containing protein [Clostridium botulinum F str.
Langeland]
gi|384463040|ref|YP_005675635.1| NHL repeat protein [Clostridium botulinum F str. 230613]
gi|152934717|gb|ABS40215.1| NHL repeat protein [Clostridium botulinum F str. Langeland]
gi|295320057|gb|ADG00435.1| NHL repeat protein [Clostridium botulinum F str. 230613]
Length = 467
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 74/171 (43%), Gaps = 22/171 (12%)
Query: 24 SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
++LE+ + ++ IN +I G + +++LD + T+
Sbjct: 163 ENIILENKCEKEEYMYPSKISINEDGLIAFSEKGKNRIVILDQNLEKIKTI--------- 213
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
G G DG+ A+F P+ ++ IYVAD NH IR+ + V TIA
Sbjct: 214 -----GSSEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHFIRECDLEKEIVKTIA 267
Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G G K+ A G A N S ++ ++L + + I+ GN I + N+K
Sbjct: 268 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 316
>gi|238062013|ref|ZP_04606722.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
gi|237883824|gb|EEP72652.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
Length = 582
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 71/139 (51%), Gaps = 21/139 (15%)
Query: 46 NPHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
P + + PG + +L+V DS+R + L+ E++V+R+ G GV+G +DG G
Sbjct: 138 EPETALRFPGKAIALDGGNLLVSDSARHSLVELAP--DGETLVRRI-GSGVRGRADGPAG 194
Query: 100 SARFDKPKSFA------VDMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRAD 150
+A F +P+ ++ G ++ VAD NH++R ++ V T+AG G + D
Sbjct: 195 AAAFAEPQGLCRLPEHVAEVAGYDVVVADTVNHLLRGVRLATGEVVTVAGTGRQWRSTVD 254
Query: 151 GPAQNA---SFSNDFELTF 166
A ++ S+ ++L +
Sbjct: 255 DHAHDSLSVDLSSPWDLAW 273
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 7/98 (7%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
EP +A K+ A+D GN+ V+D + H + ++ G T + GS GRADGPA A
Sbjct: 138 EPETALRFPGKAIALD-GGNLLVSDSARHSLVELAPDGETLVRRIGSGVRGRADGPAGAA 196
Query: 157 SFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
+F+ L +P A ++++D N L+R + L
Sbjct: 197 AFAEPQGLCRLPEHVAEVAGYDVVVADTVNHLLRGVRL 234
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 2/103 (1%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLGVTTIAGGGSK 144
AG V+ DG A +P AV G ++VAD +R + N + T G G
Sbjct: 302 AGTTVEALRDGPLAEAWLAQPSGLAVSADGARLWVADSETSAVRYVENGVLGTAVGQGLF 361
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G DGPA A + + +P ++LI+D N +R+ +
Sbjct: 362 DFGHVDGPADRALLQHPLGVCALPD-GSVLIADTYNGAVRRFD 403
>gi|290996648|ref|XP_002680894.1| predicted protein [Naegleria gruberi]
gi|284094516|gb|EFC48150.1| predicted protein [Naegleria gruberi]
Length = 966
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 41/72 (56%), Gaps = 5/72 (6%)
Query: 78 EESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
E + AG+G G+S G+ G +A+ + P+ AV G +Y+AD N +RK++ G
Sbjct: 429 EIYIAVTFAGNGTSGFS-GDGGLAINAQLNSPRCVAVSGSGEVYIADSMNSRVRKVSTNG 487
Query: 135 -VTTIAGGGSKK 145
+TTIAG G K
Sbjct: 488 IITTIAGNGDKN 499
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 57/125 (45%), Gaps = 6/125 (4%)
Query: 68 AFYTLSFPLSEESVVKRLAG-DGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSN 124
+F L + ++ +AG G GY+ S+ + +SFA+ G+ +AD
Sbjct: 203 SFNCLLRKIGANGIISTIAGVYGFCGYNSDNVIATSSYLSRFRSFAIASNGDFIIADYET 262
Query: 125 HVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
H IR+I +N + TIAG G+ G A NA S F +T + S G +
Sbjct: 263 HRIRRIFKSNSTIITIAGSGTAGYSGDGGLAINALISYPFSITTTSKG-DIFFSSDGECV 321
Query: 183 IRQIN 187
+R+I+
Sbjct: 322 VRKID 326
>gi|145487484|ref|XP_001429747.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124396841|emb|CAK62349.1| unnamed protein product [Paramecium tetraurelia]
Length = 450
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 78 EESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
++ + K +AG G G+ DG G F+ P +D+ GNI+V D +NH IR IT G V
Sbjct: 190 QKKITKTIAGICGQSGFKDGLLGMNLFNTPDQMGIDVLGNIFVNDFNNHFIRMITLDGYV 249
Query: 136 TTIAGGGSKKEGR 148
T+ G +++ R
Sbjct: 250 NTLISGSCRQDVR 262
>gi|428206267|ref|YP_007090620.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
7203]
gi|428008188|gb|AFY86751.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
7203]
Length = 502
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 57/109 (52%), Gaps = 9/109 (8%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNL---GVTTIAGGG 142
G G G +DG+ A+F P+ +D + + Y+AD NH IR+I NL V TIAG G
Sbjct: 212 GTGKPGLTDGDFTQAQFFAPQGMVLDDENQLLYLADTENHAIRRI-NLPLQRVETIAGTG 270
Query: 143 SKKEG--RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+ G Q + ++ ++LT + + L I+ GN I Q++L+
Sbjct: 271 EQSRNIRPYGGIGQKIALNSPWDLTMLKNY--LFIAMAGNHQIWQMDLE 317
>gi|290976432|ref|XP_002670944.1| predicted protein [Naegleria gruberi]
gi|284084508|gb|EFC38200.1| predicted protein [Naegleria gruberi]
Length = 1514
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 68/137 (49%), Gaps = 10/137 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMK 114
+S+L + D++ Y + LS +++ +AG+G G+ D + A+ AVD
Sbjct: 222 NSELYIADANN---YRIRKILSNGTIIT-IAGNGGYGFIDNVLATNAKLAFVNGIAVDTN 277
Query: 115 GNIYVADK----SNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
G +Y+ + SNH IRKI TN + T +G GS+ + A +A F + ++
Sbjct: 278 GEVYITESENGFSNHRIRKILTNGTIITFSGTGSRGYYGDNVEAADAKFYTPYFVSVRQT 337
Query: 170 ICALLISDHGNQLIRQI 186
+LISD GN +R +
Sbjct: 338 TGEVLISDTGNNFLRLV 354
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 51/114 (44%), Gaps = 3/114 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S+ ++ +AG G G++ + P + A D++ + V+D NHVIRKI
Sbjct: 69 ISKNGMIDSIAGTGKFGFNGDNLFATDTQLSNPFAVAFDLETQLIVSDSGNHVIRKILRN 128
Query: 134 GVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G I G + DG A+ A +N + L I+D N IR+I
Sbjct: 129 GTMVIIAGTGQSGYNGDGIDAKIAKLNNPSGIVVDSKTGELFIADTANNRIRKI 182
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 55/100 (55%), Gaps = 7/100 (7%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DG-EPGSARF 103
NP ++ + +L + D++ + + L+ +++ +AG G GY+ DG + SA+
Sbjct: 156 NPSGIVVDSKTGELFIADTANNRIRKI---LTNGTIIT-IAGTGETGYNGDGIDATSAQI 211
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
A+D+ +Y+AD +N+ IRKI +N + TIAG G
Sbjct: 212 RGVYGIALDVNSELYIADANNYRIRKILSNGTIITIAGNG 251
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 5/106 (4%)
Query: 85 LAGDGVQGYS-DG-EPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKI-TNLGVTTIAG 140
+AG G GY+ DG + A+ + P VD K G +++AD +N+ IRKI TN + TIAG
Sbjct: 134 IAGTGQSGYNGDGIDAKIAKLNNPSGIVVDSKTGELFIADTANNRIRKILTNGTIITIAG 193
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G A +A + + + L I+D N IR+I
Sbjct: 194 TGETGYNGDGIDATSAQIRGVYGIALDVNS-ELYIADANNYRIRKI 238
Score = 40.4 bits (93), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 2/82 (2%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
PK G+IYVA+ H I +I+ G + +IAG G + A + SN F +
Sbjct: 45 PKGTYFSANGDIYVAETQGHRISRISKNGMIDSIAGTGKFGFNGDNLFATDTQLSNPFAV 104
Query: 165 TFVPHICALLISDHGNQLIRQI 186
F L++SD GN +IR+I
Sbjct: 105 AFDLET-QLIVSDSGNHVIRKI 125
>gi|290987760|ref|XP_002676590.1| predicted protein [Naegleria gruberi]
gi|284090193|gb|EFC43846.1| predicted protein [Naegleria gruberi]
Length = 836
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 43/73 (58%), Gaps = 6/73 (8%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMK-GNIYVADKSNHVIRKI-T 131
+ +V+ +AG G QG+S G+ G +A+ +KP S D GNIY+AD NH IR I
Sbjct: 501 TSNEMVEVIAGIGFQGFS-GDGGLAINAQLNKPHSVHFDKNSGNIYIADSGNHRIRMIFP 559
Query: 132 NLGVTTIAGGGSK 144
N +TTI G G+
Sbjct: 560 NGTITTIVGSGTN 572
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 41/93 (44%), Gaps = 2/93 (2%)
Query: 96 GEPGSARFDKPKS-FAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPA 153
G P S P F +D IY+AD NH I K T N V IAG G + G A
Sbjct: 465 GPPSSIILKNPTGIFCLDSFSTIYIADTDNHRILKYTSNEMVEVIAGIGFQGFSGDGGLA 524
Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
NA + + F + + I+D GN IR I
Sbjct: 525 INAQLNKPHSVHFDKNSGNIYIADSGNHRIRMI 557
>gi|290970020|ref|XP_002668012.1| predicted protein [Naegleria gruberi]
gi|284081028|gb|EFC35268.1| predicted protein [Naegleria gruberi]
Length = 243
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 52/112 (46%), Gaps = 4/112 (3%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
E + +AG+G GYS +G +A+ +P S V +Y+AD NH IRKI G
Sbjct: 115 ENGRIVTIAGNGQPGYSGDNGPATTAQLHRPYSVFVSANNEVYIADTFNHSIRKIDESGN 174
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G G A NA N F+ + I++ N R+I
Sbjct: 175 IETIAGNEQPGYGGDGGYATNAQL-NHPSGVFISTNYEIYITETNNHTTRKI 225
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 50/104 (48%), Gaps = 3/104 (2%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+AG+G G +G S++ P+S V I +AD NH IRKI N + TIAG G
Sbjct: 67 IAGNGQTGCGGDNGSATSSQLYYPQSVFVSTNNEICIADTFNHRIRKIENGRIVTIAGNG 126
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+GPA A + + FV + I+D N IR+I
Sbjct: 127 QPGYSGDNGPATTAQLHRPYSV-FVSANNEVYIADTFNHSIRKI 169
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 4/105 (3%)
Query: 85 LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG+G QGY +G SA+ + P V +++ADK+N+ IRKI G + IAG
Sbjct: 11 IAGNGKQGYGGDNGLATSAQLNYPSGVFVSTNNEVFIADKNNNRIRKIVKNGNIVLIAGN 70
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G G +G A ++ + FV + I+D N IR+I
Sbjct: 71 GQTGCGGDNGSATSSQLYYP-QSVFVSTNNEICIADTFNHRIRKI 114
>gi|290996951|ref|XP_002681045.1| predicted protein [Naegleria gruberi]
gi|284094668|gb|EFC48301.1| predicted protein [Naegleria gruberi]
Length = 263
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 7/90 (7%)
Query: 78 EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAV--DMKGN---IYVADKSNHVIRKI 130
+ V+ + G+G G++ D + A ++P+S V + KG +Y+AD SNH IRK+
Sbjct: 156 QNGTVETICGNGDNGFNGDDIKAVDASLNEPQSIYVTRNTKGTDHILYIADTSNHRIRKV 215
Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
N ++TI G G K PA+ AS +N
Sbjct: 216 ENGIISTICGNGESKYNGDLIPAKEASLNN 245
Score = 37.0 bits (84), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 38/77 (49%), Gaps = 10/77 (12%)
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP-------H 169
IY+ D +NH IRKI N V TI G G D A +AS N+ + +V H
Sbjct: 142 IYICDSNNHKIRKIQNGTVETICGNGDNGFNGDDIKAVDASL-NEPQSIYVTRNTKGTDH 200
Query: 170 ICALLISDHGNQLIRQI 186
I L I+D N IR++
Sbjct: 201 I--LYIADTSNHRIRKV 215
>gi|269837841|ref|YP_003320069.1| alkyl hydroperoxide reductase [Sphaerobacter thermophilus DSM
20745]
gi|269787104|gb|ACZ39247.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Sphaerobacter thermophilus DSM 20745]
Length = 494
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 56/128 (43%), Gaps = 7/128 (5%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGN 116
L V DS SA T++FP +E V+ L G G+ + D G R P + A G
Sbjct: 356 LYVADSETSAVRTVTFPPGDE--VRTLVGTGLFDFGDVDGIGDEVRLQHPLAVACG-DGV 412
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
+YVAD NH I+++ GS + G ADG A F L+ L I+
Sbjct: 413 LYVADSYNHKIKRLDPATRRCETWLGSGEPGDADGTGTAARFHEPGGLSLAG--SRLYIA 470
Query: 177 DHGNQLIR 184
D N IR
Sbjct: 471 DTNNHAIR 478
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 64/158 (40%), Gaps = 12/158 (7%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ + LIV DS S+ V+ G G G DG A F P
Sbjct: 180 PGKVLADAATDRLIVADSGHHRLIVARLDGSDAWVI----GSGEAGLRDGPGAIAAFHDP 235
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG--SKKEGRADGPAQNASFSNDF 162
+ A+ +YVAD NHV+R++ V TIAG G RA G A+ + +
Sbjct: 236 QGLAL-YGDTLYVADTRNHVVRRVDLAARTVETIAGTGHIGWSYARA-GAARAIDLRSPW 293
Query: 163 ELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
+L H L I+ G I ++L + + SG
Sbjct: 294 DLVL--HDGVLYIAMAGMHQIWALDLDEAELRPYAGSG 329
>gi|423216520|ref|ZP_17203043.1| hypothetical protein HMPREF1074_04575 [Bacteroides xylanisolvens
CL03T12C04]
gi|392690730|gb|EIY83986.1| hypothetical protein HMPREF1074_04575 [Bacteroides xylanisolvens
CL03T12C04]
Length = 423
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 70/162 (43%), Gaps = 31/162 (19%)
Query: 41 HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG---E 97
H +E +P +LIV + R F F ++ E + +AG +G SDG
Sbjct: 275 HAMEFDPEG--------NLIVSSNDRGQF----FMVTPEKEIICIAGS-EKGDSDGTSGN 321
Query: 98 PGSARFDKPKSFAVDMKGNIYVAD------KSNHVIRKIT-------NLGVTTIAGGGSK 144
P SA+F + FA+D +G IY D S I++IT + V T+ GS
Sbjct: 322 PRSAKFYQLYGFAIDSEGTIYTVDGNDGNVGSGQKIKRITRGKKGYEDGTVVTLV--GSA 379
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
DG A F N +++ AL +SD N IR+I
Sbjct: 380 GGAIVDGSVDEAVFGNPYDIILDEANRALYVSDRSNNAIRKI 421
>gi|290969880|ref|XP_002667977.1| predicted protein [Naegleria gruberi]
gi|284080923|gb|EFC35233.1| predicted protein [Naegleria gruberi]
Length = 366
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 11/109 (10%)
Query: 89 GVQGYSDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG--GSKK 145
G G ++G P +ARF+ P A+ G ++VADK+NH+IR+I G + G G++
Sbjct: 125 GTSGSTNGNTPTAARFNNPYGMALAADGTMFVADKANHIIRRINPDGSVELYAGTVGTQC 184
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALL-------ISDHGNQLIRQIN 187
PA S+ +TF ++ + D+GN IR+I+
Sbjct: 185 PSATASPAC-GDGSDRLSVTFTNPNAVIVDSNNNVYVFDYGNFRIRKID 232
>gi|290994368|ref|XP_002679804.1| predicted protein [Naegleria gruberi]
gi|284093422|gb|EFC47060.1| predicted protein [Naegleria gruberi]
Length = 1359
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 71/161 (44%), Gaps = 23/161 (14%)
Query: 38 IDGHQLEINPHSVIDRPGS-----SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
I GH I +S + P S + V DS+ ++ +V +AG G G
Sbjct: 530 IKGHIDGIALNSTFNHPSGIHAEYSAIYVADSNNHVIRKITL---STGIVSTVAGSGEPG 586
Query: 93 YS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
Y+ DG+ P +F+ P + + + VAD +NH IR IT++ V T+AG G +
Sbjct: 587 YNGDGKLPLETQFNNPMG-VISAQMGLIVADTNNHRIRVITDVTVKTLAGTGIDGSSGDE 645
Query: 151 GPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
GPA + F PH + + ++D N IR I
Sbjct: 646 GPAIRSQL-------FYPHSVSYGYSDIYVADTFNNKIRAI 679
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 4/110 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSAR--FDKPKSFAV-DMKGNIYVADKSNHVIRKITNLG-VT 136
+V +AG+G G+ + S + P A + G+I +D++NH IR + G ++
Sbjct: 795 IVHTIAGNGGYGFYADDINSLQTSLAYPSGIAYSKITGDIIFSDRNNHRIRIVYKNGTIS 854
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T+AG G K PA+NA ++ L +P ++ SD N IR+I
Sbjct: 855 TLAGTGIKGFSGDGLPAKNAQLNSPTFLKLIPPYEDIIFSDSNNGRIRRI 904
>gi|357417484|ref|YP_004930504.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
gi|355335062|gb|AER56463.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
Length = 474
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 45/105 (42%), Gaps = 3/105 (2%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGS 143
AG G G DG A F +P A ++ +YVAD IR + G T+ G G
Sbjct: 318 AGSGQLGQRDGAGILASFAQPSGLA-QVQQALYVADTLGSSIRGVQLRGDLTQTLVGQGM 376
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ G ADG Q A L P L I+D GN IR + L
Sbjct: 377 WEFGDADGQRQQARLQAPEALALDPDAPLLWIADTGNGCIRSLRL 421
Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 4/76 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIA 139
V R G+G DGEP A F +P ++ ++ +YVAD NH++R+I NL G
Sbjct: 200 VLRRFGNGTADALDGEPEIASFCRPNGLSL-LREELYVADTGNHLLRRI-NLRSGAVQTV 257
Query: 140 GGGSKKEGRADGPAQN 155
G A+GP Q+
Sbjct: 258 LGNRLTRTPAEGPIQS 273
>gi|257069643|ref|YP_003155898.1| NHL repeat protein [Brachybacterium faecium DSM 4810]
gi|256560461|gb|ACU86308.1| NHL repeat protein [Brachybacterium faecium DSM 4810]
Length = 697
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 3/120 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
LAG +G DG ++ + +P G I VAD +R + ++ VTT+ G G
Sbjct: 352 LAGTTQEGLVDGPAVTSWWAQPSGLDELPDGRIVVADSETSAVRVLDPRSMQVTTLVGEG 411
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
G DGPA A + +T +P + +SD N IR + P + +++ G A
Sbjct: 412 LFDFGHVDGPASTARLQHPLAVTALPD-GRIAVSDTYNGAIRLVEEAPAEGERATTGGIA 470
>gi|196009344|ref|XP_002114537.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
gi|190582599|gb|EDV22671.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
Length = 507
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 10/111 (9%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEG 147
+G+ DG ARF+ P+ G I+VAD +NH IRKI VTTIAG G++
Sbjct: 214 EGFVDGVFQKARFNNPQGITCSRNGKTIFVADTNNHAIRKIDLEYCEVTTIAGTGNQGND 273
Query: 148 RADGPA-QNASFSNDFELTFVPHI----CALLISDHGNQLIRQINLKPEDC 193
G A + + ++ ++LT P L I+ G+ I + L EDC
Sbjct: 274 LVGGGAGKMQALNSPWDLTLGPSCEDQEVVLYIAMAGSHQIWALAL--EDC 322
>gi|427430403|ref|ZP_18920257.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
gi|425878863|gb|EKV27574.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
Length = 498
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 58/118 (49%), Gaps = 9/118 (7%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLG 134
S+ V+ R G G+ G DG+ A F P+ A + I+VAD NH++R+I +
Sbjct: 204 SDGRVLARY-GTGLPGTDDGQAERAAFTSPQGLAC-TEDAIFVADTGNHLVRRIDLCSGA 261
Query: 135 VTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
VTT+AG G R PA + ++ +++ P L I++ G I +N++
Sbjct: 262 VTTVAGLRFRGLPVSMRQ--PAAETALASPWDIEITPDGGTLFIANAGTHQILAMNMR 317
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 58/141 (41%), Gaps = 12/141 (8%)
Query: 54 PGSSDLIVLDSSRSAFYTL---SFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKS 108
P +S L DS SA + + L + + V+ L G G+ G+ DG +AR P
Sbjct: 352 PDASALYFADSETSAVRKVILDGWRLGQTARVETLVGRGLFDFGHDDGPMTAARMQHPLG 411
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR--ADGPAQNASFSNDFELTF 166
AV G + VAD NH IR I T + K GR G A L
Sbjct: 412 VAVMPDGRVAVADSYNHAIRLIDEAAGTV----ETLKTGRLHCTGAACRRPLWEPAGLWP 467
Query: 167 VPHICALLISDHGNQLIRQIN 187
P LL+SD N I +++
Sbjct: 468 TPK-GRLLVSDTNNHRIVEVD 487
>gi|296089788|emb|CBI39607.3| unnamed protein product [Vitis vinifera]
Length = 1002
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 4/108 (3%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAG 140
++ G +G DG A F++P+ A + K N+ YVAD NH +R+I N V T+AG
Sbjct: 551 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 610
Query: 141 GGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+K + + G ++ +++ F P + I+ G I + N
Sbjct: 611 NGTKGSDYQGGGKGATQLLNSPWDVCFEPINEIVYIAMAGQHQIWEHN 658
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 5/104 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
G DG P + G IYVAD NH I+K+ V+T+AG G K G
Sbjct: 739 GDHDGVGSEVLLQHPLGVSCGKDGQIYVADSYNHKIKKLDPATGRVSTLAGTG--KAGFK 796
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
DG A A S + V + L I+D N +IR ++LK ++
Sbjct: 797 DGRALAAQLSEPSGIVEVEN-GVLFIADTNNSVIRYLDLKKKEA 839
>gi|254447301|ref|ZP_05060768.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
gi|198263440|gb|EDY87718.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
Length = 2515
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 40/72 (55%), Gaps = 3/72 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
++ E +++ AG +GYS G A + P+ AV G +++AD NH IRK+
Sbjct: 1213 ITPEGEIEKYAGKETRGYSGDGGAATEAELNLPRGMAVSSGGTLFIADWGNHRIRKVAPN 1272
Query: 134 G-VTTIAGGGSK 144
G +TT+AG G +
Sbjct: 1273 GIITTVAGNGEE 1284
>gi|413923862|gb|AFW63794.1| hypothetical protein ZEAMMB73_706161, partial [Zea mays]
Length = 139
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
Query: 29 EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSE 78
E GYTV TV DG +L I P+SV + S +L+V+DS S Y ++ PLS
Sbjct: 75 EGGYTVETVFDGSKLGIEPYSV-EVTQSGELLVMDSMNSNIYRMALPLSR 123
>gi|299145011|ref|ZP_07038079.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
gi|298515502|gb|EFI39383.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
Length = 459
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G+++G+ A+ ++P +D +GNIYVA + H I KIT G+ T G+ G DG
Sbjct: 364 GFTEGQGNEAKLEEPCQMDLDEEGNIYVAVRKKHRIAKITPDGMLTNY-TGTGTSGTTDG 422
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
P A F++ + F P AL ISD+ N IR+I
Sbjct: 423 PLDKAQFNHPEGVQFGPD-GALYISDYWNHKIRKI 456
>gi|428308867|ref|YP_007119844.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
7113]
gi|428250479|gb|AFZ16438.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
7113]
Length = 515
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 57/113 (50%), Gaps = 7/113 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
V+++ G G G +DG A+F P+ +D + +YVAD NH +R++ N V TI
Sbjct: 213 VQQVIGTGKPGLTDGSFEEAQFFAPQGMTLDSENQLLYVADTENHCLRQVDLKNQQVKTI 272
Query: 139 AGGGSKKEG--RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
AG G + G A ++ +++ V H LLI+ G+ I ++ L+
Sbjct: 273 AGTGEQSHHIRPHSGKALETRLNSPWDVEKVGH--CLLIAMAGSHQIWEMQLE 323
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
+ + G G G+ DG+ ARF +P + + ++YVAD +NH IR I L VTT+
Sbjct: 440 QTMIGQGTAGHQDGQSIKARFSEPSGLSA-IAAHLYVADTNNHAIRCIALDTLEVTTL 496
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/129 (24%), Positives = 54/129 (41%), Gaps = 7/129 (5%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK---ITNLG 134
E ++ AG G + DG P A F +P D + +Y+AD IR + +L
Sbjct: 323 ETGWLRTYAGTGAEACLDGTPDQAAFAQPSGLTTDGR-ELYIADSEVSSIRAVGLVDHLP 381
Query: 135 VTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
V T+ G G G DG + + + + + L ++D N I++++ + C
Sbjct: 382 VRTVCGSGELFGFGDVDGEDADVRLQHALGVEYTQNY--LWVADTYNHKIKRVDPRTGTC 439
Query: 194 SKSSQSGSA 202
G+A
Sbjct: 440 QTMIGQGTA 448
>gi|320103300|ref|YP_004178891.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
gi|319750582|gb|ADV62342.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
Length = 698
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 59/125 (47%), Gaps = 8/125 (6%)
Query: 83 KRL--AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTI 138
KRL G G G++DG +A F++ + + + +YVAD NH IR +T V T+
Sbjct: 270 KRLDVIGSGATGFADGSFETASFNRQQGIRL-VGETLYVADTENHAIRAANLTTRTVATV 328
Query: 139 AGGGSKKEGRA---DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
AG G + R GPA+ ++ ++L +P L+I+ G I + +
Sbjct: 329 AGTGKQTYHRGWGVRGPAKTTGLNSPWDLVKIPDEDRLIIAMAGPHQIWDYDFSKGEIGL 388
Query: 196 SSQSG 200
S +G
Sbjct: 389 WSGTG 393
>gi|326672463|ref|XP_003199672.1| PREDICTED: NHL repeat-containing protein 2-like [Danio rerio]
Length = 719
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 8/79 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAG---GGSKK 145
G DG A+F P+ + KG+ +YVAD NH++RKI T V+T+AG G+ K
Sbjct: 262 GKRDGNLSEAQFSSPQGLFI--KGDTVYVADTENHLVRKINLTEGKVSTLAGIGVQGTDK 319
Query: 146 EGRADGPAQNASFSNDFEL 164
EG A GP Q S D L
Sbjct: 320 EGGALGPQQPISSPWDVTL 338
>gi|326672461|ref|XP_003199671.1| PREDICTED: NHL repeat-containing protein 2-like [Danio rerio]
Length = 719
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 8/79 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAG---GGSKK 145
G DG A+F P+ + KG+ +YVAD NH++RKI T V+T+AG G+ K
Sbjct: 262 GKRDGNLSEAQFSSPQGLFI--KGDTVYVADTENHLVRKINLTEGKVSTLAGIGVQGTDK 319
Query: 146 EGRADGPAQNASFSNDFEL 164
EG A GP Q S D L
Sbjct: 320 EGGALGPQQPISSPWDVTL 338
>gi|220914451|ref|YP_002489760.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
gi|219861329|gb|ACL41671.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
Length = 674
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 52/111 (46%), Gaps = 6/111 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVT 136
V +AG+G++G DG + F + A D GNI+VAD +R + +L V
Sbjct: 378 VAIVAGNGLEGLLDGPAHESWFAQSSGLAEDADGNIWVADSETSALRTLVIDDTGSLTVK 437
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
T G G G DG A A + +T +P ++ I+D N +R+ +
Sbjct: 438 TAVGKGLFDFGFRDGTAAEARLQHPLGVTVLPD-GSVAIADTYNGAVRRFD 487
>gi|290986797|ref|XP_002676110.1| predicted protein [Naegleria gruberi]
gi|284089710|gb|EFC43366.1| predicted protein [Naegleria gruberi]
Length = 1218
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 42/68 (61%), Gaps = 4/68 (5%)
Query: 85 LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
+ G GV+GY+ + A+ +KP S +VD G+IY++D S +RKI+N ++TIAG G
Sbjct: 675 ICGTGVEGYNGDDILATDAQLNKPTSVSVDQNGDIYISDNSR--LRKISNGVISTIAGNG 732
Query: 143 SKKEGRAD 150
+ + D
Sbjct: 733 NTTDIEND 740
Score = 43.9 bits (102), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 73/162 (45%), Gaps = 21/162 (12%)
Query: 39 DGHQLEINPHSVIDR-------PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GV 90
DG+ + +SVI R P +S++I+ DS S +S ++ +AG GV
Sbjct: 475 DGYNAK---YSVISRATGIDFLPSTSEMIIADSLNSCIR----KISTLGIITTIAGKPGV 527
Query: 91 QGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKE 146
GYS GE A P V G IY + + H ++KI+ N +TTI G G
Sbjct: 528 AGYS-GENVRAIDSMIGNPVRVLVSKMGEIYFSTNT-HRVQKISLNGNITTICGTGVDDY 585
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G A A+ SN + F L ++ +GN IR +NL
Sbjct: 586 SGDGGNAIEAAISNPNGMAFSSDEELLYVACNGNNAIRVVNL 627
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 47/99 (47%), Gaps = 1/99 (1%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G G++DG+ ++ +P + + ++ AD++NH IRKI+N V TI G G G
Sbjct: 137 GSSGFNDGQAKLSKVSRPFGIGL-INSDVIFADRNNHRIRKISNGAVVTICGNGINNFGG 195
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ PA A + L + D N IR+I+
Sbjct: 196 DNRPALTARINTPTGLFVNSTNGDIYFCDSLNHRIRKID 234
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 7/111 (6%)
Query: 80 SVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
+++ +AG+G GYS G+ G +A + P G +Y+ D +N+VIR + + G
Sbjct: 238 AIISTIAGNGDVGYS-GDGGLATNAALNFPTGVVSTPDGTLYIID-NNNVIRMVNSSGYI 295
Query: 137 TIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ GG D GPA NA F+ ++ V + ++D GN IR+I
Sbjct: 296 STIGGSMDSGNYGDGGPAINAKFAYPTGIS-VSKNGDVYVTDTGNLKIRKI 345
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 9/102 (8%)
Query: 93 YSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRAD 150
Y DG P +A+F P +V G++YV D N IRKI+ G+ G + G
Sbjct: 307 YGDGGPAINAKFAYPTGISVSKNGDVYVTDTGNLKIRKISKQTGIIETVIGNTNVYGGDG 366
Query: 151 GPAQNASFSNDFE------LTFVPHICALLISDHGNQLIRQI 186
PA A+ + +T++ H C + D NQ++ I
Sbjct: 367 LPASKAALGYISQVVSIDGITYISHDCRIRKVDK-NQIVSTI 407
>gi|195164496|ref|XP_002023083.1| GL21149 [Drosophila persimilis]
gi|194105168|gb|EDW27211.1| GL21149 [Drosophila persimilis]
Length = 595
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 58/127 (45%), Gaps = 19/127 (14%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NL 133
+S + VV+ G G+ DG+ +RF+ P+ A + + VAD NH +RKI+ N
Sbjct: 252 VSADGVVEHKIGGLQAGFVDGKLALSRFNSPQGVAFLNESTLIVADTENHALRKISLENE 311
Query: 134 GVTTIAGGGSKKEGRADG----------PAQNASF-SNDFELTF------VPHICALLIS 176
V T+AG G + R G P A F + D +++F VP +LIS
Sbjct: 312 LVETLAGTGHQGNERIGGRVGPLQALSSPWDLAVFRTRDMDMSFHLDERNVPEKTIILIS 371
Query: 177 DHGNQLI 183
G I
Sbjct: 372 MAGTHQI 378
>gi|241584846|ref|XP_002403871.1| NHL repeat-containing protein [Ixodes scapularis]
gi|215500274|gb|EEC09768.1| NHL repeat-containing protein [Ixodes scapularis]
Length = 621
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 57/124 (45%), Gaps = 16/124 (12%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKK---E 146
G+ DG ARF+ P+ +YVAD NH +R++ V T+AG G + E
Sbjct: 280 GWRDGALNDARFNGPQGVVWRDPHFVYVADTGNHTVREVDLEQAQVRTVAGTGQRGSDLE 339
Query: 147 GRADGPAQNASFSNDF----ELTFVP-----HICALLISDHGNQLIRQINLKPEDCSKSS 197
G GP Q S D ++ F+ + AL + D +QL +++L C + +
Sbjct: 340 GGRQGPQQPLSSPWDLCLVEDILFIAMAGSHQLWALALRD--SQLFGKLHLGAGTCLRIA 397
Query: 198 QSGS 201
SGS
Sbjct: 398 GSGS 401
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 59/144 (40%), Gaps = 24/144 (16%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGEPGSARFDKPKSFAV 111
L + DS SA TLS VK L G + G +DG R P
Sbjct: 429 LCIADSESSAIRTLSL---RTGAVKNLVGGALNPTDLFCFGDADGSALDVRLQHPLGVCW 485
Query: 112 DM-KGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
K +YVAD NH IRK + TT+AG G G A G SFS++ +
Sbjct: 486 SSDKQLLYVADSYNHKIRKVDVQKRLCTTLAGTGVA--GDATG-----SFSDEVQFDEPG 538
Query: 169 HICA----LLISDHGNQLIRQINL 188
+C L ++D N ++ ++L
Sbjct: 539 GLCVVGSRLYVADTNNHCVKLVHL 562
>gi|427706390|ref|YP_007048767.1| alkyl hydroperoxide reductase [Nostoc sp. PCC 7107]
gi|427358895|gb|AFY41617.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Nostoc sp. PCC 7107]
Length = 505
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 55/112 (49%), Gaps = 7/112 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTI 138
V + G G G +DG +A+F P+ D + I Y+AD NH +R++ V TI
Sbjct: 206 VLHIIGTGKSGLTDGSFSTAQFSAPQGMVFDSENQIFYIADTENHALRRVDLKQQIVETI 265
Query: 139 AGGGSKKEG--RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
AG G + G A + ++ ++L V +I L I+ G+ I Q+NL
Sbjct: 266 AGTGEQSRNIHPHSGVALETALNSPWDLVKVGNI--LFIAMAGSHQIWQMNL 315
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
+ + GDG G DG+ ++RF +P + + +Y++D +NH IR++ VTT+
Sbjct: 433 QTILGDGFAGLQDGQGKNSRFFEPSGLSA-INSYLYISDTNNHAIRRVDLNTFEVTTL 489
>gi|390369862|ref|XP_798415.2| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 322
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 11/102 (10%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V P + L V D+ + ++ + VV+ G G++DG ARF P
Sbjct: 173 PGKVTTNPEGTLLAVSDTGHNRVIIVAL----DGVVQHCIGGPETGFNDGLYQEARFHSP 228
Query: 107 K--SFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
+ +A D+ IYVAD NH IRKI VTTIAG G +
Sbjct: 229 QGLCWAQDV---IYVADTENHAIRKIDLKEKRVTTIAGTGEQ 267
>gi|134103740|ref|YP_001109401.1| NHL repeat-containing protein [Saccharopolyspora erythraea NRRL
2338]
gi|291007734|ref|ZP_06565707.1| NHL repeat-containing protein [Saccharopolyspora erythraea NRRL
2338]
gi|133916363|emb|CAM06476.1| NHL repeat containing protein [Saccharopolyspora erythraea NRRL
2338]
Length = 619
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 69/135 (51%), Gaps = 13/135 (9%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------- 111
L+V DS+ + + F E+ V+R+ G G +G +DG A F +P A+
Sbjct: 192 LLVSDSAHHSL--VEFTADGETPVRRI-GSGERGRADGAADEASFAEPAGLALLPEDVAA 248
Query: 112 DMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVP 168
++ ++ VAD NH++R ++ + V T+AG G + ++G GPA ++ +++ +
Sbjct: 249 EVGYDVVVADTVNHLLRGVQLADGEVVTVAGTGEQWRDGSDAGPALETPLTSPWDVAWWE 308
Query: 169 HICALLISDHGNQLI 183
++I+ GN +
Sbjct: 309 PAGGVVIAMAGNHTL 323
Score = 44.3 bits (103), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 8/108 (7%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GDG ++ E + RF K +D G + V+D ++H + + T G T + GS +
Sbjct: 164 GDGPYVPAETENTTLRF-PAKVLPLD-GGTLLVSDSAHHSLVEFTADGETPVRRIGSGER 221
Query: 147 GRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
GRADG A ASF+ L +P A ++++D N L+R + L
Sbjct: 222 GRADGAADEASFAEPAGLALLPEDVAAEVGYDVVVADTVNHLLRGVQL 269
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 6/117 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTT 137
V+R AG V+G DG+ A F + A D +++AD +R + G V T
Sbjct: 333 VRRFAGTTVEGLRDGDADEAFFAQTSGLA-DGGDRLWLADSETSALRWVEAGGDGFAVRT 391
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G G G ADG A A F + + +P ++ + D N IR+ + ++ S
Sbjct: 392 AVGAGLFDFGHADGAADQALFQHPLGVAVLPD-GSVAVCDTYNGAIRRYDPASDEVS 447
>gi|147818279|emb|CAN64724.1| hypothetical protein VITISV_026725 [Vitis vinifera]
Length = 423
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 3/64 (4%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAG 140
++ G +G DG A F++P+ A + K N+ YVAD NH +R+I N V T+AG
Sbjct: 54 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 113
Query: 141 GGSK 144
G+K
Sbjct: 114 NGTK 117
Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 5/104 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
G DG P + G IYVAD NH I+K+ V+T+AG G K G
Sbjct: 160 GDHDGVGSEVLLQHPLGVSCGKDGQIYVADSYNHKIKKLDPATGRVSTLAGTG--KAGFK 217
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
DG A A S + V + L I+D N +IR ++LK ++
Sbjct: 218 DGRALAAQLSEPSGIVEVEN-GVLFIADTNNSVIRYLDLKKKEA 260
>gi|183222153|ref|YP_001840149.1| hypothetical protein LEPBI_I2796 [Leptospira biflexa serovar Patoc
strain 'Patoc 1 (Paris)']
gi|189912214|ref|YP_001963769.1| pentraxin-like protein [Leptospira biflexa serovar Patoc strain
'Patoc 1 (Ames)']
gi|167776890|gb|ABZ95191.1| Pentraxin-related protein [Leptospira biflexa serovar Patoc strain
'Patoc 1 (Ames)']
gi|167780575|gb|ABZ98873.1| Hypothetical protein LEPBI_I2796 [Leptospira biflexa serovar Patoc
strain 'Patoc 1 (Paris)']
Length = 1237
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 58/129 (44%), Gaps = 10/129 (7%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNL 133
L+ + V +AGDGVQ +DG SA+F+ P D N+++++ H IR I+
Sbjct: 476 LTGTNAVDTIAGDGVQASTDGFGTSAQFNDPHGITTDGT-NLFISEWLGHRIRHLDISTN 534
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
VTT+ G G AD N F F + ISD N IR L+P +
Sbjct: 535 KVTTLVG---SSPGYADNATGNGLF--QFPGFITSDGQKVYISDQANHSIRF--LEPAEI 587
Query: 194 SKSSQSGSA 202
S GSA
Sbjct: 588 LHYSFDGSA 596
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 49/106 (46%), Gaps = 10/106 (9%)
Query: 36 TVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGY 93
T + G+ + N +S +LI + R+A TL SFP + S + G V
Sbjct: 206 TCVTGYIVSGNLYSTTGANLGPNLI---NRRTAVKTLAGSFPTNTPSGAGAVFGGAVP-- 260
Query: 94 SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTI 138
+ARF+ P A D IYV D +N VIRKI L G TTI
Sbjct: 261 -SATASAARFNSPAMLATDSN-FIYVVDSTNSVIRKIDKLTGTTTI 304
>gi|428317077|ref|YP_007114959.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240757|gb|AFZ06543.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
Length = 546
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 47/100 (47%), Gaps = 7/100 (7%)
Query: 48 HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
+SVI S L + DS + + E V + G G G +DG+ A F P+
Sbjct: 219 NSVIQNLVGSCLFIADSGHNRIVVSTL----EGEVLHVIGSGKPGLTDGDFEEAEFFAPQ 274
Query: 108 SFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGSK 144
A D + I YVAD NH +RKI T V T+AG G +
Sbjct: 275 GMAFDAESQILYVADTENHALRKIDFTTQRVETVAGTGEQ 314
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 62/135 (45%), Gaps = 8/135 (5%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKG 115
S+L V DS S+ + + E+ V+ + G G + G+ D + + + V+
Sbjct: 395 SELFVADSEISSIRAVG--IDEDPKVRTVCGSGELFGFGDKDGRADQVRLQHCLGVEYAQ 452
Query: 116 N-IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICAL 173
N ++VAD NH I++I + G T G G D A F L+ + PH L
Sbjct: 453 NYLWVADTYNHKIKRIDHRGGTCRTMIGDGTAGLVDAKGLKARFFEPSGLSAIGPH---L 509
Query: 174 LISDHGNQLIRQINL 188
++D N +IR++ L
Sbjct: 510 FVADTNNHVIRRVEL 524
>gi|381163732|ref|ZP_09872962.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
azurea NA-128]
gi|379255637|gb|EHY89563.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
azurea NA-128]
Length = 633
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 9/102 (8%)
Query: 97 EPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
E G+AR D P V +G + VAD +NH I + + T + G G DG
Sbjct: 182 EDGAARIDGLRFPSKAVVTAEGRVLVADTANHSIAEFASDAETLLRRFGDGHRGAVDGAF 241
Query: 154 QNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
A F+ LT +P A L+++D N L+R ++L+
Sbjct: 242 DVARFAEPSGLTLLPTHVADQVGYHLIVADTANHLLRGVDLR 283
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 60/121 (49%), Gaps = 11/121 (9%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------DMKG-NIYVADKSNH 125
F E++++R GDG +G DG ARF +P + D G ++ VAD +NH
Sbjct: 217 EFASDAETLLRRF-GDGHRGAVDGAFDVARFAEPSGLTLLPTHVADQVGYHLIVADTANH 275
Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
++R + V T+AG G++ ++G GPA ++ ++ + ++++ GN
Sbjct: 276 LLRGVDLRTGSVRTVAGTGAQWRDGDDSGPATEVDLTSPWDARWWDAAGGVVVAMAGNHT 335
Query: 183 I 183
+
Sbjct: 336 L 336
>gi|315500945|ref|YP_004079832.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
allergen [Micromonospora sp. L5]
gi|315407564|gb|ADU05681.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora sp. L5]
Length = 610
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 67/126 (53%), Gaps = 18/126 (14%)
Query: 46 NPHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
P +V+ PG + +L+V DS+R + L+ E+VV+R+ G G +G +DG
Sbjct: 166 EPDTVLRFPGKALPLDGGNLLVSDSARHSLVELA--ADGETVVRRI-GTGSRGRTDGPAS 222
Query: 100 SARFDKPKSFAV------DMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRAD 150
+A F +P+ + ++ G ++ VAD NH++R ++++ V T+AG G + D
Sbjct: 223 AATFAEPQGLCLLPPHVAEVAGYDVVVADTVNHLLRGVRLSSGEVVTVAGTGRQWRSTVD 282
Query: 151 GPAQNA 156
A +A
Sbjct: 283 DHAHDA 288
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 6/80 (7%)
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICA- 172
GN+ V+D + H + ++ G T + G+ GR DGPA A+F+ L + PH+
Sbjct: 183 GNLLVSDSARHSLVELAADGETVVRRIGTGSRGRTDGPASAATFAEPQGLCLLPPHVAEV 242
Query: 173 ----LLISDHGNQLIRQINL 188
++++D N L+R + L
Sbjct: 243 AGYDVVVADTVNHLLRGVRL 262
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 2/103 (1%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
AG V+ DG +P +V G+ ++VAD IR +++ + T G G
Sbjct: 330 AGTTVEALRDGPLPDVWMAQPSGLSVSADGSRLWVADSETSAIRWVSDGEMHTAVGQGLF 389
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G DGPA A + + +P ++LI+D N +R+ +
Sbjct: 390 DFGHVDGPADQALLQHPLGVCALPD-GSVLIADTYNGAVRRFD 431
>gi|285018581|ref|YP_003376292.1| nhl repeat protein [Xanthomonas albilineans GPE PC73]
gi|283473799|emb|CBA16302.1| hypothetical nhl repeat protein [Xanthomonas albilineans GPE PC73]
Length = 473
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 35/65 (53%), Gaps = 3/65 (4%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIAGG 141
R G G + DGE G A F +P+ A+ +G +YVAD NH +R+I L V T+ G
Sbjct: 201 RQFGTGTADFIDGELGQAAFHRPQGLALQ-RGVLYVADTGNHALRRIHLLTGQVDTLCGN 259
Query: 142 GSKKE 146
G E
Sbjct: 260 GRAGE 264
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 59/139 (42%), Gaps = 11/139 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
++R AG G DG A F +P S AV + +YV D +R + G V T+
Sbjct: 311 LRRCAGAGTLELRDGSAHLAAFAQPCSLAV-VNQVMYVCDALGSSVRSMQLRGDLVQTLL 369
Query: 140 GG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS- 197
GG G G DGP A + P L I+D GN +R + L + S +
Sbjct: 370 GGQGVWDFGHEDGPRSRARLQFPQAIALSPDALHLWIADSGNGSLRSLRLGGGELSTVAL 429
Query: 198 ------QSGSALGAVSVWV 210
+G A+ A +VW+
Sbjct: 430 PRRLHGPAGLAVSAGAVWI 448
>gi|255542920|ref|XP_002512523.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
gi|223548484|gb|EEF49975.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
Length = 1016
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVT 136
+ + ++ G +G DG A F++P+ A + K N+ YVAD NH +R+I N V
Sbjct: 561 NFIVQIGSTGEEGLRDGPFDEATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNEMVR 620
Query: 137 TIAGGGSK 144
T+AG G+K
Sbjct: 621 TLAGNGTK 628
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 42/81 (51%), Gaps = 5/81 (6%)
Query: 115 GNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
G IYVAD NH I+K+ V+TIAG G K G DG A A S + +
Sbjct: 776 GQIYVADSYNHKIKKLDPATKRVSTIAGTG--KAGFKDGKALAAQLSEPSGIIEAEN-GR 832
Query: 173 LLISDHGNQLIRQINLKPEDC 193
L+I+D N +IR ++L E+
Sbjct: 833 LIIADTNNSIIRYLDLNKEEA 853
>gi|423214448|ref|ZP_17200976.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692863|gb|EIY86099.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
CL03T12C04]
Length = 464
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G+++G+ A+ ++P +D +GNIYVA + H I KIT G+ T G+ G DG
Sbjct: 369 GFTEGQGNEAKLEEPCQMDLDEEGNIYVAVRRKHRIAKITPDGMLTNY-TGTGTSGTTDG 427
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
P A F++ + F P AL ISD+ N IR+I
Sbjct: 428 PLDKAQFNHPEGVQFGPD-GALYISDYWNHKIRKI 461
>gi|408792280|ref|ZP_11203890.1| concanavalin A-like lectin/glucanases family protein [Leptospira
meyeri serovar Hardjo str. Went 5]
gi|408463690|gb|EKJ87415.1| concanavalin A-like lectin/glucanases family protein [Leptospira
meyeri serovar Hardjo str. Went 5]
Length = 1234
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 62/129 (48%), Gaps = 10/129 (7%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNL 133
L+ + V +AGDGVQ +DG +A+F+ P D N+++++ S H IR +T
Sbjct: 475 LTGTNAVDTIAGDGVQASTDGIGTAAQFNDPHGLTTDGT-NLFISEWSGHKIRHLNLTTN 533
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
VTT+ G G +D N +F + + I+D GN +R I +P +
Sbjct: 534 KVTTLVG---SVSGYSDNTGGNGLL--NFPGFLLSDGLKVYIADTGNHSLRSI--EPSEV 586
Query: 194 SKSSQSGSA 202
+ + G+A
Sbjct: 587 LRYTFDGNA 595
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 53/117 (45%), Gaps = 16/117 (13%)
Query: 97 EPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRA----- 149
P +ARF+ PK D GN IYVAD N VIRKI + G TTI GG+ G
Sbjct: 262 SPTAARFNGPKMITTD--GNFIYVADTMNAVIRKIDKSSGTTTILAGGNTGGGTVCPGTV 319
Query: 150 -----DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
DG A F+ LT + L + + IR++NL S + SGS
Sbjct: 320 TTNCQDGVGTAAQFNAITGLTTDGN--NLFVLEASGNRIRKVNLATGVVSTFAGSGS 374
>gi|418461883|ref|ZP_13032943.1| Peroxiredoxin [Saccharomonospora azurea SZMC 14600]
gi|359738010|gb|EHK86922.1| Peroxiredoxin [Saccharomonospora azurea SZMC 14600]
Length = 633
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 9/102 (8%)
Query: 97 EPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
E G+AR D P V +G + VAD +NH I + + T + G G DG
Sbjct: 182 EDGAARIDGLRFPSKAVVTAEGRVLVADTANHSIAEFASDAETLLRRFGDGHRGAVDGAF 241
Query: 154 QNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
A F+ LT +P A L+++D N L+R ++L+
Sbjct: 242 DVARFAEPSGLTLLPTHVADQVGYHLIVADTANHLLRGVDLR 283
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 60/121 (49%), Gaps = 11/121 (9%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------DMKG-NIYVADKSNH 125
F E++++R GDG +G DG ARF +P + D G ++ VAD +NH
Sbjct: 217 EFASDAETLLRRF-GDGHRGAVDGAFDVARFAEPSGLTLLPTHVADQVGYHLIVADTANH 275
Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
++R + V T+AG G++ ++G GPA ++ ++ + ++++ GN
Sbjct: 276 LLRGVDLRTGSVRTVAGTGAQWRDGDDSGPATEVDLTSPWDARWWDAAGGVVVAMAGNHT 335
Query: 183 I 183
+
Sbjct: 336 L 336
>gi|395218326|ref|ZP_10401983.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
gi|394454578|gb|EJF09204.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
Length = 474
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 63/127 (49%), Gaps = 4/127 (3%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
LS++ V + G G +G+++G A F++P A+ +YVAD NH IRK+ +
Sbjct: 197 LSQQGQVLEVIGSGERGFNNGGYADATFNEPHGLAL-HGSTLYVADAKNHAIRKVDLQHR 255
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
V+T AG G + D + N + + ++ I++ GN I +++L+ E
Sbjct: 256 MVSTAAGTGELEYYFHDDKRREPVNPNS-PWDLLIYGRSMFIANAGNHQILRMDLETEQV 314
Query: 194 SKSSQSG 200
+ + SG
Sbjct: 315 YRFAGSG 321
>gi|334116916|ref|ZP_08491008.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
gi|333461736|gb|EGK90341.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
Length = 546
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 47/100 (47%), Gaps = 7/100 (7%)
Query: 48 HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
+SVI S L + DS + + E V + G G G +DG+ A F P+
Sbjct: 219 NSVIQNLVGSCLFIADSGHNRIVVSTL----EGEVLHVIGSGQPGLTDGDFEEAEFFAPQ 274
Query: 108 SFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGSK 144
A D + I YVAD NH +RKI T V T+AG G +
Sbjct: 275 GMAFDAESQILYVADTENHALRKIDFTTQRVETVAGTGEQ 314
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 60/135 (44%), Gaps = 8/135 (5%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKG 115
S+L V DS S+ + + E+ V+ + G G + G+ D + V+
Sbjct: 395 SELFVADSEISSIRAVG--IDEDPKVRTVCGSGELFGFGDKDGRGDEVRLQHCLGVEYAQ 452
Query: 116 N-IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICAL 173
N ++VAD NH I++I + G T G G D A F L+ + PH L
Sbjct: 453 NYLWVADTYNHKIKRIDHRGGTCRTMIGEGTAGLVDAKGLKARFFEPSGLSAIGPH---L 509
Query: 174 LISDHGNQLIRQINL 188
++D N +IR++ L
Sbjct: 510 FVADTNNHVIRRVEL 524
>gi|256556932|gb|ACU83555.1| thiol-disulfide isomerase/thioredoxin [uncultured bacterium
HF130_AEPn_1]
Length = 497
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 7/113 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR--KITNLGVTTI 138
VK + G G +G DG+ ARF +P A D K + +Y+AD NH I+ + + V T+
Sbjct: 221 VKMVIGSGKEGNKDGDIKVARFRRPHGLAFDEKNDLLYIADTDNHSIKSLDLKSKKVLTL 280
Query: 139 AGGGSK--KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
+G G + K + A+ ++ ++L LLI+ G + +++LK
Sbjct: 281 SGNGERGFKRKAENAKAKGHPMASPWDLQLKGE--KLLIAMAGTHQLWEMDLK 331
>gi|302760605|ref|XP_002963725.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
gi|300168993|gb|EFJ35596.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
Length = 1052
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 32/48 (66%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
V+ +AG GV GY DG+ A+F +P A+ G+++VAD +N++IR
Sbjct: 838 TVRTVAGTGVAGYEDGKSVKAQFSEPAGLALGPNGSLFVADTNNNLIR 885
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGS 143
G G +G +DG A F++P+ A D + NI YVAD NH +R++ V T+AG G
Sbjct: 610 GSGSEGLTDGSFSLASFNRPQGVAYDTEKNILYVADTENHALREVDFVKETVRTLAGTGE 669
Query: 144 KK-EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ + R Q+ ++ ++L P L I+ G I + N
Sbjct: 670 RGFDYRGGKAGQSQPLNSPWDLCLHPD-GMLYIAMAGEHQIWEFN 713
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 46/101 (45%), Gaps = 7/101 (6%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRA 149
G DG A F P G IYVAD NH I+ + + V T+AG G G
Sbjct: 794 GDKDGPASQALFQHPLGILSSANGAIYVADSYNHKIKLMDLASKTVRTVAGTGVA--GYE 851
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
DG + A FS L P+ +L ++D N LIR LKP
Sbjct: 852 DGKSVKAQFSEPAGLALGPN-GSLFVADTNNNLIRL--LKP 889
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 58/135 (42%), Gaps = 7/135 (5%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR-FDKPKSFAVDMKG 115
L V D+ A + F + V+ LAG G +G Y G+ G ++ + P + G
Sbjct: 641 LYVADTENHALREVDF---VKETVRTLAGTGERGFDYRGGKAGQSQPLNSPWDLCLHPDG 697
Query: 116 NIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
+Y+A H I + GV + G + N+S++ L+ P + L
Sbjct: 698 MLYIAMAGEHQIWEFNRSNGVAKVFSGDGYERNLNGSKGSNSSYAQPSGLSLSPDMDHLY 757
Query: 175 ISDHGNQLIRQINLK 189
I+D + +R +NLK
Sbjct: 758 IADSESSSVRSVNLK 772
>gi|308799191|ref|XP_003074376.1| NHL repeat (ISS) [Ostreococcus tauri]
gi|116000547|emb|CAL50227.1| NHL repeat (ISS) [Ostreococcus tauri]
Length = 1783
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 85/193 (44%), Gaps = 32/193 (16%)
Query: 34 VTTVIDGHQLEINP-HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
VTTV+ L P V+D D+ VL S+ +S +S +V G
Sbjct: 285 VTTVV--ASLPFAPIELVVDESNGGDMYVLGQSQHGVMKIS--VSTLAVTTIAGSQTTSG 340
Query: 93 YSDGEPG-SARFDKPKSFAVD-MKGNIYVADKSNHVIRKI--TNLGVTTIAGGGS----- 143
+ D G SARF P+ A+D + +YVAD NH +R I + VTT+ G GS
Sbjct: 341 FVDHNTGTSARFTLPRGLALDSLNSKLYVADTGNHAVRMIDLSTGVVTTVLGDGSPALNA 400
Query: 144 ---KKEGRADGPAQ-----NASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
K+G PA+ +++ D L+ LL+SD G +R++ L
Sbjct: 401 STLNKDGVLSTPARFNDPVGIAYNYDSALS----SGVLLVSDAGTHQLRKLIL------N 450
Query: 196 SSQSGSALGAVSV 208
S +G+A V+V
Sbjct: 451 DSTAGNAATVVTV 463
Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 8/68 (11%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGS--------ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+S V AG+G +G +DG+ + A F P S A+ G++ VAD N +R I
Sbjct: 221 KSEVTNFAGNGTEGSTDGKTSANAELGETQALFKGPTSVALLASGDVLVADTGNKAVRNI 280
Query: 131 TNLGVTTI 138
+N VTT+
Sbjct: 281 SNGDVTTV 288
>gi|158318845|ref|YP_001511353.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114250|gb|ABW16447.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 770
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 64/155 (41%), Gaps = 10/155 (6%)
Query: 33 TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
T+TTV L P +V PG LI S + L + +AG G
Sbjct: 567 TITTVPTPAALR-RPRAVTTGPGGVLLIADTDSHRVWR-----LGPGETARVIAGTAEPG 620
Query: 93 YS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRA 149
YS DG P + A +P+S AVD G + +AD IR++ + G + T+AG A
Sbjct: 621 YSGDGGPATRAAIGRPQSLAVDGAGRLLIADPDQRRIRRVDHAGRIGTMAGTAYGGRPAA 680
Query: 150 DG-PAQNASFSNDFELTFVPHICALLISDHGNQLI 183
DG PA L P L G++++
Sbjct: 681 DGAPATGTDVGTPTGLAVGPDGTVYLADPAGDRVL 715
>gi|452910311|ref|ZP_21958992.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
gi|452834558|gb|EME37358.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
Length = 632
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 44/73 (60%), Gaps = 9/73 (12%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-------NIYVADKSNHVIR--KITN 132
V+ + GDG +G +DG P A F++P+ A+ +G ++ VAD NH +R ++++
Sbjct: 207 VRSVIGDGTRGPADGGPEEAHFNEPRGLALLPQGVREQVGYDVIVADTVNHRLRGVRLSD 266
Query: 133 LGVTTIAGGGSKK 145
VTT+AG G ++
Sbjct: 267 GEVTTLAGSGVQR 279
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 59/147 (40%), Gaps = 9/147 (6%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
+P + + L+V S +T E +V AG G +G DG+P + F +
Sbjct: 307 SPWDTVWSTAADTLVVAMSGTHQIFTFDPRTGELAVA---AGTGDEGLVDGDPSQSWFAQ 363
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNL-----GVTTIAGGGSKKEGRADGPAQNASFSN 160
G I+ AD + +R I GV T AG G G DG + A +
Sbjct: 364 TSGLVEAPDGTIWAADSESSALRSIAVADGALSGVATAAGLGLYDFGFRDGDSTQARLQH 423
Query: 161 DFELTFVPHICALLISDHGNQLIRQIN 187
+ +P ++ ++D N IR+ +
Sbjct: 424 PLGVAVLPD-GSIAVADTYNGAIRRWD 449
>gi|290990712|ref|XP_002677980.1| predicted protein [Naegleria gruberi]
gi|284091590|gb|EFC45236.1| predicted protein [Naegleria gruberi]
Length = 1555
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 28/113 (24%), Positives = 54/113 (47%), Gaps = 4/113 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPG--SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTT 137
+++ +AG+G+QGY+ +A+ + P V+ ++ +D N+ +R+IT + G+
Sbjct: 548 IIQTIAGNGIQGYNGDNINALTAQLNSPTYLQVNYYHELFFSDTFNNRLRQITLSTGIIQ 607
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
G K DGPA + + + +D GN IR ++L+P
Sbjct: 608 TIAGSGVKGFNGDGPALSTHLDTPMGFSLNSDW-IIYFADSGNNRIRYVDLQP 659
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 68/149 (45%), Gaps = 21/149 (14%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEES-------------VVKRLAGDGVQGYSDGEPGSA--- 101
D +VL+S + F T S+ +S VV +AG G G+ DG+ G A
Sbjct: 456 DQLVLNSPKGIFATTSYVYISDSGNHRIIKVHRTSGVVIGVAGCGTSGF-DGDGGLATLA 514
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
+ + P +D +YVAD +NH IR IT+ + TIAG G + + A A +
Sbjct: 515 KLNSPTGIYLD-NNVLYVADTNNHRIRAVNITSGIIQTIAGNGIQGYNGDNINALTAQLN 573
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINL 188
+ L V + L SD N +RQI L
Sbjct: 574 SPTYLQ-VNYYHELFFSDTFNNRLRQITL 601
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 6/76 (7%)
Query: 81 VVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLG 134
+++ +AG GV+G++ DG S D P F+++ IY AD N+ IR + N
Sbjct: 605 IIQTIAGSGVKGFNGDGPALSTHLDTPMGFSLNSDWIIYFADSGNNRIRYVDLQPLFNDP 664
Query: 135 VTTIAGGGSKKEGRAD 150
VT GS +G D
Sbjct: 665 VTVTTHSGSGAKGYTD 680
Score = 37.4 bits (85), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 41/72 (56%), Gaps = 5/72 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSD-GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++++ +++ +AG+G G+S +P ++ + P S V KG ++D N+ IRKI+ +
Sbjct: 886 INKKGLLETVAGNGFTGFSQTSDPKTSSLNSPSS-VVSYKGQTIISDSKNNFIRKISTIA 944
Query: 135 ---VTTIAGGGS 143
IAG GS
Sbjct: 945 SHPFDVIAGDGS 956
>gi|380511998|ref|ZP_09855405.1| nhl repeat protein [Xanthomonas sacchari NCPPB 4393]
Length = 473
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIA 139
V R G G + DG G A F +P+ A++ +G +YVAD NH +R+I L V T+
Sbjct: 199 VLRQFGIGTADFIDGAIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLLTGQVDTLC 257
Query: 140 GGGSKK---EGRADGP 152
G G EGR + P
Sbjct: 258 GNGRAGEPVEGRVEQP 273
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 61/148 (41%), Gaps = 12/148 (8%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L S+ R AG G DG A F +P S A ++ +YV D +R +
Sbjct: 303 SYDLGSRSLRWR-AGAGALELRDGSGHLAAFAQPCSLAA-VQQVLYVCDALGSAVRSMQL 360
Query: 133 LG--VTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G V T+ GG G G DGP A + P L I+D GN +R + L
Sbjct: 361 RGDLVQTLLGGQGPWDFGNEDGPRSRARLQFPQAIALSPDAPLLWIADSGNGSLRSLRLG 420
Query: 190 PEDCSKS-------SQSGSALGAVSVWV 210
+ S + G A+ A +VW+
Sbjct: 421 GGELSTAVLPRRLHGPGGLAVSAGTVWI 448
>gi|152968196|ref|YP_001363980.1| NHL repeat-containing protein [Kineococcus radiotolerans SRS30216]
gi|151362713|gb|ABS05716.1| putative NHL repeat protein [Kineococcus radiotolerans SRS30216]
Length = 584
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 55/111 (49%), Gaps = 12/111 (10%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---DMKG----NIYVADKSNHVIRKITNL 133
V R G G +G +DG S F P+ AV D+ ++ VAD NH++R + +
Sbjct: 187 VGDRPIGSGARGRTDGPAPS--FAAPQGVAVLPPDLAAALGYDLVVADTGNHLLRGVRSA 244
Query: 134 G--VTTIAGGGSKKEGRA-DGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
TTIAG G+ GR+ GPA + S+ +++ + P LL++ G
Sbjct: 245 DGRTTTIAGTGAPARGRSGGGPALATALSSPWDVAWWPAFGELLVAVAGTH 295
>gi|126179662|ref|YP_001047627.1| NHL repeat-containing protein [Methanoculleus marisnigri JR1]
gi|125862456|gb|ABN57645.1| NHL repeat containing protein [Methanoculleus marisnigri JR1]
Length = 487
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
E V AGDG +GY DG G AR ++P VD G Y+AD NH++R
Sbjct: 422 ETGWVLTRAGDGDRGYRDGLSGEARLNEPGGL-VDFGGLWYIADTGNHIVR 471
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 18/107 (16%)
Query: 105 KPKSFAVDMKG-NIYVADKSNHVIRKITNLGVTTIAGG--------GSKKEGRADGPAQN 155
+P A D G ++++D +H I +AGG GS G ADGP
Sbjct: 173 RPGKVAADNAGMRLFISDTGHHRI---------VVAGGDGKILETIGSGAAGNADGPFDE 223
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
A+F L F L ++D GN IR+++ + ++ +G A
Sbjct: 224 AAFYLPEGLAFDEEAGILYVADTGNHTIRRVSFTERRVATNAGTGLA 270
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 53/121 (43%), Gaps = 12/121 (9%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGG- 142
G G G +DG A F P+ A D + I YVAD NH IR++ T V T AG G
Sbjct: 210 GSGAAGNADGPFDEAAFYLPEGLAFDEEAGILYVADTGNHTIRRVSFTERRVATNAGTGL 269
Query: 143 ---SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQS 199
S EG GP + + +L + L I+ G I +++L + + S
Sbjct: 270 AAPSFGEG---GPGTGTALNAPRDLALLG--GHLYIAMAGANQIWRMDLATYEVEPYAGS 324
Query: 200 G 200
G
Sbjct: 325 G 325
>gi|195119117|ref|XP_002004078.1| GI18254 [Drosophila mojavensis]
gi|193914653|gb|EDW13520.1| GI18254 [Drosophila mojavensis]
Length = 734
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
VVK G G+ DG +RF+ P+ A K + VAD NH +R+I+ N V T+
Sbjct: 255 VVKYKIGGLEAGFVDGSLNMSRFNNPQGVAFLDKNTLIVADTDNHALRQISLNNGVVETL 314
Query: 139 AGGGSKKEGRADG 151
AG GS+ R G
Sbjct: 315 AGTGSQGNERTGG 327
>gi|167041535|gb|ABZ06284.1| putative NHL repeat protein [uncultured marine microorganism
HF4000_008B14]
Length = 426
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 67/145 (46%), Gaps = 10/145 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ S+ L + DS+ + + + ++ G G G DG +A FD P
Sbjct: 111 PGKVVADADSNRLFIADSNHNRIIVTTL----DGDALKVVGSGEPGMVDGSFAAASFDHP 166
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGR-ADGPAQNASFSNDFE 163
+ + + +YVAD NH IRK+ V TIAG G++ R GP + ++ ++
Sbjct: 167 QGMVL-VGDLLYVADAENHAIRKVDLAAEQVETIAGTGTQGSMREGTGPGLHTELNSPWD 225
Query: 164 LTFVPHICALLISDHGNQLIRQINL 188
L F H +L I+ G + ++L
Sbjct: 226 LAF--HEGSLYIAMAGLHQLWSMDL 248
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 58/133 (43%), Gaps = 7/133 (5%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGN 116
L DS SA S L +V+ + G G+ + D G + R P A G
Sbjct: 287 LYFADSETSA--VRSADLDPHGMVRTIIGLGLFEFGDVDGADHNIRLQHPIGVA-HHDGT 343
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
IY+AD NH I+K+ + ++ G+ + G DGP A FS L+ + I+
Sbjct: 344 IYLADTYNHKIKKVLPVMQSSFTMLGNGQPGDRDGPGNQAQFSEPSGLSI--DGGKIYIA 401
Query: 177 DHGNQLIRQINLK 189
D N IR +L+
Sbjct: 402 DTNNHAIRVADLE 414
>gi|423072738|ref|ZP_17061487.1| fibronectin type III domain protein [Desulfitobacterium hafniense
DP7]
gi|361856502|gb|EHL08400.1| fibronectin type III domain protein [Desulfitobacterium hafniense
DP7]
Length = 1926
Score = 45.4 bits (106), Expect = 0.024, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 92 GYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
GYS DG P +A + P + A D GN+Y++D N IRK+ G+ T G
Sbjct: 285 GYSGDGGPATAAQLKTPNALAFDGAGNMYISDSGNRRIRKVDPQGIITTIAGKETNGDPT 344
Query: 150 DGPAQNASFSNDFELTF 166
GPA+++ L F
Sbjct: 345 PGPAEDSKIPGTHWLGF 361
>gi|219667746|ref|YP_002458181.1| hypothetical protein Dhaf_1699 [Desulfitobacterium hafniense DCB-2]
gi|219538006|gb|ACL19745.1| protein of unknown function DUF1533 [Desulfitobacterium hafniense
DCB-2]
Length = 1916
Score = 45.4 bits (106), Expect = 0.024, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 92 GYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
GYS DG P +A + P + A D GN+Y++D N IRK+ G+ T G
Sbjct: 275 GYSGDGGPATAAQLKTPNALAFDGAGNMYISDSGNRRIRKVDPQGIITTIAGKETNGDPT 334
Query: 150 DGPAQNASFSNDFELTF 166
GPA+++ L F
Sbjct: 335 PGPAEDSKIPGTHWLGF 351
>gi|145592666|ref|YP_001156963.1| alkyl hydroperoxide reductase [Salinispora tropica CNB-440]
gi|145302003|gb|ABP52585.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Salinispora tropica CNB-440]
Length = 612
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 2/103 (1%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
AG V+ DG A +P +V G+ ++VAD +R + N +TT G G
Sbjct: 332 AGSTVEARKDGPLAEAWLAQPSGLSVSADGSRLWVADSETSAVRYVENGILTTAVGQGLF 391
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ G DGPA A + + +P ++LI+D N +R+ +
Sbjct: 392 EFGHVDGPAAQALLQHPLGVCALPD-GSVLIADTYNGAVRRYD 433
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 69/139 (49%), Gaps = 21/139 (15%)
Query: 46 NPHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
P + + PG S +L+V DS+R + L+ E++++R+ G G +G +DG
Sbjct: 168 EPETTLRFPGKSVVLDNGNLLVSDSARHSLAELAP--DGETLLRRI-GTGARGRADGTAT 224
Query: 100 SARFDKPKSFAV------DMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRAD 150
+A F +P+ + + G ++ VAD NH++R ++ V T+AG G + D
Sbjct: 225 AATFAEPQGLCLLPAQVSRLVGYDLVVADTVNHLLRGVRLATGEVVTVAGTGRQWRSTVD 284
Query: 151 GPAQNA---SFSNDFELTF 166
A +A S+ ++L +
Sbjct: 285 DHAHDALSVDLSSPWDLAW 303
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 45/88 (51%), Gaps = 7/88 (7%)
Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
KS +D GN+ V+D + H + ++ G T + G+ GRADG A A+F+ L
Sbjct: 178 KSVVLD-NGNLLVSDSARHSLAELAPDGETLLRRIGTGARGRADGTATAATFAEPQGLCL 236
Query: 167 VPHICA------LLISDHGNQLIRQINL 188
+P + L+++D N L+R + L
Sbjct: 237 LPAQVSRLVGYDLVVADTVNHLLRGVRL 264
>gi|428184600|gb|EKX53455.1| hypothetical protein GUITHDRAFT_101156 [Guillardia theta CCMP2712]
Length = 367
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
VV +AG GV GY +G + + + P A+ IYV+D+ NH IR++ + T G
Sbjct: 12 VVCSIAGTGVPGYYEGNRMTCKMNCPSGIAMGQGDCIYVSDRQNHCIRRLPGIEEQTAPG 71
Query: 141 ---------GGSKKEGRADGPAQNASFS 159
G +K G DG + A F+
Sbjct: 72 KEDDVSTFAGVPRKSGMEDGHNRRALFN 99
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 43/86 (50%), Gaps = 2/86 (2%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P VD + +YV++ H IR++ G + I G S+K G+ DG N S + ++
Sbjct: 226 PVGICVDEQNCVYVSEWGGHRIRRVQPDGSSVIWVGTSQK-GKLDGKGINGSLFHPAGMS 284
Query: 166 FVPHICALLISDHGNQLIRQINLKPE 191
V + ++D GN +R+IN E
Sbjct: 285 -VDTKGIVYVADFGNHCVRRINTTGE 309
>gi|160942444|ref|ZP_02089750.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
BAA-613]
gi|158434643|gb|EDP12410.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
BAA-613]
Length = 329
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 40/73 (54%), Gaps = 3/73 (4%)
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
+G +YVAD N+ I K + V AG G ++G ADGP A FS +T AL
Sbjct: 212 EGALYVADTGNNRIVKAMDGAVVWSAGTG--EDGFADGPVSQAMFSGPQRIT-AAEDGAL 268
Query: 174 LISDHGNQLIRQI 186
+SD GN ++R+I
Sbjct: 269 YVSDTGNSVVRKI 281
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 49/109 (44%), Gaps = 19/109 (17%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
AG G G++DG A F P+ G +YV+D N V+RKI V+T+
Sbjct: 237 AGTGEDGFADGPVSQAMFSGPQRITAAEDGALYVSDTGNSVVRKIWGDNVSTL------- 289
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDH---GNQLIRQINLKPE 191
PA +A+ D +LT +++ DH + +R++ + P
Sbjct: 290 ------PAADAA---DPDLTIASPEGMMVMGDHIYLCDSFLRKVFVLPR 329
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 46/97 (47%), Gaps = 4/97 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
GY DG G +P + G ++D+SNHV+R I+ G T AG G K G D
Sbjct: 99 GYRDGSLGEMLIGEPWDMIPYLDGYA-LSDRSNHVVRYISPEGAMTAAGTG--KAGYEDN 155
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
A A F+ L P L I+D N ++R ++L
Sbjct: 156 LASKALFAAPTGLAAGPD-GILYIADTDNDVVRSLSL 191
>gi|221633398|ref|YP_002522623.1| hypothetical protein trd_1418 [Thermomicrobium roseum DSM 5159]
gi|221156519|gb|ACM05646.1| conserved hypothetical protein [Thermomicrobium roseum DSM 5159]
Length = 443
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 65/148 (43%), Gaps = 9/148 (6%)
Query: 21 LVSSGLLLEDGYTVTTVIDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEE 79
L +G+ + + + +V++ Q + P + P L+V D+ +
Sbjct: 94 LARTGVTTSEPFALLSVLERPQGPLAFPGKLAVDPSRDRLVVSDTGHHRLVIARL----D 149
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTT 137
V + GDG G DG ARF +P+ A+ + +VAD+ NH IR+I V T
Sbjct: 150 GTVTAVIGDGRPGLVDGTFAEARFREPQGIAL-VGETCFVADRGNHAIRRIDLAAGTVET 208
Query: 138 IAGGGSKKEGR-ADGPAQNASFSNDFEL 164
+AG G +G + GPA + + L
Sbjct: 209 LAGTGRLGQGMLSAGPAHQVDLRSPWAL 236
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 14/92 (15%)
Query: 52 DRPGSSDLIVLDSSRSAFYTLSFPLSEESV---VKRL----------AGDGVQGYSDGEP 98
DR G+ D ++L +T L ++ +KRL G G G+ DG P
Sbjct: 333 DRDGTGDTVLLQHPLDVAWTGEELLVADTYNHKIKRLDPVARRCSSWLGTGQPGHEDGPP 392
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
ARF +P A +YVAD +NH +R I
Sbjct: 393 ERARFWEPSGLATTFD-RVYVADTNNHAVRVI 423
>gi|290971625|ref|XP_002668588.1| predicted protein [Naegleria gruberi]
gi|284082057|gb|EFC35844.1| predicted protein [Naegleria gruberi]
Length = 731
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 1/90 (1%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
P + + P G++Y+A + N+ I K +N ++T+AG G GPA+NA
Sbjct: 142 NPNNDGINGPNGIMYTPNGDLYIASQGNNKIMKYSNSQLSTVAGNGYAGYSGDGGPARNA 201
Query: 157 SFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ ++T V + I+D GN+ +R++
Sbjct: 202 KLNYPADVT-VSSTGEVYIADWGNRAVRKV 230
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 36/72 (50%), Gaps = 4/72 (5%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-- 134
S + +AG+G GYS DG P +A+ + P V G +Y+AD N +RK+ G
Sbjct: 177 NSQLSTVAGNGYAGYSGDGGPARNAKLNYPADVTVSSTGEVYIADWGNRAVRKVFRNGTI 236
Query: 135 VTTIAGGGSKKE 146
+T I G E
Sbjct: 237 ITLIVSGNGLSE 248
>gi|451341293|ref|ZP_21911747.1| NHL repeat containing protein [Amycolatopsis azurea DSM 43854]
gi|449415822|gb|EMD21660.1| NHL repeat containing protein [Amycolatopsis azurea DSM 43854]
Length = 602
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 6/114 (5%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
Y E A P +G I VAD NH I + + G T I GS G DGP
Sbjct: 157 YVPAEEQRAELRFPSKAVTTAEGRILVADTGNHSIVEFASDGETIIRRFGSGSRGAQDGP 216
Query: 153 AQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSG 200
ASF+ +T +P+ A +++D +R I+L + S + +G
Sbjct: 217 FDLASFTEPSGITLLPYEVAERAGYHAVVADTAGHRLRGIDLITGEVSTVAGTG 270
>gi|159035770|ref|YP_001535023.1| alkyl hydroperoxide reductase [Salinispora arenicola CNS-205]
gi|157914605|gb|ABV96032.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Salinispora arenicola CNS-205]
Length = 612
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 8/93 (8%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
RF P V GN+ V+D + H I ++ G + G+ GRADGPA A+F+
Sbjct: 174 RF--PGKAVVLGNGNLLVSDSARHSIAELAPDGERVVRRIGTGARGRADGPATAATFAEP 231
Query: 162 FELTFVPHICA------LLISDHGNQLIRQINL 188
L +P A L+++D N L+R + L
Sbjct: 232 QGLCLLPAHVARLVDYDLVVADTVNHLLRGVRL 264
Score = 43.9 bits (102), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 63/124 (50%), Gaps = 15/124 (12%)
Query: 55 GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF----- 109
G+ +L+V DS+R + L+ E VV+R+ G G +G +DG +A F +P+
Sbjct: 183 GNGNLLVSDSARHSIAELAP--DGERVVRRI-GTGARGRADGPATAATFAEPQGLCLLPA 239
Query: 110 --AVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNA---SFSNDF 162
A + ++ VAD NH++R ++ V T+AG G + D A +A S+ +
Sbjct: 240 HVARLVDYDLVVADTVNHLLRGVRLATGEVVTVAGTGRQWRSTVDDHAHDALSVDLSSPW 299
Query: 163 ELTF 166
+L +
Sbjct: 300 DLAW 303
Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 48/103 (46%), Gaps = 2/103 (1%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
AG V+ DG A +P +V G+ ++VAD IR + + + T G G
Sbjct: 332 AGSTVEALKDGPLAEAWLAQPSGLSVSADGSRLWVADSETSAIRYVQDGVLNTAVGQGLF 391
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ G DGPA A + + +P ++LI+D N +R+ +
Sbjct: 392 EFGHVDGPAAQALLQHPLGVCALPD-GSVLIADTYNGAVRRYD 433
>gi|254383375|ref|ZP_04998727.1| NHL repeat containing protein [Streptomyces sp. Mg1]
gi|194342272|gb|EDX23238.1| NHL repeat containing protein [Streptomyces sp. Mg1]
Length = 607
Score = 45.1 bits (105), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 12/115 (10%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S + +V DS+R L+ ESVV+R+ G G +G+ G F +P+ A+ G
Sbjct: 191 SGNFLVSDSTRHQLVELA--ADAESVVRRI-GSGERGF-----GPDAFSEPQGLALLPDG 242
Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
++ VAD NH +R+ V T+AG G + + GPA S+ +++ +
Sbjct: 243 SVVVADTVNHALRRFDPATGSVETVAGTGRQWWQGSPTSGPALEVDLSSPWDVAW 297
>gi|449670570|ref|XP_002158620.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
magnipapillata]
Length = 629
Score = 45.1 bits (105), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 43/82 (52%), Gaps = 7/82 (8%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
VVKR+ G G G+ DGE ++F P+ + ++ I+VAD NH IR+I N TI
Sbjct: 244 VVKRI-GTGFPGFKDGEAHVSQFSSPQGLSW-IENVIFVADTENHSIREINVANWITKTI 301
Query: 139 AGGGSK---KEGRADGPAQNAS 157
G G + KEG G Q S
Sbjct: 302 VGNGKQGKDKEGGKLGIEQEIS 323
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 13/102 (12%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRA 149
G DG +A+ P A + G++YVAD NH I+ ++NL TT+AG G G
Sbjct: 449 GDVDGIGLNAKLQHPLDVAA-VNGHLYVADSYNHKIKVVNMSNLSCTTLAGNGVV--GIK 505
Query: 150 DGPAQNASFSNDFELTFVPHIC---ALLISDHGNQLIRQINL 188
+G +A F+ + IC + I+D N LIR +NL
Sbjct: 506 NGDFLSAEFNEPGGI-----ICLDDTIYIADTNNHLIRVLNL 542
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
LAG+GV G +G+ SA F++P + + IY+AD +NH+IR
Sbjct: 496 LAGNGVVGIKNGDFLSAEFNEPGGI-ICLDDTIYIADTNNHLIR 538
>gi|354615998|ref|ZP_09033698.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Saccharomonospora paurometabolica YIM 90007]
gi|353219649|gb|EHB84187.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Saccharomonospora paurometabolica YIM 90007]
Length = 727
Score = 45.1 bits (105), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 44/90 (48%), Gaps = 6/90 (6%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P +G + VAD ++H + + + T + G + G ADG A+F+ LT
Sbjct: 241 PSKAVATAEGRVLVADTAHHSVVEFASDAETVVRRFGDGERGAADGAFDVATFAEPAGLT 300
Query: 166 FVPHICA------LLISDHGNQLIRQINLK 189
+P A LL++D GN L+R I+L+
Sbjct: 301 LLPPEVAERVGYHLLVADTGNHLLRGIDLR 330
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 62/122 (50%), Gaps = 11/122 (9%)
Query: 72 LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSN 124
+ F E+VV+R GDG +G +DG A F +P A + ++ VAD N
Sbjct: 263 VEFASDAETVVRRF-GDGERGAADGAFDVATFAEPAGLTLLPPEVAERVGYHLLVADTGN 321
Query: 125 HVIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
H++R I V T+AG G++ ++G GPA ++ ++ +++ + ++++ GN
Sbjct: 322 HLLRGIDLRTGSVRTVAGTGAQWRDGADSGPATGSALTSPWDVRWWEPAGGVVVAMAGNH 381
Query: 182 LI 183
+
Sbjct: 382 TL 383
>gi|290978007|ref|XP_002671728.1| predicted protein [Naegleria gruberi]
gi|284085299|gb|EFC38984.1| predicted protein [Naegleria gruberi]
Length = 2520
Score = 45.1 bits (105), Expect = 0.030, Method: Composition-based stats.
Identities = 33/109 (30%), Positives = 53/109 (48%), Gaps = 15/109 (13%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---------TNLGVTT 137
G G GY+DG +A ++ P A ++Y+AD+ NHV+R + NL +
Sbjct: 228 GVGSAGYADGSAFNALYNAPTGVAYGPNKDLYIADRGNHVVRVVKFSYGTTGTQNLATIS 287
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ G +K G +D N ++ D +++ HI I D GN IR+I
Sbjct: 288 LFSGVAKTPG-SDLVKMNNTY--DVDVSSEGHI---FILDTGNLRIRKI 330
Score = 37.0 bits (84), Expect = 7.8, Method: Composition-based stats.
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 77 SEESVVKRL----------AGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNH 125
S+ +VKRL AG G+ G + A + P V +G + +AD NH
Sbjct: 55 SKNHLVKRLDLTTLKVTIVAGTGIAGNDTNTTATGAMLNYPGGICVTSEGKLLIADTLNH 114
Query: 126 VIRKITNLGVTTIAG 140
VIR+++N + TI G
Sbjct: 115 VIREVSNGQIKTIIG 129
>gi|290991506|ref|XP_002678376.1| NHL repeat domain-containing protein [Naegleria gruberi]
gi|284091988|gb|EFC45632.1| NHL repeat domain-containing protein [Naegleria gruberi]
Length = 985
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 63/140 (45%), Gaps = 12/140 (8%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV-QGYSDG-EPGSARFDKPKSFAVDM 113
+ D+ ++DS Y + + +V+ G+G+ + Y + G+A F+ P F +
Sbjct: 341 TGDIYIIDSQN---YRIRKITASTNVISTFLGNGLSRNYGENISAGAASFNSPVGFTFSL 397
Query: 114 ---KGNIYVADKSNHVIRKITNL----GVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
IYVAD NH IR + + T+AG G + PA ++SF+ L F
Sbjct: 398 AYPNNVIYVADTGNHRIRSFSQFYSGRNIVTVAGLPRPGLGGENLPALSSSFNFPSNLVF 457
Query: 167 VPHICALLISDHGNQLIRQI 186
P LLI D N IR +
Sbjct: 458 HPISGDLLIVDTQNHRIRSM 477
Score = 44.3 bits (103), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 7/95 (7%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE---PGSARF 103
P +++ P S DL+++D+ ++S S VK +AG G+ GY +GE +
Sbjct: 452 PSNLVFHPISGDLLIVDTQNHRIRSMS---KSTSFVKTIAGTGIAGY-NGEGMLSNMTKM 507
Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ P AV G I AD N+++R I G+ +I
Sbjct: 508 NSPSGIAVLSTGEIIFADTFNNLVRMINLQGIVSI 542
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 53/109 (48%), Gaps = 15/109 (13%)
Query: 89 GVQGYSDGEPG------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
G+QG S G+ G SA + P+S A + G+++++D N+ I+K+ N + TIAG
Sbjct: 253 GIQGQS-GDSGEGYLATSAFLNGPQSVAFNSNGDMFISDSKNNKIKKVFFNNGTIITIAG 311
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVP---HICALLISDHGNQLIRQI 186
GS G NA F+ TF+ + I D N IR+I
Sbjct: 312 TGS---ANFTGDGSNAKFATLNLPTFISVSNSTGDIYIIDSQNYRIRKI 357
>gi|162457162|ref|YP_001619529.1| hypothetical protein sce8877 [Sorangium cellulosum So ce56]
gi|161167744|emb|CAN99049.1| Hypothetical protein sce8877 [Sorangium cellulosum So ce56]
Length = 127
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 52/106 (49%), Gaps = 3/106 (2%)
Query: 85 LAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
+AG G +G DG P A+ D P+ A+ G + +A++ + +R+ T G + T+AG G
Sbjct: 1 MAGTGNEGDRGDGGPAVAKLDVPRRLALGPDGRLRIAEQGGNRVRRSTPDGTIATVAGTG 60
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+ GPA A+ ++ F + P A SD + QI L
Sbjct: 61 APGYSGDGGPATLATSTSPFSIALGPR-WATPASDPASNQTEQIEL 105
>gi|380016202|ref|XP_003692077.1| PREDICTED: membrane-associated protein Hem-like [Apis florea]
Length = 1439
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 62/136 (45%), Gaps = 12/136 (8%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
++ + G + DG+ +ARF+ P+ + + +YVAD NH IRKI + VTT+
Sbjct: 256 IQHVIGGSNPDFRDGDFENARFNAPQGVCI-LDTFVYVADNENHAIRKIDLIKKIVTTVV 314
Query: 140 GGGSKKEGRADGP-AQNASFSNDFELTFVPH------ICALLISDHGNQLIRQINLKPED 192
G G + G +N S+ ++L H I LLI+ G I + LK
Sbjct: 315 GTGIQGHDYVGGKIGKNQILSSPWDLAIYEHEYNKNSIPILLIAMAGTHQIWALFLKDTI 374
Query: 193 CSKSSQ--SGSALGAV 206
K + +G+ + V
Sbjct: 375 WWKKREYKAGTCIAIV 390
>gi|269796827|ref|YP_003316282.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
DSM 10542]
gi|269099012|gb|ACZ23448.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
DSM 10542]
Length = 634
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 61/145 (42%), Gaps = 10/145 (6%)
Query: 67 SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSN 124
S L L EE K G Y EP G+ RF P + GN+ VAD +
Sbjct: 144 STLEILVRELVEEHSAKGTLHRGSGPYVPAEPTPGTLRF--PAAVIALESGNLLVADAGH 201
Query: 125 HVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDH 178
H + ++ G T + GS + G +DG Q+A+FS L VP +L++D
Sbjct: 202 HSLAELAADGETLVRRIGSGERGLSDGGRQDATFSEPNGLCLVPATLRDQLGYDVLVADT 261
Query: 179 GNQLIRQINLKPEDCSKSSQSGSAL 203
N +R + L + + +G L
Sbjct: 262 VNHALRGVRLSDGHVTTVAGTGDQL 286
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 70/147 (47%), Gaps = 24/147 (16%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---- 111
S +L+V D+ + L+ E++V+R+ G G +G SDG A F +P +
Sbjct: 191 SGNLLVADAGHHSLAELAA--DGETLVRRI-GSGERGLSDGGRQDATFSEPNGLCLVPAT 247
Query: 112 --DMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADG-----------PAQ 154
D G ++ VAD NH +R ++++ VTT+AG G + G A+ PA
Sbjct: 248 LRDQLGYDVLVADTVNHALRGVRLSDGHVTTVAGTGDQLMVGGAENVVPETTAPDATPAL 307
Query: 155 NASFSNDFELTFVPHICALLISDHGNQ 181
++ +++ + + A L++ GN
Sbjct: 308 RHRLTSPWDVVWSERLGAFLVAMAGNH 334
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/112 (23%), Positives = 54/112 (48%), Gaps = 6/112 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGV 135
VV+++ G +G DG A F +P +V G++++AD +R++ + +
Sbjct: 345 VVEQVGGTQNEGLLDGPLAQAWFAQPSGLSVGPDGSVWLADAETSALRRVDVADDGSATI 404
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
T++ G G G DG A A + + +P +++++D N +R+ +
Sbjct: 405 TSLVGQGLFDFGHRDGAAAQALLQHPLGVAALPD-GSVVVTDTYNGALRRYD 455
>gi|395225280|ref|ZP_10403807.1| NHL repeat protein, partial [Thiovulum sp. ES]
gi|394446527|gb|EJF07348.1| NHL repeat protein, partial [Thiovulum sp. ES]
Length = 4349
Score = 45.1 bits (105), Expect = 0.032, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRA 149
G DG + RF P+ A D +GN++VAD +N I+KIT G VTT A G ++ G A
Sbjct: 795 GDEDGNLSTVRFYNPRGLAFDEEGNLFVADSNNSKIKKITPDGDVTTFASGVNEPYGLA 853
>gi|424792718|ref|ZP_18218917.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|422796954|gb|EKU25368.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
graminis ART-Xtg29]
Length = 473
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 41/76 (53%), Gaps = 4/76 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
V R G G + DG G A F +P+ A++ +G +YVAD NH +R+I L V T+
Sbjct: 199 VLRQFGLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLLSGHVDTLC 257
Query: 140 GGGSKKEGRADGPAQN 155
G G E +GP Q+
Sbjct: 258 GNGRACE-PVEGPVQH 272
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 62/148 (41%), Gaps = 12/148 (8%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L S+ R AG G DG A F +P S A ++ +YV D +R +
Sbjct: 303 SYELGNRSLRWR-AGAGALELRDGSGHLAAFAQPCSLAA-VQQALYVCDALGSAVRALQL 360
Query: 133 LG--VTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G V T+ GG G G DGP A + P L I+D GN +R + L
Sbjct: 361 RGDLVQTLLGGQGPWDFGNEDGPRSRARLQFPQAIALSPDAPLLWIADSGNGSLRSLRLG 420
Query: 190 PEDCSKS-------SQSGSALGAVSVWV 210
D S + +G A+ A +VW+
Sbjct: 421 GGDLSTTPLPRRLHGPAGLAVSAGTVWI 448
>gi|219851969|ref|YP_002466401.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546228|gb|ACL16678.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 676
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 41/89 (46%), Gaps = 9/89 (10%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
G +F++P AVD GNIYV D +N I+K G T GGS G DG N
Sbjct: 40 GDGQFNQPSGVAVDSDGNIYVVDTNNFRIQKFNATGGFTTQWGGS---GPGDGQFNNP-- 94
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
E V + + I+D N I++ N
Sbjct: 95 ----EGVAVDNNGNVYIADRDNNRIQKFN 119
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 25/37 (67%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
PG +F+ P+ AVD GN+Y+AD+ N+ I+K + G
Sbjct: 86 PGDGQFNNPEGVAVDNNGNVYIADRDNNRIQKFNSSG 122
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 46/106 (43%), Gaps = 9/106 (8%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
G +F++P A+D GN+YV DK N+ I+K + G + G EG DG S
Sbjct: 134 GDGQFNQPSGVALDSAGNVYVTDKQNNRIQKFNSSGGFLMKWG---SEGSGDGQVHWPS- 189
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
V + ++ + D N I++ N +K G+ G
Sbjct: 190 -----GVAVDNTGSVYVVDSYNHRIQKFNATGGFITKWGSEGTGDG 230
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +F++P AVD GN+YVAD N+ I+K
Sbjct: 569 GDGQFNRPSGVAVDSAGNVYVADSGNNRIQKF 600
>gi|402571882|ref|YP_006621225.1| hypothetical protein Desmer_1347 [Desulfosporosinus meridiei DSM
13257]
gi|402253079|gb|AFQ43354.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
Length = 2025
Score = 45.1 bits (105), Expect = 0.034, Method: Composition-based stats.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 11/107 (10%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
G FD P++ AVD NIY+ D + H I+K++++G + G + G+ + P A
Sbjct: 688 GQWFFDVPRAAAVDSSSNIYIVDSNRHQIKKLSSMGEILATWGSYGNENGQMNVPGHIAL 747
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
S D + ++D GN I++ + E K SGS G
Sbjct: 748 DSED----------NVYVADTGNNRIQKFSSTGEYLMKFGSSGSGEG 784
Score = 43.1 bits (100), Expect = 0.13, Method: Composition-based stats.
Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 6/73 (8%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEG 147
G G GE F++P+ AVD +GNIYVAD +NH I+K G T G + G
Sbjct: 966 GSNGSDQGE-----FNRPQGIAVDSEGNIYVADSNNHRIQKFNAAGAFMTTWGSFGTELG 1020
Query: 148 RADGPAQNASFSN 160
+ + P A SN
Sbjct: 1021 QFNSPKGIAVDSN 1033
Score = 38.9 bits (89), Expect = 2.3, Method: Composition-based stats.
Identities = 19/48 (39%), Positives = 26/48 (54%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
G+ + + G +F PKS VD GNIYVAD +N I+K + G
Sbjct: 770 GEYLMKFGSSGSGEGQFRNPKSVTVDSAGNIYVADTTNKRIQKFDSDG 817
Score = 38.5 bits (88), Expect = 3.0, Method: Composition-based stats.
Identities = 26/61 (42%), Positives = 35/61 (57%), Gaps = 5/61 (8%)
Query: 96 GEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK--KEGRADG 151
G PG+ ++F P AVD GN+YVAD S + IRK T G T +A G++ G+ D
Sbjct: 1156 GSPGAGDSQFLFPHGIAVDSAGNVYVADTSANWIRKYTPEG-TLLAKWGTRGNSAGQFDN 1214
Query: 152 P 152
P
Sbjct: 1215 P 1215
Score = 38.1 bits (87), Expect = 4.2, Method: Composition-based stats.
Identities = 25/60 (41%), Positives = 33/60 (55%), Gaps = 6/60 (10%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGS 143
LA G +G S G+ FD P A D GN+YVAD +N+ I+K+ VT + GGS
Sbjct: 1199 LAKWGTRGNSAGQ-----FDNPYGIAADSAGNVYVADTNNNRIQKLAPAYTVTFDSNGGS 1253
Score = 38.1 bits (87), Expect = 4.4, Method: Composition-based stats.
Identities = 25/79 (31%), Positives = 38/79 (48%), Gaps = 3/79 (3%)
Query: 77 SEESVVKRLA--GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
SE + +++ A GD ++ + G F P A D +GNIYV+D +N+ I K + G
Sbjct: 304 SENNRIQKFASSGDFLKKWGRGGSADGEFSLPFGIARDSEGNIYVSDTTNNRIEKFDSTG 363
Query: 135 V-TTIAGGGSKKEGRADGP 152
T G K + D P
Sbjct: 364 AFVTTWGSFGKDNHQLDMP 382
Score = 37.0 bits (84), Expect = 7.9, Method: Composition-based stats.
Identities = 15/34 (44%), Positives = 22/34 (64%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+F+ PK AVD GN+YVAD N ++K ++G
Sbjct: 1020 GQFNSPKGIAVDSNGNVYVADVENDRVQKFDSMG 1053
>gi|218190788|gb|EEC73215.1| hypothetical protein OsI_07299 [Oryza sativa Indica Group]
Length = 1103
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 9/110 (8%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAGGGS 143
G +G DG +A F++P+ A + K NI YVAD NH +R+I + V T+AG G+
Sbjct: 622 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 681
Query: 144 K-KEGRADGPAQNAS-----FSNDFELTFVPHICALLISDHGNQLIRQIN 187
K + R G N + ++ +++ + P L I+ G I + N
Sbjct: 682 KGSDYRGGGQGTNQACFFMVLNSPWDVCYDPSKETLYIAMAGQHQIWKHN 731
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 71/184 (38%), Gaps = 35/184 (19%)
Query: 30 DGYTVTTVIDGHQLEIN-----------PHSVIDRPGSSDLIVLDSSRSAFYTLSF---- 74
DG T DG++ +N P + P +LIV DS S+ ++
Sbjct: 734 DGVTEVLSGDGYERNLNGSSSTRTSFAQPSGISLAPELQELIVADSESSSIRVVNLKSGG 793
Query: 75 --------PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
P+ E++ + G DG P IYVAD NH
Sbjct: 794 SRSLAGGDPMIPENLFRF-------GDHDGTGSDVLLQHPLGVVYASDNQIYVADSYNHK 846
Query: 127 IRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
I+++ VTTIAG G + G DGPA +A S L V LL++D N IR
Sbjct: 847 IKRLDPVTRKVTTIAGTG--RAGYKDGPALSAQLSEPAGLVEVGD-GRLLVADTNNSTIR 903
Query: 185 QINL 188
I L
Sbjct: 904 YIVL 907
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 63/141 (44%), Gaps = 13/141 (9%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR------FDKPKSFA 110
L V D+ A ++F +SE VK LAG+G +G Y G G+ + + P
Sbjct: 653 LYVADTENHALREINF-VSE--TVKTLAGNGTKGSDYRGGGQGTNQACFFMVLNSPWDVC 709
Query: 111 VD-MKGNIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
D K +Y+A H I K L GVT + G + + SF+ ++ P
Sbjct: 710 YDPSKETLYIAMAGQHQIWKHNTLDGVTEVLSGDGYERNLNGSSSTRTSFAQPSGISLAP 769
Query: 169 HICALLISDHGNQLIRQINLK 189
+ L+++D + IR +NLK
Sbjct: 770 ELQELIVADSESSSIRVVNLK 790
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 42/96 (43%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P A+D+ N SNH +TNL I GS +EG DG ASF+ L
Sbjct: 586 PGKLAIDVLNNRLFISDSNHNRIVVTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLA 645
Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
+ L ++D N +R+IN E + +G+
Sbjct: 646 YNSKKNILYVADTENHALREINFVSETVKTLAGNGT 681
>gi|149176433|ref|ZP_01855047.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
gi|148844785|gb|EDL59134.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
Length = 368
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 4/68 (5%)
Query: 82 VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
+ +AG G +G+S DG P SA F +P S A+D N+Y+ D NH IR++ T V+T
Sbjct: 138 ISTVAGTGEKGFSGDGGPATSATFSRPHSIALDNDNNLYICDIGNHRIRRVDLTTGTVST 197
Query: 138 IAGGGSKK 145
+G G +K
Sbjct: 198 FSGTGERK 205
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 59/124 (47%), Gaps = 5/124 (4%)
Query: 81 VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVT 136
VK LAG G G++ DG P A+ +P + G +YV + +HVIR+I ++
Sbjct: 23 TVKTLAGTGKVGHTGDGGPAVKAQVGEPYGLTLGPDGALYVCEIKSHVIRRIDEKTGKIS 82
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKS 196
T+AG G K G A A + +E+ F + + N ++R+++ K S
Sbjct: 83 TVAGSGKKGYSGDSGLALAAELNEPYEVRF-DKAGNMYFVEMINNIVRRVDAKTGKISTV 141
Query: 197 SQSG 200
+ +G
Sbjct: 142 AGTG 145
Score = 38.9 bits (89), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 70 YTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+T+ ++ ++ GDG +G DG+P R +P V GN+Y+ D + +R
Sbjct: 301 HTIRVIRKKDGTIETAVGDGKKGDGPDGDPAHCRMARPHGVYVGPNGNVYIGDSETYRVR 360
Query: 129 KIT 131
K+T
Sbjct: 361 KLT 363
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 56/130 (43%), Gaps = 20/130 (15%)
Query: 39 DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL----------AGD 88
DG ++E P ++ P + D S + + Y L E + V R+ AG
Sbjct: 209 DGAKVEGTP---LNGPRALDFFTDGSGKGSLY---LALREGNKVYRIDLDDQTLHHIAGT 262
Query: 89 GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
G +GY+ G A PK +V G++Y+AD +H IR I + + T G G K
Sbjct: 263 GKKGYTGHGGPAKKATLSGPKGISVAPNGDVYLADTESHTIRVIRKKDGTIETAVGDGKK 322
Query: 145 KEGRADGPAQ 154
+G PA
Sbjct: 323 GDGPDGDPAH 332
>gi|145340877|ref|XP_001415544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575767|gb|ABO93836.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1675
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 68/159 (42%), Gaps = 21/159 (13%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGD-----GVQGYS----DGEPGSARFDKPKSFA 110
+V D S A Y L + S + +G G QG S + P SARF+ P+ A
Sbjct: 186 MVFDESDGAIYFLGSSQNHVSKIAYGSGTVSTYAGSQGTSGLVDNVTPSSARFNAPEGLA 245
Query: 111 VDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGS--------KKEGRADGPAQ-NASF 158
+D I YVAD NH +R I + VTT+ G G+ +G PA+ N
Sbjct: 246 LDGANRILYVADTGNHAVRAIDLSTGAVTTVLGDGTISASGAVLDSDGILSTPARFNYPA 305
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
+ L LL++D G IR+I L S ++
Sbjct: 306 GIAYNLDSSLSSGVLLVTDRGTHQIRKIILTDATASNAA 344
>gi|224125014|ref|XP_002319481.1| predicted protein [Populus trichocarpa]
gi|222857857|gb|EEE95404.1| predicted protein [Populus trichocarpa]
Length = 1065
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVT 136
+ + ++ G +G DG A F++P+ A + K NI YVAD NH +R++ + V
Sbjct: 608 NFIAQIGSSGEEGLRDGSFDDATFNRPQGLAYNSKKNILYVADTENHALREVDFVSEKVR 667
Query: 137 TIAGGGSK 144
T+AG G+K
Sbjct: 668 TLAGNGTK 675
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 40/76 (52%), Gaps = 5/76 (6%)
Query: 115 GNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
G IY+AD NH I+K + VTTIAG G K G DG A A S L +
Sbjct: 824 GLIYIADSYNHKIKKLDLATKRVTTIAGTG--KAGFKDGKALTAQLSEPAGLIEAEN-GR 880
Query: 173 LLISDHGNQLIRQINL 188
L+I+D N +IR ++L
Sbjct: 881 LIIADTNNSVIRYLDL 896
>gi|156406833|ref|XP_001641249.1| predicted protein [Nematostella vectensis]
gi|156228387|gb|EDO49186.1| predicted protein [Nematostella vectensis]
Length = 708
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 46/89 (51%), Gaps = 9/89 (10%)
Query: 77 SEESVVKRLAG--DGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-- 131
S E VV + G DG G+ DG A+F P+ A+ + IYVAD NH IR+I
Sbjct: 229 STEGVVHEVIGGEDGFSAGFQDGTFKEAKFHAPQGVAM-LGEVIYVADTENHAIREINLD 287
Query: 132 NLGVTTIAGGGSK---KEGRADGPAQNAS 157
+ VTT+AG G + KEG G Q S
Sbjct: 288 SKKVTTVAGTGRQGADKEGGRQGREQTIS 316
Score = 37.4 bits (85), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 31/122 (25%), Positives = 51/122 (41%), Gaps = 11/122 (9%)
Query: 84 RLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNI---YVADKSNHVIRKI-TNLGVTTI 138
R AG G + ++ P A F +P AV + + +VAD + +R + G T
Sbjct: 372 RFAGSGNEENRNNSYPHKAAFAQPSGLAVAISNTLRCLFVADSESSSVRCVDIETGATKA 431
Query: 139 AGGGSKKE------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
G + G DG +A F + + + P LL++D N I+ + K +
Sbjct: 432 VVGADRDPMNLFAFGDVDGKGVDAKFQHPLGVAWNPQEDHLLVADSYNHKIKTVEPKQKT 491
Query: 193 CS 194
CS
Sbjct: 492 CS 493
>gi|312198573|ref|YP_004018634.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311229909|gb|ADP82764.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 797
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 57/116 (49%), Gaps = 18/116 (15%)
Query: 87 GDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAG 140
G G G+S G+ GSA R D P AV G ++VAD N +IR++ ++G V T+AG
Sbjct: 449 GTGEAGFS-GDTGSASRSRIDTPTGLAVGPDGALFVADSGNGLIRRV-DVGADLVETVAG 506
Query: 141 GGSKKEGRADGPAQNASFSND-FELTF-------VPHICALLISDHGNQLIRQINL 188
G ++ R DG S D EL V LLI+D GN + +I L
Sbjct: 507 GAARA--RPDGAEPRRWTSLDPAELVLRGPRGLTVDAAGRLLIADTGNHRVLRIGL 560
>gi|290995061|ref|XP_002680150.1| predicted protein [Naegleria gruberi]
gi|284093769|gb|EFC47406.1| predicted protein [Naegleria gruberi]
Length = 880
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 59/120 (49%), Gaps = 29/120 (24%)
Query: 86 AGDGVQGYS-DGEPG-SARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
AG G G+S DG +A+ P+ +VD KG++Y+AD N+ IRK++N ++TIAG G
Sbjct: 410 AGTGSGGFSGDGVLAINAKLYFPRGVSVDPFKGDVYIADSYNNRIRKVSNGFISTIAGTG 469
Query: 143 SKKEGRADGPAQNASFSNDFELTF-----VPHICA-------LLISDHGNQLIRQINLKP 190
S A F+ D EL P+ A + I D N IR N+KP
Sbjct: 470 S------------AGFTGDGELAIAAKLDTPYSVAYSNTSGLVYILDTNNARIR--NIKP 515
>gi|440733650|ref|ZP_20913345.1| nhl repeat protein [Xanthomonas translucens DAR61454]
gi|440359672|gb|ELP96967.1| nhl repeat protein [Xanthomonas translucens DAR61454]
Length = 473
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 62/148 (41%), Gaps = 12/148 (8%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L S+ R AG G DG A F +P S A ++ +YV D +R +
Sbjct: 303 SYELGNRSLRWR-AGAGALELRDGSGHLAAFAQPCSLAA-VQQALYVCDALGSAVRSLQL 360
Query: 133 LG--VTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G V T+ GG G G DGP A + P L I+D GN +R + L
Sbjct: 361 RGDLVQTLLGGQGPWDFGNEDGPRSRARLQFPQAIALSPDAPLLWIADSGNGSLRSLRLG 420
Query: 190 PEDCSKS-------SQSGSALGAVSVWV 210
D S + +G A+ A +VW+
Sbjct: 421 GGDLSTTPLPRRLHGPAGLAVSAGTVWI 448
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 4/74 (5%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGG 141
R G G + DG G A F +P+ A++ +G +YVAD NH +R+I L V T+ G
Sbjct: 201 RQFGLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLLSGHVDTLCGN 259
Query: 142 GSKKEGRADGPAQN 155
G E +GP Q+
Sbjct: 260 GRAGE-PVEGPVQH 272
>gi|222624800|gb|EEE58932.1| hypothetical protein OsJ_10596 [Oryza sativa Japonica Group]
Length = 1017
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 9/110 (8%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAGGGS 143
G +G DG +A F++P+ A + K NI YVAD NH +R+I + V T+AG G+
Sbjct: 536 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 595
Query: 144 K-KEGRADGPAQNAS-----FSNDFELTFVPHICALLISDHGNQLIRQIN 187
K + R G N + ++ +++ + P L I+ G I + N
Sbjct: 596 KGSDYRGGGQGTNQACFFMVLNSPWDVCYDPSKETLYIAMAGQHQIWKHN 645
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 71/184 (38%), Gaps = 35/184 (19%)
Query: 30 DGYTVTTVIDGHQLEIN-----------PHSVIDRPGSSDLIVLDSSRSAFYTLSF---- 74
DG T DG++ +N P + P +LIV DS S+ ++
Sbjct: 648 DGVTEVLSGDGYERNLNGSSSTRTSFAQPSGISLAPELQELIVADSESSSIRVVNLKSGG 707
Query: 75 --------PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
P+ E++ + G DG P IYVAD NH
Sbjct: 708 SRSLAGGDPMIPENLFRF-------GDHDGTGSDVLLQHPLGVVYASDNQIYVADSYNHK 760
Query: 127 IRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
I+++ VTTIAG G + G DGPA +A S L V LL++D N IR
Sbjct: 761 IKRLDPVTRKVTTIAGTG--RAGYKDGPALSAQLSEPAGLVEVGD-GRLLVADTNNSTIR 817
Query: 185 QINL 188
I L
Sbjct: 818 YIVL 821
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 63/141 (44%), Gaps = 13/141 (9%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR------FDKPKSFA 110
L V D+ A ++F +SE VK LAG+G +G Y G G+ + + P
Sbjct: 567 LYVADTENHALREINF-VSE--TVKTLAGNGTKGSDYRGGGQGTNQACFFMVLNSPWDVC 623
Query: 111 VD-MKGNIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
D K +Y+A H I K L GVT + G + + SF+ ++ P
Sbjct: 624 YDPSKETLYIAMAGQHQIWKHNTLDGVTEVLSGDGYERNLNGSSSTRTSFAQPSGISLAP 683
Query: 169 HICALLISDHGNQLIRQINLK 189
+ L+++D + IR +NLK
Sbjct: 684 ELQELIVADSESSSIRVVNLK 704
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 42/96 (43%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P A+D+ N SNH +TNL I GS +EG DG ASF+ L
Sbjct: 500 PGKLAIDVLNNRLFISDSNHNRIVVTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLA 559
Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
+ L ++D N +R+IN E + +G+
Sbjct: 560 YNSKKNILYVADTENHALREINFVSETVKTLAGNGT 595
>gi|406909052|gb|EKD49392.1| lipoprotein, partial [uncultured bacterium]
Length = 552
Score = 44.7 bits (104), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 49/100 (49%), Gaps = 6/100 (6%)
Query: 95 DGEPGS-----ARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
DGE G+ A F+KP + V+ G YV D N+++R+I T G++ G
Sbjct: 340 DGETGANKSGCAYFNKPSAIVVNRAGTKAYVTDSGNNMLREINLSTGKTRKLAGTRSAGF 399
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+G A+F+ +L L ++D GN IR+INL
Sbjct: 400 KNGFGPLATFNGPADLDLDAAGANLYVADKGNHAIRRINL 439
Score = 42.4 bits (98), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 78/179 (43%), Gaps = 33/179 (18%)
Query: 47 PHSVIDRPGSSDLIV-LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
PHS++ P L +++ + T+S V+ +AG + Y +G+ S RF
Sbjct: 229 PHSIVWNPTKPILYAAINNQLIEYNTVS------GKVRFMAGHPMDNYVEGQGSSVRFSN 282
Query: 106 PKSFAVDMKGN-IYVADKSNHVIRKITNLGVTT--IAGGG----------SKKEGRADGP 152
P + GN +Y+AD+ NH IRK++ T I G G + +EG GP
Sbjct: 283 P-DIDISPDGNWLYIADQYNHRIRKMSTKTKKTEYITGAGEVNFYSPNYSAYQEG---GP 338
Query: 153 AQNASFSNDFELTFVPHICALLI---------SDHGNQLIRQINLKPEDCSKSSQSGSA 202
+ +N + A+++ +D GN ++R+INL K + + SA
Sbjct: 339 CDGETGANKSGCAYFNKPSAIVVNRAGTKAYVTDSGNNMLREINLSTGKTRKLAGTRSA 397
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 70/192 (36%), Gaps = 61/192 (31%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
V++R G+ V DS + ++ + ++LAG G+ +G A F+ P
Sbjct: 360 VVNRAGTK-AYVTDSGNNMLREINLSTGK---TRKLAGTRSAGFKNGFGPLATFNGPADL 415
Query: 110 AVDMKG-NIYVADKSNHVIRKIT--NLGVTTIAG-------------------------- 140
+D G N+YVADK NH IR+I L V+T+ G
Sbjct: 416 DLDAAGANLYVADKGNHAIRRINLATLNVSTVVGKGKPGYQEGSKKTALLQMPTGIEVRG 475
Query: 141 ---------------------------GGSKKEGRADGPAQNASFSNDFELTFVPHICA- 172
GS K GR GP+ + F+ L P +
Sbjct: 476 NNMIFWSESGSQRIRMYNTTENQTRLFAGSGKRGRLTGPSTRSQFNEPKGLAINPKDGSK 535
Query: 173 LLISDHGNQLIR 184
L I+D+ N I+
Sbjct: 536 LYIADYYNDSIK 547
>gi|398337816|ref|ZP_10522521.1| hypothetical protein LkmesMB_21054 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 359
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 1/100 (1%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G G +G+ A F+ P D + N+Y+ + NH IRK NL +++ G
Sbjct: 164 GTAGLQNGDNSGALFNGPFFMDFDRERNLYLGELGNHDIRKF-NLNFQSVSTLSGSSLGY 222
Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
DG A F + L + +LL+ D + IR+INL
Sbjct: 223 QDGSVSTALFKSPIGLVYDRKKNSLLVVDIQDHRIRKINL 262
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 33/69 (47%)
Query: 62 LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
LD Y ++ VV AG G DG+ +A F P +D GNIYV++
Sbjct: 37 LDCILKECYLCRMKVTGNPVVTLFAGTGTASSVDGDISTATFQNPFGIELDSFGNIYVSE 96
Query: 122 KSNHVIRKI 130
++ + IRKI
Sbjct: 97 QNVNKIRKI 105
>gi|157112324|ref|XP_001657496.1| hypothetical protein AaeL_AAEL000965 [Aedes aegypti]
gi|108883769|gb|EAT47994.1| AAEL000965-PA [Aedes aegypti]
Length = 812
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
++ R+ G G+ DG+ ARF+ P+ A ++VAD NH IRKI + VTT+
Sbjct: 345 ILYRIGGKRA-GFVDGDFRKARFNAPQGVAFQNDTTLFVADNENHAIRKIDLKSQQVTTV 403
Query: 139 AGGGSKKEGRADG 151
G G + R G
Sbjct: 404 VGNGQQGCDRIGG 416
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 59/129 (45%), Gaps = 12/129 (9%)
Query: 85 LAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVTTIAG 140
+AG+G + ++ P +A F +P A++ IY+AD + +RK +++ V +AG
Sbjct: 489 IAGNGREENRNNSYPNNAAFSQPSGLAINKDAKEIYLADSESSSVRKMSLSDGKVLAVAG 548
Query: 141 GGSK-----KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE---D 192
G G DG +A F + + + + ++D N I++IN
Sbjct: 549 GDRNPLDLFSFGDIDGKQYSAKFQHPLGVAYNSKDNCIYVADTYNHKIKKINASTNVATT 608
Query: 193 CSKSSQSGS 201
CS +SGS
Sbjct: 609 CSFQDESGS 617
>gi|433676068|ref|ZP_20508221.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
DSM 18974]
gi|430818813|emb|CCP38486.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
DSM 18974]
Length = 473
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 4/74 (5%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGG 141
R G G + DG G A F +P+ A++ +G +YVAD NH +R+I L V T+ G
Sbjct: 201 RQFGLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLLSGHVDTLCGN 259
Query: 142 GSKKEGRADGPAQN 155
G E +GP Q+
Sbjct: 260 GRAGE-PVEGPVQH 272
Score = 44.3 bits (103), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 62/148 (41%), Gaps = 12/148 (8%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
S+ L S+ R AG G DG A F +P S A ++ +YV D +R +
Sbjct: 303 SYELGNRSLRWR-AGAGALELRDGSGHLAAFAQPCSVAA-VQQALYVCDALGSAVRSLQL 360
Query: 133 LG--VTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G V T+ GG G G DGP A + P L I+D GN +R + L
Sbjct: 361 RGDLVQTLLGGQGPWDFGNEDGPRSRARLQFPQAIALSPDAPLLWIADSGNGSLRSLRLG 420
Query: 190 PEDCSKS-------SQSGSALGAVSVWV 210
D S + +G A+ A +VW+
Sbjct: 421 GGDLSTTPLPRRLHGPAGLAVSAGTVWI 448
>gi|171913874|ref|ZP_02929344.1| NHL repeat containing protein [Verrucomicrobium spinosum DSM 4136]
Length = 358
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 2/106 (1%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQ 154
+P A F+ + G + D NH +R++ + VTTIAG G G GPA
Sbjct: 93 DPQKALFNGMHDIQITNDGKAILGDSFNHRVRQLDLKSGTVTTIAGTGKAAFGGDGGPAT 152
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
ASF+ T P ++D GN R+I+L S + SG
Sbjct: 153 AASFNITMTATLSPDRQRYYVADIGNHRTREIDLATGRISTVAGSG 198
>gi|290975165|ref|XP_002670314.1| predicted protein [Naegleria gruberi]
gi|284083871|gb|EFC37570.1| predicted protein [Naegleria gruberi]
Length = 1504
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
++++ V +AG G+ GYS G+P + P V G+IY++D NH IR+I N
Sbjct: 471 ITQDGFVVNVAGTGISGYSLIGDPLLSYLSNPNGIVVLKNGDIYISDTDNHCIRQIKNGT 530
Query: 135 VTTIAG 140
++ G
Sbjct: 531 LSNFVG 536
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDM--KGNIYVADKSNHVIRKIT 131
+S E +V + G G G++ + + A + P + AV + + Y D N +IRK++
Sbjct: 789 ISIEGIVSTIVGKGF-GFNGNKVRNEDALLNLPSNMAVSSLNENDFYFCDTGNRLIRKVS 847
Query: 132 NLGVTTIAGGGS 143
N VTTIAGGG+
Sbjct: 848 NGYVTTIAGGGN 859
>gi|427731644|ref|YP_007077881.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
gi|427367563|gb|AFY50284.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
Length = 505
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
+ + GDG G DG+ SARF +P + M N+Y+ D +NH IR++ L VTT+
Sbjct: 433 QTILGDGNAGLQDGQGKSARFFEPSGLSA-MDSNLYICDTNNHAIRRVDLNTLEVTTL 489
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 35/67 (52%), Gaps = 5/67 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG---VTT 137
V L G G G++DG A+F P+ D I YVAD NH +R+I NL V T
Sbjct: 206 VLHLIGTGKSGFTDGAFDEAQFSAPQGMTFDAANQILYVADTDNHAVRQI-NLKRQIVET 264
Query: 138 IAGGGSK 144
IAG G +
Sbjct: 265 IAGTGEQ 271
>gi|184200016|ref|YP_001854223.1| hypothetical protein KRH_03700 [Kocuria rhizophila DC2201]
gi|183580246|dbj|BAG28717.1| hypothetical protein [Kocuria rhizophila DC2201]
Length = 690
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 50/88 (56%), Gaps = 14/88 (15%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV-------DMKGNIYVADKSNHVIRK 129
S+ + V+R G G +G+SDG ARF++P+ AV ++ ++ VAD NH +R
Sbjct: 205 SDLTTVRRAIGSGARGWSDGPRDVARFNEPQGIAVLPGRLRSEVGYDLVVADTVNHRLRG 264
Query: 130 ITNLG---VTTIAGGGSKK---EGRADG 151
+ +LG V T+AG G ++ + RA G
Sbjct: 265 V-DLGSGSVRTLAGNGVQRLLDDARARG 291
Score = 38.1 bits (87), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 26/46 (56%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LAG G +G DG + F +P A D G+++VAD + +R++
Sbjct: 345 LAGTGDEGLRDGSAHESWFAQPSGLAQDAHGDVWVADSESSALRRV 390
>gi|158318814|ref|YP_001511322.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114219|gb|ABW16416.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 732
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 37/59 (62%), Gaps = 3/59 (5%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
+AG G GYS DG P A +P++ AVD +G + +AD+ + IR++ G +TTIAG
Sbjct: 581 VAGSGTPGYSGDGGPAVHASLTRPQAVAVDAQGRLLIADQEHRRIRRVDTTGRITTIAG 639
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 53/107 (49%), Gaps = 5/107 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
++R++ DG PG +P+S VD G IY+AD NH I ++ G +AG
Sbjct: 527 LRRVSPDGRIETMPALPG---LRQPRSVTVDPHGVIYLADTGNHRIWRLDPGDGARVVAG 583
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+ GPA +AS + + V LLI+D ++ IR+++
Sbjct: 584 SGTPGYSGDGGPAVHASLTRP-QAVAVDAQGRLLIADQEHRRIRRVD 629
>gi|158315875|ref|YP_001508383.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158111280|gb|ABW13477.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 756
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 48/90 (53%), Gaps = 10/90 (11%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
L + V +AG G QG SDG+ G AR + AVD G +Y+ + SN IR+I
Sbjct: 463 LGLDGKVSVVAGTGAQG-SDGDGGPAVQARLRGLDAVAVDGAGTVYLGEDSNGKIRRIGR 521
Query: 133 LGV-TTIAGGGSK--KEGRADGPAQNASFS 159
GV TT+ GGG++ +EG PA S S
Sbjct: 522 DGVITTVVGGGTRLAREGE---PATEVSLS 548
>gi|296221249|ref|XP_002756652.1| PREDICTED: NHL repeat-containing protein 2 [Callithrix jacchus]
Length = 726
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNTIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAQGEQQ--PISSPWDVVF 343
>gi|219851088|ref|YP_002465520.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219545347|gb|ACL15797.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 831
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 9/118 (7%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
+ +F P+ A+D GN+YVAD+SNH + K T++G A G +G DG
Sbjct: 173 NGQFSYPEGIAIDGTGNVYVADESNHRVEKFTSIGTFLTAWG---TKGSEDGQFAYP--- 226
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSC 217
+ V + + ++D GN +++ + SG+ G S V ++V S
Sbjct: 227 ---DGVAVDSLGNVYVADSGNNRVQKFTSSGAFITDWGSSGTGSGQFSSPVDIAVDSS 281
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 9/118 (7%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
+ +F P+ A+D GN+YVAD+SNH + K T++G A G +G DG
Sbjct: 473 NGQFSYPEGIAIDGTGNVYVADESNHRVEKFTSIGTFLTAWG---TKGSEDGQFAYP--- 526
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSC 217
+ V + + ++D GN +++ + SG+ G S V ++V S
Sbjct: 527 ---DGVAVDSLGNVYVADSGNNRVQKFTSSGAFITDWGSSGTGSGQFSSPVDIAVDSS 581
Score = 43.5 bits (101), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 27/48 (56%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GS +F P A+D GN+YV D+ NH ++K T+ G+ G S E
Sbjct: 125 GSGQFSSPYGVAIDSAGNVYVVDRGNHRVQKFTSSGIFVAKWGSSGSE 172
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 27/48 (56%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GS +F P A+D GN+YV D+ NH ++K T+ G+ G S E
Sbjct: 425 GSGQFSSPYGVAIDGAGNVYVVDRGNHRVQKFTSSGIFVAKWGSSGSE 472
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 9/108 (8%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
+ +F P AVD GN+YVAD+ N+ ++K T+ G T +A G+ G + FS
Sbjct: 79 NGQFRTPTGIAVDSAGNVYVADRDNNRVQKFTSTG-TYLAKWGTSGTG-------SGQFS 130
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
+ + + + + + D GN +++ +K SGS G S
Sbjct: 131 SPYGVA-IDSAGNVYVVDRGNHRVQKFTSSGIFVAKWGSSGSENGQFS 177
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 9/108 (8%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
+ +F P AVD GN+YVAD+ N+ ++K T+ G T +A G+ G + FS
Sbjct: 379 NGQFRTPTGIAVDSAGNVYVADRDNNRVQKFTSTG-TYLAKWGTSGTG-------SGQFS 430
Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
+ + + + + + D GN +++ +K SGS G S
Sbjct: 431 SPYGVA-IDGAGNVYVVDRGNHRVQKFTSSGIFVAKWGSSGSENGQFS 477
>gi|257057817|ref|YP_003135649.1| Peroxiredoxin [Saccharomonospora viridis DSM 43017]
gi|256587689|gb|ACU98822.1| Peroxiredoxin [Saccharomonospora viridis DSM 43017]
Length = 644
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 45/94 (47%), Gaps = 8/94 (8%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
RF P V +G I VAD +NH + + + T + G + G DG A F+
Sbjct: 194 RF--PAKAVVTAEGRILVADTANHSVAEFASDAETLLRRFGDGRRGAVDGAFDVARFAEP 251
Query: 162 FELTFVP-HICA-----LLISDHGNQLIRQINLK 189
LT +P HI LL++D N L+R ++L+
Sbjct: 252 SGLTLLPTHIADEVGYHLLVADTANHLLRGVDLR 285
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 30/121 (24%), Positives = 59/121 (48%), Gaps = 11/121 (9%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNH 125
F E++++R GDG +G DG ARF +P A ++ ++ VAD +NH
Sbjct: 219 EFASDAETLLRRF-GDGRRGAVDGAFDVARFAEPSGLTLLPTHIADEVGYHLLVADTANH 277
Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
++R + V T+AG G + ++G G A ++ +++ + ++++ GN
Sbjct: 278 LLRGVDLRTGAVRTVAGTGRQWRDGDDSGAALEVDLTSPWDVRWWDVAGGVVVAMAGNHT 337
Query: 183 I 183
+
Sbjct: 338 L 338
>gi|383829668|ref|ZP_09984757.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
xinjiangensis XJ-54]
gi|383462321|gb|EID54411.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
xinjiangensis XJ-54]
Length = 681
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 43/90 (47%), Gaps = 6/90 (6%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P V +G + VAD +NH I + + T + G + G DG A F+ LT
Sbjct: 227 PSRAVVTAEGRVLVADAANHSIAEFASDAETLLRRFGDGERGAVDGAFDIARFAEPSGLT 286
Query: 166 FVP-HICA-----LLISDHGNQLIRQINLK 189
+P HI LL++D N L+R I+L+
Sbjct: 287 LLPTHIAEQVGYHLLVADTANHLLRAIDLR 316
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 60/120 (50%), Gaps = 11/120 (9%)
Query: 74 FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNHV 126
F E++++R GDG +G DG ARF +P A + ++ VAD +NH+
Sbjct: 251 FASDAETLLRRF-GDGERGAVDGAFDIARFAEPSGLTLLPTHIAEQVGYHLLVADTANHL 309
Query: 127 IRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
+R I V T+AG G + ++G G A + ++ +++ + + ++++ GN +
Sbjct: 310 LRAIDLRTGAVRTVAGTGRQWRDGDDSGAALDVDLTSPWDVRWWDTVGGVVVAMAGNHTL 369
Score = 37.0 bits (84), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 54/136 (39%), Gaps = 20/136 (14%)
Query: 70 YTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+TLS E+ V R AG V+G DG A F + AVD +++ D +R
Sbjct: 367 HTLSLFDPVEATVSRFAGTTVEGLRDGAATEAFFAQTSGLAVDGD-RLWLVDAETSALRY 425
Query: 130 ITNLGVTTIAGGGSKKE------------------GRADGPAQNASFSNDFELTFVPHIC 171
+ +G T AG K G DG A A + +T +P
Sbjct: 426 VERVGDTGDAGATDKTAESGGFVVRTAVGQDLFTFGHRDGAADTALLQHPLGVTVLPD-G 484
Query: 172 ALLISDHGNQLIRQIN 187
++ ++D N +R+ +
Sbjct: 485 SIGVADTYNGAVRRFD 500
>gi|378548897|ref|ZP_09824113.1| hypothetical protein CCH26_02380 [Citricoccus sp. CH26A]
Length = 708
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 12/111 (10%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----------- 134
AG G++G DG+P A F + A G I+VAD + +R+I+
Sbjct: 396 AGTGLEGLEDGDPTVAWFAQSSGLAESADGTIWVADSESSALRRISPADPAAADGALSRR 455
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
V++ G G G DG A A F + + +P ++L++D N IR+
Sbjct: 456 VSSAVGTGLFDFGFRDGAADQARFQHPLGVAALPD-GSVLVADTYNGAIRR 505
>gi|290969729|ref|XP_002667939.1| predicted protein [Naegleria gruberi]
gi|284080810|gb|EFC35195.1| predicted protein [Naegleria gruberi]
Length = 152
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 42/82 (51%), Gaps = 2/82 (2%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
P+ V +Y+AD NH +RKI G + TIAG G + D PA NA S+ +
Sbjct: 41 PEGVFVAPNDEVYMADIQNHRVRKILKDGTIVTIAGTGEEGFSGDDDPAINAQLSHPCGV 100
Query: 165 TFVPHICALLISDHGNQLIRQI 186
FV + ISD GN IR+I
Sbjct: 101 -FVSLTNEVYISDFGNYRIRKI 121
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 3/85 (3%)
Query: 78 EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
++ + +AG G +G+S D +P +A+ P V + +Y++D N+ IRKI N
Sbjct: 67 KDGTIVTIAGTGEEGFSGDDDPAINAQLSHPCGVFVSLTNEVYISDFGNYRIRKILRNGN 126
Query: 135 VTTIAGGGSKKEGRADGPAQNASFS 159
+ TI G G K GPA NA +
Sbjct: 127 IVTIVGTGVKGYSGDGGPAINAQYQ 151
>gi|383120609|ref|ZP_09941337.1| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
gi|382985045|gb|EES68418.2| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
Length = 483
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 40/89 (44%), Gaps = 4/89 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
NPH + L + D + + PL V + G++GY DG P A F
Sbjct: 394 NPHQICFTE-DGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALF 452
Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
+ P AV G I YVAD N VIRK++
Sbjct: 453 NHPFGVAVSADGQIVYVADTGNKVIRKLS 481
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGGGSKKE 146
G+ DG +A+FD P G +Y+AD N+ IR I + + T G +
Sbjct: 381 GWEDGLLRNAKFDNPHQICFTEDGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMK 440
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G DG A F++ F + + ++D GN++IR+++++
Sbjct: 441 GYKDGGPDIALFNHPFGVAVSADGQIVYVADTGNKVIRKLSIQ 483
>gi|375103275|ref|ZP_09749538.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
cyanea NA-134]
gi|374664007|gb|EHR63885.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
cyanea NA-134]
Length = 638
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 13/114 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
V R DG +G+ DG RF P V +G + VAD +NH + + + T +
Sbjct: 177 VARADTDG-RGHLDG----LRF--PSKAVVTAEGRVLVADTANHSVAEFASDAETLLRRF 229
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
G + G DG A F+ LT +P A LL++D N L+R ++L+
Sbjct: 230 GDGRRGAVDGAFDVARFAEPSGLTLLPTHVADEVGYHLLVADTANHLLRGVDLR 283
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 60/121 (49%), Gaps = 11/121 (9%)
Query: 73 SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNH 125
F E++++R GDG +G DG ARF +P A ++ ++ VAD +NH
Sbjct: 217 EFASDAETLLRRF-GDGRRGAVDGAFDVARFAEPSGLTLLPTHVADEVGYHLLVADTANH 275
Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
++R + V T+AG G + ++G GPA ++ +++ + ++++ GN
Sbjct: 276 LLRGVDLRTGAVRTVAGTGRQWRDGDDSGPALGVDLTSPWDVRWWDAANGVVVAMAGNHT 335
Query: 183 I 183
+
Sbjct: 336 L 336
>gi|326431538|gb|EGD77108.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
Length = 1455
Score = 44.7 bits (104), Expect = 0.045, Method: Composition-based stats.
Identities = 32/116 (27%), Positives = 52/116 (44%), Gaps = 4/116 (3%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVT 136
V+ G G G+ G +A+ P A D +G +Y+AD NHV+ + +L +
Sbjct: 1222 VLSVFVGTGEAGHRGMGGSSRAAQIHSPHGLAFDSRGTLYIADTENHVVYGVDRRSLVID 1281
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
+AG +K DG ++ N V L I+D GN IR ++L ++
Sbjct: 1282 VVAGTPFRKGYHGDGRPATTAWLNAPTGIAVRGAGDLYIADTGNNRIRYVDLDSDN 1337
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 3/81 (3%)
Query: 80 SVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
S + +AG +G + G +A+ D+P AVD GN+Y+++K NH + + + GV
Sbjct: 1164 STITTIAGTKTRGSAGVGGLATAAQLDRPTCVAVDTIGNVYISEKGNHRVSVVDASTGVL 1223
Query: 137 TIAGGGSKKEGRADGPAQNAS 157
++ G + R G + A+
Sbjct: 1224 SVFVGTGEAGHRGMGGSSRAA 1244
>gi|149689662|ref|XP_001495693.1| PREDICTED: NHL repeat-containing protein 2 [Equus caballus]
Length = 726
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 44/80 (55%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGTFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A+G Q S+ +++ F
Sbjct: 326 GGAEGEQQ--PISSPWDVVF 343
>gi|432922359|ref|XP_004080313.1| PREDICTED: NHL repeat-containing protein 2-like [Oryzias latipes]
Length = 720
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 43/84 (51%), Gaps = 8/84 (9%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG--VTTIAGG-- 141
G G DG A F+ P+ A KG+ +YVAD NH+IRKI G V+TIAG
Sbjct: 257 GGPASGRKDGSLSEASFNSPQGVA--FKGDAVYVADTENHLIRKIDLSGGRVSTIAGAGV 314
Query: 142 -GSKKEGRADGPAQNASFSNDFEL 164
G+ +EG A G Q S D L
Sbjct: 315 QGTDREGGAVGVQQPISSPWDVTL 338
>gi|395226993|ref|ZP_10405365.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
gi|394444658|gb|EJF05823.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
Length = 240
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 30/50 (60%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
V+ G G+ GY DG +++ +KP+ D KGN+ V D N +IR+I+
Sbjct: 168 VQNFVGSGIAGYQDGVGFNSQLNKPRGMTFDEKGNLLVVDSYNEMIREIS 217
>gi|219853189|ref|YP_002467621.1| PKD domain-containing protein [Methanosphaerula palustris E1-9c]
gi|219547448|gb|ACL17898.1| PKD domain containing protein [Methanosphaerula palustris E1-9c]
Length = 930
Score = 44.7 bits (104), Expect = 0.045, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 3/65 (4%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
G+ + + G+ +F+ P AVD GNIYV D NH ++K T+ G GG E
Sbjct: 76 GEFITQWGSEGSGNGQFENPGGIAVDSAGNIYVTDTWNHRVQKFTSTGEFITKWGG---E 132
Query: 147 GRADG 151
G DG
Sbjct: 133 GSGDG 137
Score = 42.0 bits (97), Expect = 0.24, Method: Composition-based stats.
Identities = 37/110 (33%), Positives = 53/110 (48%), Gaps = 18/110 (16%)
Query: 44 EIN-PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
E+N PH V +D G+ +I ++R +T S + +A G +G DG+
Sbjct: 326 ELNHPHGVALDDAGNVYIIDTWNNRVQIFT--------STGEFIAKWGSKGSEDGQ---- 373
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
F P S AVD GN+YV D N+ ++K T+ G GG EG DG
Sbjct: 374 -FIYPYSIAVDSAGNVYVVDTGNNRVQKFTSTGTFITQWGG---EGFGDG 419
Score = 38.9 bits (89), Expect = 2.1, Method: Composition-based stats.
Identities = 30/101 (29%), Positives = 48/101 (47%), Gaps = 16/101 (15%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
+DR G+ ++ ++R +T S + + + G G + DG+ F+ P A
Sbjct: 476 VDRAGNVYVVDTGNNRVQIFT-----STGTFIAQWGGSGSR---DGQ-----FNYPGGIA 522
Query: 111 VDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
VD GN+YV D+SN+ +K T+ G G EG DG
Sbjct: 523 VDSAGNVYVVDESNNRFQKFTSTGEFITKWG---SEGLGDG 560
Score = 37.4 bits (85), Expect = 6.2, Method: Composition-based stats.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 11/76 (14%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ ++ V+D S + F F + E + K G +G DGE F P+ AVD G
Sbjct: 526 AGNVYVVDESNNRFQ--KFTSTGEFITKW----GSEGLGDGE-----FTYPRDVAVDSGG 574
Query: 116 NIYVADKSNHVIRKIT 131
N+Y+ D+SN I+K +
Sbjct: 575 NVYIVDESNSRIQKFS 590
Score = 37.4 bits (85), Expect = 6.6, Method: Composition-based stats.
Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 11/74 (14%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA-- 139
+ + GDG G F+ P AVD GN+YV D N+ ++ T+ G T IA
Sbjct: 455 ITKWGGDG--------SGVGEFNYPYGIAVDRAGNVYVVDTGNNRVQIFTSTG-TFIAQW 505
Query: 140 GGGSKKEGRADGPA 153
GG ++G+ + P
Sbjct: 506 GGSGSRDGQFNYPG 519
>gi|308178347|ref|YP_003917753.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
gi|307745810|emb|CBT76782.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
Length = 635
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 88/204 (43%), Gaps = 46/204 (22%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK-------PKS 108
+ + +V DS L+ S+ S V R G GV+GY+DG+ +A+F++ P
Sbjct: 190 NGNFLVGDSGHHRLVELA---SDLSTVVRTIGAGVKGYADGDAQTAQFNELQGLTALPAE 246
Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKEGRADGPAQ----------- 154
A ++ ++ VAD NH +R + NL V T+AG G ++ ++ Q
Sbjct: 247 LAAEVGYDVIVADTVNHRLRSV-NLETGVVGTLAGNGVQRLLDSENARQEVQRTELGTDA 305
Query: 155 -NASFSNDFELTFVPHICALLISDHGNQLIRQINLK-------------------PEDCS 194
N S S+ +++ + P ++I+ G I N + P++
Sbjct: 306 TNISLSSPWDVLYHPS-GQVIIAMAGTHQIFAFNPRTSAVSVFAGTGLEGLADGAPDEAW 364
Query: 195 KSSQSGSALGAVSVWVLVSVLSCL 218
+ SG AL +VW+ S S L
Sbjct: 365 FAQSSGLALDGQNVWIADSETSAL 388
Score = 37.0 bits (84), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 49/109 (44%), Gaps = 8/109 (7%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
G+G + G RF P GN V D +H + ++ + T + G+ +
Sbjct: 165 GNGPYVAPEAREGDLRF--PGKAIALANGNFLVGDSGHHRLVELASDLSTVVRTIGAGVK 222
Query: 147 GRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
G ADG AQ A F+ LT +P A ++++D N +R +NL+
Sbjct: 223 GYADGDAQTAQFNELQGLTALPAELAAEVGYDVIVADTVNHRLRSVNLE 271
>gi|423072880|ref|ZP_17061629.1| repeat protein [Desulfitobacterium hafniense DP7]
gi|361856375|gb|EHL08282.1| repeat protein [Desulfitobacterium hafniense DP7]
Length = 2017
Score = 44.3 bits (103), Expect = 0.049, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 9/102 (8%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
AG + G+ +F P+S AVD GNIYV+D N+ I+K + G KK
Sbjct: 254 AGSFIMGWGGAGIDDGQFATPRSIAVDSAGNIYVSDSMNNRIQKFASSGTFL------KK 307
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GR G + + F + + V H + +SD GN I + +
Sbjct: 308 WGR--GGSADGEFYQPYGVA-VDHEGTIYVSDTGNNRIEKFD 346
Score = 42.4 bits (98), Expect = 0.19, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 27/49 (55%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
G +F PKS AVD GNIYVAD +N I+K + G G S+ G
Sbjct: 772 GDGQFRNPKSVAVDNDGNIYVADTTNKRIQKFDSDGTLITKWGDSQDNG 820
Score = 42.4 bits (98), Expect = 0.20, Method: Composition-based stats.
Identities = 29/73 (39%), Positives = 36/73 (49%), Gaps = 6/73 (8%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEG 147
G G +GE F++P AVD GNIYVAD +NH I+K GV T G G
Sbjct: 956 GSNGSDEGE-----FNRPYGIAVDSDGNIYVADSNNHRIQKFNAAGVFITTWGSYGTGLG 1010
Query: 148 RADGPAQNASFSN 160
+ + P A SN
Sbjct: 1011 QFNSPKGIAVDSN 1023
Score = 41.6 bits (96), Expect = 0.33, Method: Composition-based stats.
Identities = 33/82 (40%), Positives = 43/82 (52%), Gaps = 7/82 (8%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK-ITNLGVTTIAGGGS 143
LA G +G S G+ FD P AVD GNIYVAD +N+ ++K VT + GGS
Sbjct: 1189 LAKWGTRGNSAGQ-----FDNPSGIAVDNDGNIYVADTNNNRVQKSAQTYTVTFDSNGGS 1243
Query: 144 KKEGRADGPAQNASFSNDFELT 165
E + A+NA+ S E T
Sbjct: 1244 AVEAIVN-IAKNATVSLPIEPT 1264
Score = 39.7 bits (91), Expect = 1.4, Method: Composition-based stats.
Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 11/86 (12%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
FD P + AVD GNIY+ D + H I+K+++ G + G EG+ + P A S++
Sbjct: 682 FDVPAAAAVDSNGNIYIVDSNRHHIKKLSSTGEILATWGSYGNAEGQLNVPGHIAIDSDN 741
Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
+ ++D GN I++ +
Sbjct: 742 ----------NVYVADTGNNRIQKFS 757
Score = 38.1 bits (87), Expect = 4.1, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS-KKEGRADGPA 153
G +F+ PK AVD GN+YVAD N ++K ++G A G + EG P+
Sbjct: 1008 GLGQFNSPKGIAVDSNGNVYVADIENDRVQKFDSMGGNPEAFGSTGTGEGEFKRPS 1063
Score = 37.4 bits (85), Expect = 7.6, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 31/64 (48%), Gaps = 5/64 (7%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G G +DGE F +P AVD +G IYV+D N+ I K GV + G E +
Sbjct: 309 GRGGSADGE-----FYQPYGVAVDHEGTIYVSDTGNNRIEKFDAAGVFVTSWGNGSDEDQ 363
Query: 149 ADGP 152
+ P
Sbjct: 364 LNMP 367
>gi|290997640|ref|XP_002681389.1| hypothetical protein NAEGRDRAFT_78414 [Naegleria gruberi]
gi|284095013|gb|EFC48645.1| hypothetical protein NAEGRDRAFT_78414 [Naegleria gruberi]
Length = 630
Score = 44.3 bits (103), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 53/96 (55%), Gaps = 14/96 (14%)
Query: 91 QGYSDGEPG---SARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLG-VTTIAG---GG 142
QGYS G+ G +A + P A D+K G++Y++D N +IRK+ G +TTIAG GG
Sbjct: 107 QGYS-GDGGLAVNAALNWPYDVACDLKTGDVYISDNENQLIRKVFKNGTITTIAGSRSGG 165
Query: 143 SKKEGR-ADGPAQN----ASFSNDFELTFVPHICAL 173
+ ++G A N S S++ L F +IC +
Sbjct: 166 TTQDGSLATQTVINRPMMMSLSSNGNLYFTSNICTI 201
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 49/89 (55%), Gaps = 2/89 (2%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DGPAQNAS 157
+A + P ++ +G I++ ++++HV+RKI G + G ++G + G A NA+
Sbjct: 61 NAMLNNPFGICLNDEGEIFIVERASHVVRKILTNGTIIVFAGIVNQQGYSGDGGLAVNAA 120
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
+ +++ + ISD+ NQLIR++
Sbjct: 121 LNWPYDVACDLKTGDVYISDNENQLIRKV 149
>gi|290996232|ref|XP_002680686.1| predicted protein [Naegleria gruberi]
gi|284094308|gb|EFC47942.1| predicted protein [Naegleria gruberi]
Length = 655
Score = 44.3 bits (103), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 65/114 (57%), Gaps = 11/114 (9%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAV-DMKGNIYVADKSNHVIRKI---TNLG 134
+V RL G G+ G++ DG + + + PK ++ + G+IY AD SN++IR + TN+
Sbjct: 77 IVTRLIGTGIGGFNGDGLLANLTQVNSPKGVSISQISGDIYFADSSNNLIRVMNAATNI- 135
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
V+TIAG GS G ++G A ++ + E + LL +D N IR+++L
Sbjct: 136 VSTIAGTGS--FGFSEGNATSSKLNKPTEPVYWNG--QLLFADSLNYRIRKVDL 185
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 51/100 (51%), Gaps = 7/100 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S D+ DSS + ++ + ++V +AG G G+S+G S++ +KP V G
Sbjct: 113 SGDIYFADSSNNLIRVMN---AATNIVSTIAGTGSFGFSEGNATSSKLNKPTE-PVYWNG 168
Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPA 153
+ AD N+ IRK+ +N ++T+ G G E DG A
Sbjct: 169 QLLFADSLNYRIRKVDLSNGMISTLTGTG-ISEFNGDGLA 207
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 48/101 (47%), Gaps = 6/101 (5%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
DL DS +F + S +SV+ L G Y DG + P+ +V G++
Sbjct: 225 DLYFADSD--SFRIRKYIQSTKSVIT-LVG-STNNYRDGLVNQSLLSYPEGVSVASNGDV 280
Query: 118 YVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNA 156
+D SNHVIRK I+ V+TIAG GS + A N+
Sbjct: 281 IFSDNSNHVIRKISISKGMVSTIAGTGSAGAAIDNVDATNS 321
>gi|290999615|ref|XP_002682375.1| serine/threonine-protein kinase [Naegleria gruberi]
gi|284096002|gb|EFC49631.1| serine/threonine-protein kinase [Naegleria gruberi]
Length = 1357
Score = 44.3 bits (103), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 117 IYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALL 174
+Y++D NH+IRK++ G +TT+ G G+ DG + +N L+ P+ L
Sbjct: 29 LYISDTFNHMIRKVSKNGTLTTLVGVGAAGSNDDDGLLGIDTQLNNPNGLSISPNGELLF 88
Query: 175 ISDHGNQLIRQINLKPEDCS--KSSQSGSALGAVSVWVLVSVLS 216
I+D N+LIR +N+K C+ K S L ++ V VS+ S
Sbjct: 89 IADTDNKLIRVLNMKTMICNVFKMFTSSGVLDTLTKPVGVSITS 132
>gi|452951305|gb|EME56755.1| NHL repeat-containing protein [Amycolatopsis decaplanina DSM 44594]
Length = 602
Score = 44.3 bits (103), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 48/105 (45%), Gaps = 8/105 (7%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
RF P +G I VAD NH I + + G T I GS + G DGP ASF+
Sbjct: 168 RF--PSKAVTTAEGRILVADTGNHSIVEFASDGETIIRRFGSGERGAQDGPFDLASFTEP 225
Query: 162 FELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSG 200
+T +P+ A +++D +R I+L + S + +G
Sbjct: 226 SGITLLPYEVAERAGYHAVVADTAGHRLRGIDLITGEVSTVAGTG 270
>gi|451981204|ref|ZP_21929574.1| NHL repeat containing protein [Nitrospina gracilis 3/211]
gi|451761567|emb|CCQ90828.1| NHL repeat containing protein [Nitrospina gracilis 3/211]
Length = 488
Score = 44.3 bits (103), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 65/128 (50%), Gaps = 10/128 (7%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI--TNLG 134
E ++ G G G +DG A F +P+ F D+ ++VAD NH++RKI +
Sbjct: 206 EGTIRHRIGSGAVGKADGGFEEASFYRPQGLCFQDDV---VWVADTENHLLRKIDLNSKQ 262
Query: 135 VTTIAGGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
VTT+AG G + R PA+ + S+ ++++ H AL ++ G I + ++ +
Sbjct: 263 VTTVAGTGDQGGFLRDTHPAKETALSSPWDVSL--HEGALYFANAGTHQIGRYHIAADTV 320
Query: 194 SKSSQSGS 201
+ + +G+
Sbjct: 321 EQFAGTGA 328
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 48/110 (43%), Gaps = 6/110 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTI 138
V++ AG G + DG A F +P V G +++AD IR I G V T
Sbjct: 320 VEQFAGTGAEALQDGPRLQAPFAQPSGLTVG-DGKLFLADSETSAIRSIELGGQGKVETY 378
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G G G DG + A + + +V A+ I+D N IR ++L
Sbjct: 379 VGTGLFDFGDRDGVGKEAVLQHPLGVHYVE--GAVFIADSYNHKIRVLDL 426
>gi|312372811|gb|EFR20689.1| hypothetical protein AND_19679 [Anopheles darlingi]
Length = 745
Score = 44.3 bits (103), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 35/73 (47%), Gaps = 2/73 (2%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
V+ G G+ DG ARF+ P+ A +YVAD NH IR+I V+T+
Sbjct: 258 TVRYTIGGKSSGFVDGGFREARFNAPQGVAFRGGDELYVADNENHAIRRIDLRTRTVSTV 317
Query: 139 AGGGSKKEGRADG 151
AG G + R G
Sbjct: 318 AGNGVQGNDRTGG 330
>gi|310657686|ref|YP_003935407.1| exported protein of unknown function [[Clostridium] sticklandii]
gi|308824464|emb|CBH20502.1| exported protein of unknown function [[Clostridium] sticklandii]
Length = 780
Score = 44.3 bits (103), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 13/102 (12%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
GY DGE A F + D + ++Y+AD N+ IRK+ N V+T+AG G+ E
Sbjct: 566 GYKDGESQEALFQSIRGITFDDRTKSLYIADTYNNRIRKVQNGIVSTVAGNGATGEPEFG 625
Query: 151 GPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQIN 187
A N S PH ++ISD N ++ +++
Sbjct: 626 EKATNTYLSR-------PHDVIIKNDEIIISDSWNNIVSKVD 660
>gi|312198349|ref|YP_004018410.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311229685|gb|ADP82540.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 789
Score = 44.3 bits (103), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 49/94 (52%), Gaps = 5/94 (5%)
Query: 74 FPLSEESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
F + V LAG G++G S DG P +A + P A+ KG++YV D + V R T
Sbjct: 485 FRRGPDGKVSVLAGTGIRGSSGDGGPATAAMLNGPTGLALGPKGDLYVLD-GDRVRRIAT 543
Query: 132 NLGVTTIAGGGSKKEGRADG--PAQNASFSNDFE 163
+ +TT+AGGGS G PA +AS FE
Sbjct: 544 SGKITTVAGGGSDSGPSPTGPVPATSASLGFLFE 577
>gi|256380948|ref|YP_003104608.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Actinosynnema mirum DSM 43827]
gi|255925251|gb|ACU40762.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Actinosynnema mirum DSM 43827]
Length = 612
Score = 44.3 bits (103), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 67/135 (49%), Gaps = 15/135 (11%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG------EPGSARFDKPKSFAVD 112
L+V DS+ + L+ E+V++R+ G G +G +DG EP P A
Sbjct: 195 LLVSDSAHHSLVELA--ADGETVLRRV-GTGERGRADGLEPTFSEPAGIAL-LPPEVAAT 250
Query: 113 MKGNIYVADKSNHVIRKITNLG---VTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVP 168
+ VAD NH++R + NL VTT+AG G + ++G DGPA ++ +++ +
Sbjct: 251 TGYHAVVADTVNHLLRGL-NLDTGEVTTVAGTGEQWRDGATDGPADEIPLTSPWDVVWWE 309
Query: 169 HICALLISDHGNQLI 183
A+ I+ GN +
Sbjct: 310 PANAVAIALAGNHTL 324
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 51/107 (47%), Gaps = 3/107 (2%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
++RLAG V+G +DG A F + A D + +++ D +R + + V T G
Sbjct: 334 LERLAGTTVEGLNDGPADQAFFAQTSGLASDGE-RLWLVDSETSALRWLDPDRTVHTAVG 392
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G DGPA A + +T +P ++ I+D N +R+ +
Sbjct: 393 KGLFDFGHRDGPADQALLQHPLGVTALPD-GSVAIADTYNGAVRRYD 438
>gi|406947273|gb|EKD78225.1| lipoprotein [uncultured bacterium]
Length = 883
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 51/94 (54%), Gaps = 6/94 (6%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV--DM 113
+ D+ VLD+ + + + ++R+AG G GY + +A+F +P+ AV D
Sbjct: 50 ADDIYVLDTINNRIE----KVKSDGSLERVAGSGEYGYKNDSALNAQFAEPQDIAVFGDN 105
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
I++AD +N+VIRK+ N V+T G S +G
Sbjct: 106 AEQIFIADTNNNVIRKLENGVVSTFLTGLSGPKG 139
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 3/52 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKIT 131
VV L+G+G GY +G +A + PK F D + G +YVAD N VIR I+
Sbjct: 833 VVMTLSGNGNVGYVNGTDTAAEWHNPKGF--DFRAGKLYVADTMNDVIRTIS 882
>gi|158521169|ref|YP_001529039.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
gi|158509995|gb|ABW66962.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
Length = 2961
Score = 44.3 bits (103), Expect = 0.054, Method: Composition-based stats.
Identities = 24/67 (35%), Positives = 43/67 (64%), Gaps = 3/67 (4%)
Query: 79 ESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
+ ++ +AG+G++G DG A + P+S VD+KGN+Y +++ ++ +RKI G +
Sbjct: 1769 DGIISTVAGNGLEGNGPDGISALATPLNDPESIVVDVKGNLYFSERDHYKVRKIDPSGII 1828
Query: 136 TTIAGGG 142
TT+AG G
Sbjct: 1829 TTVAGNG 1835
>gi|431895427|gb|ELK04943.1| NHL repeat-containing protein 2 [Pteropus alecto]
Length = 726
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 39/75 (52%), Gaps = 6/75 (8%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+TIAG GS KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTIAGIGIQGSDKE 325
Query: 147 GRADGPAQNASFSND 161
G A G Q S D
Sbjct: 326 GGAKGEQQPISSPWD 340
>gi|269957976|ref|YP_003327765.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Xylanimonas cellulosilytica DSM 15894]
gi|269306657|gb|ACZ32207.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Xylanimonas cellulosilytica DSM 15894]
Length = 688
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 56/107 (52%), Gaps = 13/107 (12%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P I PG + L+V D+ + L+ E++V+R+ G G +G DG P ARF +P
Sbjct: 178 PAKAITLPGGT-LLVADAGHHSLAELA--PDGETLVRRI-GSGQRGLLDGGPDEARFAEP 233
Query: 107 KSFAV---DMKG----NIYVADKSNHVIR--KITNLGVTTIAGGGSK 144
+ D++G ++ VAD NH +R ++ + V T+AG G +
Sbjct: 234 NGLCLVPEDLRGRLGYDVVVADTVNHALRGVRLADGLVVTLAGTGEQ 280
>gi|344274719|ref|XP_003409162.1| PREDICTED: NHL repeat-containing protein 2 [Loxodonta africana]
Length = 726
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 40/78 (51%), Gaps = 6/78 (7%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGTFSESTFNSPQGVAI-MNNVIYVADTENHLIRKIDLEAEIVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFEL 164
G A G Q S D L
Sbjct: 326 GGAKGEQQPISSPWDVVL 343
>gi|116751477|ref|YP_848164.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
gi|116700541|gb|ABK19729.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
Length = 1763
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 74/167 (44%), Gaps = 27/167 (16%)
Query: 41 HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
+ E+N + + DL + DS + ++ +AG+G GYS G+ G+
Sbjct: 468 NHTEVNWPTGVAVDAKGDLYIADSGNCRVRKVD----SYGIITTVAGNGSCGYS-GDGGA 522
Query: 101 A---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGG----SKKEGRA--- 149
A + P AVD + N+Y++D +H IRK+ GV TT+AG G ++G
Sbjct: 523 AVEAQLGYPAFVAVDSRENLYISDTESHRIRKVGPDGVITTVAGSGLCHLKVEDGYGGYQ 582
Query: 150 ----------DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
GPA A +N + L I+D GN+ +R++
Sbjct: 583 EYDAPCFDGDQGPADQAKLNNPSGVA-TDASGNLYIADTGNRRLRKV 628
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 6/65 (9%)
Query: 88 DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKE 146
DG QG +D A+ + P A D GN+Y+AD N +RK+ G ++T+AG G
Sbjct: 590 DGDQGPAD----QAKLNNPSGVATDASGNLYIADTGNRRLRKVAPSGTISTVAGDGELGS 645
Query: 147 GRADG 151
G ADG
Sbjct: 646 G-ADG 649
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 66/149 (44%), Gaps = 7/149 (4%)
Query: 42 QLEINPHSVI-DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPG 99
QL ++P V+ D G +L V D S + V +AG +G DG P
Sbjct: 654 QLHLSPVGVVADSRG--NLFVSDLSHGEGNGRIRKIDPTGFVTTVAGHDWEGTLGDGGPA 711
Query: 100 S-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
+ A P+ AVD GN+Y AD + +R + +G +TT+AG G G A A
Sbjct: 712 TQAYIGSPRGLAVDRGGNLYFADPWINRVRAVNTMGIITTVAGSDDGTFGEDGGLALRAR 771
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
+ ++ P L ++ NQ +R++
Sbjct: 772 LAFPMDVAVDPR-GDLYATEQWNQRVRKV 799
>gi|354612773|ref|ZP_09030715.1| NHL repeat containing protein [Saccharomonospora paurometabolica
YIM 90007]
gi|353222911|gb|EHB87206.1| NHL repeat containing protein [Saccharomonospora paurometabolica
YIM 90007]
Length = 436
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 13/103 (12%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
+AG G G S DG P A P + A+D GN+Y+ D + +RK+ T+ ++T+AG
Sbjct: 159 VAGTGQNGDSGDGGPAVDAELRHPTTVALDRHGNVYLTDPESRRVRKVSATDHTISTVAG 218
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
G + + GPA +A EL F P+ A + HGN +
Sbjct: 219 TGREGDSGDGGPATDA------ELRF-PNCVA--VDGHGNVFL 252
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 59/115 (51%), Gaps = 5/115 (4%)
Query: 77 SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TN 132
+ + + +AG G +G S DG P + A P AVD GN+++ D +H +RK+ T+
Sbjct: 208 ATDHTISTVAGTGREGDSGDGGPATDAELRFPNCVAVDGHGNVFLTDPRSHRVRKVSATD 267
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+TT+AG G + +G GPA +A L V L D GN +R+++
Sbjct: 268 HTITTVAGTGREGDGGDGGPATDAEVRYPNSLA-VDGAGNLYFGDTGNHRVRRVS 321
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 36/80 (45%), Gaps = 11/80 (13%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT--TIAGGGSKKEGRADGPAQNASFSN 160
F +P A+D +GN YVAD +R++T G + +AG G + GPA +A
Sbjct: 122 FGRPWGLALDTQGNFYVADADGCQVRRVTADGASFAAVAGTGQNGDSGDGGPAVDAELR- 180
Query: 161 DFELTFVPHICALLISDHGN 180
H + + HGN
Sbjct: 181 --------HPTTVALDRHGN 192
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 38/83 (45%), Gaps = 4/83 (4%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
P AVD GN+Y+AD +RK+ T+ + TI G G +G GPA
Sbjct: 353 PVGLAVDGAGNLYIADPDTCRVRKVSATDHTIDTIVGNGHAGDGDERGPATGLPLDPGGV 412
Query: 164 LTFVPHICALLISDHGNQLIRQI 186
+T L ++D G IR++
Sbjct: 413 VTDGRE--NLFVADFGQYRIRRV 433
>gi|298383938|ref|ZP_06993499.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298263542|gb|EFI06405.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 483
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGGGSKKE 146
G+ DG +A+FD P G +Y+AD N+ IR I + + T G +
Sbjct: 381 GWEDGLLRNAKFDNPHQICFTEDGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMK 440
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G DG A F++ F + + I+D GN++IR+++++
Sbjct: 441 GYKDGGPDIALFNHPFGVAVSADGQIVYIADTGNKVIRKLSIQ 483
Score = 44.3 bits (103), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 4/89 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
NPH + L + D + + PL V + G++GY DG P A F
Sbjct: 394 NPHQICFTE-DGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALF 452
Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
+ P AV G I Y+AD N VIRK++
Sbjct: 453 NHPFGVAVSADGQIVYIADTGNKVIRKLS 481
>gi|167534306|ref|XP_001748831.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772793|gb|EDQ86441.1| predicted protein [Monosiga brevicollis MX1]
Length = 5844
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 3/112 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRAD 150
+ DG A F +P G+ VAD +NH IR I+ +L G + G A+
Sbjct: 825 AHRDGRLSDACFHRPSGICQLEDGSFLVADSANHCIRSISRSLSTVKTFAGQPGQAGGAE 884
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDCSKSSQSGS 201
G A F+ +L + +LISD N IR + L K ++ + ++ +GS
Sbjct: 885 GAVSKAQFNQPSDLLVLDEK-RILISDEANNSIRLLQLQKDKNWTVTTVAGS 935
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 47/106 (44%), Gaps = 11/106 (10%)
Query: 57 SDLIVLDSSR--------SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK- 107
SDL+VLD R ++ L + V +AG G++DG GS + ++P+
Sbjct: 896 SDLLVLDEKRILISDEANNSIRLLQLQKDKNWTVTTVAGSRTAGFADGRRGSTQLNQPRG 955
Query: 108 -SFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADG 151
F + + +AD NH IR + T+ + G + GR G
Sbjct: 956 MCFLNQRRTVVAIADAGNHCIRALNTSDWSLKLVAGVPGRAGRRYG 1001
>gi|184201467|ref|YP_001855674.1| hypothetical protein KRH_18210 [Kocuria rhizophila DC2201]
gi|183581697|dbj|BAG30168.1| hypothetical protein [Kocuria rhizophila DC2201]
Length = 246
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 43/93 (46%), Gaps = 7/93 (7%)
Query: 55 GSSDLIVLDSSRSAFYTLSF----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
G L +D S+ L+ P + VV + G GY DG AR D P+
Sbjct: 111 GDGRLWFVDRDSSSLRCLTIDTDRPDGDPHVVTVVGRHGA-GYQDGPATEARLDHPEDLQ 169
Query: 111 VDMKGNIYVADKSNHVIR--KITNLGVTTIAGG 141
+ G++ VAD NH +R +T+ VTTI GG
Sbjct: 170 MLYDGSVVVADTGNHALRHVDVTDGEVTTICGG 202
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 28/116 (24%), Positives = 52/116 (44%), Gaps = 11/116 (9%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--------N 132
V++ AG G G DG A+F S G ++ D+ + +R +T +
Sbjct: 79 VLQEWAGTGEPGRRDGRVAQAKFAATVSITEGGDGRLWFVDRDSSSLRCLTIDTDRPDGD 138
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
V T+ G G DGPA A + +L + + +++++D GN +R +++
Sbjct: 139 PHVVTVV--GRHGAGYQDGPATEARLDHPEDLQML-YDGSVVVADTGNHALRHVDV 191
>gi|219852941|ref|YP_002467373.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219547200|gb|ACL17650.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 579
Score = 44.3 bits (103), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 55/120 (45%), Gaps = 13/120 (10%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
LA G +G DG+ FD P AVD + N+YV D N+ ++K T+ G T +A G++
Sbjct: 176 LAKWGTEGGGDGQ-----FDYPTGIAVDSENNVYVVDSYNNRVQKFTSNG-TFLAKWGAR 229
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
G DG F++ E V + ++D GN I + +K GS G
Sbjct: 230 --GSGDG-----EFADFPEEIAVDSTGNVFVTDTGNNRIEKFTSNGTFLAKWGGRGSGDG 282
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 82 VKRLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
V+R G+ + G G +F P AVD GN+YVAD N+ ++K ++ G T +A
Sbjct: 119 VQRFTSTGIFLARWGTGGTGDGQFKSPSGVAVDSAGNVYVADMYNYRVQKFSSAG-TLLA 177
Query: 140 GGGSKKEGRADG 151
G+ EG DG
Sbjct: 178 KWGT--EGGGDG 187
>gi|397779898|ref|YP_006544371.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
gi|396938400|emb|CCJ35655.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
Length = 483
Score = 44.3 bits (103), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 12/131 (9%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKIT--NL 133
S + + + G G G +DG A F P+ A D + G +YVAD NH IR+++ +L
Sbjct: 196 SRDGAILGVIGTGAPGNADGSSSEAAFYLPEGLAFDEEAGVLYVADAGNHTIRQVSWPDL 255
Query: 134 GVTTIAGGG----SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
V T+AG G S EG GP + + +L + L I+ G + +++L
Sbjct: 256 RVETVAGTGLEAPSPGEG---GPGTGVALNAPRDLALMGDY--LYIAMAGANQVWRMDLA 310
Query: 190 PEDCSKSSQSG 200
+ SG
Sbjct: 311 THTLEPYAGSG 321
>gi|380693174|ref|ZP_09858033.1| hypothetical protein BfaeM_04235 [Bacteroides faecis MAJ27]
Length = 469
Score = 44.3 bits (103), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 22/117 (18%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKG--------------NIYVADKSNHVIRKITNLGVT 136
+ + +G GSARF+KP+ F V +K + Y D+ NH I K+ GV
Sbjct: 351 KSFKEGTGGSARFNKPQ-FGVFVKNEKYGEGIGPDKDQYDFYFCDRENHCIWKLDPHGVA 409
Query: 137 TIAGGGSKKE------GRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++A G S + G DG P A F+ L + P I D N+ IR +
Sbjct: 410 SLAAGRSNENADGKIWGYVDGNPLHEARFNQPAGLAYDPDTDMFYIGDIDNKGIRYM 466
>gi|348688202|gb|EGZ28016.1| hypothetical protein PHYSODRAFT_321722 [Phytophthora sojae]
Length = 490
Score = 44.3 bits (103), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 80/205 (39%), Gaps = 42/205 (20%)
Query: 11 AITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEIN---PHSVIDRPGSSDLIVLDSSRS 67
A+T +F+ SG + G+ DG LE N P V + S +IV D+
Sbjct: 124 AVTDDVEMFAFAGSG---KKGWR-----DGLALEANFNSPSGVCEYADGS-IIVADTGNH 174
Query: 68 AFYTLSFPLSEESVVKRLAGD---------GVQG----------YSDGEPGSARFDKPKS 108
+ + + VVK +AG GVQG Y DGE + F P +
Sbjct: 175 CIRQIRRGANGKLVVKTIAGAYASFETKRVGVQGAGIKDQRTAGYRDGE--RSLFRSPST 232
Query: 109 FAVDMKGNIYVADKSNHVIRKI-------TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
G + VAD N+ IR + + V T+ G + G ADG + A F
Sbjct: 233 VLAGPSGELLVADTMNNCIRGLLPPPDGTSPWRVKTVCG--QTRPGHADGNCEVALFDQP 290
Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
L + ++D GN IRQ+
Sbjct: 291 LSLCWGEDSNTFFVADRGNACIRQV 315
>gi|290976772|ref|XP_002671113.1| predicted protein [Naegleria gruberi]
gi|284084679|gb|EFC38369.1| predicted protein [Naegleria gruberi]
Length = 520
Score = 44.3 bits (103), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 53/109 (48%), Gaps = 6/109 (5%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
+ +AG G GYS G+ G +A PKS A+ G I+ D N VIR+IT G +TT
Sbjct: 255 ITTIAGTGAYGYS-GDGGLAINAVIRYPKSIAIGKHGEIFFTDSDNQVIRRITPDGIITT 313
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
IAG G+ G A +A S + V + D+ N +R++
Sbjct: 314 IAGTGNFGYSGDGGLATSADISKPTGIA-VDSNGTIYFCDNNNNRVRKL 361
Score = 37.0 bits (84), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 55/123 (44%), Gaps = 5/123 (4%)
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNH 125
FY + + V +AG G G S G G +A P V+ K IY+ + H
Sbjct: 70 FYGRVRKVDKNGYVTTIAGFGTSGDSSGNGGLATNAMLGYPAGITVNDKDEIYLVEYGTH 129
Query: 126 VIRKI-TNLGVTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
++RKI N + AG G +G A NA F++ + ++ P + ISD N I
Sbjct: 130 MVRKILVNGTIQEFAGILNGGGFGGDNGLATNAIFNHPYGVSVNPLTGDVYISDFSNNRI 189
Query: 184 RQI 186
R++
Sbjct: 190 RKV 192
>gi|195387367|ref|XP_002052367.1| GJ17513 [Drosophila virilis]
gi|194148824|gb|EDW64522.1| GJ17513 [Drosophila virilis]
Length = 736
Score = 44.3 bits (103), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
VV+ G G+ DG +ARF+ P+ A K + VAD NH +R+I+ + V T+
Sbjct: 255 VVEYKVGGLEAGFVDGNLNTARFNNPQGVAFLDKDTLIVADTDNHALRQISLKSGAVETL 314
Query: 139 AGGGSKKEGRADG 151
AG G + R G
Sbjct: 315 AGTGCQGNDRTGG 327
>gi|219852041|ref|YP_002466473.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546300|gb|ACL16750.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 930
Score = 44.3 bits (103), Expect = 0.062, Method: Composition-based stats.
Identities = 29/99 (29%), Positives = 47/99 (47%), Gaps = 8/99 (8%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
G ++ + G +F P AVD GN+YVAD N+ I+K T++G+ G S
Sbjct: 76 GTFIKKWGSSGSGDGQFSSPSGVAVDSAGNVYVADTGNNRIQKFTSMGIFIKQWGSS--- 132
Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G FS+ F + V + + ++D GN I++
Sbjct: 133 ----GSGNGQFFSSPFGVA-VDNAGNVYVADTGNNRIQK 166
>gi|383115566|ref|ZP_09936322.1| hypothetical protein BSGG_2564 [Bacteroides sp. D2]
gi|313695029|gb|EFS31864.1| hypothetical protein BSGG_2564 [Bacteroides sp. D2]
Length = 459
Score = 43.9 bits (102), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 51/95 (53%), Gaps = 2/95 (2%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G+ +G+ AR ++P +D +GNIYVA + H I KIT GV T G+ G DG
Sbjct: 364 GFVEGQGNDARLEEPCQMDLDEEGNIYVAVRKKHRIAKITPDGVVTSY-TGTGTSGTTDG 422
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
P A F++ L F AL +SD+ N IR+I
Sbjct: 423 PLDKAQFNHPEGLQFGLD-GALYVSDYWNHKIRKI 456
>gi|219852087|ref|YP_002466519.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546346|gb|ACL16796.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 668
Score = 43.9 bits (102), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 62/135 (45%), Gaps = 16/135 (11%)
Query: 51 IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
+D+PGS + +DS R Y + + V G +Q S G G +FD P A
Sbjct: 139 LDQPGS---VAVDS-RGQIYVADWGNNRVQVFNSTGGYLMQWGSSGS-GDGQFDGPNGIA 193
Query: 111 VDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE-GRADGPAQNASFSNDFELTFVPH 169
+D GN+YV D N+ I++ + G + G S E G+ + P A SND
Sbjct: 194 IDSTGNVYVTDAYNNRIQEFNSTGGYLMQWGSSGSEAGQFEIPQGIAMDSND-------- 245
Query: 170 ICALLISDHGNQLIR 184
+ ++D GN++ +
Sbjct: 246 --NVYVADSGNRVQK 258
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 23/37 (62%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
PG+ +F P A+D GN+Y+ D N+ ++K T+ G
Sbjct: 462 PGAGQFSGPNGIALDSTGNVYITDWGNNRVQKFTSTG 498
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 46/107 (42%), Gaps = 11/107 (10%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNAS 157
G +F+ P AVD G +YVAD +N+ I+K + G T G +EG+ D P A
Sbjct: 88 GDGQFNIPYDIAVDSVGYVYVADMNNNRIQKFNSTGGYLTQWGTKGSEEGQLDQPGSVA- 146
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
V + ++D GN ++ N + SGS G
Sbjct: 147 ---------VDSRGQIYVADWGNNRVQVFNSTGGYLMQWGSSGSGDG 184
>gi|298387118|ref|ZP_06996672.1| hypothetical protein HMPREF9007_03883 [Bacteroides sp. 1_1_14]
gi|298260268|gb|EFI03138.1| hypothetical protein HMPREF9007_03883 [Bacteroides sp. 1_1_14]
Length = 469
Score = 43.9 bits (102), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 22/117 (18%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKG--------------NIYVADKSNHVIRKITNLGVT 136
+ + +G GSARF+KP+ F V +K + Y D+ NH I K+ GV
Sbjct: 351 KSFKEGTGGSARFNKPQ-FGVFVKNEKYGEGIGPDNDQYDFYFCDRENHCIWKLDPHGVA 409
Query: 137 TIAGGGSKKE------GRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++A G S + G DG P A F+ L + P I D N+ IR +
Sbjct: 410 SLAAGRSNENADGKIWGYVDGNPLHEARFNQPAGLAYDPDTDMFYIGDIDNKGIRYM 466
>gi|119569872|gb|EAW49487.1| NHL repeat containing 2, isoform CRA_a [Homo sapiens]
Length = 683
Score = 43.9 bits (102), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+TIAG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTIAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|290979059|ref|XP_002672252.1| predicted protein [Naegleria gruberi]
gi|284085827|gb|EFC39508.1| predicted protein [Naegleria gruberi]
Length = 1239
Score = 43.9 bits (102), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 39 DGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
DG+ L N + V++R G +L ++ LSF + + +AG+G GY
Sbjct: 42 DGNALVTNLKNPRGLFVVERNGG-ELYFVEEENHIIRKLSFVTNN---ITTIAGNGTAGY 97
Query: 94 SDGEPGSARFDKPKSFAV-DMKGNIYVADKSNHVIRKITNLGVTTIAG 140
SDG + + P S V + G IY+AD N +R++ N + TIAG
Sbjct: 98 SDGVATLSMLNYPHSVHVSNTTGEIYIADTFNCKLRRVFNGQMETIAG 145
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 11/105 (10%)
Query: 55 GSS-DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS-FAVD 112
GSS D I + + + F S LS+ ++V D G DG P+ F V+
Sbjct: 3 GSSLDTIYVSTGKQIFNFTSDLLSKVTIVAGTFVDSWNG--DGNALVTNLKNPRGLFVVE 60
Query: 113 MKG-NIYVADKSNHVIRK---ITNLGVTTIAGGGSKKEGRADGPA 153
G +Y ++ NH+IRK +TN +TTIAG G+ G +DG A
Sbjct: 61 RNGGELYFVEEENHIIRKLSFVTN-NITTIAGNGTA--GYSDGVA 102
>gi|383124929|ref|ZP_09945589.1| hypothetical protein BSIG_1326 [Bacteroides sp. 1_1_6]
gi|251840920|gb|EES69001.1| hypothetical protein BSIG_1326 [Bacteroides sp. 1_1_6]
Length = 469
Score = 43.9 bits (102), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 22/117 (18%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKG--------------NIYVADKSNHVIRKITNLGVT 136
+ + +G GSARF+KP+ F V +K + Y D+ NH I K+ GV
Sbjct: 351 KSFKEGTGGSARFNKPQ-FGVFVKNEKYGEGIGPDNDQYDFYFCDRENHCIWKLDPHGVA 409
Query: 137 TIAGGGSKKE------GRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++A G S + G DG P A F+ L + P I D N+ IR +
Sbjct: 410 SLAAGRSNENADGKIWGYVDGNPLHEARFNQPAGLAYDPDTDMFYIGDIDNKGIRYM 466
>gi|29348756|ref|NP_812259.1| hypothetical protein BT_3347 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340662|gb|AAO78453.1| Quinoprotein amine dehydrogenase [Bacteroides thetaiotaomicron
VPI-5482]
Length = 469
Score = 43.9 bits (102), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 22/117 (18%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKG--------------NIYVADKSNHVIRKITNLGVT 136
+ + +G GSARF+KP+ F V +K + Y D+ NH I K+ GV
Sbjct: 351 KSFKEGTGGSARFNKPQ-FGVFVKNEKYGEGIGPDNDQYDFYFCDRENHCIWKLDPHGVA 409
Query: 137 TIAGGGSKKE------GRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++A G S + G DG P A F+ L + P I D N+ IR +
Sbjct: 410 SLAAGRSNENADGKIWGYVDGNPLHEARFNQPAGLAYDPDTDMFYIGDIDNKGIRYM 466
>gi|261855185|ref|YP_003262468.1| redoxin [Halothiobacillus neapolitanus c2]
gi|261835654|gb|ACX95421.1| Redoxin domain protein [Halothiobacillus neapolitanus c2]
Length = 498
Score = 43.9 bits (102), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 36/60 (60%), Gaps = 3/60 (5%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
GDG G++DG +A F++P+ A + ++YVAD N IR+I + V+T+ G G +
Sbjct: 234 GDGKAGFADGSFATAEFNRPQGIAW-LGNDLYVADTDNQRIRRIDFASQTVSTVVGNGQR 292
>gi|386384782|ref|ZP_10070129.1| redoxin domain-containing protein [Streptomyces tsukubaensis
NRRL18488]
gi|385667750|gb|EIF91146.1| redoxin domain-containing protein [Streptomyces tsukubaensis
NRRL18488]
Length = 591
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 18/115 (15%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +L+V DS+R L E+VV+R+ G + F++P+ A+ G
Sbjct: 185 SGNLLVSDSTRHQLVELE--PDGETVVRRIGG------------ADEFNEPQGLALLPDG 230
Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
+ VAD NH IR ++ V T+AG G + + A+GPA S+ ++L +
Sbjct: 231 RVAVADTVNHRIRAYDPSSGAVETLAGTGKQWWQGSPAEGPALTVDLSSPWDLAW 285
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG-V 135
E V+R AG +G DG A F +P AV G +++AD +R I G V
Sbjct: 306 EAGTVRRAAGTTNEGLVDGPADEAWFAQPSGLAVAADGERLWIADSETSALRWIGRDGSV 365
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ G G G DG A A F + +T +P ++ + D N +R+ +
Sbjct: 366 HSAVGTGLFDFGHRDGDAAQALFQHPLGVTALPD-GSVAVCDTYNHALRRYD 416
>gi|412985326|emb|CCO20351.1| predicted protein [Bathycoccus prasinos]
Length = 897
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 60/123 (48%), Gaps = 18/123 (14%)
Query: 81 VVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVT 136
VV+ +AG G+ GY D G+ ARF++P + G+ ++V+D N+ IRK + + VT
Sbjct: 220 VVRTVAGSGLTGYHDDLVGTKARFNQPMDVVMHPDGHTLFVSDAMNNCIRKVDVQSGEVT 279
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI-----------CALL-ISDHGNQLIR 184
T+ G K G DG NA F ++ + C L ++D GN +R
Sbjct: 280 TLT--GDKLPGMIDGIKSNARFLQPMGMSCSTNTVLDGGGKSSNKCTFLAVADAGNHAVR 337
Query: 185 QIN 187
+N
Sbjct: 338 LVN 340
Score = 43.5 bits (101), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI-TNLG-VTTIAGGGSKKEGRA 149
+ DG ARF+KP G +I++AD NH IR+I G VTT+AG G+ G A
Sbjct: 118 FHDGPSLEARFNKPFGVVASPDGRDIFIADTFNHRIRRIDVEFGTVTTLAGTGA--SGSA 175
Query: 150 DGPAQ--NASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
DG A+ +A+F + P + ++D N IR I L
Sbjct: 176 DGLAKRGDATFKFPSGVAVSPDGGFVYVADTKNHKIRVIIL 216
>gi|428304139|ref|YP_007140964.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
gi|428245674|gb|AFZ11454.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
Length = 537
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 7/101 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P VI S+ L + D+ S ++V+ G G + DG+ A+F +P
Sbjct: 206 PSKVIACQQSNSLFIADTGHHRLVIASLNGETQAVI----GTGSPAWVDGDLEIAQFCEP 261
Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
D + IYVAD NH++RKI V+TIAG G++
Sbjct: 262 MGMVFDYEQQVIYVADTVNHLLRKIDLKTRQVSTIAGNGTQ 302
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 63/141 (44%), Gaps = 10/141 (7%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDM-KG 115
+L + DS S+ ++ L + +V+ + G G + G+ D + + G
Sbjct: 384 ELFIADSESSSIRAVT--LGDFPIVRTICGSGQLFGFGDVDGIGENVRLQHCLGITYGAG 441
Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
++V D NH ++++ T TI G G K G DG + FS L F + L
Sbjct: 442 YLWVTDTYNHKLKRVNPTTGECQTICGSG--KAGLQDGFGTDVYFSEPSGLAFACNY--L 497
Query: 174 LISDHGNQLIRQINLKPEDCS 194
I+D N IR+INL ++ +
Sbjct: 498 YIADSNNHAIRRINLNSQEVT 518
>gi|427719807|ref|YP_007067801.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
gi|427352243|gb|AFY34967.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
Length = 505
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 3/58 (5%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
GDG G DGE ++RF +P +V + ++YVAD +NHVIR++ VTT+ G
Sbjct: 437 GDGSAGLQDGEGKNSRFFEPSGLSV-INSHLYVADTNNHVIRRVDLNTFEVTTLKFPG 493
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 61/139 (43%), Gaps = 11/139 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+S L + DS SF + + + G G G+ DG+ A+F P+ A D +
Sbjct: 184 ASGLFIADSGHHRLVRSSF----DGEILHVIGTGKPGFIDGDFSEAQFFAPQGMAFDEEN 239
Query: 116 NI-YVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE--LTFVPHI 170
I Y+AD NH +R++ + V TIAG G K+ R P A + V
Sbjct: 240 QILYIADTENHALRRVDLRHQLVKTIAGTG--KQSRNIHPHSGAGLETELNSPWDLVKVG 297
Query: 171 CALLISDHGNQLIRQINLK 189
L I+ G I Q++L+
Sbjct: 298 NNLFIAMAGPHQIWQMDLE 316
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 32/129 (24%), Positives = 53/129 (41%), Gaps = 7/129 (5%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---G 134
E ++ AG G + DG + F +P D K +Y+AD IR + +
Sbjct: 316 ETGIINTYAGTGAEACVDGSLTESAFAQPSGITTDGK-ELYIADSEVSSIRSVGIVEPRQ 374
Query: 135 VTTIAG-GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
V T+ G GG G DG +N + + + + L ++D N I+ ++ +C
Sbjct: 375 VKTVCGSGGLFSFGDVDGQGENVRLQHCLGVEYAQNY--LWVADTYNHKIKLVSPSTRNC 432
Query: 194 SKSSQSGSA 202
GSA
Sbjct: 433 QTVLGDGSA 441
>gi|119569873|gb|EAW49488.1| NHL repeat containing 2, isoform CRA_b [Homo sapiens]
Length = 726
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+TIAG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTIAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|336322170|ref|YP_004602138.1| alkyl hydroperoxide reductase [[Cellvibrio] gilvus ATCC 13127]
gi|336105751|gb|AEI13570.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [[Cellvibrio] gilvus ATCC 13127]
Length = 638
Score = 43.9 bits (102), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 53/116 (45%), Gaps = 8/116 (6%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---- 133
+E ++ +AG +G DG A F +P AVD +G I++AD +R + +
Sbjct: 342 KEPSLRHVAGTMNEGLLDGPLADAWFAQPSGLAVDDEGGIWLADSETSALRVVEPVHGSA 401
Query: 134 ---GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
V T+ G G + G DG A A + + +P ++ ++D N +R+
Sbjct: 402 GDGSVRTVVGAGLFEFGHRDGAADEARLQHPLGVAVLPD-GSVAVADTYNGAVRRF 456
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 72/159 (45%), Gaps = 30/159 (18%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------ 111
+L+V D+ + L+ +++V+R+ G G +G DG P ARF +P +
Sbjct: 187 NLLVADAGHHSLAELA--PDGQTLVRRI-GSGERGLVDGGPDDARFSEPNGLCLVPDELR 243
Query: 112 -DMKGNIYVADKSNHVIR--KITNLGVTTIAG-----------------GGSKKEGRADG 151
+ ++ VAD NH +R ++ + V+T+AG GG + GRA
Sbjct: 244 PWVGYDVLVADTVNHALRGVRLADGHVSTVAGTGEQFMVGAADNVLPGWGGDDRSGRAYA 303
Query: 152 PAQNA-SFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
A A S+ +++T+ A +++ GN + + K
Sbjct: 304 GAPLAVRLSSPWDVTWSAAWGAFVVAMAGNHTLWAFDPK 342
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 47/110 (42%), Gaps = 8/110 (7%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNAS 157
PG+ RF P GN+ VAD +H + ++ G T + GS + G DG +A
Sbjct: 171 PGTLRF--PAKAVALPDGNLLVADAGHHSLAELAPDGQTLVRRIGSGERGLVDGGPDDAR 228
Query: 158 FSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSGS 201
FS L VP +L++D N +R + L S + +G
Sbjct: 229 FSEPNGLCLVPDELRPWVGYDVLVADTVNHALRGVRLADGHVSTVAGTGE 278
>gi|291235470|ref|XP_002737667.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 638
Score = 43.9 bits (102), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
G RF P +D KGN+ VADK+NH I+ +T G G +G++DG Q
Sbjct: 511 GDCRFKSPYGITIDSKGNVIVADKNNHCIQVLTGEGEFLFKFGS---QGKSDGQLQ 563
>gi|29349707|ref|NP_813210.1| hypothetical protein BT_4299 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298383965|ref|ZP_06993526.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383120644|ref|ZP_09941370.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
gi|29341617|gb|AAO79404.1| putative cell surface protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251840310|gb|EES68392.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
gi|298263569|gb|EFI06432.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 495
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 3/76 (3%)
Query: 59 LIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
+ + DS S + + P +V + G +GY DG P A+F P AV+ G
Sbjct: 418 MYIADSGNSCIRVIDTTMPKERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGT 477
Query: 117 I-YVADKSNHVIRKIT 131
I YVAD N VIRK++
Sbjct: 478 IVYVADTQNKVIRKLS 493
Score = 37.0 bits (84), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 51/119 (42%), Gaps = 8/119 (6%)
Query: 79 ESVVKRLAGDGV---QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-- 133
ES + +GV +G+ DG +A+F P+ G +Y+AD N IR I
Sbjct: 377 ESYAGKAIYEGVMNGKGWEDGLLKNAKFRHPRQICFTDDGKMYIADSGNSCIRVIDTTMP 436
Query: 134 ---GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
T G EG DG + A F + + ++D N++IR+++++
Sbjct: 437 KERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGTIVYVADTQNKVIRKLSIE 495
>gi|290976601|ref|XP_002671028.1| predicted protein [Naegleria gruberi]
gi|284084593|gb|EFC38284.1| predicted protein [Naegleria gruberi]
Length = 925
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 55/115 (47%), Gaps = 3/115 (2%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S++ +AG GV GYS DG+ + A+ +V G + +AD SNH IRKI
Sbjct: 512 ISQDGSSTVIAGTGVAGYSGDGQLATQAQLFNVSGVSVLPNGEVVIADTSNHRIRKINID 571
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G + TI G G + A N+ + ++ P + I D N IR+IN
Sbjct: 572 GIIQTICGTGVAGYSGENILATNSQLNLPIDVEVSPISSEIFIVDTNNHRIRKIN 626
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 8/109 (7%)
Query: 38 IDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
++ Q+ I+P V +DR + ++ V D S + + + + AG+G+ GYS D
Sbjct: 50 VNAGQVSISPTKVKVDRK-TGNVYVSDRINSRIRKIDINTGKATTI---AGNGIIGYSGD 105
Query: 96 GEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
G+ + A + P + +IY AD NH IRKI N + TIAG G
Sbjct: 106 GQLATQANINNPGKVELSSTNDIYFADTENHRIRKIFANGTIITIAGCG 154
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 45/91 (49%), Gaps = 3/91 (3%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP-AQNAS 157
GS D+P A+ G IYVAD H + +I+ G +T+ G DG A A
Sbjct: 482 GSKTSDRPHGIAI-FNGMIYVADPKTHRVIRISQDGSSTVIAGTGVAGYSGDGQLATQAQ 540
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINL 188
N ++ +P+ ++I+D N IR+IN+
Sbjct: 541 LFNVSGVSVLPN-GEVVIADTSNHRIRKINI 570
>gi|168702582|ref|ZP_02734859.1| NHL repeat [Gemmata obscuriglobus UQM 2246]
Length = 599
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 51/98 (52%), Gaps = 10/98 (10%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
V D PG L + DS+ ++ V+ G G G +DG A+FD P+
Sbjct: 269 VADEPGRR-LFIADSTHHRVVVTDLAGNKLHVI----GTGAPGKTDGPFDKAQFDDPQGM 323
Query: 110 AVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
AV +G+ ++VAD+ NH++R+I V TIAG G++
Sbjct: 324 AV--RGDTLFVADRKNHLVREIDLKAKTVATIAGNGTQ 359
>gi|407792655|ref|ZP_11139692.1| NHL repeat containing protein [Idiomarina xiamenensis 10-D-4]
gi|407217768|gb|EKE87600.1| NHL repeat containing protein [Idiomarina xiamenensis 10-D-4]
Length = 510
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
V+R+ G G ++DG +A F+ P+ A + +YVAD NH +R I ++ VTT+
Sbjct: 230 TVQRI-GTGNAAFNDGSASAASFNSPQGLAF-AEQTLYVADTGNHALRAIDLSSWQVTTV 287
Query: 139 AGGGS-KKEGRADGPAQNASFSNDFELTF 166
AG G + A G A + + +++T
Sbjct: 288 AGTGQLGRNYNASGKATAVALRSPWDITM 316
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 58/136 (42%), Gaps = 22/136 (16%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
LAG G + DG A F++P AV + +YVAD IR I T V T+ G G
Sbjct: 344 LAGSGREALLDGTAKQAAFNQPSGLAV-VGDTLYVADAEASAIRAIDLTTQKVRTVVGQG 402
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICA--------LLISDHGNQLIRQINLKPEDCS 194
+ G DG DF + H A L+++D N +R ++L + S
Sbjct: 403 LFEFGSKDG---------DFARALLQHPLAVAALTPDQLVVADTYNHQLRLLDLGTQQVS 453
Query: 195 KSSQSGSAL--GAVSV 208
S S L G VS+
Sbjct: 454 TLHLSDSLLEPGGVSI 469
>gi|284044371|ref|YP_003394711.1| NHL repeat containing protein [Conexibacter woesei DSM 14684]
gi|283948592|gb|ADB51336.1| NHL repeat containing protein [Conexibacter woesei DSM 14684]
Length = 623
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 52/104 (50%), Gaps = 6/104 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
AG+GV + R ++P+ + G +AD N+ IR++ G +TT+AG G+
Sbjct: 117 FAGEGVPATD----ATVRLNQPRGVSPTGDGGFLIADTLNNRIRRVDAAGLITTVAGTGA 172
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GPA A+ + + +P + LI+D GN IR+++
Sbjct: 173 AAFTGDGGPATAAALNAPQGVAALPD-GSFLIADTGNHRIRRVD 215
>gi|290974456|ref|XP_002669961.1| predicted protein [Naegleria gruberi]
gi|284083515|gb|EFC37217.1| predicted protein [Naegleria gruberi]
Length = 1106
Score = 43.5 bits (101), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 58/120 (48%), Gaps = 2/120 (1%)
Query: 76 LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+S + +V +AG + GY+ +P + + P S + G+++ D+ N+VIRKI+N
Sbjct: 806 VSSDGIVSTIAGVSQIHGYNLYDPQESILNGPTSLSSAKNGDLFFIDQKNYVIRKISNGI 865
Query: 135 VTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
VTTI G +++ +G + S L P AL+ SD I NL E+
Sbjct: 866 VTTIIGTPNERCDMEEGKRGRETCLSMPQSLAVSPGGDALVFSDLSTIGIFHYNLNDENA 925
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 50/100 (50%), Gaps = 10/100 (10%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFD 104
NP V++R G + + RS + +E V + G+ ++ DG+ G+
Sbjct: 528 NPRDVMERNG----YLYYAERSIICKIDLVSNELEV---MVGNRQTSFNGDGKKGNETSI 580
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGG 142
P+S V +G+IY AD SNH IRK T V+TIAG G
Sbjct: 581 GPESIYVTEEGDIYFADISNHRIRKWTRKTDLVSTIAGTG 620
>gi|290971759|ref|XP_002668647.1| predicted protein [Naegleria gruberi]
gi|284082132|gb|EFC35903.1| predicted protein [Naegleria gruberi]
Length = 355
Score = 43.5 bits (101), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 56/114 (49%), Gaps = 6/114 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFD--KPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ ++ +AGDG++GY DG+ A K K IY+AD ++ IRKI
Sbjct: 130 IDRNGIITTIAGDGIEGY-DGDNQLATLSSLKHPCSVFQYKDEIYIADSFSNRIRKIDRD 188
Query: 134 GVT-TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G+ TIAG G + A + S +N L FV H + SD NQ+IR+I
Sbjct: 189 GIIHTIAGNGIEGYNGDGRNATDTSLNNPNSL-FV-HNDEIYFSDLDNQVIRKI 240
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + ++ +AG+G++GY+ DG + + P S V IY +D N VIRKI
Sbjct: 185 IDRDGIIHTIAGNGIEGYNGDGRNATDTSLNNPNSLFV-HNDEIYFSDLDNQVIRKILPN 243
Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + IAG KK D + + + FV + +D+ N +R+I
Sbjct: 244 GIIKNIAGIPGKKGYNGDYKLATRTTLSWPKGIFVDSKNQVYFADYYNHCVRKI 297
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 41/94 (43%), Gaps = 13/94 (13%)
Query: 81 VVKRLAG-DGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
++K +AG G +GY+ + R PK VD K +Y AD NH +RKI N +
Sbjct: 245 IIKNIAGIPGKKGYNGDYKLATRTTLSWPKGIFVDSKNQVYFADYYNHCVRKIFKNGNMK 304
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
IAG G D P F+ PHI
Sbjct: 305 IIAGIGDVFGYSGDVP---------FDFQQYPHI 329
>gi|441145773|ref|ZP_20963891.1| hypothetical protein SRIM_07703 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
gi|440620877|gb|ELQ83900.1| hypothetical protein SRIM_07703 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
Length = 615
Score = 43.5 bits (101), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 13/124 (10%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R L+ E+V++R+ G G +G+ G+ F +P
Sbjct: 185 PGKALRLPGGT-FLVSDTTRHQLVELA--ADGETVLRRI-GSGERGF-----GADAFSEP 235
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDF 162
+ A+ G + VAD NH IR V T+AG G + +G A GPA+ S+ +
Sbjct: 236 QGLALLPDGKVAVADTVNHAIRTFDPETGEVATVAGTGKQWWQGSATSGPAREVDLSSPW 295
Query: 163 ELTF 166
++ +
Sbjct: 296 DVAW 299
>gi|302535478|ref|ZP_07287820.1| NHL repeat containing protein [Streptomyces sp. C]
gi|302444373|gb|EFL16189.1| NHL repeat containing protein [Streptomyces sp. C]
Length = 612
Score = 43.5 bits (101), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 12/115 (10%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
S +L+V DS+R L E+VV+R+ G G +G+ G F +P+ A+ G
Sbjct: 198 SGNLLVSDSTRHQLVELE--ADGETVVRRI-GSGERGF-----GPDAFSEPQGLALLPDG 249
Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
+ VAD NH +R + V T+AG G + + GPA S+ +++ +
Sbjct: 250 RVVVADTVNHALRVLDPETGAVETVAGTGRQWWQGSPTSGPALEVDLSSPWDVAW 304
>gi|295394954|ref|ZP_06805167.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
49030]
gi|294972287|gb|EFG48149.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
49030]
Length = 655
Score = 43.5 bits (101), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 53/120 (44%), Gaps = 23/120 (19%)
Query: 87 GDGVQGYSDGEPGSARFDKP-------KSFAVDMKGNIYVADKSNHVIRKIT--NLGVTT 137
G G +G +DG+ +A+F +P A + ++ AD NH +R I VTT
Sbjct: 231 GSGERGNADGDFATAQFSEPGGVAQLPPEIAQQVGYHLVAADTVNHTLRGINLETQTVTT 290
Query: 138 IAGGGSKKE--------------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
+AG G + GR DGPA S+ +++ + P ++I+ GN +
Sbjct: 291 VAGTGEQYMVGASDNVPDQPGFFGRYDGPATAVKLSSPWDVVYAPKTSNVVIAMAGNHTM 350
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 67/150 (44%), Gaps = 12/150 (8%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD- 104
+P V+ P +S++++ + +T P++ V ++G +G D + FD
Sbjct: 327 SPWDVVYAPKTSNVVIAMAGNHTMWTFD-PVA--GTVAHISGSLNEGLRDTSAAATPFDL 383
Query: 105 -----KPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNAS 157
+P + GN++VAD +RK+ + VT+ G G G DGPA++A
Sbjct: 384 DEWFAQPSGVRLASDGNVWVADSETSALRKLDPSTGAVTSYVGVGLFDFGFQDGPAESAR 443
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ +P + I+D N IR+ +
Sbjct: 444 LQHPLGALELPD-GTVAIADTYNGAIRRFD 472
>gi|326434843|gb|EGD80413.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
Length = 1449
Score = 43.5 bits (101), Expect = 0.089, Method: Composition-based stats.
Identities = 32/112 (28%), Positives = 48/112 (42%), Gaps = 4/112 (3%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-- 136
V+ G G G+ G +A+ P A D +G +Y+AD NHV+ + V
Sbjct: 1216 VLSVFVGTGEAGHRGMGGSSRAAQIHSPHGLAFDSRGTLYIADTENHVVYGVDRRSVVID 1275
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+AG +K DG ++ N V L I+D GN IR ++L
Sbjct: 1276 VVAGTPFRKGYHGDGRPATTAWLNAPTGIAVRGAGDLYIADKGNNRIRYVDL 1327
Score = 39.7 bits (91), Expect = 1.5, Method: Composition-based stats.
Identities = 43/155 (27%), Positives = 71/155 (45%), Gaps = 14/155 (9%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-- 101
+N + I G+ DL + D + + L ++ L G G G DG +A
Sbjct: 1298 LNAPTGIAVRGAGDLYIADKGNNRIRYVD--LDSYKIIDTLVGTGRYGGDVDGSTTTAAL 1355
Query: 102 --RFDKPKSFAVDMKGNIYV-ADKSNHVIRKITNLG----VTTIAGGGSKKEGRADGPAQ 154
D P+ AV+ KG + V AD + HV+R ++ G VT +AG GS+ DG
Sbjct: 1356 ETNLDHPEGVAVNAKGTMLVIADTNKHVLRSVSLDGGDSPVTVLAGTGSRG-FNGDGNEP 1414
Query: 155 NAS-FSNDFELTFVPHICALLISDHGNQLIRQINL 188
NA+ + + + A++ D N+ IR+++L
Sbjct: 1415 NATNLHSPVSVAYDIGERAIVFVDQQNRRIRRVHL 1449
Score = 39.3 bits (90), Expect = 1.7, Method: Composition-based stats.
Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 3/81 (3%)
Query: 80 SVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
S + +AG +G + G +A+ D+P AVD GN+Y+++K NH + + + GV
Sbjct: 1158 STITTIAGTKTRGSTGVGGLATAAQLDRPTCVAVDTMGNVYISEKGNHRVSVVDASTGVL 1217
Query: 137 TIAGGGSKKEGRADGPAQNAS 157
++ G + R G + A+
Sbjct: 1218 SVFVGTGEAGHRGMGGSSRAA 1238
>gi|345882347|ref|ZP_08833836.1| hypothetical protein HMPREF0666_00012 [Prevotella sp. C561]
gi|345045612|gb|EGW49528.1| hypothetical protein HMPREF0666_00012 [Prevotella sp. C561]
Length = 496
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 49/104 (47%), Gaps = 6/104 (5%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-----TNLGVTTIAGGGSKK 145
+G+ DG +A+F P+ G +Y+AD N IR I N VTT G +
Sbjct: 394 KGWEDGALKNAKFCFPRQMTFTKDGKLYIADSGNQCIRMIDTTQGKNARVTTPI-GVPQS 452
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G DG + A F+ + + ++D NQ+IR++++K
Sbjct: 453 AGFQDGGVELAKFNWPTGVAVSADGSTVYVADSKNQVIRELSIK 496
>gi|390956314|ref|YP_006420071.1| hypothetical protein Terro_0387 [Terriglobus roseus DSM 18391]
gi|390411232|gb|AFL86736.1| hypothetical protein Terro_0387 [Terriglobus roseus DSM 18391]
Length = 1224
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 44/89 (49%), Gaps = 6/89 (6%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASF 158
A P +VD G++Y D SNHV+RK+T +TTIAG GS A++ F
Sbjct: 262 ANIGWPSGMSVDANGDLYFVDISNHVVRKVTKSTGLITTIAGAGSNFPYNTVQLAKDTEF 321
Query: 159 SNDFELTFVPHICALLISDHG---NQLIR 184
+ +L +P LL+ D G Q+ R
Sbjct: 322 ISPEDLLILPS-GDLLVDDSGVGAGQIFR 349
>gi|380694052|ref|ZP_09858911.1| hypothetical protein BfaeM_08738 [Bacteroides faecis MAJ27]
Length = 495
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 3/76 (3%)
Query: 59 LIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
+ + DS S + + P +V + G +GY DG P A+F P AV+ G
Sbjct: 418 MYIADSGNSCIRVVDTTMPKERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGT 477
Query: 117 I-YVADKSNHVIRKIT 131
I YVAD N VIRK++
Sbjct: 478 IVYVADTQNKVIRKLS 493
>gi|334335815|ref|YP_004540967.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
allergen [Isoptericola variabilis 225]
gi|334106183|gb|AEG43073.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Isoptericola variabilis 225]
Length = 668
Score = 43.5 bits (101), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 73/186 (39%), Gaps = 35/186 (18%)
Query: 48 HSVIDRPGSSDLIVLDS-SRSAFYTLSFPLSEESVVKRLAGDG----------------- 89
H V+D P +L+ S + A+ TL E VV ++AG+G
Sbjct: 112 HPVLDDP---ELVTWGSYTARAWPTLVVVDPEGYVVAQMAGEGHASAIAALVRELVGEHE 168
Query: 90 VQG--------YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
+G Y EP S P GN+ VAD +H + ++ G T +
Sbjct: 169 AKGTLHRGDAPYVPPEPTSGTLRFPAKAVALPNGNLLVADAGHHSLAELAPDGETLVRRI 228
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSK 195
GS G DG A A FS L VP + +L++D N +R + L+ + +
Sbjct: 229 GSGARGLVDGAADVARFSEPNGLCLVPEVLRATLGYDVLVADTVNHALRGVRLETGEVTT 288
Query: 196 SSQSGS 201
+ +G
Sbjct: 289 VAGTGE 294
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 9/91 (9%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV----DMKGNIYVADKSNHVIRKI 130
P E V+ LAG +G DG G+A F +P A D G++++AD +R +
Sbjct: 358 PGPESGTVRLLAGTMNEGLEDGPGGTAWFAQPSGLAPVLGSDADGSLWLADAETSALRLV 417
Query: 131 T-----NLGVTTIAGGGSKKEGRADGPAQNA 156
T + V T G G G DG A A
Sbjct: 418 TPAADGQVQVRTAVGQGLFDFGHRDGAADQA 448
>gi|326923951|ref|XP_003208196.1| PREDICTED: NHL repeat-containing protein 2-like [Meleagris
gallopavo]
Length = 727
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 36/79 (45%), Positives = 42/79 (53%), Gaps = 8/79 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT-NLG-VTTIAG---GGSKK 145
G DG A F+ P+ A+ K N IYVAD NH+IRKI +L VTT+AG G K
Sbjct: 262 GRKDGRFSEAAFNSPQGVAI--KNNVIYVADTENHLIRKIDLDLEIVTTVAGIGIQGVDK 319
Query: 146 EGRADGPAQNASFSNDFEL 164
EG A G Q S D L
Sbjct: 320 EGGAKGEEQPISSPWDVVL 338
>gi|390559793|ref|ZP_10244078.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
gi|390173637|emb|CCF83377.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
Length = 442
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 50/100 (50%), Gaps = 9/100 (9%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P V+ + L + DS + +E ++ G G G +DG+ +A+F+ P
Sbjct: 128 PGKVLADAAGNRLFIADSGHNRIVVTRLDGAEPWMI----GSGQPGLADGDAATAQFNHP 183
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGS 143
+ A + G+I Y+AD NHVIR++ + V TIAG G
Sbjct: 184 QGMA--LAGDILYLADTRNHVIRRVDLASRQVETIAGTGE 221
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 61/145 (42%), Gaps = 12/145 (8%)
Query: 49 SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKP 106
S ID G+ L V DS SA T P +V+ + G G+ + D G R P
Sbjct: 295 SGIDTDGTR-LYVADSETSAIRTADLP--PRDLVQTIVGIGLFEFGDVDGTGNQVRLQHP 351
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
AV G +YVAD NH I+++ T T G G+ G DG + A F +
Sbjct: 352 LGLAVG-DGVVYVADSYNHKIKRLYPTERRCETWLGDGTP--GDRDGVREAARFHEPGGV 408
Query: 165 TFVPHICALLISDHGNQLIRQINLK 189
+ L I+D N IR L+
Sbjct: 409 SLAGD--RLYIADTNNHAIRVAELE 431
>gi|124009915|ref|ZP_01694581.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123984066|gb|EAY24439.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length = 768
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 50/92 (54%), Gaps = 4/92 (4%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQ 154
+ GSA + P +D+ GN+Y+A+ S +V+RK+ ++ + TIAGGG+ G G A
Sbjct: 274 DAGSATINGPTGIHIDVSGNLYIAEFSGNVVRKVNASDNKINTIAGGGASDPGNG-GLAT 332
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+A S F L + + I + N IR++
Sbjct: 333 SAVVSLPFGL-VIDAAGDMYIPEFTNGKIRKV 363
>gi|297687408|ref|XP_002821208.1| PREDICTED: NHL repeat-containing protein 2 [Pongo abelii]
Length = 726
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDIVF 343
>gi|373253064|ref|ZP_09541182.1| thiol-disulfide isomerase-like thioredoxin [Nesterenkonia sp. F]
Length = 659
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 50/110 (45%), Gaps = 7/110 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG------V 135
++ LAG G++G +DG A F + A D G++++AD +R + V
Sbjct: 360 LRVLAGTGLEGLTDGSADEAWFAQTSGLAADAAGDVWIADSETSSLRVLRTAEDPKAPRV 419
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
TI G G G DG A + + +P ++L++D N IR+
Sbjct: 420 ETIVGEGMFDFGFRDGDPAQARLQHPLGVAALPD-GSVLVADTYNGAIRR 468
>gi|449435944|ref|XP_004135754.1| PREDICTED: NHL repeat-containing protein 2-like [Cucumis sativus]
Length = 1086
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 6/77 (7%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG 140
++ G G DG A F++P+ A + K N+ YVAD NH +R++ + V T+AG
Sbjct: 639 QIGSTGEDGLRDGNFDDATFNRPQGLAYNAKKNLLYVADTENHALREVDFVKERVRTLAG 698
Query: 141 GGSK---KEGRADGPAQ 154
GSK +G +G +Q
Sbjct: 699 DGSKGSDYQGGKEGTSQ 715
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 41/83 (49%), Gaps = 6/83 (7%)
Query: 115 GNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
G IYVAD NH ++ + + VTTIAG G K G DG A A S +T
Sbjct: 850 GQIYVADSYNHKVKMLDPVSKKVTTIAGTG--KAGFKDGTALEAQLSEPSGITEAGG--R 905
Query: 173 LLISDHGNQLIRQINLKPEDCSK 195
L I+D N +IR + L + S+
Sbjct: 906 LFIADTNNNVIRYLYLNNREQSQ 928
>gi|158293254|ref|XP_314572.4| AGAP010610-PA [Anopheles gambiae str. PEST]
gi|157016870|gb|EAA09865.4| AGAP010610-PA [Anopheles gambiae str. PEST]
Length = 730
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 64/156 (41%), Gaps = 20/156 (12%)
Query: 2 ASHFVYSSLAITLLSLLFSLVSSGLLLED----GYTVTTVIDGHQLEINPHSVIDRPGSS 57
A F I SL +LVSSG + + G +V G + G+
Sbjct: 201 AIRFYREKGEIKRHSLPINLVSSGAIASNMKYPGKVACSVPTGAGGSEEQLFAVSDSGNH 260
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
++++DS+ + + + G G+ DG ARF+ P+ A +
Sbjct: 261 RVLIVDSAGTVLHKV--------------GGKQSGFVDGNFTKARFNAPQGVAFQGTDVV 306
Query: 118 YVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADG 151
+VAD NH +R+I V+TIAG G++ R G
Sbjct: 307 FVADNENHAVRRIDLKARLVSTIAGNGTQGNDRTGG 342
>gi|57529823|ref|NP_001006504.1| NHL repeat-containing protein 2 [Gallus gallus]
gi|82081010|sp|Q5ZI67.1|NHLC2_CHICK RecName: Full=NHL repeat-containing protein 2
gi|53136494|emb|CAG32576.1| hypothetical protein RCJMB04_29n5 [Gallus gallus]
Length = 727
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 45/82 (54%), Gaps = 12/82 (14%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG---VTTIAG---GGSK 144
G DG A F+ P+ A+ K N IYVAD NH+IRKI +L VTT+AG G
Sbjct: 262 GRKDGRFSEAAFNSPQGVAI--KNNVIYVADTENHLIRKI-DLELEIVTTVAGIGIQGVD 318
Query: 145 KEGRADGPAQNASFSNDFELTF 166
KEG A G Q S+ +++ F
Sbjct: 319 KEGGAKGEEQ--PISSPWDVVF 338
>gi|345324001|ref|XP_001513492.2| PREDICTED: NHL repeat-containing protein 2-like [Ornithorhynchus
anatinus]
Length = 725
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 47/89 (52%), Gaps = 20/89 (22%)
Query: 81 VVKRLAGDGVQGYSDGEPGSAR---------FDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
VVK+ +G YS G P S R F+ P+ A+ K NI YVAD NH+IRKI
Sbjct: 250 VVKK---NGQIQYSIGGPNSGRKDGTFSESAFNSPQGVAI--KNNIIYVADTENHLIRKI 304
Query: 131 --TNLGVTTIAG---GGSKKEGRADGPAQ 154
V+T+AG G+ KEG A G AQ
Sbjct: 305 DLEVEQVSTVAGVGVQGTDKEGGAKGEAQ 333
>gi|149279484|ref|ZP_01885614.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
gi|149229777|gb|EDM35166.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
Length = 742
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 46/106 (43%), Gaps = 1/106 (0%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
R AG+G G +DG SA PK F D KGN+Y +D + RK+ + T
Sbjct: 638 RWAGNGNFGNADGPIKSAMLVSPKGFVGDGKGNMYFSDGFSQNTRKVDLIQNTISTVSAQ 697
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G DG A F + + + I D GN IR++ LK
Sbjct: 698 TTRGFKDGGLNEAQFMQRDAMA-MDKDGNIYIVDMGNNAIRKLFLK 742
>gi|73667964|ref|YP_303979.1| hypothetical protein Mbar_A0416 [Methanosarcina barkeri str.
Fusaro]
gi|72395126|gb|AAZ69399.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 346
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 47/112 (41%), Gaps = 21/112 (18%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
GS +F P AVD GN+YV D NH I+K + G G + + R
Sbjct: 46 GSGQFACPTGVAVDSSGNVYVTDTGNHRIQKFNSTGGYLTQWGSNGTDNRQ--------- 96
Query: 159 SNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
F+P+ A + ++D GN+ I++ N ++ SG+ G
Sbjct: 97 ------FFLPYGVAVDSSGNVYVADKGNKCIQKFNSNGGHLTQWGSSGNGNG 142
Score = 40.8 bits (94), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 25/36 (69%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
G+ +F+ P S AVD GN+YVADK N I+K ++G
Sbjct: 234 GNGQFEFPLSIAVDSSGNVYVADKYNQRIQKFNSIG 269
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 11/90 (12%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
G+ +F+ P+ AVD GN+YVAD N+ I+K + G + GS G N F
Sbjct: 187 GNGQFNDPEGVAVDSSGNVYVADSGNNRIQKFNSTG-GYLTQWGSYGSG-------NGQF 238
Query: 159 SNDFELTF-VPHICALLISDHGNQLIRQIN 187
+F L+ V + ++D NQ I++ N
Sbjct: 239 --EFPLSIAVDSSGNVYVADKYNQRIQKFN 266
>gi|42476013|ref|NP_940916.2| NHL repeat-containing protein 2 [Homo sapiens]
gi|74762548|sp|Q8NBF2.1|NHLC2_HUMAN RecName: Full=NHL repeat-containing protein 2
gi|21748831|dbj|BAC03493.1| unnamed protein product [Homo sapiens]
Length = 726
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|403259474|ref|XP_003922237.1| PREDICTED: NHL repeat-containing protein 2 [Saimiri boliviensis
boliviensis]
Length = 726
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|114632912|ref|XP_508046.2| PREDICTED: NHL repeat-containing protein 2 [Pan troglodytes]
gi|410221294|gb|JAA07866.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252040|gb|JAA13987.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252042|gb|JAA13988.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252044|gb|JAA13989.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252046|gb|JAA13990.1| NHL repeat containing 2 [Pan troglodytes]
gi|410292446|gb|JAA24823.1| NHL repeat containing 2 [Pan troglodytes]
gi|410341451|gb|JAA39672.1| NHL repeat containing 2 [Pan troglodytes]
gi|410341453|gb|JAA39673.1| NHL repeat containing 2 [Pan troglodytes]
Length = 726
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|397510538|ref|XP_003825652.1| PREDICTED: NHL repeat-containing protein 2 [Pan paniscus]
Length = 726
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|449524308|ref|XP_004169165.1| PREDICTED: NHL repeat-containing protein 2-like, partial [Cucumis
sativus]
Length = 892
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 6/77 (7%)
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG 140
++ G G DG A F++P+ A + K N+ YVAD NH +R++ + V T+AG
Sbjct: 639 QIGSTGEDGLRDGNFDDATFNRPQGLAYNAKKNLLYVADTENHALREVDFVKERVRTLAG 698
Query: 141 GGSK---KEGRADGPAQ 154
GSK +G +G +Q
Sbjct: 699 DGSKGSDYQGGKEGTSQ 715
>gi|224052833|ref|XP_002198291.1| PREDICTED: NHL repeat-containing protein 2 [Taeniopygia guttata]
Length = 727
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 44/81 (54%), Gaps = 10/81 (12%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSK---K 145
G DG A F+ P+ A+ K N IYVAD NH+IRKI VTT+AG G + K
Sbjct: 262 GRRDGRFSEAAFNSPQGIAI--KNNVIYVADTENHLIRKIDLELQMVTTVAGIGVQGVDK 319
Query: 146 EGRADGPAQNASFSNDFELTF 166
EG A G Q S+ +++ F
Sbjct: 320 EGGAKGEEQ--PISSPWDVVF 338
>gi|197121735|ref|YP_002133686.1| hypothetical protein AnaeK_1325 [Anaeromyxobacter sp. K]
gi|196171584|gb|ACG72557.1| YD repeat protein [Anaeromyxobacter sp. K]
Length = 2350
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 58/115 (50%), Gaps = 9/115 (7%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
+ +++R AG + G+ G ARF K P +V G++Y+AD N IR++ G
Sbjct: 1230 DGIIERFAG--LSSGFGGDGGPARFAKLRGPNGVSVGPDGSVYIADTYNARIRRVDPSGI 1287
Query: 135 VTTIAGGG-SKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
+ +IAG G ++ + D GPA A + ++ P L I+D N IR++
Sbjct: 1288 IESIAGSGFAQPQFWGDGGPALAARLNGTWQAVVAPD-GDLFIADSFNARIRRVG 1341
>gi|298705700|emb|CBJ28938.1| NHL repeat-containing protein [Ectocarpus siliculosus]
Length = 631
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV--TTIAGGGSKKEGRADGPAQNAS 157
SA PK AVD NIYVA+ IR I+ G+ T + G + EG GPA A
Sbjct: 535 SAVLAYPKGIAVDADDNIYVAEFQGEKIRNISTSGIISTCVGTGSTAYEGDG-GPASEAG 593
Query: 158 FSNDFELTFVPHICA--LLISDHGNQLIRQIN 187
+ F+P + I+D G+ +R +
Sbjct: 594 LDLPNAIAFMPSTPGSRMAIADFGHNAVRVVT 625
>gi|290983130|ref|XP_002674282.1| predicted protein [Naegleria gruberi]
gi|284087871|gb|EFC41538.1| predicted protein [Naegleria gruberi]
Length = 1572
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 42/80 (52%), Gaps = 1/80 (1%)
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
G++Y+AD S IRKI N G+ ++ G +K DG A + S + F + +
Sbjct: 74 GSMYIADASMAAIRKIDNNGIISLVAGNYRKTIPGDGSALSVSVISPFSMVLTQDETIMY 133
Query: 175 ISDHGNQLIRQINLKPEDCS 194
I+ GN IR++++K + S
Sbjct: 134 IA-QGNGCIRKLDIKNGNIS 152
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 52/123 (42%), Gaps = 11/123 (8%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGN-I 117
++L SRS+ L + +AG GV GY DG+ F + S AVD N I
Sbjct: 590 LILSHSRSSIRMLK---KSSGFITNIAGTFGVSGYKDGKTSLFSF-RLNSLAVDRNDNSI 645
Query: 118 YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
Y++D N IRK+ T G G + + + S D +TFV + +
Sbjct: 646 YISDNGNLAIRKL-----TPYCDDGWILSGNSTDCSPDISIKFDNTITFVDSTVLIAVKY 700
Query: 178 HGN 180
H N
Sbjct: 701 HSN 703
>gi|391330742|ref|XP_003739813.1| PREDICTED: NHL repeat-containing protein 2-like [Metaseiulus
occidentalis]
Length = 685
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 6/88 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
V+ + G G G+ DG G RF P+ G I+VAD NH IR++ + V+T+
Sbjct: 258 VQLIVGTGEAGFLDGAYGICRFSSPQGVCF-YDGGIFVADAGNHAIRRVDFSTKCVSTVV 316
Query: 140 GGGSKK---EGRADGPAQNASFSNDFEL 164
G G + G DG Q S D L
Sbjct: 317 GTGKQGVDLVGNLDGNVQEISTPWDVVL 344
>gi|444916089|ref|ZP_21236213.1| hypothetical protein D187_08495 [Cystobacter fuscus DSM 2262]
gi|444712768|gb|ELW53683.1| hypothetical protein D187_08495 [Cystobacter fuscus DSM 2262]
Length = 2336
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 10/118 (8%)
Query: 76 LSEESVVKRLAGD-GVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ E ++ LAG +G+ +G +A + P AV G++YV+D +NH +R+I +
Sbjct: 1111 VTPEGIIHTLAGKPNARGFCGDNGLASAACLNGPWDVAVGKAGDVYVSDSANHRVRRIGS 1170
Query: 133 LG-VTTIAGGGSKK--EGRA---DGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
G +TT+AG G EG + GPAQ A S L L I+DH +++ R
Sbjct: 1171 NGRITTVAGTGDDGSLEGISIGDGGPAQQALLSAPKGLALDSE-GNLYIADHFSRVRR 1227
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 62/135 (45%), Gaps = 14/135 (10%)
Query: 82 VKRLAGDG----VQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+ +AG G ++G S G+ G A+ PK A+D +GN+Y+AD + V R N
Sbjct: 1174 ITTVAGTGDDGSLEGISIGDGGPAQQALLSAPKGLALDSEGNLYIADHFSRVRRVDANGI 1233
Query: 135 VTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN-----L 188
+TT AG +G PA F + L P + +SD N +R+++ L
Sbjct: 1234 ITTYAGQLEASGFSGNGTPALQGKFDSPTGLAVGPD-GSCYVSDEWNHSVRRVSYPGAFL 1292
Query: 189 KPEDCSKSSQSGSAL 203
+ + SQ GS L
Sbjct: 1293 RENEAWVPSQDGSEL 1307
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 9/92 (9%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAGGGSKKEGRA----DGPAQ 154
SAR P+ A+ +GN+Y+AD N +R +T G+ T+AG K R +G A
Sbjct: 1081 SARLSYPRGIALGKEGNLYIADFDNDRVRYVTPEGIIHTLAG---KPNARGFCGDNGLAS 1137
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A + +++ V + +SD N +R+I
Sbjct: 1138 AACLNGPWDVA-VGKAGDVYVSDSANHRVRRI 1168
>gi|358458536|ref|ZP_09168745.1| NHL repeat containing protein [Frankia sp. CN3]
gi|357078265|gb|EHI87715.1| NHL repeat containing protein [Frankia sp. CN3]
Length = 674
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 6/94 (6%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRAD---GPAQN 155
A + P+ AV G +Y+AD +N+ +RKI ++ +TT+AG G+ +D G A
Sbjct: 461 AGLNSPQGIAVTSDGTVYIADTANNRVRKIDPSSGTITTVAGTGASTGSVSDDDGGLATQ 520
Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
A S ++ P AL I D G+ IR+++ +
Sbjct: 521 ADLSAPADVAVGPG-GALYIVDTGHDRIRKVDAQ 553
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 61/138 (44%), Gaps = 6/138 (4%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + VV AG GV +S DG P A+ P AV G +Y+ D N IRKI
Sbjct: 374 IDKAGVVSDFAGTGVSTFSGDGGPAIRAQVGFPNGVAVTDDGTVYIDDNHNDRIRKIDPS 433
Query: 134 G-VTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
G + TIAG G D A + N + V + I+D N +R+I+
Sbjct: 434 GTIRTIAGIGTGDGHGTFSGDNSAATKAGLNSPQGIAVTSDGTVYIADTANNRVRKIDPS 493
Query: 190 PEDCSKSSQSGSALGAVS 207
+ + +G++ G+VS
Sbjct: 494 SGTITTVAGTGASTGSVS 511
>gi|219852663|ref|YP_002467095.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546922|gb|ACL17372.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 522
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
L GD V S G+++F++P S AVD G++YVAD N+ I+K T+ G
Sbjct: 216 LTGDLVATRSISNSGASQFNRPTSVAVDTGGSVYVADTGNNRIQKFTSSG 265
Score = 42.4 bits (98), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 9/105 (8%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
G+ +F+ P+ AVD GN+YVAD N+ I K T+ G + T G + G+ + P A
Sbjct: 134 GTNQFNSPEGIAVDGAGNVYVADTGNNRIEKFTSSGDIVTSWGSYGSEVGQFNRPTSVAV 193
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
+ I + ++D GN I++ L + + S S S
Sbjct: 194 DNT--------GIGYIYVADTGNNRIQKFTLTGDLVATRSISNSG 230
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 65/159 (40%), Gaps = 30/159 (18%)
Query: 51 IDRPGSSDLIVLDSSRSAFYT-LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
+D G+ + +++ R +F+T S P S+ + +G G +F P
Sbjct: 50 VDGTGNVYVADMNNDRISFFTKASLPQMPSSIGRIGSGHG------------QFFYPHGV 97
Query: 110 AVDMKGNIYVADKSNHVIRKIT-NLGVTT---IAGGGSKKEGRADGPAQNASFSNDFELT 165
AVD GN+YVAD NH I+K T N T I G G+ + +G A
Sbjct: 98 AVDSTGNVYVADTGNHQIQKFTVNGNFNTQWGIKGSGTNQFNSPEGIA------------ 145
Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
V + ++D GN I + + + GS +G
Sbjct: 146 -VDGAGNVYVADTGNNRIEKFTSSGDIVTSWGSYGSEVG 183
>gi|290976426|ref|XP_002670941.1| predicted protein [Naegleria gruberi]
gi|284084505|gb|EFC38197.1| predicted protein [Naegleria gruberi]
Length = 872
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 76/160 (47%), Gaps = 13/160 (8%)
Query: 34 VTTVIDGHQLEINPHSV--IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-V 90
VT+ IDG + +P + + ++ D +R + T++ ++K +AG+G +
Sbjct: 284 VTSSIDGERALDSPLTAFGVSTSNEGEVYFADYARVSVVTIN------GLIKHVAGNGKL 337
Query: 91 QGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGR 148
Q DG + ++ KP+ AV G ++++D + IRK++ GV TIAGG G
Sbjct: 338 QYLGDGNLATQSQLFKPRGVAVSTSGELFISDSAQDTIRKVSTEGVINTIAGGTFPLGGY 397
Query: 149 -ADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
DG N +N L F L +SD N +R+I
Sbjct: 398 IQDGVDPLNTFLNNPSALEFSSSSGELFLSDSHNFRVRKI 437
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 55/112 (49%), Gaps = 4/112 (3%)
Query: 79 ESVVKRLAGDGVQGYSDGE--PGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG- 134
E +++ +AG G+ G++ SA+ + P V + G I + D +NH IRKI + G
Sbjct: 538 EGIIQTIAGTGIPGFNGDNILATSAQLNSPTDIEVSPVSGEIIIVDSNNHRIRKIDSFGN 597
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++TIAG G + S + L+ + I+D+ N+ IR++
Sbjct: 598 ISTIAGTGIASLSSHGKLSIYGSLNTPTTLSISAFSGHVYIADYENRRIRKL 649
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 53/109 (48%), Gaps = 3/109 (2%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
V +AG+G GYS DG + A+ V G + +AD +NH IRK+ G + TI
Sbjct: 485 VTLIAGNGTAGYSGDGSLATQAQLFNVSGVTVLPNGEVLIADTNNHRIRKVNLEGIIQTI 544
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
AG G + A +A ++ ++ P ++I D N IR+I+
Sbjct: 545 AGTGIPGFNGDNILATSAQLNSPTDIEVSPVSGEIIIVDSNNHRIRKID 593
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 3/75 (4%)
Query: 117 IYVAD-KSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
+YVAD +SN VIR N G VT IAG G+ A A N +T +P+ +L
Sbjct: 465 VYVADSQSNRVIRVDKNDGSVTLIAGNGTAGYSGDGSLATQAQLFNVSGVTVLPN-GEVL 523
Query: 175 ISDHGNQLIRQINLK 189
I+D N IR++NL+
Sbjct: 524 IADTNNHRIRKVNLE 538
>gi|290979154|ref|XP_002672299.1| predicted protein [Naegleria gruberi]
gi|284085875|gb|EFC39555.1| predicted protein [Naegleria gruberi]
Length = 430
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 64/119 (53%), Gaps = 12/119 (10%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
++ +++ +AG+ YS G+ G +A F P A+ M G +++ D +N+VIRK+++
Sbjct: 23 INTNNLITTIAGNNQIAYS-GDGGLAIAASFRIPYGVALGMNGELFITDTNNNVIRKVSS 81
Query: 133 -LG-VTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+G +TTIAG GG +G G A A S+ + + + + D GN +R++
Sbjct: 82 TIGIITTIAGSVTGGYSGDG---GLAIAAKLSSPYGIVVDQLNGTIYVCDFGNSRVRKL 137
>gi|219851898|ref|YP_002466330.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546157|gb|ACL16607.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 387
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 39/84 (46%), Gaps = 9/84 (10%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
FDKP AVD GN+YVAD N+ ++K T+ G GS G A F
Sbjct: 233 FDKPSGVAVDSAGNVYVADWGNNWVQKFTSTGGFITGWNGSGLAGGA--------FQYPV 284
Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
+ V + ++D+GN I++
Sbjct: 285 SIA-VDSTGNVYVADYGNNRIQKF 307
Score = 41.6 bits (96), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 28/47 (59%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
F+KP AVD GN+YVAD +N+ I+K T+ G + GS G A
Sbjct: 139 FNKPGGVAVDSVGNVYVADSANNQIQKFTSTGGFITSWNGSASAGGA 185
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 50/115 (43%), Gaps = 11/115 (9%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
F+ P AVD GN+YVAD N I+K T+ G T G + G D P+ A
Sbjct: 186 FEYPGGVAVDSAGNVYVADTFNGQIQKFTSTGEFITRWNGSASAGGVFDKPSGVA----- 240
Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLS 216
V + ++D GN +++ + + SG A GA V ++V S
Sbjct: 241 -----VDSAGNVYVADWGNNWVQKFTSTGGFITGWNGSGLAGGAFQYPVSIAVDS 290
>gi|355783113|gb|EHH65034.1| hypothetical protein EGM_18374, partial [Macaca fascicularis]
Length = 723
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 264 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAERVSTVAGIGIQGTDKE 322
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 323 GGAKGEQQ--PISSPWDVVF 340
>gi|383418201|gb|AFH32314.1| NHL repeat-containing protein 2 [Macaca mulatta]
Length = 726
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAERVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|302864652|ref|YP_003833289.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora aurantiaca ATCC 27029]
gi|302567511|gb|ADL43713.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora aurantiaca ATCC 27029]
Length = 611
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 6/80 (7%)
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICA- 172
GN+ V+D + H + ++ G T + G+ GR DGPA A+F+ L + PH+
Sbjct: 183 GNLLVSDSARHSLVELAADGETVVRRIGAGSRGRTDGPASAATFAEPQGLCLLPPHVAEV 242
Query: 173 ----LLISDHGNQLIRQINL 188
++++D N L+R + L
Sbjct: 243 AGYDVVVADTVNHLLRGVRL 262
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 2/103 (1%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
AG V+ DG +P +V G+ ++VAD IR +++ + T G G
Sbjct: 330 AGTTVEALRDGPLPDVWMAQPSGLSVSADGSRLWVADSETSAIRWVSDGEMHTAVGQGLF 389
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G DGPA A + + +P ++LI+D N +R+ +
Sbjct: 390 DFGHVDGPADQALLQHPLGVCALPD-GSVLIADTYNGAVRRFD 431
>gi|380812570|gb|AFE78159.1| NHL repeat-containing protein 2 [Macaca mulatta]
Length = 726
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAERVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|330464985|ref|YP_004402728.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
allergen [Verrucosispora maris AB-18-032]
gi|328807956|gb|AEB42128.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
allergen [Verrucosispora maris AB-18-032]
Length = 611
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
RF P V G + V+D + H + ++ G T + G+ GR DG A A+FS
Sbjct: 172 RF--PGKLVVGPDGTLLVSDSARHRVVELAADGETLLRAIGTGSRGRVDGTADTATFSEP 229
Query: 162 FELTFV-PHICA-----LLISDHGNQLIRQINL 188
L + PH+ L+++D N L+R + L
Sbjct: 230 QGLCLLPPHVAEVAGYDLVVADTVNHLLRGVRL 262
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 49/110 (44%), Gaps = 2/110 (1%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
AG V+ DG +P AV G ++VAD + IR + + + T G G
Sbjct: 330 AGTTVEALRDGPLAETWLAQPSGLAVSADGRRLWVADSESSAIRYVEDDVMGTAVGQGLF 389
Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
G DGPA A + + +P ++LI+D N +R+ + ++ S
Sbjct: 390 DFGHVDGPAAQALLQHPLGVCALPD-GSVLIADTYNGAVRRYDPATDEVS 438
Score = 37.7 bits (86), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 64/140 (45%), Gaps = 23/140 (16%)
Query: 46 NPHSVIDRPGS------SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
P + + PG L+V DS+R L+ ++ + R G G +G DG
Sbjct: 166 EPQTTLRFPGKLVVGPDGTLLVSDSARHRVVELA---ADGETLLRAIGTGSRGRVDGTAD 222
Query: 100 SARFDKPKSFAV------DMKG-NIYVADKSNHVIRKITNLG---VTTIAGGGSKKEGRA 149
+A F +P+ + ++ G ++ VAD NH++R + LG V T+AG G +
Sbjct: 223 TATFSEPQGLCLLPPHVAEVAGYDLVVADTVNHLLRGV-RLGSGEVVTVAGTGRQWRSTV 281
Query: 150 DGPAQNA---SFSNDFELTF 166
D A +A S+ ++L +
Sbjct: 282 DDHAHDALSVDLSSPWDLAW 301
>gi|109090607|ref|XP_001091193.1| PREDICTED: NHL repeat-containing protein 2 [Macaca mulatta]
Length = 726
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAERVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|395646873|ref|ZP_10434733.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
gi|395443613|gb|EJG08370.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
Length = 2489
Score = 42.7 bits (99), Expect = 0.15, Method: Composition-based stats.
Identities = 19/44 (43%), Positives = 26/44 (59%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
+G + + PG + F PK AVD GN+YVAD +NH I+K
Sbjct: 1671 SGTFITTWGSEPPGDSEFSYPKGVAVDDAGNVYVADTNNHRIQK 1714
Score = 41.2 bits (95), Expect = 0.49, Method: Composition-based stats.
Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 3/65 (4%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
G + + G G FD+P VD GN++V+D N+ I+K T+ G G +
Sbjct: 1151 GTFITKWGSGGTGDGEFDRPHGIVVDADGNVFVSDAGNNNIQKFTSTGTFITKWGTA--- 1207
Query: 147 GRADG 151
G DG
Sbjct: 1208 GSGDG 1212
Score = 40.8 bits (94), Expect = 0.57, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 47/111 (42%), Gaps = 11/111 (9%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNAS 157
G +FD P+ +D G+IYVAD +N I+K + G T G G +G D P
Sbjct: 1116 GDGQFDFPQGITIDADGSIYVADNANQRIQKFDSNGTFITKWGSGGTGDGEFDRPHG--- 1172
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV 208
V + +SD GN I++ +K +GS G +V
Sbjct: 1173 -------IVVDADGNVFVSDAGNNNIQKFTSTGTFITKWGTAGSGDGQFNV 1216
Score = 40.4 bits (93), Expect = 0.75, Method: Composition-based stats.
Identities = 25/72 (34%), Positives = 34/72 (47%), Gaps = 5/72 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
V++ GDG + G GS +F+ P AVD NIYV D NH ++K + G
Sbjct: 117 VQKFDGDGTYIATWGTKGSGEGQFESPGGIAVDNASNIYVTDTVNHRVQKFDSTGTFVTQ 176
Query: 140 GGGSKKEGRADG 151
G +G DG
Sbjct: 177 WG---NQGAGDG 185
Score = 39.7 bits (91), Expect = 1.5, Method: Composition-based stats.
Identities = 17/36 (47%), Positives = 25/36 (69%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
G A+F +P+ AVD G++YVAD N+ I+K +N G
Sbjct: 42 GDAQFIRPEGIAVDPGGDVYVADPGNNRIQKFSNTG 77
Score = 37.7 bits (86), Expect = 4.6, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 29/57 (50%), Gaps = 5/57 (8%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
QG DG+ G P S AVD GN+YV D N I+K N G T IA GS G
Sbjct: 1539 QGSGDGQFGLG----PDSVAVDRAGNLYVTDPINSRIQKFDNSG-TFIAAWGSYGAG 1590
Score = 37.7 bits (86), Expect = 5.0, Method: Composition-based stats.
Identities = 37/119 (31%), Positives = 54/119 (45%), Gaps = 19/119 (15%)
Query: 89 GVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS--KK 145
G GYS DGE F+ P AVD N+YV D N I+K + G T IA GS
Sbjct: 1063 GSSGYSEDGE-----FNMPHGIAVDSDSNVYVTDTWNSRIQKFDSTG-TFIAKWGSYGTG 1116
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
+G+ D P Q + D ++ ++D+ NQ I++ + +K G+ G
Sbjct: 1117 DGQFDFP-QGITIDAD---------GSIYVADNANQRIQKFDSNGTFITKWGSGGTGDG 1165
Score = 37.7 bits (86), Expect = 5.7, Method: Composition-based stats.
Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 6/65 (9%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEG 147
G G DG+ F+ P+ AVD +GN++VAD NH I+ TN T G EG
Sbjct: 1205 GTAGSGDGQ-----FNVPRGIAVDSRGNVFVADSLNHRIQIFDTNGTFLTEFGSYGTGEG 1259
Query: 148 RADGP 152
+ P
Sbjct: 1260 EFNEP 1264
Score = 37.0 bits (84), Expect = 8.9, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 3/69 (4%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK-- 144
GD + + G+ +F P+ AVD +G +YV +++N+ ++K G T IA G+K
Sbjct: 77 GDFITKWGAPGTGAGQFSYPRGVAVDGEGYVYVVEQTNNRVQKFDGDG-TYIATWGTKGS 135
Query: 145 KEGRADGPA 153
EG+ + P
Sbjct: 136 GEGQFESPG 144
>gi|327277470|ref|XP_003223487.1| PREDICTED: NHL repeat-containing protein 2-like [Anolis
carolinensis]
Length = 724
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 71/159 (44%), Gaps = 35/159 (22%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
VI G ++V+ + YT+ P S G DG A F+ P+
Sbjct: 235 VIADTGHHRILVVKKTGQILYTVGGPDS--------------GRKDGTYSEATFNSPQGV 280
Query: 110 AVDMKGNI-YVADKSNHVIRKITNLG---VTTIAG---GGSKKEGRADGPAQNASFSNDF 162
A+ K NI YVAD NH+IRK+ +L V+T+AG G+ KEG A G Q S+ +
Sbjct: 281 AI--KNNIIYVADTENHLIRKV-DLQLEMVSTVAGVGIQGTDKEGGARGDEQ--PISSPW 335
Query: 163 ELTFVPHICA------LLISDHGNQLIRQINLKPEDCSK 195
++ F + A L I+ G I QI DC K
Sbjct: 336 DVAFGTSVSATHEDDVLWIAMAG---IHQIWALMLDCGK 371
>gi|290997205|ref|XP_002681172.1| predicted protein [Naegleria gruberi]
gi|284094795|gb|EFC48428.1| predicted protein [Naegleria gruberi]
Length = 2212
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 42/130 (32%), Positives = 58/130 (44%), Gaps = 24/130 (18%)
Query: 85 LAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
LAGDG QG YS + +P+ V G+IY+ D N+VIRKI + G ++T G G
Sbjct: 480 LAGDGYQGIYS-------QLSQPRGVCVGPTGDIYIVDAGNYVIRKIDSNGIISTFIGDG 532
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDH------------GNQLIRQINLKP 190
G DG A A +T + + LLISD NQ IR+ K
Sbjct: 533 V--SGYRDGDALTARIGFASAITCLSN-GDLLISDSISDGMLFFGNYLNNQRIRKFTAKT 589
Query: 191 EDCSKSSQSG 200
+ + +G
Sbjct: 590 NQVTTVAGTG 599
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 58/124 (46%), Gaps = 3/124 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGS 143
LAG G S+ A+ P VD + IYV+D + IRKI N + TIAG G
Sbjct: 151 LAGGGSNLSSNISALDAKLGAPFGLLVDEANDVIYVSDTYKYTIRKIVNGTIYTIAGTGV 210
Query: 144 KKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
+ + +G A +AS + LTF P L+ + + +R+I + + S + A
Sbjct: 211 EDKNTPEGSLAASASIAKVLTLTFHPLTRELIYLEETGR-VRRITNEGRIFTLYSGTNGA 269
Query: 203 LGAV 206
+G V
Sbjct: 270 IGMV 273
Score = 41.2 bits (95), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 6/113 (5%)
Query: 82 VKRLAGDGVQGYS-DGEPGS-ARFDKPKS-FAVDMKGNIYVADKSNHVIRKITNLG--VT 136
V +AG GV+ YS DG P A + P + + G+I D N+ IRKI+N +T
Sbjct: 592 VTTVAGTGVRSYSGDGGPAIIAPLNGPTGVYYNETNGDIIFCDSENYRIRKISNSTGLIT 651
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
TIAG G DG A A+ F ++ P + SD + ++R+I+ K
Sbjct: 652 TIAGTGVYSY-NGDGLAALATNIAPFGISVHPITDEIYFSDMNSFMVRKIDSK 703
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 50/120 (41%), Gaps = 14/120 (11%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK------------SNHVIR 128
++ GDGV GY DG+ +AR + G++ ++D +N IR
Sbjct: 524 IISTFIGDGVSGYRDGDALTARIGFASAITCLSNGDLLISDSISDGMLFFGNYLNNQRIR 583
Query: 129 KIT--NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
K T VTT+AG G + GPA A + + + ++ D N IR+I
Sbjct: 584 KFTAKTNQVTTVAGTGVRSYSGDGGPAIIAPLNGPTGVYYNETNGDIIFCDSENYRIRKI 643
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 50/119 (42%), Gaps = 9/119 (7%)
Query: 76 LSEESVVKRLAGDGVQGY--------SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
+S ++ R+AG GY + F SF G + +AD +N+VI
Sbjct: 923 ISGTGILTRVAGKKECGYVIDSMLLNQTTFSTNTTFGYSISFTYMSNGEMLIADTNNNVI 982
Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
RK+ G +TI G D A+ A +N L+ + ++ SD GN IR +
Sbjct: 983 RKVDLNGYSTIIAGNGTAGFNGDSDAKQAYLNNPQGLSVLSD-GRIIFSDSGNDRIRML 1040
>gi|108707786|gb|ABF95581.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
protein, expressed [Oryza sativa Japonica Group]
Length = 1064
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 66/167 (39%), Gaps = 17/167 (10%)
Query: 30 DGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSS------RSAFYTLSFPLSEESVVK 83
DGY + L IN H V+ S I L+S + Y L L ++
Sbjct: 735 DGYERNLNGSSYPLCINEH-VVYVSFYSHGIALESHFYCECWNCSLYPLLCYLGPTWFLQ 793
Query: 84 RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGG 141
L GD DG P IYVAD NH I+++ VTTIAG
Sbjct: 794 YLFGD-----HDGTGSDVLLQHPLGVVYASDNQIYVADSYNHKIKRLDPVTRKVTTIAGT 848
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G + G DGPA +A S L V LL++D N IR I L
Sbjct: 849 G--RAGYKDGPALSAQLSEPAGLVEVGD-GRLLVADTNNSTIRYIVL 892
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
G +G DG +A F++P+ A + K NI YVAD NH +R +T
Sbjct: 610 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALRTLT 655
>gi|332212861|ref|XP_003255539.1| PREDICTED: NHL repeat-containing protein 2 [Nomascus leucogenys]
Length = 726
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAGKVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|144899324|emb|CAM76188.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense
MSR-1]
Length = 477
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 54/116 (46%), Gaps = 11/116 (9%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV-DMKGNIYVADKSNHVIRKI--TNLG 134
+ +++ LAG G + DGE G + +P A+ D G++Y AD +R++ +
Sbjct: 312 DTGMIRMLAGSGGENLGDGEAGQSLLAQPSGLALSDDGGSLYFADAETSALRRVCLSTGR 371
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA---LLISDHGNQLIRQIN 187
V T+ G G G +GP A + ++ CA L I+D N IR ++
Sbjct: 372 VDTLVGQGLFDFGHKNGPLAQARLQHPLGVS-----CANGRLFIADSYNHAIRVVD 422
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 9/65 (13%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
G G+ G DG A F+ P+ A+D + IYVAD NH+IR+I GG +
Sbjct: 210 GSGLAGRDDGI--DASFNGPEGVAID-RAFIYVADTRNHLIRRIDR------QSGGDRHH 260
Query: 147 GRADG 151
RA G
Sbjct: 261 RRAQG 265
>gi|124003658|ref|ZP_01688507.1| NHL repeat [Microscilla marina ATCC 23134]
gi|123991227|gb|EAY30679.1| NHL repeat [Microscilla marina ATCC 23134]
Length = 508
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 55/116 (47%), Gaps = 5/116 (4%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGV 135
+ ++VKR AG G + +DG + F++P S V +YVAD IR I V
Sbjct: 347 QTNLVKRFAGSGREALTDGSFRKSAFNQP-SGLVKNGNTLYVADSEASAIRAIDLAKGEV 405
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
+T G G + G DG A A + +TF + L I+D N I+ +LK +
Sbjct: 406 STPLGKGLFEFGDVDGKANKARLQHAVGVTFRAN--KLYIADTYNGKIKTFDLKTQ 459
Score = 40.0 bits (92), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 57/124 (45%), Gaps = 20/124 (16%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGG-- 141
G G QG DG+ A F +P A +KG+ +YVAD N+VIR++ V TIAG
Sbjct: 245 GSGTQGLKDGDFDKATFYEPHGLA--LKGDFLYVADTKNNVIRRVDLKRKKVKTIAGDGR 302
Query: 142 -----GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKS 196
G ++ G A P D E+ + I++ GN I ++L+ +
Sbjct: 303 LDYYFGKERWGVAVNPNSPWDLWIDGEVMY--------IANAGNHQILTMDLQTNLVKRF 354
Query: 197 SQSG 200
+ SG
Sbjct: 355 AGSG 358
>gi|440794391|gb|ELR15552.1| haloacid dehalogenase family hydrolase family protein [Acanthamoeba
castellanii str. Neff]
Length = 731
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/130 (23%), Positives = 58/130 (44%), Gaps = 6/130 (4%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPKSFA---VDMKGNIYVADKSNHVIRKITNLG-- 134
+ ++ + G G+ DG A+F++ + A D +YVAD NH +R + L
Sbjct: 256 AFLEEIGTRGSLGFKDGSYAEAKFNRLQGVAYHKTDEHERLYVADAENHALRVVDLLAKT 315
Query: 135 VTTIAGGGSKKEGRADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
VTT+AG G++ G + + S +++ P L ++ G I +++L
Sbjct: 316 VTTLAGDGTQGNDFVGGKSGREQQLSTPWDVALSPDGQQLFVAMAGTHQIWEVSLDSSAV 375
Query: 194 SKSSQSGSAL 203
+ S G +
Sbjct: 376 TNYSGDGQEM 385
>gi|148654423|ref|YP_001274628.1| NHL repeat-containing protein [Roseiflexus sp. RS-1]
gi|148566533|gb|ABQ88678.1| NHL repeat containing protein [Roseiflexus sp. RS-1]
Length = 1146
Score = 42.7 bits (99), Expect = 0.17, Method: Composition-based stats.
Identities = 47/187 (25%), Positives = 75/187 (40%), Gaps = 36/187 (19%)
Query: 11 AITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHS--VIDRPGSSDLIVLDSSRSA 68
A+ + LV++ + E G ++ + I+P ++ + LIV D S +
Sbjct: 818 AVAATGSIIRLVATDIFGEFGNAPGQLVQPRGIAIDPQGNIIVSDSANHRLIVFDPSGTP 877
Query: 69 FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
T+ G G DG+ F +P+ AVD +GNIYVAD N I
Sbjct: 878 IRTI----------------GSFGSGDGQ-----FYEPRGVAVDAQGNIYVADTWNARIV 916
Query: 129 KITNLGVTTIAGG------GSKKEGRADGPAQNASFSNDFELTFVPHICA------LLIS 176
K+ G + G G + G Q+A+ + +L F P A + I+
Sbjct: 917 KLDPQGTFLASWGVGRDDFGDGRRASPTGGTQDANLARPLDL-FGPRGVAVDADGNVYIA 975
Query: 177 DHGNQLI 183
D GN+ I
Sbjct: 976 DTGNKRI 982
>gi|219851408|ref|YP_002465840.1| carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
gi|219545667|gb|ACL16117.1| Carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
Length = 627
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
G +F+ P+ AVD GNIYVAD N ++K T+ G A G + E
Sbjct: 44 GDGQFNYPQGIAVDGAGNIYVADNGNKQVQKFTSSGTFVTAWGTAGPE 91
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 29/66 (43%), Gaps = 3/66 (4%)
Query: 86 AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
+G V + P RF +P AVD +GN+YV D ++K T G G
Sbjct: 78 SGTFVTAWGTAGPEDDRFSEPSGIAVDSEGNVYVTDTGKFRVQKFTPTGTLITEWG---T 134
Query: 146 EGRADG 151
EG DG
Sbjct: 135 EGTGDG 140
>gi|421872066|ref|ZP_16303685.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
gi|372458678|emb|CCF13234.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
Length = 633
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 16/73 (21%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG------- 141
G +G DG+ F++P A+D K NIYV D++NH I+K N G G
Sbjct: 487 GQKGTKDGQ-----FNQPMQLAIDSKDNIYVVDRNNHRIQKFDNSGRFLAKWGTNGGAGA 541
Query: 142 ----GSKKEGRAD 150
G+ KEG+ D
Sbjct: 542 TDPLGNWKEGKGD 554
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 45/104 (43%), Gaps = 11/104 (10%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
S F P AVD KGNI+VAD NH I+K + T G K+G+ + P Q A
Sbjct: 446 SGFFQFPAGVAVDSKGNIFVADTLNHRIQKFNPFFIYMTEWGQKGTKDGQFNQPMQLAID 505
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
S D + + D N I++ + +K +G A
Sbjct: 506 SKD----------NIYVVDRNNHRIQKFDNSGRFLAKWGTNGGA 539
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 11/86 (12%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
F P + A D+ GNIYVAD NH I K+ G V GG K G+ + P A
Sbjct: 62 FRTPVAMAKDLTGNIYVADMGNHRIVKMNKSGKVLETFGGLGDKPGKFNMPFGVA----- 116
Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
+ +L++D GN I++ +
Sbjct: 117 -----IDKEGNILVADTGNYRIQKFD 137
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+F P+ A+D G +YVAD NH I+K TN GV
Sbjct: 400 GQFIVPRQVAIDAYGFVYVADSVNHRIQKFTNTGV 434
>gi|326781271|ref|ZP_08240536.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
gi|326661604|gb|EGE46450.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
Length = 508
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
A+ +KP AVD G +YVA+ H +R++ GV TT+ G G+ GPA A +
Sbjct: 263 AQLNKPYGIAVDSTGTLYVAEYGGHRVRRVGADGVITTVVGTGAAGSAGDGGPAAEAQLN 322
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
F LT V + AL I+DH N +R++
Sbjct: 323 CPFGLT-VDSVDALYIADHLNHRVRKV 348
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 2/88 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
A+ + P+ AVD G +Y+AD + +R++ GV T+AG G GPA A +
Sbjct: 95 AQLNCPRQIAVDGDGALYIADAGGNRVRRVGADGVIVTVAGTGVAGSAGDGGPAAKAQLN 154
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
F + V L +++ G +R++
Sbjct: 155 KPFGVA-VDGEGVLYVAEFGGHRVRRVG 181
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 2/55 (3%)
Query: 82 VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+ +AG G G+ DG SA+ P AV G +YV+D SNH +R+IT G
Sbjct: 18 ISTIAGSGTAGFGGDDGAAVSAQLKHPYEMAVSGTGILYVSDYSNHRVRRITTDG 72
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 37/62 (59%), Gaps = 2/62 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + V+ +AG GV G + DG P + A+ +KP AVD +G +YVA+ H +R++
Sbjct: 124 VGADGVIVTVAGTGVAGSAGDGGPAAKAQLNKPFGVAVDGEGVLYVAEFGGHRVRRVGAD 183
Query: 134 GV 135
GV
Sbjct: 184 GV 185
>gi|299142776|ref|ZP_07035905.1| NHL repeat-containing protein [Prevotella oris C735]
gi|298575805|gb|EFI47682.1| NHL repeat-containing protein [Prevotella oris C735]
Length = 448
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 12/105 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
V++ AG G DG SA F P A D+ GN+YV + IRKI T+ VTT+A
Sbjct: 339 VEQFAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTDGMVTTVA 398
Query: 140 GGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
G +K + + +G +F+ ++++ + D GN I
Sbjct: 399 GIYTKSDNNKVEGLPLETTFNYPYDIS---------VDDEGNFYI 434
>gi|281423977|ref|ZP_06254890.1| NHL repeat-containing domain protein [Prevotella oris F0302]
gi|281401902|gb|EFB32733.1| NHL repeat-containing domain protein [Prevotella oris F0302]
Length = 469
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 12/105 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
V++ AG G DG SA F P A D+ GN+YV + IRKI T+ VTT+A
Sbjct: 360 VEQFAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTDGMVTTVA 419
Query: 140 GGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
G +K + + +G +F+ ++++ + D GN I
Sbjct: 420 GIYTKSDNNKVEGLPLETTFNYPYDIS---------VDDEGNFYI 455
>gi|290973472|ref|XP_002669472.1| basal body protein NBP-1 [Naegleria gruberi]
gi|284083020|gb|EFC36728.1| basal body protein NBP-1 [Naegleria gruberi]
Length = 2352
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 85 LAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGG 142
+AG G +G++ D A+ D P ++ +Y+ D N+ +RKI + GV TIAG G
Sbjct: 76 IAGTGYEGFNGDILAKQAKLDNPGG-VLEYNNELYIMDYGNNRVRKINSEGVLVTIAGTG 134
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
++ +G A +AS + + + +P + I+++ IR+I
Sbjct: 135 TRSSAGDNGAATSASLNGPWGI-HIPSNGDIYITEYVGNKIRKI 177
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 52/106 (49%), Gaps = 12/106 (11%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGS 143
+GDG G SA+ P S V+ G I++AD N IRKI TN +TTIAG GS
Sbjct: 195 FSGDG------GNAVSAKLKSPWSVFVNAIGEIFIADTDNDRIRKIATNGIITTIAGSGS 248
Query: 144 KKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
+DG A AS + F+ L I++ IR+++L
Sbjct: 249 ST---SDGVLATTASLKKPTSV-FISPANELFIAEADGGRIRKVDL 290
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 39/79 (49%), Gaps = 19/79 (24%)
Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSK--------KEGRADGPAQNASFSNDFELTFVP 168
+Y+++ S H ++K + LG+ IAG G + K+ + D P ++N+
Sbjct: 55 VYMSEYSGHRVKKKSYLGLVVIAGTGYEGFNGDILAKQAKLDNPGGVLEYNNE------- 107
Query: 169 HICALLISDHGNQLIRQIN 187
L I D+GN +R+IN
Sbjct: 108 ----LYIMDYGNNRVRKIN 122
>gi|149177401|ref|ZP_01856005.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
gi|148843734|gb|EDL58093.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
Length = 683
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 61/131 (46%), Gaps = 11/131 (8%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P ++ P L + DS+ + S + + + G G G DG+ +A FD P
Sbjct: 223 PGKLLADPAQQRLFISDSNHNRIVIASL----DGKLIDVIGSGQIGNKDGDYKTASFDHP 278
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGR-ADGPAQNASFSNDFE 163
+ A+ + +YVAD NH IR + V+T+AG G + R A G + A+ ++ +
Sbjct: 279 QGMAL-VGNTLYVADTENHSIRTVDLDKKQVSTLAGTGEQARFRSAGGKLKEAALNSPWA 337
Query: 164 LTFVP---HIC 171
L + +IC
Sbjct: 338 LAAIDDVLYIC 348
>gi|428162955|gb|EKX32055.1| hypothetical protein GUITHDRAFT_58597, partial [Guillardia theta
CCMP2712]
Length = 186
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 5/105 (4%)
Query: 85 LAGDGVQGYSDGEPG-SARFDKPKSFAVDMKG-NIYVADKSNHVIRKI--TNLGVTTIAG 140
LAGDG G D G +A+F P AVD G ++ VAD +H IR I ++ V+T+ G
Sbjct: 83 LAGDGTNGLVDNVNGAAAKFYNPTGVAVDFAGQSLLVADSRDHTIRIIDLSSTAVSTLVG 142
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
G + P +A+ S F ++ L+SD GN IR+
Sbjct: 143 TGVAGGEDSIDP-NSATLSLPFGVSVTRDGRWGLVSDQGNSGIRR 186
>gi|290988149|ref|XP_002676784.1| predicted protein [Naegleria gruberi]
gi|284090388|gb|EFC44040.1| predicted protein [Naegleria gruberi]
Length = 2167
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 45/101 (44%), Gaps = 17/101 (16%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT---TIAGGGSKKEG-- 147
Y DG A+F+ P A IY+ADK NHVIR I +G++ ++ G +K G
Sbjct: 231 YIDGNTTIAKFNSPTGVAYGPSKEIYIADKMNHVIRMIEFVGMSGTVSVISGIAKSSGND 290
Query: 148 --RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
R + P A S + I D GN IR++
Sbjct: 291 TLRLNSPVDVAVNSQK----------EVFILDSGNNRIRKL 321
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 1/65 (1%)
Query: 77 SEESVVKRLAGDGVQGY-SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
S + +AG G G ++ SA+ P AV + G + +AD SNH IR+++N +
Sbjct: 69 SNSGAITIVAGTGAFGSDTNTNATSAKLYSPSGLAVTIDGKLLIADTSNHAIREVSNGII 128
Query: 136 TTIAG 140
TIAG
Sbjct: 129 RTIAG 133
>gi|242041143|ref|XP_002467966.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
gi|241921820|gb|EER94964.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
Length = 360
Score = 42.4 bits (98), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 42/99 (42%), Gaps = 5/99 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
G DG P A +Y+AD NH I+K+ VTTIAG G + G
Sbjct: 92 GDYDGTGSDVLLQHPLGVAYASDNQVYIADSYNHKIKKLDPVTRKVTTIAGTG--RAGYK 149
Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
DGP A S L V LL++D N IR I L
Sbjct: 150 DGPGLAAQLSEPAGLVEVGD-GRLLVADTNNNTIRYITL 187
>gi|157376750|ref|YP_001475350.1| hypothetical protein Ssed_3618 [Shewanella sediminis HAW-EB3]
gi|157319124|gb|ABV38222.1| hypothetical protein Ssed_3618 [Shewanella sediminis HAW-EB3]
Length = 362
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 44/98 (44%), Gaps = 6/98 (6%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G+ + A +P D GNI++ADK H IRKI G+ + G ++ D
Sbjct: 62 GFENHSAKKAELSRPHQAIADSDGNIFIADKDAHAIRKIDLNGMISTYVGTNESGFNGDS 121
Query: 152 PAQ---NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ SF N LT AL I D GN +IR++
Sbjct: 122 LKRTLTQLSFPNGIWLTASE---ALFILDSGNDMIRRV 156
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 7/97 (7%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
G G++ +DG FD+ + AV+ KG ++V D+ +++ K++ G I G K+
Sbjct: 201 GRGIRVLADG------FDELANLAVNAKGELFVTDRDANLVYKVSANGDKEIVAGTGKRS 254
Query: 147 GRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
G DG A + + + + F P+ L + G Q+
Sbjct: 255 GCGDGCLATDTALNQVRGIWFHPNGGYFLATHKGGQI 291
>gi|290997291|ref|XP_002681215.1| predicted protein [Naegleria gruberi]
gi|284094838|gb|EFC48471.1| predicted protein [Naegleria gruberi]
Length = 1196
Score = 42.4 bits (98), Expect = 0.19, Method: Composition-based stats.
Identities = 44/146 (30%), Positives = 65/146 (44%), Gaps = 21/146 (14%)
Query: 61 VLDSSRSAFYTLSFPLSE--------ESVVKRLAGDG----VQGYSDGEPGSARFDK--- 105
++ + SAF +S +SE + +K++ +G + G S G G K
Sbjct: 128 IISTYGSAFSPVSLTMSETDDVLVGTDYSIKQIFKNGTNVRIAGASYGYGGDGSLAKDCK 187
Query: 106 ---PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
P+ + G+IY AD NH IRKI G VTTIAG G+ G A NA
Sbjct: 188 ITSPRGITAAI-GSIYFADAFNHRIRKIDKDGIVTTIAGTGTLGFSGDGGLAVNAQLYTP 246
Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
L +P + I+D N IR+++
Sbjct: 247 AGLDVLP-TGVVFIADVNNHRIRRVS 271
Score = 42.0 bits (97), Expect = 0.25, Method: Composition-based stats.
Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 14/104 (13%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ ++ +V +AG G G+S G+ G +A+ P V G +++AD +NH IR+++
Sbjct: 214 IDKDGIVTTIAGTGTLGFS-GDGGLAVNAQLYTPAGLDVLPTGVVFIADVNNHRIRRVSK 272
Query: 133 LG-VTTIAGGGSKKE---------GRADGPAQNASFSNDFELTF 166
G +TT+AG G+ GR GP+ + + E+ F
Sbjct: 273 DGIITTVAGTGTSGTSGDGSLATLGRIYGPSSISICPTNGEIIF 316
>gi|417404183|gb|JAA48864.1| Putative nhl repeat-containing protein 2 [Desmodus rotundus]
Length = 725
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 39/75 (52%), Gaps = 6/75 (8%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSND 161
G A G Q S D
Sbjct: 326 GGAKGEEQPISSPWD 340
>gi|406832294|ref|ZP_11091888.1| NHL repeat containing protein [Schlesneria paludicola DSM 18645]
Length = 707
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 4/84 (4%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
G G G+ DG A+FD P+ + + +YVAD NH++R I V+T AG G +
Sbjct: 268 GSGAIGHKDGGYAVAQFDHPQGMTL-VGDTLYVADTENHLLRAIDLKQKHVSTFAGVGEQ 326
Query: 145 KEGRADGPA-QNASFSNDFELTFV 167
RA G + + ++ ++LT V
Sbjct: 327 ARTRAPGGTLRETALNSPWDLTVV 350
>gi|37519956|ref|NP_923333.1| hypothetical protein gll0387 [Gloeobacter violaceus PCC 7421]
gi|35210948|dbj|BAC88328.1| gll0387 [Gloeobacter violaceus PCC 7421]
Length = 544
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 41/82 (50%), Gaps = 5/82 (6%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI--TNLGVTTIAGGGS 143
G G+ G+ DG A F P+ A+ G ++V D NH++RKI V+T AG G
Sbjct: 251 GSGIPGWRDGNFEEAEFWAPQGLALSADGRTLFVCDTENHLLRKIDLVRRQVSTFAGTGE 310
Query: 144 KK--EGRADGPAQNASFSNDFE 163
+ G+ +GP ++ ++
Sbjct: 311 QSLGYGQVEGPGLEIPLNSPWD 332
>gi|20092638|ref|NP_618713.1| cell surface protein [Methanosarcina acetivorans C2A]
gi|19917919|gb|AAM07193.1| cell surface protein [Methanosarcina acetivorans C2A]
Length = 526
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+ + G G+ +F P AVD GNIYVAD +NH ++K+ + GV
Sbjct: 84 IDKWGSGGSGNGQFYTPNGVAVDSMGNIYVADYNNHRVQKLDSTGV 129
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 68/161 (42%), Gaps = 25/161 (15%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
S + + +L DG G G++R F+ P+ AVD GNIYV D N I K G
Sbjct: 163 SRSNRIVKLNKDGNYLTQWGSKGASRNQFNDPEGIAVDSSGNIYVVDSGNSRIMKFDGTG 222
Query: 135 VT-TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN------ 187
T G +++G+ P A + A+ ++D GN+ I++ +
Sbjct: 223 TYLTEWGTPGQEDGQFRSPHGIA----------IDSSGAIYVTDTGNRRIQKFDSTGSYV 272
Query: 188 ---LKPEDCSKSSQSGSAL---GAVSVWVLVSVLSCLVSLV 222
+ PE+ Q+ + + +V+V+ S C+ V
Sbjct: 273 TKWVSPENGDGKFQNPVGIVVDSSNNVYVVDSFYHCVFQFV 313
>gi|300791003|ref|YP_003771294.1| NHL repeat-containing protein [Amycolatopsis mediterranei U32]
gi|384154546|ref|YP_005537362.1| NHL repeat-containing protein [Amycolatopsis mediterranei S699]
gi|399542881|ref|YP_006555543.1| NHL repeat-containing protein [Amycolatopsis mediterranei S699]
gi|299800517|gb|ADJ50892.1| NHL repeat-containing protein [Amycolatopsis mediterranei U32]
gi|340532700|gb|AEK47905.1| NHL repeat-containing protein [Amycolatopsis mediterranei S699]
gi|398323651|gb|AFO82598.1| NHL repeat-containing protein [Amycolatopsis mediterranei S699]
Length = 605
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 50/111 (45%), Gaps = 8/111 (7%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
+PG RF P +G + VAD +H I + + G T I GS G DGP A
Sbjct: 163 QPGELRF--PSKAVATAEGRVLVADTGHHAIVEFASDGETVIRRFGSGARGSQDGPFDIA 220
Query: 157 SFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSGS 201
+F+ + +P+ A +++D +R ++L + + + +G+
Sbjct: 221 TFTEPSGIALLPYDIAERVGYHAVVADTAGHRLRGLDLITGEVTTVAGTGA 271
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 6/111 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTT 137
++R AG V+G DG+ G A F + AVD +++ D +R I G V T
Sbjct: 322 IRRFAGTTVEGLRDGDVGEAFFAQTSGLAVDGD-KLWLVDAETSALRWIEPAGESFTVQT 380
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
G G DGPA A + L +P + I+D N +R+ ++
Sbjct: 381 AIGVDLFSFGHTDGPADQALLQHPLGLAVLPG-DKIAIADTYNGAVRRFDV 430
>gi|290972811|ref|XP_002669144.1| predicted protein [Naegleria gruberi]
gi|284082687|gb|EFC36400.1| predicted protein [Naegleria gruberi]
Length = 1362
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 4/97 (4%)
Query: 86 AGDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
AG G+ GYS S F+ P A + + +++AD +NH IRK+ AG G+
Sbjct: 429 AGTGLVGYSGNYTLRNSTLFNSPNKLAFNSRNELFIADSNNHRIRKVVGGMTFDYAGNGN 488
Query: 144 KKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHG 179
D G A NA +N L +P+ LLI+ G
Sbjct: 489 PTGSDNDGGSALNAVLANPTSLFVLPND-DLLIASTG 524
Score = 40.8 bits (94), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 55/111 (49%), Gaps = 6/111 (5%)
Query: 91 QGYS-DGEPG-SARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLG--VTTIAGGGSKK 145
+GYS DG +A+ + P S VD G+++ D NH IRKI+ V+T+AG G +
Sbjct: 107 RGYSGDGSLAVNAKINGPTSVIVDSSNGDVFFTDDWNHCIRKISASSGIVSTVAGTGVQG 166
Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKS 196
G A + S + F + D+GN +IR++N+ +K+
Sbjct: 167 FDGNTGNALSMRLSIPRSIRFTSAT-TMTFLDYGNCIIRELNISSSQITKT 216
>gi|426253136|ref|XP_004020256.1| PREDICTED: NHL repeat-containing protein 2 [Ovis aries]
Length = 726
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESSFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGDEQ--PISSPWDVVF 343
Score = 38.1 bits (87), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 52/104 (50%), Gaps = 5/104 (4%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
G++ Y D P S P AVD N + +AD +H I + G T + GG G
Sbjct: 210 GIKLYKDSLPPSPLL-FPGKIAVDHVSNRLVIADTGHHRILVVWKNGQTQHSIGGPNP-G 267
Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
R DG +SF++ + + +I + ++D N LIR+I+L+ E
Sbjct: 268 RKDGIFSESSFNSPQGVAIMNNI--IYVADTENHLIRKIDLEAE 309
>gi|139949007|ref|NP_001077192.1| NHL repeat-containing protein 2 [Bos taurus]
gi|166233893|sp|A4IF69.1|NHLC2_BOVIN RecName: Full=NHL repeat-containing protein 2
gi|134025213|gb|AAI34431.1| NHLRC2 protein [Bos taurus]
gi|296472617|tpg|DAA14732.1| TPA: NHL repeat-containing protein 2 [Bos taurus]
Length = 726
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESSFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGDEQ--PISSPWDVVF 343
>gi|149278854|ref|ZP_01884989.1| hypothetical protein PBAL39_06746 [Pedobacter sp. BAL39]
gi|149230473|gb|EDM35857.1| hypothetical protein PBAL39_06746 [Pedobacter sp. BAL39]
Length = 460
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 49/111 (44%), Gaps = 14/111 (12%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKG----------NIYVADKSNHVIRKITNLG-VTT 137
G GY+DG SA + P V +K + Y DK NH IR +T G V+T
Sbjct: 347 GTAGYADGIGTSALLNWPVQ-GVFVKNPNYEGEEDPYDFYFCDKENHAIRTLTPKGRVST 405
Query: 138 IAG-GGSKKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G + G ADG + A F + + I D GN LIR+I
Sbjct: 406 YAGRGNNGTSGYADGDLRTEARFKQPQAIVYDEERKCFYIGDTGNNLIRKI 456
>gi|302036773|ref|YP_003797095.1| hypothetical protein NIDE1420 [Candidatus Nitrospira defluvii]
gi|190343188|gb|ACE75576.1| putative NHL repeat protein [Candidatus Nitrospira defluvii]
gi|300604837|emb|CBK41169.1| protein of unknown function, contains NHL repeats [Candidatus
Nitrospira defluvii]
Length = 989
Score = 42.4 bits (98), Expect = 0.23, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 5/46 (10%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
G +G DGE F P AVD GNIYVAD NH I+K G
Sbjct: 160 GQEGVEDGE-----FSSPTGIAVDKDGNIYVADTDNHSIQKFDKDG 200
Score = 39.3 bits (90), Expect = 1.8, Method: Composition-based stats.
Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 16/130 (12%)
Query: 82 VKRLAGDGVQGYSDGEPGSA-------RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
V+R DG + G+ G A +FD P D GNIYV D +N I+K G
Sbjct: 241 VQRFDTDGNPMKAWGKFGFAWRGADMNKFDAPWGVTTDQDGNIYVTDTNNARIQKFKGDG 300
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
+ G ++G DG F + FV + ++D GN I++ + + +
Sbjct: 301 TPLLKWG---RDGSFDGA---FFFPRGVAVDFVGNT---YVADEGNNRIQKFDTRGSFLT 351
Query: 195 KSSQSGSALG 204
K + GS G
Sbjct: 352 KWGREGSGPG 361
Score = 38.1 bits (87), Expect = 3.7, Method: Composition-based stats.
Identities = 33/108 (30%), Positives = 49/108 (45%), Gaps = 11/108 (10%)
Query: 82 VKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
+++ GDG G GS F P+ AVD GN YVAD+ N+ I+K G
Sbjct: 293 IQKFKGDGTPLLKWGRDGSFDGAFFFPRGVAVDFVGNTYVADEGNNRIQKFDTRGSFLTK 352
Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G +EG GP Q F + +T + + + D GN I++ +
Sbjct: 353 WG---REG--SGPGQ---FKAPWGVT-CDALGNVYVVDQGNHRIQKFD 391
Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats.
Identities = 30/95 (31%), Positives = 39/95 (41%), Gaps = 13/95 (13%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA-GGGSKKEGRADGPAQNAS 157
G +F P D GN+YV D+ NH I+K G A G K EG+ + P+ A
Sbjct: 359 GPGQFKAPWGVTCDALGNVYVVDQGNHRIQKFDGNGTFLCAWGNRGKTEGQLNFPSGVA- 417
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
V A+ + D GN R I P D
Sbjct: 418 ---------VDKEGAVYVVDSGNH--RVIKYVPTD 441
>gi|380694022|ref|ZP_09858881.1| hypothetical protein BfaeM_08588 [Bacteroides faecis MAJ27]
Length = 490
Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 42/89 (47%), Gaps = 4/89 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARF 103
NPH + L + D + + LS + +V + G++GY DG P A F
Sbjct: 401 NPHQICFTE-DGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMF 459
Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
+ P AV G I YVAD N VIRK++
Sbjct: 460 NHPFGVAVSADGQIVYVADTGNKVIRKLS 488
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 9/106 (8%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV------TTIAGGGS 143
+G+ DG +A+F+ P G +Y+AD N+ IR I T L + T I G
Sbjct: 387 KGWEDGLLRNAKFNNPHQICFTEDGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGM 446
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
K G DG A F++ F + + ++D GN++IR+++++
Sbjct: 447 K--GYKDGGPDIAMFNHPFGVAVSADGQIVYVADTGNKVIRKLSIE 490
>gi|288921011|ref|ZP_06415303.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288347586|gb|EFC81871.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 801
Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 3/89 (3%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DGPAQNASF 158
A F P A+ G +YVAD S H +R+I+ G T+ G G + G A +A
Sbjct: 643 ATFTSPSGLALADDGTLYVADSSAHTVRRISTDGKITLVAGTPGAYGYSGDGGAATSAKL 702
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
N L P L I+D N +IR +
Sbjct: 703 YNPHSLAVGPE-GELYIADTYNDVIRVVT 730
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 14/121 (11%)
Query: 76 LSEESVVKRLAG-DGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S + + +AG G GYS G SA+ P S AV +G +Y+AD N VIR +T
Sbjct: 672 ISTDGKITLVAGTPGAYGYSGDGGAATSAKLYNPHSLAVGPEGELYIADTYNDVIRVVTT 731
Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
G + T AG ADG + + + E ++ + A+ +S+ +R+I
Sbjct: 732 DGKINTFAGTDDVDANTADG-----ALATETEFSYPSGLAIDASGAVYVSEPDRGTVRRI 786
Query: 187 N 187
Sbjct: 787 G 787
>gi|29349674|ref|NP_813177.1| hypothetical protein BT_4266 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341584|gb|AAO79371.1| conserved hypothetical protein with NHL repeat [Bacteroides
thetaiotaomicron VPI-5482]
Length = 490
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 9/106 (8%)
Query: 91 QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV------TTIAGGGS 143
+G+ DG +A+F+ P G +Y+AD N+ IR I T L + T I G
Sbjct: 387 KGWEDGLLRNAKFNNPHQICFTEDGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGM 446
Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
K G DG A F++ F + + ++D GN++IR+++++
Sbjct: 447 K--GYKDGGPDIAMFNHPFGVAVSADGQIVYVADTGNKVIRKLSIE 490
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 42/89 (47%), Gaps = 4/89 (4%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARF 103
NPH + L + D + + LS + +V + G++GY DG P A F
Sbjct: 401 NPHQICFTE-DGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMF 459
Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
+ P AV G I YVAD N VIRK++
Sbjct: 460 NHPFGVAVSADGQIVYVADTGNKVIRKLS 488
>gi|347757619|ref|YP_004865181.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
gi|347590137|gb|AEP09179.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
Length = 495
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 51/116 (43%), Gaps = 27/116 (23%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFP----------------------LSEESVVKRLAGDGVQ 91
P +D+ L +R+ TLSFP L VK G G
Sbjct: 170 PPFTDISTLVPARTKTSTLSFPARIEDAGDFIFVADSGHNRILVLDRTGAVKMTIGSGEL 229
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTIAGGGSK 144
G+ DG A+F+ P+ + + N+Y+AD NH++R + NL V T++G G +
Sbjct: 230 GHQDGSFVMAKFNHPRGMVL-IGRNLYIADTDNHMLR-VANLDSGSVMTLSGNGKR 283
>gi|430743486|ref|YP_007202615.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
gi|430015206|gb|AGA26920.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 307
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 54/124 (43%), Gaps = 14/124 (11%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
K L G GY+ GE F KP++ A+D + IYVAD NH I+ +G G
Sbjct: 169 KWLRQWGGHGYALGE-----FLKPRALAIDDQDRIYVADSCNHRIQVFDTMGKLLFHWG- 222
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
R P Q + ++L P AL + ++GN +++ +L+ +G
Sbjct: 223 ----TRGGAPGQ---MNYPYDLAIGPD-HALYVCEYGNHRVQKFSLEGRSLGVWGSAGRG 274
Query: 203 LGAV 206
G +
Sbjct: 275 PGQL 278
>gi|440911916|gb|ELR61535.1| NHL repeat-containing protein 2, partial [Bos grunniens mutus]
Length = 760
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 283 GRKDGIFSESSFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 341
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 342 GGAKGDEQ--PISSPWDVVF 359
>gi|339008739|ref|ZP_08641312.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
gi|338774539|gb|EGP34069.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
Length = 633
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 13/68 (19%)
Query: 96 GEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-----------TNLGVTTIAGGG 142
G+ G+ +F++P A+D K NIYV D++NH I+K TN GV G
Sbjct: 487 GQKGTKEGQFNQPMQLAIDSKDNIYVVDRNNHRIQKFDNSGRFLAKWGTNGGVGATDQLG 546
Query: 143 SKKEGRAD 150
+ KEG+ D
Sbjct: 547 NWKEGKGD 554
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
S F P AVD KGNI+VAD NH I+K + T G KEG+ + P Q A
Sbjct: 446 SGFFQFPAGVAVDSKGNIFVADTLNHRIQKFNPFFIYMTEWGQKGTKEGQFNQPMQLAID 505
Query: 159 SND 161
S D
Sbjct: 506 SKD 508
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 11/86 (12%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
F P + A D+ GNIYVAD NH I K+ G V GG K G+ + P A
Sbjct: 62 FRTPVAMAKDLTGNIYVADMGNHRIVKMNKSGKVLETFGGLGDKPGKFNMPFGVA----- 116
Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
+ +L++D GN I++ +
Sbjct: 117 -----IDKEGNILVADTGNYRIQKFD 137
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
+F P+ A+D G +YVAD NH I+K TN GV
Sbjct: 400 GQFIVPRQVAIDAYGFVYVADSVNHRIQKFTNTGV 434
>gi|182440597|ref|YP_001828316.1| hypothetical protein SGR_6804 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178469113|dbj|BAG23633.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC
13350]
Length = 508
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 2/88 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
A+ + P+ AVD G +Y+AD + +R++ GV T+AG G GPA A +
Sbjct: 95 AQLNCPRQIAVDGDGALYIADAGGNRVRRVGADGVMVTVAGTGVAGSAGDGGPAAKAQLN 154
Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
F + V L +++ G +R++
Sbjct: 155 KPFGVA-VDGEGVLYVAEFGGHRVRRVG 181
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 2/87 (2%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
A+ +KP AVD G +YVA+ H +R++ GV +T+AG G+ GPA A +
Sbjct: 263 AQLNKPYGIAVDSTGTLYVAEYGGHRVRRVGADGVISTVAGTGAAGSAGDGGPAAEAQLN 322
Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
F LT V + AL I+DH N +R++
Sbjct: 323 CPFGLT-VDSVDALYIADHLNHRVRKV 348
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 33/62 (53%), Gaps = 5/62 (8%)
Query: 78 EESVVKRL---AGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+E V R+ AG G G+ DG SA+ P AV G +YV+D SNH +R+IT
Sbjct: 11 DEGFVPRISAIAGSGTAGFGGDDGAAVSAQLKHPYEMAVSGTGILYVSDYSNHRVRRITT 70
Query: 133 LG 134
G
Sbjct: 71 DG 72
Score = 38.1 bits (87), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 37/62 (59%), Gaps = 2/62 (3%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+ + V+ +AG GV G + DG P + A+ +KP AVD +G +YVA+ H +R++
Sbjct: 124 VGADGVMVTVAGTGVAGSAGDGGPAAKAQLNKPFGVAVDGEGVLYVAEFGGHRVRRVGAD 183
Query: 134 GV 135
GV
Sbjct: 184 GV 185
>gi|425444428|ref|ZP_18824479.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
gi|389735830|emb|CCI00727.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9443]
Length = 358
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
G +FD P AVD GNIYVAD+ NH ++ + GV A GG
Sbjct: 245 GDGQFDDPVGIAVDSGGNIYVADRDNHRVQVFNSGGVFQSAFGG 288
>gi|348578653|ref|XP_003475097.1| PREDICTED: NHL repeat-containing protein 2-like [Cavia porcellus]
Length = 726
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 39/71 (54%), Gaps = 6/71 (8%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
G DG + F+ P+ A+ M I+VAD NH+IRKI + V+T+AG G+ KE
Sbjct: 267 GRKDGLFSESAFNSPQGVAM-MDNIIFVADTENHLIRKIDLIAEKVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNAS 157
G A G Q S
Sbjct: 326 GGAKGEEQPIS 336
>gi|390341274|ref|XP_003725419.1| PREDICTED: tripartite motif-containing protein 71-like
[Strongylocentrotus purpuratus]
Length = 778
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 15/110 (13%)
Query: 82 VKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTT 137
++ DG + G PG+ +FD+P AV+ +GNI +ADK NH + KI+ +
Sbjct: 540 IQIFNADGSFSHKFGSPGTRNGQFDRPAGVAVNGEGNIIIADKDNHRCQVFKISGQFLFK 599
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
GSK N F+ +++ V +L+SD N I+Q N
Sbjct: 600 FGEKGSK----------NGQFNYPWDVA-VNSEGKILVSDTRNHRIQQFN 638
>gi|433610083|ref|YP_007042452.1| Alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
allergen [Saccharothrix espanaensis DSM 44229]
gi|407887936|emb|CCH35579.1| Alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
allergen [Saccharothrix espanaensis DSM 44229]
Length = 630
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 67/142 (47%), Gaps = 15/142 (10%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG------EPGSARFDKPKSF 109
S ++V DS+ L E+ ++R+ G G +G DG EP P
Sbjct: 209 SGTILVSDSAHHRLVELD--TDGETALRRI-GTGERGRRDGLNPSFSEPAGLAL-LPAEV 264
Query: 110 AVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSK-KEGRADGPAQNASFSNDFELT 165
A + ++ VAD NH++R + NL VTT+AG G + + G DGPA + ++ +++
Sbjct: 265 AARVGYHVVVADTVNHLLRGL-NLDTGEVTTVAGTGEQWRGGETDGPALDIPLTSPWDVA 323
Query: 166 FVPHICALLISDHGNQLIRQIN 187
+ ++I+ GN + N
Sbjct: 324 WWEPAGGVVIALAGNHTLGLFN 345
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 53/121 (43%), Gaps = 10/121 (8%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
GDG Y P P + G I V+D ++H + ++ G T + G+ +
Sbjct: 184 GDGP--YVPPAPADTELRFPAKALLTPSGTILVSDSAHHRLVELDTDGETALRRIGTGER 241
Query: 147 GRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSG 200
GR DG N SFS L +P A ++++D N L+R +NL + + + +G
Sbjct: 242 GRRDG--LNPSFSEPAGLALLPAEVAARVGYHVVVADTVNHLLRGLNLDTGEVTTVAGTG 299
Query: 201 S 201
Sbjct: 300 E 300
>gi|374312921|ref|YP_005059351.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
gi|358754931|gb|AEU38321.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
Length = 654
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 7/83 (8%)
Query: 54 PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVD 112
P ++ I +D+S +A+ T S V +L+G G V +G G A D+P+S A+D
Sbjct: 374 PTAAGNIAIDTSGNAWVT-----SGNFSVTKLSGTGTVLSVVNGSSGGA-LDEPQSLAID 427
Query: 113 MKGNIYVADKSNHVIRKITNLGV 135
GN++V D N+ + +I++ GV
Sbjct: 428 GSGNVWVNDFENNSVTEISSTGV 450
>gi|290995072|ref|XP_002680155.1| predicted protein [Naegleria gruberi]
gi|284093775|gb|EFC47411.1| predicted protein [Naegleria gruberi]
Length = 1951
Score = 42.0 bits (97), Expect = 0.29, Method: Composition-based stats.
Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 8/107 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
+ + GDG SA+ P A+ IYVAD +N+ IRKIT G ++TIAG
Sbjct: 142 INAITGDG------SSATSAQLYNPYGVAISSSDEIYVADYNNNRIRKITTSGIISTIAG 195
Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G+ A +A N + + + + I+D N IR+++
Sbjct: 196 TGTSGYSGDGSSAISAQLYNPYGVA-ISSSDEIYITDTNNNRIRKLS 241
Score = 40.4 bits (93), Expect = 0.79, Method: Composition-based stats.
Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 5/71 (7%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKS--NHVIRKIT 131
++ ++ +AG G GY+ DG SA+ P A+ IY+AD N+ IRKIT
Sbjct: 60 ITTSGIISTIAGTGTAGYNGDGSSATSAKLYFPYGVAISSSDEIYIADMDMFNNRIRKIT 119
Query: 132 NLG-VTTIAGG 141
G ++TIAGG
Sbjct: 120 TSGIISTIAGG 130
>gi|351706822|gb|EHB09741.1| NHL repeat-containing protein 2 [Heterocephalus glaber]
Length = 726
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 37/68 (54%), Gaps = 6/68 (8%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ M IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGLFSESTFNSPQGVAM-MDNVIYVADTENHLIRKIDLEAERVSTVAGIGVQGTDKE 325
Query: 147 GRADGPAQ 154
G A G Q
Sbjct: 326 GGAKGEQQ 333
>gi|430760419|ref|YP_007216276.1| YD repeat protein [Thioalkalivibrio nitratireducens DSM 14787]
gi|430010043|gb|AGA32795.1| YD repeat protein [Thioalkalivibrio nitratireducens DSM 14787]
Length = 2395
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 62/134 (46%), Gaps = 10/134 (7%)
Query: 60 IVLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAV 111
+ +D+ S F+T F + + V +AG+G + ++ DG P +A F P+ AV
Sbjct: 1125 VAVDAGGSVFFTRRNGGPRIFRANPDGTVHHVAGNGSRTFAGDGGPAVNASFIDPRGIAV 1184
Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
G++YV D H +R+I G + T+AG G G GPA A +++ P
Sbjct: 1185 GADGSLYVGDFRTHRVRRIGPDGIIRTVAGNGGSGIGGIGGPAVQARIDRTYDVAVGPE- 1243
Query: 171 CALLISDHGNQLIR 184
L I H + R
Sbjct: 1244 GDLYIDSHNYTVFR 1257
>gi|290972172|ref|XP_002668832.1| predicted protein [Naegleria gruberi]
gi|284082360|gb|EFC36088.1| predicted protein [Naegleria gruberi]
Length = 2254
Score = 42.0 bits (97), Expect = 0.31, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 38/60 (63%), Gaps = 5/60 (8%)
Query: 82 VKRLAGDGVQGYSDGEPG---SARFDKPKS-FAVDMKGNIYVADKSNHVIRKITNLGVTT 137
+ +AG +G SDG+ G +++ + P+ F D+ G+++VAD +NH IRKI N GV +
Sbjct: 86 ISTIAGQMTKG-SDGDGGLATASQMNSPRGVFFDDVTGSLFVADTNNHKIRKIDNFGVIS 144
>gi|290982604|ref|XP_002674020.1| predicted protein [Naegleria gruberi]
gi|284087607|gb|EFC41276.1| predicted protein [Naegleria gruberi]
Length = 499
Score = 41.6 bits (96), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 45/97 (46%), Gaps = 4/97 (4%)
Query: 60 IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
+V+D++ Y F + VV GD ++ + G +FD PK D+ GN+ V
Sbjct: 296 LVIDTTNEDIYICDFD-NHRIVVLSKNGDFLRDFGSKGSGDTQFDGPKDIDFDLDGNLVV 354
Query: 120 ADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
+D+ NH I ++ L + I + G + P Q A
Sbjct: 355 SDRENHRILVLSKLNGSLIR---KIENGHGNQPGQFA 388
>gi|256833654|ref|YP_003162381.1| alkyl hydroperoxide reductase [Jonesia denitrificans DSM 20603]
gi|256687185|gb|ACV10078.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Jonesia denitrificans DSM 20603]
Length = 628
Score = 41.6 bits (96), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 27/116 (23%), Positives = 55/116 (47%), Gaps = 11/116 (9%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNHVIR--KITNLGVTT 137
G G +G DG +A+F++P A ++ ++ VAD NH +R ++ V T
Sbjct: 228 GAGERGLVDGPAHTAQFNEPNGLLVLPPDVAAEVGYDVVVADTVNHCVRSLSLSTGAVGT 287
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
+AG G ++ GP ++ +++ + H ++I+ GN + ++ + C
Sbjct: 288 LAGNG--RQYMVGGPDNEGRLTSPWDVVWSAHRKQVIIAMAGNHTLWTLDPRTGQC 341
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 6/80 (7%)
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-- 172
G VAD +H I + GVT A G+ + G DGPA A F+ L +P A
Sbjct: 201 GGYLVADAGHHQIVHVAADGVTEQARYGAGERGLVDGPAHTAQFNEPNGLLVLPPDVAAE 260
Query: 173 ----LLISDHGNQLIRQINL 188
++++D N +R ++L
Sbjct: 261 VGYDVVVADTVNHCVRSLSL 280
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 49/109 (44%), Gaps = 5/109 (4%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAV---DMKGNIYVADKSNHVIRKI-TNLGVTTI 138
+ AG +G DG + F +P AV D ++VAD +R I T+ V T
Sbjct: 342 QWFAGTMNEGLVDGPVAQSWFAQPSGLAVTGVDADERVWVADSETSALRWIDTSDTVHTA 401
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G G DG A A F + + +P +L+++D N +R+ +
Sbjct: 402 VGAGLFDFGHRDGDADQALFQHPLAVAALPD-NSLVVADTYNGALRRYD 449
>gi|290978917|ref|XP_002672181.1| predicted protein [Naegleria gruberi]
gi|284085756|gb|EFC39437.1| predicted protein [Naegleria gruberi]
Length = 1457
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 8/60 (13%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKIT-NLGVTTIAGGG 142
AGDG G A F++P A+D G++YV D+ NH IRKIT + ++T+AG G
Sbjct: 125 FAGDG------GMAIKALFNRPVHIAIDHSNGDLYVTDQGNHAIRKITWDKTISTVAGIG 178
>gi|374849371|dbj|BAL52388.1| PKD domain protein [uncultured candidate division OP1 bacterium]
gi|374857216|dbj|BAL60069.1| hypothetical protein HGMM_OP4C705 [uncultured candidate division
OP1 bacterium]
Length = 635
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 41/91 (45%), Gaps = 8/91 (8%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNAS 157
PG +F+ P AVD GN+YV D NH I+K + G + G++ G +
Sbjct: 384 PGDGQFNGPTDIAVDAAGNVYVVDSGNHRIQKFDSTG-KFLGKWGTRGSG-------DGQ 435
Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINL 188
F + + ++D GN I++ ++
Sbjct: 436 FETPIGIALDGSGKFIYVADKGNHRIQKFDI 466
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 43/86 (50%), Gaps = 16/86 (18%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV----QGYSDG-EPGSARFD 104
++DR G+ + RS F E + R DG+ Q DG EPG+ F
Sbjct: 271 LVDRQGNRSEQSVLRQRSTF---------EFIAGRKVADGLFFLDQFGQDGSEPGA--FR 319
Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI 130
+P+ AVD KG+IYVAD NH I++
Sbjct: 320 RPQGIAVDSKGSIYVADTENHRIQRF 345
Score = 40.4 bits (93), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 48/109 (44%), Gaps = 9/109 (8%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
G +F +P++ AVD GN+YV+D NH I+K G + G + G A G F
Sbjct: 504 GDGQFFEPQAIAVDGAGNVYVSDTGNHRIQKFDANGKFLLKWG---RNGLAQG-----QF 555
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
L F LL+ D N +++ N + + G+ G ++
Sbjct: 556 DVPRGLAFTKQ-GILLVVDQNNNRVQEFNADGTFVRQWGEQGNGEGELN 603
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 59/146 (40%), Gaps = 21/146 (14%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGV 135
+S K L G +G DG+ F+ P A+D G IYVADK NH I+K I+ V
Sbjct: 417 DSTGKFLGKWGTRGSGDGQ-----FETPIGIALDGSGKFIYVADKGNHRIQKFDISGPTV 471
Query: 136 TTIAGGGSK-------KEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQL 182
+ GS+ GR P F P A + +SD GN
Sbjct: 472 RFVGKWGSECNLTVTPPTGRCIDPDGGGPLQTGDGQFFEPQAIAVDGAGNVYVSDTGNHR 531
Query: 183 IRQINLKPEDCSKSSQSGSALGAVSV 208
I++ + + K ++G A G V
Sbjct: 532 IQKFDANGKFLLKWGRNGLAQGQFDV 557
>gi|124008663|ref|ZP_01693353.1| cohesin domain protein [Microscilla marina ATCC 23134]
gi|123985739|gb|EAY25612.1| cohesin domain protein [Microscilla marina ATCC 23134]
Length = 899
Score = 41.6 bits (96), Expect = 0.37, Method: Composition-based stats.
Identities = 38/115 (33%), Positives = 63/115 (54%), Gaps = 17/115 (14%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHV-IRKI--TNLGV 135
V+ +AG GV G++ G A+ + PK+ A+D + N+Y+AD H+ IRK+ T +
Sbjct: 136 VITPVAGTGVAGFAGDGGLATQAQLNTPKAIALDAENNLYIAD---HIRIRKVDATTGII 192
Query: 136 TTIAGGGSKKEGRADG-PAQNASFS--NDFELTFVPHICALLISDHGNQLIRQIN 187
TT+AG G+ DG PA A+ + +D L + +I + + H IR+I+
Sbjct: 193 TTVAGSGTLT---TDGVPALEANLAPIHDLALDAIGNIYVVEVLKH---RIRKID 241
Score = 38.5 bits (88), Expect = 2.9, Method: Composition-based stats.
Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 6/77 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVT 136
++ +AG G +DG P A A+D GNIYV + H IRKI TN+ +T
Sbjct: 191 IITTVAGSGTLT-TDGVPALEANLAPIHDLALDAIGNIYVVEVLKHRIRKIDAQTNI-IT 248
Query: 137 TIAGGGSKKEGRADGPA 153
T+AG + GPA
Sbjct: 249 TVAGSAQQTSNNPIGPA 265
>gi|423214678|ref|ZP_17201206.1| hypothetical protein HMPREF1074_02738 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692584|gb|EIY85821.1| hypothetical protein HMPREF1074_02738 [Bacteroides xylanisolvens
CL03T12C04]
Length = 593
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 43/74 (58%), Gaps = 2/74 (2%)
Query: 72 LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LSF + S V +AG+ +DG+ +A F P+ A+ G+I++A++ N+VIRK+
Sbjct: 281 LSFTYEDLSYTVATVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNVIRKM 340
Query: 131 -TNLGVTTIAGGGS 143
V+T+A G+
Sbjct: 341 DQEYNVSTVAKSGT 354
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 79 ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +K +AG G +G DG +A F+ P S V K +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRTIT 590
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 21/35 (60%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
F P S D GN+Y+AD+ N ++KIT+ G T
Sbjct: 399 FKSPMSVTFDSNGNMYIADRDNKAVKKITSGGTVT 433
>gi|294644532|ref|ZP_06722286.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
gi|294805794|ref|ZP_06764671.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|336403611|ref|ZP_08584323.1| hypothetical protein HMPREF0127_01636 [Bacteroides sp. 1_1_30]
gi|345510783|ref|ZP_08790344.1| hypothetical protein BSAG_00302 [Bacteroides sp. D1]
gi|229442801|gb|EEO48592.1| hypothetical protein BSAG_00302 [Bacteroides sp. D1]
gi|292640085|gb|EFF58349.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
gi|294447015|gb|EFG15605.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|335945537|gb|EGN07348.1| hypothetical protein HMPREF0127_01636 [Bacteroides sp. 1_1_30]
Length = 593
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 43/74 (58%), Gaps = 2/74 (2%)
Query: 72 LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LSF + S V +AG+ +DG+ +A F P+ A+ G+I++A++ N+VIRK+
Sbjct: 281 LSFTYEDLSYTVATVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNVIRKM 340
Query: 131 -TNLGVTTIAGGGS 143
V+T+A G+
Sbjct: 341 DQEYNVSTVAKSGT 354
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 79 ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +K +AG G +G DG +A F+ P S V K +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRTIT 590
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 21/35 (60%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
F P S D GN+Y+AD+ N ++KIT+ G T
Sbjct: 399 FKSPMSVTFDSNGNMYIADRDNKAVKKITSGGTVT 433
>gi|206603181|gb|EDZ39661.1| NHL repeat domain protein [Leptospirillum sp. Group II '5-way CG']
Length = 494
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 37/69 (53%), Gaps = 4/69 (5%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI--TNL 133
E++ ++ + G G+ G +DG A+F P+ F D VAD NH++R++
Sbjct: 210 EKAALREVIGQGIPGSADGSFDQAQFRDPQGVRFCPDDPDTAIVADTGNHLLRRVDFRRR 269
Query: 134 GVTTIAGGG 142
VTTIAG G
Sbjct: 270 SVTTIAGTG 278
>gi|163841552|ref|YP_001625957.1| hypothetical protein RSal33209_2819 [Renibacterium salmoninarum
ATCC 33209]
gi|162955028|gb|ABY24543.1| conserved hypothetical protein [Renibacterium salmoninarum ATCC
33209]
Length = 425
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 10/76 (13%)
Query: 79 ESVVKRLAGDGVQGYSDGEPGSARFDKPKS-------FAVDMKGNIYVADKSNHVIRKI- 130
E+VV+ G G +G++DG ARF++P+ A + ++ VAD NH +R I
Sbjct: 88 ETVVRAF-GTGTKGFADGAAQMARFNEPQGLVLLPEVLAAQLGYDVVVADSVNHRLRGIS 146
Query: 131 -TNLGVTTIAGGGSKK 145
T+ V T+AG G ++
Sbjct: 147 LTDGSVRTLAGNGVQR 162
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 51/111 (45%), Gaps = 6/111 (5%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVT--- 136
V LAG G++G DG A F + AVD G+I+VA +R ++T+ V
Sbjct: 226 VSILAGTGLEGLLDGAADQAWFAQSSGLAVDANGDIWVAASETSALRVLRVTDGRVNRVE 285
Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
T G G G DG A A + + +P ++ I+D N +R+ +
Sbjct: 286 TAVGEGLFDFGFQDGDASAARLQHPLGVAALPD-GSVAIADTYNGAVRRYD 335
>gi|395502099|ref|XP_003755423.1| PREDICTED: NHL repeat-containing protein 2 [Sarcophilus harrisii]
Length = 1044
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 49/90 (54%), Gaps = 15/90 (16%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAG---GGSK 144
G DG + F+ P+ A+ K NI YVAD NH+IRKI T + V+T+AG G+
Sbjct: 587 GRKDGTFLESTFNSPQGVAI--KNNIIYVADTENHLIRKIDLETKM-VSTVAGIGIQGTD 643
Query: 145 KEGRADGPAQNASFSNDFELTF---VPHIC 171
KEG A G Q S+ +++ F V IC
Sbjct: 644 KEGGAKGEEQ--PISSPWDVVFGISVSEIC 671
>gi|290990638|ref|XP_002677943.1| predicted protein [Naegleria gruberi]
gi|284091553|gb|EFC45199.1| predicted protein [Naegleria gruberi]
Length = 1317
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 2/67 (2%)
Query: 76 LSEESVVKRLAG-DGVQGY-SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
+S+E ++ L G GV GY ++ + F++PK G++YV D N++IR+I+
Sbjct: 202 VSKEGIISTLYGIPGVSGYNTESDASKTLFNQPKDIHYTNDGDLYVFDSKNNLIRRISKG 261
Query: 134 GVTTIAG 140
+TTI G
Sbjct: 262 IITTIVG 268
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 1/96 (1%)
Query: 92 GYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
G DG P + A+ + SF V ++G + +AD +HVIRK++ G+ + G G
Sbjct: 163 GLGDGLPATEAKIQEVSSFFVSVEGEMVIADTQDHVIRKVSKEGIISTLYGIPGVSGYNT 222
Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ + N + + L + D N LIR+I
Sbjct: 223 ESDASKTLFNQPKDIHYTNDGDLYVFDSKNNLIRRI 258
>gi|427384743|ref|ZP_18881248.1| hypothetical protein HMPREF9447_02281 [Bacteroides oleiciplenus YIT
12058]
gi|425728004|gb|EKU90863.1| hypothetical protein HMPREF9447_02281 [Bacteroides oleiciplenus YIT
12058]
Length = 486
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 51/113 (45%), Gaps = 18/113 (15%)
Query: 92 GYSDGEPGSARFDKPKS--------FAVDMKGN---IYVADKSNHVIRKITNLG-VTTIA 139
G+ DG SAR P + + K + Y D+ NH IR +T G VTT A
Sbjct: 361 GWLDGVGASARLSSPYQGIFVKNPDYVAEGKEDQYDFYFCDRDNHAIRVLTPEGIVTTFA 420
Query: 140 GGGS-----KKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G GS K EG +G + +A F+ L + A I + GN L+R+I
Sbjct: 421 GRGSTSVDGKPEGYINGAIRADARFNKPVSLAYDEKEDAFYIGEVGNNLLRKI 473
>gi|350403739|ref|XP_003486886.1| PREDICTED: NHL repeat-containing protein 2-like [Bombus impatiens]
Length = 696
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 62/136 (45%), Gaps = 12/136 (8%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
V+ + G + DG+ +A+F+ P+ + + IYVAD NH IRKI + V TIA
Sbjct: 259 VQHIIGGPNPDFKDGDFETAKFNAPQGVST-LGSLIYVADTKNHAIRKIDLMRRIVITIA 317
Query: 140 GGGSKKEGRADGP-AQNASFSNDFELTFVPH------ICALLISDHGNQLIRQINLKPED 192
G G K G ++ S+ ++L H LLI+ GN I + L+
Sbjct: 318 GTGIKGHDYVGGKIGKDQVLSSPWDLAVYKHEHDNNITPILLIAMAGNHQIWALFLEDTI 377
Query: 193 CSKSSQ--SGSALGAV 206
K+ + G+ L V
Sbjct: 378 WWKNKKYNEGTCLAIV 393
>gi|219853190|ref|YP_002467622.1| SMP-30/Gluconolaconase/LRE domain-containing protein
[Methanosphaerula palustris E1-9c]
gi|219547449|gb|ACL17899.1| SMP-30/Gluconolaconase/LRE domain protein [Methanosphaerula
palustris E1-9c]
Length = 343
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
PG +F P+ AVD GN+YV D N+ I+K T+ G T I G++ G A
Sbjct: 105 PGDGQFKSPEGIAVDNAGNVYVVDTVNNQIQKFTSDG-TFITKWGTRASGIA 155
>gi|402571883|ref|YP_006621226.1| hypothetical protein Desmer_1348 [Desulfosporosinus meridiei DSM
13257]
gi|402253080|gb|AFQ43355.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
Length = 1977
Score = 41.6 bits (96), Expect = 0.39, Method: Composition-based stats.
Identities = 31/86 (36%), Positives = 39/86 (45%), Gaps = 2/86 (2%)
Query: 77 SEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
S + KR AG GV + G+ +F P AVD GN+YVAD N I+K +N
Sbjct: 112 SNHRIQKRNAGTGVWTTWGSSGSGNKQFSSPSGVAVDADGNVYVADTFNSRIQKFDSNGN 171
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSN 160
T G EG+ P A SN
Sbjct: 172 YLTQWGSLGSDEGQLKYPFDLAVSSN 197
Score = 40.8 bits (94), Expect = 0.58, Method: Composition-based stats.
Identities = 26/62 (41%), Positives = 34/62 (54%), Gaps = 4/62 (6%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI---AGGGSKKEGRADGPAQ 154
G+ +F P AVD GN+YVAD SNH I+K GV T +G G+K+ G A
Sbjct: 88 GAGKFYSPAGVAVDGAGNVYVADTSNHRIQKRNAGTGVWTTWGSSGSGNKQFSSPSGVAV 147
Query: 155 NA 156
+A
Sbjct: 148 DA 149
Score = 39.3 bits (90), Expect = 1.7, Method: Composition-based stats.
Identities = 19/39 (48%), Positives = 23/39 (58%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
G +F S AVD NIYVAD NH I+++TNL T
Sbjct: 556 GDGQFSGIYSLAVDRNDNIYVADSQNHRIQRLTNLYTVT 594
Score = 37.7 bits (86), Expect = 5.4, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 1/73 (1%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGR 148
V+ + G+ +FD VD GN+YVAD +N I+K T+ G T G +G+
Sbjct: 455 VRSWGSQGSGNGQFDNMNGIGVDSSGNVYVADTNNSRIQKFTSEGGYLTQWGSAGSTDGK 514
Query: 149 ADGPAQNASFSND 161
P A S D
Sbjct: 515 FYSPRGIAVDSQD 527
>gi|290970758|ref|XP_002668244.1| predicted protein [Naegleria gruberi]
gi|284081534|gb|EFC35500.1| predicted protein [Naegleria gruberi]
Length = 735
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 95 DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
+G+P SA F P S V G IYV+D H IRKI N +T IAG G++
Sbjct: 391 NGQPALSAHFKLPTSIFVTDDGTIYVSDTQLHQIRKIQNGIITAIAGTGTQ 441
Score = 37.4 bits (85), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 11/74 (14%)
Query: 78 EESVVKRLAGDGVQGYSDG----EPGSA---RFD--KPKSFAVDMKGNIYVADKSNHVIR 128
+ ++ +AG G Q + G + GS R D P V G +Y AD SNH +R
Sbjct: 428 QNGIITAIAGTGTQCTTAGSNTCDHGSTSPLRVDLYTPMGIYVTKSGEVYFADTSNHKVR 487
Query: 129 KITN--LGVTTIAG 140
KI++ L +TT+AG
Sbjct: 488 KISSDGLTITTVAG 501
>gi|290982063|ref|XP_002673750.1| predicted protein [Naegleria gruberi]
gi|284087336|gb|EFC41006.1| predicted protein [Naegleria gruberi]
Length = 1759
Score = 41.6 bits (96), Expect = 0.40, Method: Composition-based stats.
Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 5/106 (4%)
Query: 65 SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVD-MKGNIYVAD 121
+ S+ + F ++ AGDG YS DG S A+ + P +D + N+ +AD
Sbjct: 52 AESSASVIRFVDKSTGIISTYAGDGSPAYSGDGVLASQAKLNYPNFVTIDNLSNNLLIAD 111
Query: 122 KSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
+N V+R + T ++TIAG P++ +SFS+ F +T
Sbjct: 112 TANFVVRYVNRTTRIISTIAGYNVNGNSGDGLPSKQSSFSSLFGIT 157
>gi|426366237|ref|XP_004050167.1| PREDICTED: NHL repeat-containing protein 2 [Gorilla gorilla
gorilla]
Length = 726
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ + IYVAD NH+IRKI V+T+AG G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-INNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325
Query: 147 GRADGPAQNASFSNDFELTF 166
G A G Q S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343
>gi|384248465|gb|EIE21949.1| hypothetical protein COCSUDRAFT_16964 [Coccomyxa subellipsoidea
C-169]
Length = 1021
Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
+ D+IV DS L P ++ ++ +AG G G +DG G+AR +P +G
Sbjct: 773 NGDVIVADSYNHRLKLLD-PATDS--IRTVAGSGAAGLTDGTGGNARVSEPGGLCAGPRG 829
Query: 116 NIYVADKSNHVIR 128
+++AD +N IR
Sbjct: 830 TVFIADTNNSAIR 842
>gi|195641946|gb|ACG40441.1| hypothetical protein [Zea mays]
Length = 79
Score = 41.2 bits (95), Expect = 0.42, Method: Composition-based stats.
Identities = 18/34 (52%), Positives = 23/34 (67%)
Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
GVTTI G S + G DGP+ A FSNDFE+ ++
Sbjct: 43 GVTTIDWGKSGRGGHVDGPSGEAKFSNDFEVHYI 76
>gi|383100947|emb|CCD74491.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis halleri subsp. halleri]
Length = 1028
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 43/103 (41%), Gaps = 2/103 (1%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
G +DG P G IY+ D NH I+K+ + + G+ K G DG
Sbjct: 764 GDNDGVGAEVLLQHPLGVLCAKDGQIYLTDSYNHKIKKLDPVTKRVVTVAGTGKAGFKDG 823
Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC 193
+ A S L + L ++D N LIR I+L K ED
Sbjct: 824 KVKGAQLSEPAGLAITEN-GRLFVADTNNSLIRYIDLNKGEDA 865
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 1/56 (1%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITN 132
E + + ++ G +G+ DG A F++P+ A + K N +YVAD NH +R + N
Sbjct: 600 EGNFIVQIGSTGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALRYLLN 655
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 79 ESVVKRL---AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
+ V KR+ AG G G+ DG+ A+ +P A+ G ++VAD +N +IR I
Sbjct: 803 DPVTKRVVTVAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI 857
>gi|291240969|ref|XP_002740388.1| PREDICTED: tripartite motif protein-like [Saccoglossus kowalevskii]
Length = 720
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 63/145 (43%), Gaps = 9/145 (6%)
Query: 49 SVIDRPGSS-DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD--- 104
++D G D I+LDS R ++ L+ VV V YS+ +F
Sbjct: 496 QILDSNGKYIDQILLDSERPI--GVAMALNSNIVVVEFDTKHVHIYSENREKIKQFTYKE 553
Query: 105 --KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
KP V+ KG+I VAD H I T+ G + G + EG+ + P A+ ND
Sbjct: 554 FVKPYGVTVNSKGHILVADPKAHCIFVFTSDGDFIMKIGKEEGEGKLNNPHFVATGRNDD 613
Query: 163 ELTFVPHICALLISDHGNQLIRQIN 187
+ + L + HG Q IR+I+
Sbjct: 614 VIVSDTYQGLALFTKHG-QFIRRIH 637
>gi|410477767|ref|YP_006765404.1| NHL repeat domain-containing protein [Leptospirillum ferriphilum
ML-04]
gi|406773019|gb|AFS52444.1| NHL repeat domain protein [Leptospirillum ferriphilum ML-04]
Length = 494
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 63/131 (48%), Gaps = 10/131 (7%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI--TNL 133
E++ ++ + G G G +DG A+F P+ F D VAD NH++R++
Sbjct: 210 EKAALREVIGQGTPGSADGSFDQAQFRDPQGIRFCPDDPDIAIVADTGNHLLRRVDFRRR 269
Query: 134 GVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALL-ISDHGNQLIRQINLKPE 191
VTTIAG G + + PA +A ++ +++ F H +L ++ G I I L PE
Sbjct: 270 SVTTIAGTGVQGWAIFEPVPAMSAVLNSPWDILF--HRDGMLYVAQAGPHQI--IRLDPE 325
Query: 192 DCSKSSQSGSA 202
+GSA
Sbjct: 326 RQEIFPVAGSA 336
>gi|328783461|ref|XP_396242.3| PREDICTED: NHL repeat-containing protein 2 [Apis mellifera]
Length = 699
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 3/63 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
V+ + G + DG+ +ARF+ P+ + + IYVAD NH IRKI + VTT+
Sbjct: 262 VQHVIGGSNPDFRDGDFENARFNAPQGVCI-LDTFIYVADNENHAIRKIDLIKKMVTTVV 320
Query: 140 GGG 142
G G
Sbjct: 321 GTG 323
>gi|440683842|ref|YP_007158637.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
gi|428680961|gb|AFZ59727.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
Length = 506
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 14/139 (10%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAVDM 113
+L + DS S+ + L E+ V+ + G G+ G DGE R V+
Sbjct: 350 QELFIADSEVSSIRGVE--LDEQGKVRTICGSGMLFGFGDMDGEKLDVRLQH--CLGVEY 405
Query: 114 KGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
N ++VAD NH I+ + T T+ G G G DG +N FS L+ +
Sbjct: 406 FQNYLWVADTYNHKIKLVHPTTGNCQTVLGDGLP--GFQDGQGKNTRFSEPSGLSGID-- 461
Query: 171 CALLISDHGNQLIRQINLK 189
L ISD N IR++NLK
Sbjct: 462 SYLYISDTNNHAIRRVNLK 480
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ + GDG+ G+ DG+ + RF +P + + +Y++D +NH IR++ NL T+
Sbjct: 431 QTVLGDGLPGFQDGQGKNTRFSEPSGLS-GIDSYLYISDTNNHAIRRV-NLKTLTV 484
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 11/139 (7%)
Query: 56 SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
SS L + DS F + + + G G G +DG +F P+ D +
Sbjct: 182 SSGLFIADSGHHRIVMTRF----DGKILYVIGTGKSGLTDGAFNQVQFSAPQGMTFDDES 237
Query: 116 NI-YVADKSNHVIRKI--TNLGVTTIAGGGSKKE--GRADGPAQNASFSNDFELTFVPHI 170
I Y+AD N+ +R+I V TIAG G + G A + ++ ++L V +
Sbjct: 238 QILYIADTENYALRQIDLQRQVVETIAGTGQQNHIISPHGGVALETALNSPWDLQKVGN- 296
Query: 171 CALLISDHGNQLIRQINLK 189
LLI+ G+ I +++L+
Sbjct: 297 -KLLIAMAGSHQIWEMDLE 314
>gi|422325157|ref|ZP_16406193.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
gi|353343435|gb|EHB87752.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
Length = 656
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 50/110 (45%), Gaps = 5/110 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTT 137
V AG GV+G +DG A F +P G+++VA +R IT + VT+
Sbjct: 363 VSIFAGSGVEGLADGPAADAWFAQPSGIIEARDGSLWVACSETSGLRHITFTDNGVQVTS 422
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G G DG + A + LT +P ++ ++D N IR+ +
Sbjct: 423 AVGKGLFDFGFVDGDSDTARMQHPLGLTELPD-GSIAVADTYNGAIRRWD 471
>gi|326429396|gb|EGD74966.1| hypothetical protein PTSG_12546 [Salpingoeca sp. ATCC 50818]
Length = 1448
Score = 41.2 bits (95), Expect = 0.46, Method: Composition-based stats.
Identities = 23/83 (27%), Positives = 45/83 (54%), Gaps = 3/83 (3%)
Query: 80 SVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVT 136
+ +K +AG+G ++ +P ++ + AVD GN+YVA ++ H + +I + +T
Sbjct: 1097 NTIKTVAGNGKTAWTGSSQPTQTGLNQAECVAVDGSGNMYVALRTGHRVVRIARGDGAIT 1156
Query: 137 TIAGGGSKKEGRADGPAQNASFS 159
T+AG G+ + G A +A +
Sbjct: 1157 TVAGTGTAGDSGVGGAAASAQLN 1179
Score = 38.9 bits (89), Expect = 2.6, Method: Composition-based stats.
Identities = 25/115 (21%), Positives = 53/115 (46%), Gaps = 4/115 (3%)
Query: 79 ESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLG 134
+ + +AG G G S G SA+ ++P A+D GN+Y++++ +H + + T+
Sbjct: 1152 DGAITTVAGTGTAGDSGVGGAAASAQLNRPTCVALDAAGNVYISEEGSHRVSVVDATSGD 1211
Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
++ + G G G ++ A L L ++D N ++ ++L+
Sbjct: 1212 LSVLVGTGHSGHRGMGGSSRAADIHTPRGLAVDNDAKVLYVADAENHVVYSVDLR 1266
>gi|290991059|ref|XP_002678153.1| serine/threonine protein kinase [Naegleria gruberi]
gi|284091764|gb|EFC45409.1| serine/threonine protein kinase [Naegleria gruberi]
Length = 1397
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 33/74 (44%), Positives = 39/74 (52%), Gaps = 4/74 (5%)
Query: 117 IYVADKSNHVIRKI-TNLGV-TTIAGG-GSKKEGRADGP-AQNASFSNDFELTFVPHICA 172
+YV D NHVIRKI TN V TTIAG + DG A NASFS +
Sbjct: 181 LYVCDSGNHVIRKIDTNTNVITTIAGNVNTNNTFYGDGILALNASFSYPQGIYVSQSSGE 240
Query: 173 LLISDHGNQLIRQI 186
+ +SD GN IR+I
Sbjct: 241 IFVSDSGNNRIRKI 254
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 62/142 (43%), Gaps = 7/142 (4%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P S++ P + + +L+S R+ +L LS+ + R G + + +P S R
Sbjct: 341 PRSIMRNPFTEEWYILES-RADRISL---LSQTGTISRFCGSPFEPSFNSDPFSIRL--T 394
Query: 107 KSFAVDMKGNI-YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
K++ V N+ Y+ H I +I N V + G + G N F+N +
Sbjct: 395 KTYGVTKVQNVLYLTQSLTHSILQIKNSQVYSFGKGEAGFAGDGLVLDSNTQFNNPTAVA 454
Query: 166 FVPHICALLISDHGNQLIRQIN 187
F A+ I+D N IR+I+
Sbjct: 455 FSSGTKAIYIADTYNHRIRRID 476
>gi|295837701|ref|ZP_06824634.1| NHL repeat protein [Streptomyces sp. SPB74]
gi|197695928|gb|EDY42861.1| NHL repeat protein [Streptomyces sp. SPB74]
Length = 610
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 66/124 (53%), Gaps = 8/124 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
P + PG + +V D++R A L ++ V+R G G +G DG+ +ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGGGERGLLDGDAATARFNEP 238
Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGR-ADGPAQNASFSNDF 162
+ A+ + VAD NH +R++ + VTT+AG G + +G G A+ S S+ +
Sbjct: 239 QGLALLPDRAVVVADTVNHALRRVDLASGEVTTLAGTGRQWWQGEPVAGAAREVSLSSPW 298
Query: 163 ELTF 166
++ +
Sbjct: 299 DVAW 302
>gi|126273410|ref|XP_001377730.1| PREDICTED: NHL repeat-containing protein 2 [Monodelphis domestica]
Length = 777
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 40/70 (57%), Gaps = 10/70 (14%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAG---GGSK 144
G DG + F+ P+ V +K NI YVAD NH+IRKI T + V+T+AG G+
Sbjct: 317 GSKDGTFLESTFNSPQ--GVSIKNNIIYVADTENHLIRKIDLETEM-VSTVAGIGIQGTD 373
Query: 145 KEGRADGPAQ 154
KEG A G Q
Sbjct: 374 KEGGAKGEEQ 383
>gi|160883851|ref|ZP_02064854.1| hypothetical protein BACOVA_01824 [Bacteroides ovatus ATCC 8483]
gi|156110581|gb|EDO12326.1| NHL repeat protein [Bacteroides ovatus ATCC 8483]
Length = 494
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 51/124 (41%), Gaps = 20/124 (16%)
Query: 89 GVQGYSDGEPGSARFDKP-----------KSFAVDMKGNIYVADKSNHVIRKITNLGVT- 136
G GY D AR +P K+ D + Y D+ NH IRK+T G+T
Sbjct: 356 GQAGYDDKIGAKARLSRPYQGVFVKNPEYKAAGKDEIYDFYFTDRDNHCIRKLTPDGITS 415
Query: 137 TIAGGGS-----KKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK- 189
T AG GS G DG + A F + L + + D N IR+I L+
Sbjct: 416 TFAGRGSVGMNIHANGYVDGALREEARFDSPAALAYDEVNNIFYVGDVNNHRIRKIALEK 475
Query: 190 -PED 192
PED
Sbjct: 476 IPED 479
>gi|124004309|ref|ZP_01689155.1| cohesin domain protein [Microscilla marina ATCC 23134]
gi|123990379|gb|EAY29878.1| cohesin domain protein [Microscilla marina ATCC 23134]
Length = 849
Score = 41.2 bits (95), Expect = 0.50, Method: Composition-based stats.
Identities = 38/114 (33%), Positives = 59/114 (51%), Gaps = 15/114 (13%)
Query: 81 VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
V+ +AG GV G++ G A+ PK+ A+D N+Y+ D + IRK+ + +T
Sbjct: 86 VITPVAGTGVAGFAGDGGLATQAQLVGPKAIALDATNNLYIVDGTR--IRKVDAVTGIIT 143
Query: 137 TIAGGGSKKEGRADG-PAQNASFS--NDFELTFVPHICALLISDHGNQLIRQIN 187
TIAG G R DG PA AS + +D L + +I + + H IR+I+
Sbjct: 144 TIAGSGRL---RTDGVPALEASLALIHDLALDAIGNIYVVEVLKH---RIRKID 191
Score = 37.7 bits (86), Expect = 4.8, Method: Composition-based stats.
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 6/77 (7%)
Query: 81 VVKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVT 136
++ +AG G + +DG P A A+D GNIYV + H IRKI TN+ +T
Sbjct: 141 IITTIAGSG-RLRTDGVPALEASLALIHDLALDAIGNIYVVEVLKHRIRKIDAQTNI-IT 198
Query: 137 TIAGGGSKKEGRADGPA 153
T+AG + GPA
Sbjct: 199 TVAGSAQQTSNNPIGPA 215
>gi|317504749|ref|ZP_07962709.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
gi|315664131|gb|EFV03838.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
Length = 448
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 12/105 (11%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
+++ AG G DG SA F P A D+ GN+Y+ D IRKI + VTT+A
Sbjct: 339 IEQFAGFNGVGSMDGPRMSASFRNPTGMAFDVDGNMYITDSMGFTIRKIGHADGMVTTVA 398
Query: 140 GGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
G +K + + +G +F+ ++++ + D GN I
Sbjct: 399 GIYNKSDNNKVEGLPSEVTFTYPYDIS---------VDDDGNFYI 434
>gi|443708985|gb|ELU03866.1| hypothetical protein CAPTEDRAFT_52631, partial [Capitella teleta]
Length = 507
Score = 40.8 bits (94), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 60/135 (44%), Gaps = 13/135 (9%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGEPGSARFDKPKSFAV 111
L + DS S+ +L LS+ SV K + G + G DG+ A+ P + +
Sbjct: 356 LYIADSESSSIRSLD--LSDGSV-KSVVGADIDPKNLFAFGDVDGKGIEAKLQHPLAVTL 412
Query: 112 DMKGNIYVADKSNHVIRKITN--LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
G ++VAD NH I+K+ V T+ G G + D P Q A + L + PH
Sbjct: 413 AEDGQLFVADSYNHKIKKVVPKFREVRTVFGTGKMGDTLKDSPLQCA-LNEPGGLAYNPH 471
Query: 170 ICALLISDHGNQLIR 184
L I+D N I+
Sbjct: 472 SRRLYIADTNNHYIK 486
Score = 37.0 bits (84), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 36/77 (46%), Gaps = 2/77 (2%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-- 134
S +V+ + G G +DG S++F P+ I VAD NH IRKI +
Sbjct: 188 SMNGMVQHVIGGRGHGLTDGSFISSQFFGPQGVCFHGDDLIIVADTENHAIRKIDLVAGT 247
Query: 135 VTTIAGGGSKKEGRADG 151
V TIAG G + A G
Sbjct: 248 VRTIAGNGKQGTDLAGG 264
>gi|186683945|ref|YP_001867141.1| redoxin domain-containing protein [Nostoc punctiforme PCC 73102]
gi|186466397|gb|ACC82198.1| Redoxin domain protein [Nostoc punctiforme PCC 73102]
Length = 509
Score = 40.8 bits (94), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 35/58 (60%), Gaps = 3/58 (5%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
+ + GDG G DG+ ++RF +P +++ +Y+AD +NH IR++ L VTT+
Sbjct: 432 QTIQGDGTAGLQDGQGKNSRFFEPSGLSIN-GSYLYIADTNNHAIRRVDLNTLEVTTL 488
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 7/89 (7%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI- 117
L + DS SF + + L G G G +DG A+F P+ A D + I
Sbjct: 187 LFIADSGHHRLIMSSF----DGEILHLIGTGKSGLTDGAFNEAQFFAPQGMAYDAENQIL 242
Query: 118 YVADKSNHVIRKI--TNLGVTTIAGGGSK 144
YVAD NH +R++ V IAG G +
Sbjct: 243 YVADTENHTLRRVDMKRQVVDAIAGTGEQ 271
>gi|309790072|ref|ZP_07684646.1| NHL repeat-containing protein [Oscillochloris trichoides DG-6]
gi|308227927|gb|EFO81581.1| NHL repeat-containing protein [Oscillochloris trichoides DG6]
Length = 1124
Score = 40.8 bits (94), Expect = 0.54, Method: Composition-based stats.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 2/37 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
GY+ GEPG+ F++P AVD +GN+YVAD N ++
Sbjct: 980 GYAGGEPGA--FNEPVGVAVDAQGNVYVADTWNSRVQ 1014
>gi|91076402|ref|XP_969236.1| PREDICTED: similar to CG12547 CG12547-PA [Tribolium castaneum]
gi|270002449|gb|EEZ98896.1| hypothetical protein TcasGA2_TC004511 [Tribolium castaneum]
Length = 712
Score = 40.8 bits (94), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 30/124 (24%), Positives = 46/124 (37%), Gaps = 22/124 (17%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
L + + G G G+ DG F+ P+ + ++VAD NH IRKI
Sbjct: 245 LKSDGTILHQVGSGEIGFRDGSFTQCEFNAPQGLVFQNENTLFVADTENHAIRKIDLAKK 304
Query: 134 GVTTIAGGGSKKEGRADGPAQNASF-----------SNDFELTFVPHICALLISDHGNQL 182
V ++ G G + R G + D +++F P GN
Sbjct: 305 TVESVVGSGVQGHDRVGGKQWTQQVISSPWDLCIFRTPDMDMSFYP---------EGNPP 355
Query: 183 IRQI 186
IR+I
Sbjct: 356 IREI 359
>gi|325106982|ref|YP_004268050.1| redoxin [Planctomyces brasiliensis DSM 5305]
gi|324967250|gb|ADY58028.1| Redoxin domain protein [Planctomyces brasiliensis DSM 5305]
Length = 563
Score = 40.8 bits (94), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
++ + G G G DG A FD P+ A+ + +YVAD NH++R I + V T+A
Sbjct: 268 LQAVIGTGKIGRQDGSYEQASFDHPQGMAL-VGNTLYVADTENHLLRTIDLESKQVATLA 326
Query: 140 GGGSKKEGRADG 151
G G + RA G
Sbjct: 327 GTGEQARFRAGG 338
>gi|293372067|ref|ZP_06618462.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
gi|292632971|gb|EFF51556.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
Length = 484
Score = 40.8 bits (94), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 47/100 (47%), Gaps = 10/100 (10%)
Query: 97 EPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEG 147
+PG F ++ K+ D + Y ADK NH IR +T G V+T AG GS K G
Sbjct: 362 KPGQGVFVLNEEYKAANKDDWYDFYFADKENHCIRILTPDGVVSTFAGRGSASASSYKWG 421
Query: 148 RADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ +G + A F+ L + + D GN IR+I
Sbjct: 422 KQNGEVRERARFNQPVALAYNEATKTFYVGDSGNYKIRKI 461
>gi|383110973|ref|ZP_09931791.1| hypothetical protein BSGG_2078 [Bacteroides sp. D2]
gi|382949397|gb|EFS31378.2| hypothetical protein BSGG_2078 [Bacteroides sp. D2]
Length = 499
Score = 40.8 bits (94), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 47/100 (47%), Gaps = 10/100 (10%)
Query: 97 EPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEG 147
+PG F ++ K+ D + Y ADK NH IR +T G V+T AG GS K G
Sbjct: 377 KPGQGVFVLNEEYKAANKDDWYDFYFADKENHCIRILTPDGVVSTFAGRGSASASSYKWG 436
Query: 148 RADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ +G + A F+ L + + D GN IR+I
Sbjct: 437 KQNGEVRERARFNQPVALAYNEATKTFYVGDSGNYKIRKI 476
>gi|255513298|gb|EET89564.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Candidatus Micrarchaeum acidiphilum ARMAN-2]
Length = 579
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 63/140 (45%), Gaps = 16/140 (11%)
Query: 57 SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMK 114
+++ V DS SA +++ ++ + G G+ G DG SAR P +
Sbjct: 429 NEIYVADSEASAIRSINM---KDGYASTITGSGLFTFGDQDGSLASARLQHPIGVSAK-D 484
Query: 115 GNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNA---SFSNDFELTFVP 168
G IYVAD N IR+I N T ++G G K R P + ND + P
Sbjct: 485 GLIYVADTYNSAIREIDVKANRITTLVSGPGRKSACRFGDPKCDTLQLYEPNDVK----P 540
Query: 169 HICALLISDHGNQLIRQINL 188
AL+I+D N LIR+ ++
Sbjct: 541 FGRALIIADTNNNLIRRFDI 560
>gi|87311714|ref|ZP_01093830.1| NHL repeat protein [Blastopirellula marina DSM 3645]
gi|87285608|gb|EAQ77526.1| NHL repeat protein [Blastopirellula marina DSM 3645]
Length = 652
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 87 GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
G+G G +DG+ +A F + A+D +YVAD NH++RK+ VTTIAG G
Sbjct: 199 GNGAIGTTDGDYATASFHHLQGMALDGD-TLYVADTENHMLRKVDLKTKQVTTIAGKGV- 256
Query: 145 KEGRADGP 152
+GR P
Sbjct: 257 -QGRNSWP 263
>gi|57107627|ref|XP_544027.1| PREDICTED: NHL repeat-containing protein 2 isoform 1 [Canis lupus
familiaris]
Length = 726
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 10/91 (10%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTI 138
++ + G G DG + F+ P+ A+ + NI YVAD NH+IRKI V+T+
Sbjct: 257 IQYIIGGPNPGRKDGIFSESTFNSPQGVAI--RNNIIYVADTENHLIRKIDLEAEMVSTV 314
Query: 139 AG---GGSKKEGRADGPAQNASFSNDFELTF 166
AG G+ KEG A P + S+ +++ F
Sbjct: 315 AGIGIQGTDKEGGA--PGEEQPISSPWDVVF 343
>gi|383111215|ref|ZP_09932032.1| hypothetical protein BSGG_2315 [Bacteroides sp. D2]
gi|313694780|gb|EFS31615.1| hypothetical protein BSGG_2315 [Bacteroides sp. D2]
Length = 593
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 2/74 (2%)
Query: 72 LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
LSF + S V+ +AG+ +DG+ +A F P+ A+ G+I++A++ N+ IRK+
Sbjct: 281 LSFTYEDLSYTVETVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNTIRKM 340
Query: 131 -TNLGVTTIAGGGS 143
V+T+A G+
Sbjct: 341 DQEYNVSTVAKSGT 354
Score = 37.7 bits (86), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 21/35 (60%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
F P S D GN+Y+AD+ N ++KIT+ G T
Sbjct: 399 FKSPMSITFDANGNMYIADRDNKAVKKITSGGAVT 433
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 79 ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
+ +K +AG G +G DG +A F+ P S V K +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRSIT 590
>gi|290984250|ref|XP_002674840.1| predicted protein [Naegleria gruberi]
gi|284088433|gb|EFC42096.1| predicted protein [Naegleria gruberi]
Length = 831
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 52/111 (46%), Gaps = 5/111 (4%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
E SVV +AG QGYS G ++ P V + ++ D NH IRKI G +T
Sbjct: 103 ENSVVSFIAGSSSQGYSTGYAIGSQLSSPNG-VVALGSSVVFTDTGNHCIRKINTDGTLT 161
Query: 137 TIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
TIAG + +D A NA S+ + V L +D N +IR++
Sbjct: 162 TIAGVCTTSGYNSDQADATNAWLSSPSGIVEVNG--DLYFTDSMNGIIRKL 210
>gi|320105699|ref|YP_004181289.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319924220|gb|ADV81295.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 1347
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 67/138 (48%), Gaps = 14/138 (10%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARF---DKPKSFAVDMKG 115
L++ + R AF + ++ + + AG G Y +G+ GSA F + P + A+D G
Sbjct: 19 LLLTTTGRGAF---AQAVTLVPIQTKFAGTGAGTY-NGDFGSADFVSLNGPAAVALDSSG 74
Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-- 171
N +++D+ N+ +R+I VTT+ G S G + A NA+ S L I
Sbjct: 75 NTFISDRGNNCVRRIDAATGSVTTLLGLVSSGTGDSCNTATNATPSAAQGLYQPGGIVID 134
Query: 172 ---ALLISDHGNQLIRQI 186
+L I+D G+ +R +
Sbjct: 135 SNDSLYIADTGHNCVRML 152
>gi|167997357|ref|XP_001751385.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697366|gb|EDQ83702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 663
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 14/95 (14%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRK--ITNLGVTTIAGGGSK-- 144
+ G+ DG SAR +P S D K +Y+AD NH IR+ I++ V T+ +
Sbjct: 218 IAGFEDGSFESARLRRPSSIVFDKEKKLLYIADCENHAIRRAHISSRMVDTLYPKPDECP 277
Query: 145 --------KEGRADGPAQNAS-FSNDFELTFVPHI 170
+ G G +Q S SND+E+T+ H+
Sbjct: 278 GFLQRWLHRLGILQGSSQKRSNVSNDYEMTYPWHL 312
>gi|290971428|ref|XP_002668505.1| predicted protein [Naegleria gruberi]
gi|284081943|gb|EFC35761.1| predicted protein [Naegleria gruberi]
Length = 355
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 26/125 (20%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+ ++ +AG+G++GY +G+ +A + P S V IY +D NHVIRKI
Sbjct: 185 IDRNGIIHTIAGNGIEGY-NGDRRNATDTSLNNPDSLFV-HNDEIYFSDLDNHVIRKILP 242
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----------ALLISDHGNQ 181
G+ G K+G ++ DF+L +C + +DH N
Sbjct: 243 NGIIKNIAGIPGKKG----------YNGDFKLATRTKLCWPTGIFVDSKNQVYFADHYNH 292
Query: 182 LIRQI 186
+R+I
Sbjct: 293 CVRKI 297
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 60/115 (52%), Gaps = 8/115 (6%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
+ +++ +AG+G+ Y+ D + + + S K IY+AD ++ IRKI G
Sbjct: 130 IDRNGIIRTIAGNGIMTYNGDNQLATLASLRYPSSVFQYKDEIYIADSFSNRIRKIDRNG 189
Query: 135 VT-TIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQI 186
+ TIAG G EG +G +NA+ + N+ + FV H + SD N +IR+I
Sbjct: 190 IIHTIAGNGI--EG-YNGDRRNATDTSLNNPDSLFV-HNDEIYFSDLDNHVIRKI 240
>gi|160882120|ref|ZP_02063123.1| hypothetical protein BACOVA_00060 [Bacteroides ovatus ATCC 8483]
gi|156112488|gb|EDO14233.1| IPT/TIG domain protein [Bacteroides ovatus ATCC 8483]
Length = 514
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEGRADGP-AQNASFSNDFELTFVP 168
+ Y D NH IRK+T G V+T AG GS K G+ +G + A F+ L +
Sbjct: 407 DFYFTDTQNHCIRKLTPDGVVSTFAGRGSASTSAYKWGKQNGEIRERARFNEPVALAYDE 466
Query: 169 HICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
+ D GN IR+I K ++ + + GS
Sbjct: 467 ETKTFYVGDTGNFKIRKI-AKEQEADEQGEGGS 498
>gi|283458725|ref|YP_003363360.1| thiol-disulfide isomerase [Rothia mucilaginosa DY-18]
gi|283134775|dbj|BAI65540.1| thiol-disulfide isomerase [Rothia mucilaginosa DY-18]
Length = 656
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 50/110 (45%), Gaps = 5/110 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTT 137
V AG GV+G +DG A F +P G+++VA +R +T + VT+
Sbjct: 363 VSIFAGSGVEGLADGPAADAWFAQPSGIIEARDGSLWVACSETSGLRHVTFTEDGVQVTS 422
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G G DG + A + LT +P ++ ++D N IR+ +
Sbjct: 423 AVGKGLFDFGFVDGDSDTARMQHPLGLTELPD-GSIAVADTYNGAIRRWD 471
>gi|343780954|ref|NP_001230490.1| NHL repeat-containing protein 2 [Sus scrofa]
Length = 725
Score = 40.4 bits (93), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 45/81 (55%), Gaps = 10/81 (12%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG---GGSKK 145
G DG + F+ P+ A+ + NI YVAD NH+IRKI V+T+AG G+ K
Sbjct: 267 GRKDGIFSESTFNSPQGVAI--RNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDK 324
Query: 146 EGRADGPAQNASFSNDFELTF 166
EG A+G Q S+ +++ F
Sbjct: 325 EGGANGEEQ--PISSPWDVVF 343
>gi|392965340|ref|ZP_10330759.1| pre-peptidase C-terminal domain-containing protein [Fibrisoma limi
BUZ 3]
gi|387844404|emb|CCH52805.1| pre-peptidase C-terminal domain-containing protein [Fibrisoma limi
BUZ 3]
Length = 480
Score = 40.4 bits (93), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 11/107 (10%)
Query: 54 PGSSDLIVLDSSRSAFYTLS---FPLSEESVVKRLAGDGVQGYSDGEPGSARFD-----K 105
P ++ ++ + +S F + + L+ S+ + G GY DG+ SARF +
Sbjct: 330 PNDANTMLASNGKSMFVFANQTLWKLNGNSLQQVAGSPGQVGYQDGKGASARFQDGGYAR 389
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG--GGSKKEGRA 149
+ A D N+YVAD IRKI G VTT+AG S + GR
Sbjct: 390 GNAIAADGDDNVYVADYGCGCIRKIDKEGNVTTVAGNKNASNRVGRG 436
>gi|119509213|ref|ZP_01628363.1| NHL repeat protein [Nodularia spumigena CCY9414]
gi|119466055|gb|EAW46942.1| NHL repeat protein [Nodularia spumigena CCY9414]
Length = 502
Score = 40.4 bits (93), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 7/118 (5%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNL 133
S + + L G G G +DG A+F P+ A D + +YVAD NH +R++
Sbjct: 201 SYDGEILHLIGTGKSGLTDGAFNEAQFFAPQGMAFDPENQMLYVADTENHTLRRVDLKRQ 260
Query: 134 GVTTIAGGGSKKEG--RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
V TIAG G + G + ++ +++ + L I+ G+ I Q+NL+
Sbjct: 261 IVKTIAGTGEQSRNIYPHGGAGIETALNSPWDVVQLGQ--TLFIAMAGSHQIWQMNLE 316
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 83 KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI 138
+ + GDG G DG+ + RF +P +V M +Y++D +NH IR+ + VTT+
Sbjct: 433 QTILGDGSAGLQDGQGQNTRFFEPSGLSV-MGDYLYISDTNNHAIRRGDLNKFEVTTL 489
>gi|383781882|ref|YP_005466449.1| hypothetical protein AMIS_67130 [Actinoplanes missouriensis 431]
gi|381375115|dbj|BAL91933.1| hypothetical protein AMIS_67130 [Actinoplanes missouriensis 431]
Length = 332
Score = 40.4 bits (93), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 44/97 (45%), Gaps = 14/97 (14%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNA----- 156
+ P AV G +YV+D NHV+R I TN + T+AG G G + PA A
Sbjct: 1 MNNPTGIAVGPDGTVYVSDSGNHVVRAISTNGTIRTVAGTGRDGAGAPE-PAVGAKATDV 59
Query: 157 --SFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
+F ND V + I+D + +R L P+
Sbjct: 60 PPAFPNDLA---VGQDGTVFIAD--GRTVRVYALAPD 91
>gi|255327417|ref|ZP_05368491.1| NHL repeat containing protein [Rothia mucilaginosa ATCC 25296]
gi|255295697|gb|EET75040.1| NHL repeat containing protein [Rothia mucilaginosa ATCC 25296]
Length = 656
Score = 40.4 bits (93), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 50/110 (45%), Gaps = 5/110 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTT 137
V AG GV+G +DG A F +P G+++VA +R IT + VT+
Sbjct: 363 VSIFAGSGVEGLADGPADDAWFAQPSGIIEARDGSLWVACSETSGLRHITFTEDGVQVTS 422
Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G G G DG + A + LT +P ++ ++D N IR+ +
Sbjct: 423 AVGKGLFDFGFVDGDSDTARMQHPLGLTELPD-GSIAVADTYNGAIRRWD 471
>gi|223940572|ref|ZP_03632418.1| NHL repeat containing protein [bacterium Ellin514]
gi|223890764|gb|EEF57279.1| NHL repeat containing protein [bacterium Ellin514]
Length = 791
Score = 40.4 bits (93), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 2/41 (4%)
Query: 90 VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
V G G PG F P S A D KGN+YVAD NH ++K
Sbjct: 736 VLGGVGGAPGE--FSNPWSIAFDSKGNLYVADSQNHRVQKF 774
>gi|428297525|ref|YP_007135831.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
gi|428234069|gb|AFY99858.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
Length = 511
Score = 40.4 bits (93), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 5/65 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN--IYVADKSNHVIRKIT--NLGVTT 137
V ++ G G G+ DG+ +A F P+ D +GN +Y+AD NH +R++ N V T
Sbjct: 206 VLQIIGVGKAGFVDGDFEAATFSTPQGMTFD-EGNQILYIADTGNHALRQVDLGNQLVRT 264
Query: 138 IAGGG 142
IAG G
Sbjct: 265 IAGTG 269
>gi|290995474|ref|XP_002680320.1| predicted protein [Naegleria gruberi]
gi|284093940|gb|EFC47576.1| predicted protein [Naegleria gruberi]
Length = 992
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 47/99 (47%), Gaps = 5/99 (5%)
Query: 45 INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR 102
++P+ V+ P + D+ + D + + + +V +AG G G +G A+
Sbjct: 444 VDPYGVVVDPSNGDVFISDGYLNCVRKID---GKSGIVTTVAGTGEAGDVGDNGPSNKAQ 500
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
P ++ G++ +AD N IRK++N +TTI G
Sbjct: 501 LFSPSGLSLTSSGDLLIADNGNQAIRKVSNGIITTIVSG 539
Score = 37.4 bits (85), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 76 LSEESVVKRLAGD-GVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
+S++ V+ +AG G+ G S DG+ + ++ + P AV KGNI +AD N +R I N
Sbjct: 833 VSKDGVITTIAGQSGLIGLSGDGDKAANSKLNIPTGVAVTKKGNIIIADTINGRLRMINN 892
Query: 133 -LGV-TTIAGGGSKK 145
GV TTIAG + K
Sbjct: 893 DTGVITTIAGAENDK 907
Score = 37.4 bits (85), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 4/84 (4%)
Query: 106 PKSFAVD-MKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDF 162
P VD G+++++D + +RKI VTT+AG G + +GP+ A +
Sbjct: 446 PYGVVVDPSNGDVFISDGYLNCVRKIDGKSGIVTTVAGTGEAGDVGDNGPSNKAQLFSPS 505
Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
L+ LLI+D+GNQ IR++
Sbjct: 506 GLSLTSS-GDLLIADNGNQAIRKV 528
>gi|397781302|ref|YP_006545775.1| Tripartite motif-containing protein 71 [Methanoculleus bourgensis
MS2]
gi|396939804|emb|CCJ37059.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
[Methanoculleus bourgensis MS2]
Length = 786
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
FD PK AVD GN+YVAD+ N+ I+K + G +A GS G+ DG
Sbjct: 78 FDTPKGVAVDAAGNVYVADRGNNRIQKFDSDG-NLLAKWGS--HGKGDG 123
Score = 40.0 bits (92), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
G F P A+D GN+YVADK NH I+K + G
Sbjct: 547 GDGEFRNPSGVAIDAAGNVYVADKENHRIQKFDSDGT 583
>gi|282879766|ref|ZP_06288496.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
gi|281306435|gb|EFA98465.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
Length = 486
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 1/58 (1%)
Query: 75 PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
P ++ +V + G G+ +G P A+F P+ AV+ G+I YVAD N IRK+T
Sbjct: 427 PDNKATVTTVIGLPGSAGFKEGGPDVAQFRYPRGVAVNADGSIVYVADTGNRCIRKLT 484
>gi|424868210|ref|ZP_18291970.1| NHL repeat domain protein [Leptospirillum sp. Group II 'C75']
gi|124516337|gb|EAY57845.1| NHL repeat domain protein [Leptospirillum rubarum]
gi|387221511|gb|EIJ76065.1| NHL repeat domain protein [Leptospirillum sp. Group II 'C75']
Length = 494
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 63/131 (48%), Gaps = 10/131 (7%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI--TNL 133
E++ ++ + G G G +DG A+F P+ F D VAD NH++R++
Sbjct: 210 EKAALREVIGQGTPGSADGSFDQAQFRDPQGIRFCPDDPDIAIVADTGNHLLRRVDFRRG 269
Query: 134 GVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALL-ISDHGNQLIRQINLKPE 191
VTTIAG G + + PA +A ++ +++ F H +L ++ G I I L PE
Sbjct: 270 SVTTIAGTGVQGWAIFEPVPAMSAVLNSPWDILF--HRDGMLYVAQAGPHQI--IRLDPE 325
Query: 192 DCSKSSQSGSA 202
+GSA
Sbjct: 326 RQEIFPVAGSA 336
>gi|350561579|ref|ZP_08930417.1| YD repeat protein [Thioalkalivibrio thiocyanoxidans ARh 4]
gi|349780611|gb|EGZ34929.1| YD repeat protein [Thioalkalivibrio thiocyanoxidans ARh 4]
Length = 2546
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 10/134 (7%)
Query: 60 IVLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAV 111
+ +D+ S F+T F + + V +AG+G ++ DG P +A F P+ AV
Sbjct: 1276 VAVDAGGSVFFTRRNGGPRIFRANPDGTVHHVAGNGSSTFAGDGGPAVNASFIDPRGIAV 1335
Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
G++YV D H +R+I G + T+AG G G GPA A +++ P
Sbjct: 1336 GADGSLYVGDFRTHRVRRIGPDGIIRTVAGNGGSGIGGIGGPAVQARIDRIYDVAVSPE- 1394
Query: 171 CALLISDHGNQLIR 184
L I H + R
Sbjct: 1395 GDLYIDSHNYTVFR 1408
>gi|298386035|ref|ZP_06995592.1| hypothetical protein HMPREF9007_02755 [Bacteroides sp. 1_1_14]
gi|298261263|gb|EFI04130.1| hypothetical protein HMPREF9007_02755 [Bacteroides sp. 1_1_14]
Length = 462
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 84 RLAGDGVQGYSDGE-PGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
R GDG +G++DG+ ARF+ P S D ++G + V D +NH IRKI
Sbjct: 410 RPNGDGTKGFNDGDLRTQARFNYPASIVYDEVRGCLLVGDSNNHRIRKI 458
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 63/156 (40%), Gaps = 16/156 (10%)
Query: 50 VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKRLA--GDGVQGYSDGEPGSARFDKP 106
V+ P + + ++ +R Y + ++ LA DG DG R ++P
Sbjct: 306 VVIHPTGNYMYIMVRNRHTIYRADYNWETKTFGTPYLACGKDGTSACEDGVGSMVRLNQP 365
Query: 107 KSFAV----DMKG-----NIYVADKSNHVIRKITNLGVTTIAGG---GSKKEGRADGPAQ 154
+ D +G + Y DK NH +R +T G T+ G G +G DG +
Sbjct: 366 QQGCFVKNPDYEGQADEYDFYFVDKQNHCVRVMTPAGKVTLYAGRPNGDGTKGFNDGDLR 425
Query: 155 -NASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
A F+ + + LL+ D N IR+I L+
Sbjct: 426 TQARFNYPASIVYDEVRGCLLVGDSNNHRIRKIALE 461
>gi|290976675|ref|XP_002671065.1| predicted protein [Naegleria gruberi]
gi|284084630|gb|EFC38321.1| predicted protein [Naegleria gruberi]
Length = 343
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 74/173 (42%), Gaps = 26/173 (15%)
Query: 59 LIVLDSSRSAFYTLSFPLSEES-VVKRLAGDGVQGYSDGEPGS----ARFDKPKSFAVDM 113
L + D A +S +++ +++ + G+G G S E GS ++ + P A+
Sbjct: 140 LYIADRDNHAIRKVSLNDDDDNGLIETIVGNGSSGES--ENGSLALNSQINCPYGIAISK 197
Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-- 171
+Y ++ SNH IRKI G G K G + + +F+N L + +IC
Sbjct: 198 DDELYFSEFSNHTIRKIDKNGKLWKVAGIDGKNG--NNEYDHHAFNN--LLNYPAYICFD 253
Query: 172 ---ALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSL 221
L I+D N IR+I+LK + V L S L C V++
Sbjct: 254 NNNNLYIADRDNNSIRKIDLKT----------GLINTAVVKELNSPLGCFVTI 296
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 6/92 (6%)
Query: 101 ARFDKPKSFAV----DMK--GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
A+ ++P S V DM +Y+AD NH IRKI N + TIAG G + A
Sbjct: 5 AKLNRPFSVFVHSEQDMNQCDEVYIADTENHCIRKIINGNIITIAGTGEAGFNGDNIKAT 64
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
A+ + + + SD GN IR+I
Sbjct: 65 QATINKPVCVIVNHRNGNVYFSDLGNNRIRKI 96
>gi|291000670|ref|XP_002682902.1| predicted protein [Naegleria gruberi]
gi|284096530|gb|EFC50158.1| predicted protein [Naegleria gruberi]
Length = 726
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 18/129 (13%)
Query: 81 VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
+ +AG G GY+ DG G+ A+ P+ V+ G ++ AD N +RKI+ G+ +
Sbjct: 217 TITTIAGIGSNGYNGDGILGTMAKIFSPRGITVNQLGEVFFADSVNFRVRKISTSGIIST 276
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPH------ICALLISDHGNQLIRQINLKPED 192
G + DG N++ PH + + I+D IR+I P +
Sbjct: 277 FAGTGNQLFNGDGLLGNST-------NLSPHGVSINQLGEIFIADTNYNYIRKIVPCPLN 329
Query: 193 CSKSSQSGS 201
S+QSGS
Sbjct: 330 ---STQSGS 335
>gi|325917587|ref|ZP_08179787.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
gi|325536221|gb|EGD08017.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
Length = 472
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 3/67 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIA 139
V R G G DG G A F +P+ A++ + +YVAD NH +R+I N V T+
Sbjct: 198 VLRQFGHGNADLIDGGTGEAAFRRPQGLALE-RDQLYVADTGNHALRRINLRNGQVDTLC 256
Query: 140 GGGSKKE 146
G G E
Sbjct: 257 GTGRSGE 263
>gi|58581916|ref|YP_200932.1| hypothetical protein XOO2293 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58426510|gb|AAW75547.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 472
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 42/81 (51%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTI 138
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL V T+
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQVDTL 255
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G E +GP +AS S
Sbjct: 256 CGTGRSGE-PVEGPLASASAS 275
>gi|86742862|ref|YP_483262.1| serine/threonine protein kinase [Frankia sp. CcI3]
gi|86569724|gb|ABD13533.1| serine/threonine protein kinase [Frankia sp. CcI3]
Length = 719
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 48/107 (44%), Gaps = 6/107 (5%)
Query: 85 LAGDGVQGYSDGEPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
LAG GV G S G+ G A+F +P N+Y+AD NH++R + G+ + A G
Sbjct: 570 LAGAGVPGLS-GDHGPAQFAQLRRPHDVTAPPGRNVYLADTDNHLLRAVDADGIISTAAG 628
Query: 142 GSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
DG PA+ A L P LL++D +R +
Sbjct: 629 MFYGASPDDGAPARIADVGRPHSLAPTPS-GGLLVTDPDRGRVRLVT 674
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 37/86 (43%), Gaps = 2/86 (2%)
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD-GPAQNASFSN 160
R D P S G +YV+ + H + +I G T G K D GPA NA N
Sbjct: 419 RVDYPYSVVAAPDGAVYVSQRLRHRVLRIERDGRTVHVAGSGKSGPHGDGGPAVNAELDN 478
Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
L P +L I+D N IR++
Sbjct: 479 PCGLALGPD-GSLFIADSFNNRIRRV 503
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 85 LAGDGVQG-YSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG G G + DG P +A D P A+ G++++AD N+ IR++ G + T+AG
Sbjct: 456 VAGSGKSGPHGDGGPAVNAELDNPCGLALGPDGSLFIADSFNNRIRRVAPDGRIVTVAGS 515
Query: 142 GSKKEGRADGPAQNASFS 159
G + G GPA + S
Sbjct: 516 G--RHGPPAGPAARHAAS 531
>gi|380693132|ref|ZP_09857991.1| hypothetical protein BfaeM_04015 [Bacteroides faecis MAJ27]
Length = 472
Score = 40.4 bits (93), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 55/122 (45%), Gaps = 20/122 (16%)
Query: 88 DGVQGYSDGEPGSARFDKPK--SFAVDMK--------GNIY---VADKSNHVIRKITNLG 134
+G GYS SARF +P +F + + GNIY + D+ NH I KIT G
Sbjct: 351 EGEPGYSPSPGPSARFQEPYQGTFVKNEEYIKNPRPDGNIYDYYICDRGNHCIFKITPDG 410
Query: 135 -VTTIAGGGS-----KKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
V+ AG GS K G DG + A F N + + I+D N IR I+
Sbjct: 411 NVSLFAGRGSVSSDNKVHGNIDGELKTEARFDNPCGIAYDEETQTFYIADKENHSIRTIS 470
Query: 188 LK 189
++
Sbjct: 471 IE 472
>gi|308159172|gb|EFO61716.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia P15]
Length = 967
Score = 40.0 bits (92), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 6/93 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
PHS++ P + + + D+ TLS + VK LAG GV GY DG + F++
Sbjct: 143 PHSIVSDPVNGIIYLTDTKNHCIRTLSLMTRK---VKTLAGTPGVFGYKDGL--NPLFNE 197
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
P + GN+ V D N +R I+ TI
Sbjct: 198 PLGLVLTEDGNLIVCDSKNGALRYISRENGETI 230
>gi|188576512|ref|YP_001913441.1| hypothetical protein PXO_00583 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|188520964|gb|ACD58909.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 472
Score = 40.0 bits (92), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 42/81 (51%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTI 138
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL V T+
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQVDTL 255
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G E +GP +AS S
Sbjct: 256 CGTGRSGE-PVEGPLASASAS 275
>gi|84623828|ref|YP_451200.1| hypothetical protein XOO_2171 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84367768|dbj|BAE68926.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 499
Score = 40.0 bits (92), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 42/81 (51%), Gaps = 6/81 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTI 138
V R G G DG G A F +P+ A++ + +YVAD NH +R+I NL V T+
Sbjct: 225 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQVDTL 282
Query: 139 AGGGSKKEGRADGPAQNASFS 159
G G E +GP +AS S
Sbjct: 283 CGTGRSGE-PVEGPLASASAS 302
>gi|389844275|ref|YP_006346355.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
gi|387859021|gb|AFK07112.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
Length = 496
Score = 40.0 bits (92), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 59/141 (41%), Gaps = 17/141 (12%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGV 135
E + + AG GV+ DG A F +P +D + ++YVAD IR + V
Sbjct: 315 ENNTISAFAGSGVENIGDGHLKDANFAQPSGLFLD-RNSLYVADSEVSAIRFVDLEAGTV 373
Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI--------- 186
T+AG G G DG + + F + + L ++D N IR+I
Sbjct: 374 QTVAGSGLFSFGYVDGILKRSLFQHPIGIHGEGRF--LYVADTYNHAIRKIDLGIRRVET 431
Query: 187 ---NLKPEDCSKSSQSGSALG 204
NL+ C+ + S+LG
Sbjct: 432 VIKNLREGTCTLNGDKCSSLG 452
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 67/141 (47%), Gaps = 8/141 (5%)
Query: 63 DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
++ R LS P +++ + G G+ G DG +AR +KP+ V G +YVAD
Sbjct: 191 NNDRILLTELSTPFVAKTIDQ--IGSGLSGLEDGPFENARLNKPQGI-VYSNGRLYVADT 247
Query: 123 SNHVIR--KITNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHG 179
NH +R + ++T++G G + + +G A A ++ ++L L I+ G
Sbjct: 248 ENHALRIADMNQRCLSTLSGDGFQDNDWNYNGDASKARLNSPWDLQTDGRF--LYIAMAG 305
Query: 180 NQLIRQINLKPEDCSKSSQSG 200
I +++L+ S + SG
Sbjct: 306 MHQIWRLDLENNTISAFAGSG 326
>gi|111220793|ref|YP_711587.1| hypothetical protein FRAAL1337 [Frankia alni ACN14a]
gi|111148325|emb|CAJ59997.1| Hypothetical protein FRAAL1337 [Frankia alni ACN14a]
Length = 722
Score = 40.0 bits (92), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 53/115 (46%), Gaps = 10/115 (8%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
L + + +AG G G+S G+ G A P++ AVD +G + VAD+ + +R++
Sbjct: 562 LDPDGTARVVAGSGTPGHS-GDGGLAIHASLRGPQAVAVDAQGRLLVADQEHRRVRRVDA 620
Query: 133 LG-VTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
G + TIAG GG + A GPA+ L P + NQ++
Sbjct: 621 AGRIETIAGTAYGG--RPATAGGPARATDIGAPTSLAVGPDGIVYVADSANNQVL 673
>gi|423290544|ref|ZP_17269393.1| hypothetical protein HMPREF1069_04436 [Bacteroides ovatus
CL02T12C04]
gi|392665931|gb|EIY59454.1| hypothetical protein HMPREF1069_04436 [Bacteroides ovatus
CL02T12C04]
Length = 499
Score = 40.0 bits (92), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEGRADGPA-QNASFSNDFELTFVP 168
+ Y ADK NH IR +T G V+T AG GS K G+ +G + A F+ L +
Sbjct: 399 DFYFADKENHCIRVLTPDGVVSTFAGRGSASASSYKWGKQNGEVRERARFNQPVALAYDE 458
Query: 169 HICALLISDHGNQLIRQI 186
+ D GN IR+I
Sbjct: 459 ATKTFYVGDSGNYKIRKI 476
>gi|299147167|ref|ZP_07040234.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
gi|298515052|gb|EFI38934.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
Length = 499
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEGRADGPA-QNASFSNDFELTFVP 168
+ Y ADK NH IR +T G V+T AG GS K G+ +G + A F+ L +
Sbjct: 399 DFYFADKENHCIRVLTPDGVVSTFAGRGSASASSYKWGKQNGEVRERARFNQPVALAYDE 458
Query: 169 HICALLISDHGNQLIRQI 186
+ D GN IR+I
Sbjct: 459 ATKTFYVGDSGNYKIRKI 476
>gi|147918859|ref|YP_687415.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
gi|110622811|emb|CAJ38089.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
Length = 514
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%), Gaps = 2/32 (6%)
Query: 97 EPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
+PG FD+P+ AVD +GNIYV+D NH I+
Sbjct: 363 QPG--EFDRPRGIAVDGRGNIYVSDSYNHRIQ 392
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 33/64 (51%), Gaps = 7/64 (10%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH---VIRKITNLGVTTIAGGGSKKEGR 148
G D +PG +F P+ AVD GNIYVAD N+ V K TN TI G + G+
Sbjct: 170 GSPDDDPG--QFSSPRGIAVDGAGNIYVADTYNYRIQVWNKTTN--TWTIIGSDGTEPGQ 225
Query: 149 ADGP 152
D P
Sbjct: 226 FDRP 229
>gi|386841434|ref|YP_006246492.1| serine/threonine protein kinase [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374101734|gb|AEY90618.1| serine/threonine protein kinase [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451794728|gb|AGF64777.1| serine/threonine protein kinase [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 476
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 39/63 (61%), Gaps = 3/63 (4%)
Query: 85 LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
+AG+G++G++ DG AR P + V G ++VAD+ NH IR+I G + T+AG
Sbjct: 28 IAGNGLRGFAGDGADAVRARLGFPSALVVAPDGALFVADEMNHRIRRIDPSGAIGTVAGD 87
Query: 142 GSK 144
GS+
Sbjct: 88 GSR 90
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 115 GNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
G ++VAD +NH IR+I G +TTIAG G + A A L P AL
Sbjct: 4 GAVFVADHANHRIRRIDGTGTLTTIAGNGLRGFAGDGADAVRARLGFPSALVVAPD-GAL 62
Query: 174 LISDHGNQLIRQIN 187
++D N IR+I+
Sbjct: 63 FVADEMNHRIRRID 76
>gi|160885320|ref|ZP_02066323.1| hypothetical protein BACOVA_03319 [Bacteroides ovatus ATCC 8483]
gi|156108942|gb|EDO10687.1| NHL repeat protein [Bacteroides ovatus ATCC 8483]
Length = 198
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 47/100 (47%), Gaps = 10/100 (10%)
Query: 97 EPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEG 147
+PG F ++ K+ D + Y ADK NH IR +T G V+T AG GS K G
Sbjct: 76 KPGQGVFVFNEEYKAANKDDWYDFYFADKENHCIRILTPDGVVSTFAGRGSASASSYKWG 135
Query: 148 RADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ +G + A F+ L + + D GN IR+I
Sbjct: 136 KQNGEVRERARFNQPVALAYDEVTKTFYVGDSGNYKIRKI 175
>gi|303274116|ref|XP_003056382.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462466|gb|EEH59758.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 490
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 2/82 (2%)
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
P A+D G I+VAD N+ IR+I + GV T+ GS G DG A F+N +
Sbjct: 4 PAGIALDGNGAIFVADSGNNRIRQIKSGGVVTVL-AGSGTAGFLDGSLLEAEFNNP-QSV 61
Query: 166 FVPHICALLISDHGNQLIRQIN 187
V + ++D N IR IN
Sbjct: 62 VVTSSGVVYVADTNNHRIRLIN 83
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 34/59 (57%), Gaps = 3/59 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAG 140
L G G G+SD SARF+ P ++ + Y+AD+ N+ IR + T L VT+IAG
Sbjct: 148 LNGAGHAGFSDSSGLSARFNSPSHMTLNTDETVLYIADRRNNRIRAVTLTTLAVTSIAG 206
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 51/108 (47%), Gaps = 4/108 (3%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
VV LAG G G+ DG A F+ P+S V G +YVAD +NH IR I N VTT
Sbjct: 33 VVTVLAGSGTAGFLDGSLLEAEFNNPQSVVVTSSGVVYVADTNNHRIRLINEENDVVTTY 92
Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
AG G A Q A S L F L +SD G+ IR I
Sbjct: 93 AGAGISDYRDAQVAIQ-AYISKPLGLAFRTLTKDLFVSD-GDSRIRII 138
>gi|262199596|ref|YP_003270805.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
gi|262082943|gb|ACY18912.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
Length = 344
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 37/76 (48%), Gaps = 6/76 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKIT-----NLGV 135
V+ + G G G+ DG+ A P+ AV G +YVAD IR++ + V
Sbjct: 64 VRLVVGSGDPGWRDGDGADAALHAPRGIAVCASGRRLYVADSGTCTIRRVDLDAVGDATV 123
Query: 136 TTIAGGGSKKEGRADG 151
TT+AG G + E A G
Sbjct: 124 TTVAGTGERGEALAPG 139
>gi|428162946|gb|EKX32046.1| hypothetical protein GUITHDRAFT_61646, partial [Guillardia theta
CCMP2712]
Length = 254
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 65/139 (46%), Gaps = 15/139 (10%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR--KITN 132
L+ +V + DG G +G S RF P N + V D SN++I+ +I++
Sbjct: 25 LTTLAVTNFIGLDGTSGNVNGFGASVRFRNPAILLFSPSYNYLIVTDASNNLIKTIRISD 84
Query: 133 LGVTTIAGGGSKKE--------GRADGPAQNASFSNDF-ELTFVPHICALLISDHGNQLI 183
L V+ + G G+ P ++ +FS DF +L F C L++D GN I
Sbjct: 85 LYVSVLIGTGTATNVDNFNALSTTLTNP-KSLAFSEDFTKLYFSTVYC--LVADSGNHAI 141
Query: 184 RQINLKPEDCSKSSQSGSA 202
R++++ S + SG+A
Sbjct: 142 RKVDIVSGYMSTLAGSGTA 160
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 65/154 (42%), Gaps = 16/154 (10%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG-SARFD 104
NP ++ P + LIV D+S + T+ + V L G G D S
Sbjct: 54 NPAILLFSPSYNYLIVTDASNNLIKTIRI---SDLYVSVLIGTGTATNVDNFNALSTTLT 110
Query: 105 KPKSFAVDMK------GNIY--VADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQ 154
PKS A +Y VAD NH IRK+ + ++T+AG G+ G D
Sbjct: 111 NPKSLAFSEDFTKLYFSTVYCLVADSGNHAIRKVDIVSGYMSTLAGSGT--AGYNDAVGV 168
Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
A F+ ++T +SD+ N IR+I+L
Sbjct: 169 LAQFNKPVDVTLDWTESYAYVSDYNNNCIRRIDL 202
>gi|321469552|gb|EFX80532.1| hypothetical protein DAPPUDRAFT_304106 [Daphnia pulex]
Length = 689
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 5/84 (5%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
+S + +K G G+++G +A+F P+ A ++YVAD NH IR I +
Sbjct: 220 ISLQGEIKVTIGGPTPGWNEGAFSTAKFRSPQGIAFLPPCHLYVADTGNHSIRLIDFSTK 279
Query: 134 GVTTIAG---GGSKKEGRADGPAQ 154
V T+AG G+ KEG G Q
Sbjct: 280 QVKTVAGTGLQGTDKEGGMPGIEQ 303
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 6/76 (7%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE-PGSARFDKPKSFAVDMKGN- 116
L V DS T+ PL++ + G+G G +DG +F++P +D +G
Sbjct: 464 LYVADSYNHKIKTVD-PLTK--LCSTYLGNGKAGLADGSSEDEIQFNEPGGLCIDEEGGG 520
Query: 117 -IYVADKSNHVIRKIT 131
+YVAD +NH IRK+
Sbjct: 521 ILYVADTNNHCIRKVN 536
>gi|219851493|ref|YP_002465925.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219545752|gb|ACL16202.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 390
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 24/37 (64%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
G FD+P AVD GN+YVAD N+ ++K T+ G+
Sbjct: 180 GVGEFDEPFGIAVDGTGNVYVADTYNNRVQKFTSAGL 216
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 24/45 (53%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
G +FD P AVD GN+YVAD N I+K T+ G G S
Sbjct: 39 GIGQFDHPGGVAVDSAGNVYVADTENDRIQKFTSSGTFVTVWGSS 83
>gi|340725457|ref|XP_003401086.1| PREDICTED: NHL repeat-containing protein 2-like [Bombus terrestris]
Length = 692
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 63/144 (43%), Gaps = 25/144 (17%)
Query: 42 QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
Q E + V+ G + ++V+D++ + V+ + G + DG A
Sbjct: 229 QTEQGENLVVADTGHNRILVMDTAGN--------------VQHIIGGPNPDFRDGNFEIA 274
Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGP-AQNASF 158
+F+ P+ + + IYVAD NH IRKI + V TIAG G+K G +
Sbjct: 275 KFNGPQGMST-LDSVIYVADTKNHAIRKIDLIKKTVITIAGTGTKGNDYNGGKIGTDQVL 333
Query: 159 SNDFELTFVPHICALLISDHGNQL 182
++ ++L H +HGN +
Sbjct: 334 ASPWDLAIYKH-------EHGNNI 350
>gi|288923320|ref|ZP_06417454.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288345336|gb|EFC79731.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 748
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 56/119 (47%), Gaps = 8/119 (6%)
Query: 76 LSEESVVKRLAGDGVQGYS-DGEPGSARF----DKPKSFAVDMKGNIYVADKSNHVIRKI 130
L + V+ LAG G ++ DG P +A P S A+D GN++ + S + +R++
Sbjct: 623 LEPDEPVRTLAGGGPAEFTGDGGPATAAGLTLGSGPSSLALDPVGNLFFPEPSVYRVRRV 682
Query: 131 TNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
GV TT+AG G GPA A ++ + V L I + G IR++ L
Sbjct: 683 DTRGVITTVAGTGEYGSDGDGGPATAARLASPARVA-VDTAGVLYIGEVGTA-IRRVGL 739
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 9/88 (10%)
Query: 66 RSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNI 117
R FY + L + + +AG GV G + DG P +A R A G++
Sbjct: 438 RGGFYVVDRAGLRVLRLDLDGTLSAVAGTGVAGSTGDGGPATAARLRGVSDVAAASDGSV 497
Query: 118 YVADKSNHVIRKITNLGV-TTIAGGGSK 144
++AD N IR+I GV TT+AGGG +
Sbjct: 498 FLADSGNGRIRRIGPDGVITTVAGGGER 525
>gi|423294181|ref|ZP_17272308.1| hypothetical protein HMPREF1070_00973 [Bacteroides ovatus
CL03T12C18]
gi|392676438|gb|EIY69874.1| hypothetical protein HMPREF1070_00973 [Bacteroides ovatus
CL03T12C18]
Length = 499
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 47/100 (47%), Gaps = 10/100 (10%)
Query: 97 EPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEG 147
+PG F ++ K+ D + Y ADK NH IR +T G V+T AG GS K G
Sbjct: 377 KPGQGVFVFNEEYKAANKDDWYDFYFADKENHCIRILTPDGVVSTFAGRGSASASSYKWG 436
Query: 148 RADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
+ +G + A F+ L + + D GN IR+I
Sbjct: 437 KQNGEVRERARFNQPVALAYDEVTKTFYVGDSGNYKIRKI 476
>gi|301755504|ref|XP_002913612.1| PREDICTED: NHL repeat-containing protein 2-like [Ailuropoda
melanoleuca]
gi|281347739|gb|EFB23323.1| hypothetical protein PANDA_001408 [Ailuropoda melanoleuca]
Length = 726
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 44/81 (54%), Gaps = 10/81 (12%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG---GGSKK 145
G DG + F+ P+ A+ + NI YVAD NH++RKI V+T+AG G+ K
Sbjct: 267 GRKDGVFSESTFNSPQGVAI--RNNIIYVADTENHLVRKIDLEAEMVSTVAGIGIQGTDK 324
Query: 146 EGRADGPAQNASFSNDFELTF 166
EG A G Q S+ +++ F
Sbjct: 325 EGGAQGEEQ--PISSPWDVVF 343
>gi|392963930|ref|ZP_10329351.1| hypothetical protein BN8_00315 [Fibrisoma limi BUZ 3]
gi|387846825|emb|CCH51395.1| hypothetical protein BN8_00315 [Fibrisoma limi BUZ 3]
Length = 351
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 3/58 (5%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAV--DMKGNIYVADKSNHVIRKITNLG-VTTIA 139
+ G +GY DG+ +A F+ P+ AV D + NIY+ D N IR+++ G VTTIA
Sbjct: 290 MGGRSKEGYKDGKALAALFNFPQRCAVSIDSRQNIYILDGGNDCIRRLSADGMVTTIA 347
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 44/113 (38%), Gaps = 26/113 (23%)
Query: 93 YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-----------G 141
Y G A P+ +D G + AD + I K+ N V T+AG G
Sbjct: 232 YVQGTISKAELMAPEDILIDRTGAVIFADGRLNRIIKVANQTVATLAGNSEIQPNNVNMG 291
Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICA--------LLISDHGNQLIRQI 186
G KEG DG A A F+ P CA + I D GN IR++
Sbjct: 292 GRSKEGYKDGKALAALFN-------FPQRCAVSIDSRQNIYILDGGNDCIRRL 337
>gi|309813024|ref|ZP_07706752.1| putative septum site-determining protein MinC [Dermacoccus sp.
Ellin185]
gi|308433096|gb|EFP57000.1| putative septum site-determining protein MinC [Dermacoccus sp.
Ellin185]
Length = 651
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 51/115 (44%), Gaps = 6/115 (5%)
Query: 78 EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKIT----N 132
E + V LAG +G DG A F +P A G +++VAD +R ++
Sbjct: 354 ELNTVAVLAGTSNEGIRDGAAHDAWFAQPSGLATSADGSHVWVADSETSALRSLSVSDEG 413
Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
VTT G G G DG A A + +T +P ++ ++D N IR+ +
Sbjct: 414 FEVTTHVGQGLFDFGHVDGAADAALLQHPLGVTELPD-GSVAVADTYNGAIRRFD 467
>gi|110639199|ref|YP_679408.1| hypothetical protein CHU_2818 [Cytophaga hutchinsonii ATCC 33406]
gi|110281880|gb|ABG60066.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 392
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 51/116 (43%), Gaps = 21/116 (18%)
Query: 96 GEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
G GSA +FD + VD GN+YVAD N I+K +N G T I G++ G
Sbjct: 31 GGTGSAAGKFDDIGALDVDADGNVYVADWRNFRIQKFSNTG-TYITSWGTEGTG------ 83
Query: 154 QNASFSNDFELTFVPHICA-----LLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
D + V ++C +L+SD G ++++ SK SG G
Sbjct: 84 -------DNQFKRVNNLCVDNDGNILVSDAGASVVKKYTNTGVFISKFGSSGKQEG 132
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 98 PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV--TTIAGGGSKK 145
G+ F+ +D GN+YVAD +N+ I+K TN GV TI G G+ +
Sbjct: 176 TGAGSFNGNFDLTIDASGNVYVADVNNNRIQKFTNTGVYIKTIGGLGTTE 225
>gi|398784412|ref|ZP_10547676.1| redoxin domain-containing protein [Streptomyces auratus AGR0001]
gi|396995335|gb|EJJ06353.1| redoxin domain-containing protein [Streptomyces auratus AGR0001]
Length = 602
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 58/112 (51%), Gaps = 12/112 (10%)
Query: 59 LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
+V D++R L+ E V++R+ G G +G + P S F++P+ A+ G +
Sbjct: 188 FLVSDTTRHQLVELA--ADGEQVLRRI-GSGERGLT---PDS--FNEPQGLALLPDGTVA 239
Query: 119 VADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
VAD NH IR + + T+AG G++ + GPA+ + S+ ++L +
Sbjct: 240 VADTVNHAIRVFDPESGALETVAGTGTQWWQGSPTSGPAREVALSSPWDLAW 291
>gi|42524282|ref|NP_969662.1| hemagglutinin/hemolysin-related protein [Bdellovibrio bacteriovorus
HD100]
gi|39576491|emb|CAE80655.1| putative hemagglutinin/hemolysin-related protein [Bdellovibrio
bacteriovorus HD100]
Length = 1637
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/115 (25%), Positives = 55/115 (47%), Gaps = 5/115 (4%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
L+ ++V + G +DG G++RF +P + G IY D+S IR TN GV
Sbjct: 1018 LANQTVTTIAGNSSLGGITDGPLGTSRFSQPWGV-YKVPGGIYSIDRSAATIRFTTNAGV 1076
Query: 136 TTIAGGGSKKEGRADGPAQN--ASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
T G G + +Q + F+ +L + + ++ + D+ N +R++++
Sbjct: 1077 TRTVAGTPNVHGFIEAASQGDISRFNGPTDLCAIGN--SIYLVDNDNARVRRLDV 1129
>gi|159113256|ref|XP_001706855.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
50803]
gi|157434955|gb|EDO79181.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
50803]
Length = 969
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 6/93 (6%)
Query: 47 PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
PHS++ P + + + D+ TLS + VK LAG GV GY DG + F++
Sbjct: 143 PHSIVSDPVNGIIYLTDAKNHCIRTLSLMTRK---VKTLAGTPGVFGYKDGL--NPLFNE 197
Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
P + GN+ V D N +R ++ TI
Sbjct: 198 PLGLVLTEDGNVIVCDSKNGALRYVSRENGETI 230
>gi|195030130|ref|XP_001987921.1| GH10881 [Drosophila grimshawi]
gi|193903921|gb|EDW02788.1| GH10881 [Drosophila grimshawi]
Length = 728
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 19/122 (15%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
+V+ G G DG+ +ARF+ P+ A + + VAD NH +R+I+ N V T+
Sbjct: 255 IVEHKIGGLDNGLVDGDLLTARFNNPQGVAFLDEHTLIVADTDNHALRQISLKNGIVETL 314
Query: 139 AGGGSKKEGRADG----------PAQNASF-SNDFELTF------VPHICALLISDHGNQ 181
AG G + R G P A F + D +++F VP +LI+ G
Sbjct: 315 AGTGYQGNERVGGKLGQLQPLSSPWDVAVFRTRDMDMSFHVDELNVPEKTIVLIAMAGTH 374
Query: 182 LI 183
I
Sbjct: 375 QI 376
>gi|355707227|gb|AES02894.1| NHL repeat containing 2 [Mustela putorius furo]
Length = 498
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 38/69 (55%), Gaps = 8/69 (11%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG---GGSKK 145
G DG + F+ P+ A+ + NI YVAD NH+IRKI V+T+AG G+ K
Sbjct: 192 GRKDGIFSESTFNSPQGVAI--RNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDK 249
Query: 146 EGRADGPAQ 154
EG A G Q
Sbjct: 250 EGGAQGEEQ 258
>gi|434391812|ref|YP_007126759.1| NHL repeat containing protein [Gloeocapsa sp. PCC 7428]
gi|428263653|gb|AFZ29599.1| NHL repeat containing protein [Gloeocapsa sp. PCC 7428]
Length = 509
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 33/66 (50%), Gaps = 3/66 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTI 138
++ + G G G DG A+F P+ D + + YVAD NH IR+I V TI
Sbjct: 205 LQYVIGTGKPGLRDGSFSEAQFFSPQGMCFDTEKQLLYVADTENHAIRRIDFQRQVVETI 264
Query: 139 AGGGSK 144
AG G +
Sbjct: 265 AGTGDQ 270
>gi|380300595|ref|ZP_09850288.1| NHL repeat protein [Brachybacterium squillarum M-6-3]
Length = 644
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 58/126 (46%), Gaps = 5/126 (3%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
LAG +G DG ++ + +P G + VAD + IR + + V+T+ G G
Sbjct: 352 LAGTTQEGLVDGPAVTSWWAQPSGIDELPDGRLVVADSESSAIRVLDPQTMEVSTLVGQG 411
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
G DGPA+ A + +T +P + ++D N IR ++ E+ ++G A
Sbjct: 412 LFDFGHVDGPAEVARLQHPLGVTVLPD-GRIAVADTYNGAIRIVDEHAEE--NPIRTGDA 468
Query: 203 LGAVSV 208
V+V
Sbjct: 469 ATVVTV 474
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 23/110 (20%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPK-------SFAVDMKGNIYVADKSNHVIRKI-- 130
+ V GDG +G DGE RF +P A ++ ++ VAD +NHV+R +
Sbjct: 213 TTVDEEIGDGRRGLLDGESAGVRFAEPNGVLALPSDVAAEVGYDLLVADTANHVLRGVKV 272
Query: 131 ---------TNLGVTTIAGGGSK-KEG----RADGPAQNASFSNDFELTF 166
T V T+AG G++ +G R +G ++ + S ++L +
Sbjct: 273 GQDRFLRSRTATEVWTVAGSGAQWMQGDALPRGEGDPRSYALSTPWDLGW 322
>gi|78047814|ref|YP_363989.1| hypothetical protein XCV2258 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78036244|emb|CAJ23935.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 499
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 5/68 (7%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTI 138
V R G G + DG G A F +P+ A++ + +YVAD NH +R+I NL V T+
Sbjct: 225 VLRQFGHGNADFIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQVDTL 282
Query: 139 AGGGSKKE 146
G G E
Sbjct: 283 CGTGRSGE 290
>gi|333381581|ref|ZP_08473261.1| hypothetical protein HMPREF9455_01427 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829828|gb|EGK02470.1| hypothetical protein HMPREF9455_01427 [Dysgonomonas gadei ATCC
BAA-286]
Length = 455
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 52/114 (45%), Gaps = 13/114 (11%)
Query: 85 LAG-DGVQGYSDGEPGSARFDKPKSFAVDMKG----------NIYVADKSNHVIRKITNL 133
+AG D GY+DG AR ++P+ V +K + Y DK+NHV+R +T
Sbjct: 339 IAGKDQTAGYADGVGSYARVNEPQQ-GVFVKNPEYEGQEDEYDFYFCDKANHVVRIMTPQ 397
Query: 134 GVTTIAGGGSKKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
G + G + G DG + +A F+ + + I D N+ IR++
Sbjct: 398 GRVSTFAGTPQASGYRDGDLRLDARFAYPAGIAYDEERQCFYIGDSNNRRIRKV 451
>gi|397781344|ref|YP_006545817.1| Internalin-A [Methanoculleus bourgensis MS2]
gi|396939846|emb|CCJ37101.1| Internalin-A [Methanoculleus bourgensis MS2]
Length = 2759
Score = 39.7 bits (91), Expect = 1.4, Method: Composition-based stats.
Identities = 34/106 (32%), Positives = 51/106 (48%), Gaps = 9/106 (8%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
G +F P S AVD GN+YV+D +N I+K + G T + GS G DG + F
Sbjct: 138 GDGQFSYPDSVAVDAAGNVYVSDTNNGRIQKFDSDG-TFLGKWGS--WGSGDGQLR---F 191
Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
S D + +I ++++GN I++ + K SGS G
Sbjct: 192 SQDLVVDAAGNI---YVAEYGNHRIQKFDSNGNFLWKKGSSGSGDG 234
Score = 37.7 bits (86), Expect = 5.8, Method: Composition-based stats.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
G+ +F+ P+ AV+ GN+YVAD NH I+K + G
Sbjct: 91 GNGQFNGPQGIAVNAAGNVYVADTYNHRIQKFDSSG 126
>gi|29789158|ref|NP_080087.1| NHL repeat-containing protein 2 [Mus musculus]
gi|81898236|sp|Q8BZW8.1|NHLC2_MOUSE RecName: Full=NHL repeat-containing protein 2
gi|26329039|dbj|BAC28258.1| unnamed protein product [Mus musculus]
gi|63146329|gb|AAH95956.1| NHL repeat containing 2 [Mus musculus]
gi|148669816|gb|EDL01763.1| NHL repeat containing 2 [Mus musculus]
gi|187951881|gb|AAI38191.1| NHL repeat containing 2 [Mus musculus]
gi|223459878|gb|AAI38192.1| NHL repeat containing 2 [Mus musculus]
Length = 725
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
G DG + F+ P+ A+ IYVAD NH+IRKI VTT+AG G+ E
Sbjct: 267 GRKDGMFSESSFNSPQGVAI-ADNVIYVADTENHLIRKIDLEAEKVTTVAGVGIQGTDTE 325
Query: 147 GRADGPAQNASFSNDFEL 164
G +G Q S D L
Sbjct: 326 GGEEGDKQPISSPWDVAL 343
>gi|188501663|gb|ACD54780.1| NHL repeat containing protein-like protein [Philodina roseola]
Length = 324
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 12/106 (11%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNH--VIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
F+ P A+D GNIYVAD NH V+ + T AG G G + F+
Sbjct: 89 FNNPSDVAIDSVGNIYVADTLNHRVVLYSVGATNGTLFAGTGLPGSGIHE-------FNQ 141
Query: 161 DFELTFVPHICALLISDHGNQLIRQINLKPED---CSKSSQSGSAL 203
L+F + AL I+DH N I + L + + S SG A+
Sbjct: 142 PSGLSFDSNSNALYIADHENHRIMRYYLNSSNGTVVAGGSGSGRAI 187
>gi|148657095|ref|YP_001277300.1| PA14 domain-containing protein [Roseiflexus sp. RS-1]
gi|148569205|gb|ABQ91350.1| PA14 domain protein [Roseiflexus sp. RS-1]
Length = 1293
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
++R + DG + G GS +F P+S AV G +YVAD +NH I++ + G T +
Sbjct: 352 IQRFSADGTFLGAWGAEGSGDGQFVYPRSVAVASDGTVYVADSNNHRIQRFSATG-TFLG 410
Query: 140 GGGSKKEGRADG 151
GS EG +DG
Sbjct: 411 KWGS--EGSSDG 420
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query: 77 SEESVVKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
S+ ++R + G + G PG+ +F P+S AV G +YVAD NH I++ + +G
Sbjct: 76 SDNHRIQRFSAAGELLGAWGSPGTGDGQFSSPRSVAVAPDGTVYVADTGNHRIQRFSAIG 135
>gi|295132101|ref|YP_003582777.1| hypothetical protein ZPR_0220 [Zunongwangia profunda SM-A87]
gi|294980116|gb|ADF50581.1| conserved hypothetical protein [Zunongwangia profunda SM-A87]
Length = 466
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 54/128 (42%), Gaps = 22/128 (17%)
Query: 81 VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG------------NIYVADKSNHVIR 128
V +L+ DG GY DG S R +KP V +K + Y ++ NH IR
Sbjct: 338 VCGKLSNDG--GYQDGVGSSVRLNKPYQ-GVFVKNEDYVEAGKTDEYDFYFTEQGNHDIR 394
Query: 129 KITNLG-VTTIAGGGSKK-----EGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQ 181
K+T G VTT AG GS G +G + A F + + A I D N
Sbjct: 395 KLTPEGSVTTFAGRGSSSINPDPWGYVNGDLREEARFDQPTGIAYNEEEKAFYIGDKSNH 454
Query: 182 LIRQINLK 189
R+I L+
Sbjct: 455 RFRKIALE 462
>gi|403363814|gb|EJY81656.1| NHL repeat-containing protein 2 [Oxytricha trifallax]
Length = 478
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 12/135 (8%)
Query: 76 LSEESV-VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN---IYVADKSNHVIRKI- 130
++EE++ + + G+G G DG F P+ + N IY+ D NH IR+I
Sbjct: 121 VNEETLECQGVIGNGKIGLVDGNYEEGSFHHPQGMCHIYRDNMHFIYLCDTKNHAIREIN 180
Query: 131 -TNLGVTTIAGGGSK---KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
T V T+ G G K +EG + Q S+ +++ V LLI+ G I +
Sbjct: 181 LTKKEVLTVIGTGEKGFDREGNKEPELQ--KLSSPWDIVAVNRD-TLLIAMAGVHQIWAL 237
Query: 187 NLKPEDCSKSSQSGS 201
NLK C S SG
Sbjct: 238 NLKTNRCFNFSGSGK 252
>gi|302834543|ref|XP_002948834.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
nagariensis]
gi|300266025|gb|EFJ50214.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
nagariensis]
Length = 774
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 10/100 (10%)
Query: 46 NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
+P +V+ S + V DS L+ P + E V LAG G G+ DG +A+F +
Sbjct: 498 HPLAVLSSADGSAIYVADSYNHRIKALN-PNTNEIVT--LAGSGAAGFRDGVGTAAQFSE 554
Query: 106 PKSFAVDMKGNIYVADKSNHVIR-------KITNLGVTTI 138
P G I +AD +N +R +++ LG+T +
Sbjct: 555 PAGLCRGPNGTILIADTNNSAVRILDPKTQRVSTLGLTGV 594
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 38/66 (57%), Gaps = 4/66 (6%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR--KITNLGVTTI 138
++++ G+G DG +A F++P+ + N ++VAD NH +R +T+ V T+
Sbjct: 295 IEQIGGNG-PALRDGSFETAAFNRPQGLVFSPRRNQLFVADTENHAVRCCDLTSNTVITL 353
Query: 139 AGGGSK 144
AG GSK
Sbjct: 354 AGNGSK 359
>gi|414163305|ref|ZP_11419552.1| hypothetical protein HMPREF9697_01453 [Afipia felis ATCC 53690]
gi|410881085|gb|EKS28925.1| hypothetical protein HMPREF9697_01453 [Afipia felis ATCC 53690]
Length = 357
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 80 SVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIR 128
S V ++ DG S GE GS +F+ P S A D KGN+YVAD+ N I+
Sbjct: 184 SRVAKVDKDGNWIKSWGEKGSEPGQFNTPHSIATDAKGNVYVADRGNARIQ 234
>gi|290981371|ref|XP_002673404.1| predicted protein [Naegleria gruberi]
gi|284086987|gb|EFC40660.1| predicted protein [Naegleria gruberi]
Length = 1204
Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats.
Identities = 19/42 (45%), Positives = 28/42 (66%), Gaps = 2/42 (4%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGG 142
F+KP+ V GNIY+AD N++I+K I+ ++ IAGGG
Sbjct: 69 FNKPQGVFVTKNGNIYIADTGNNLIKKYEISTQKLSIIAGGG 110
>gi|194767578|ref|XP_001965892.1| GF16360 [Drosophila ananassae]
gi|190619368|gb|EDV34892.1| GF16360 [Drosophila ananassae]
Length = 300
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 2/80 (2%)
Query: 76 LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NL 133
+S +V+ + GD G DG+ ARF P+ + + VAD NH +RKI+ +
Sbjct: 72 VSSRGLVEHIIGDHKAGLIDGKFTEARFKHPQGLTFLDEHTLIVADTENHALRKISLADG 131
Query: 134 GVTTIAGGGSKKEGRADGPA 153
V T+AG G + R G A
Sbjct: 132 IVKTLAGTGVQGCDRIGGNA 151
>gi|423223182|ref|ZP_17209651.1| hypothetical protein HMPREF1062_01837 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392639433|gb|EIY33254.1| hypothetical protein HMPREF1062_01837 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 447
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 4/80 (5%)
Query: 83 KRLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIA 139
+R AG+G Q ++G +F P VD +GNIYV + IRKI+ G TIA
Sbjct: 338 ERYAGNGWPNQTNTEGYREDIQFAWPNGVTVDNEGNIYVVNAGGQSIRKISYPDGYVTIA 397
Query: 140 GGGSKK-EGRADGPAQNASF 158
GG+ E DG +F
Sbjct: 398 AGGTMSGETETDGLPLECTF 417
>gi|224537023|ref|ZP_03677562.1| hypothetical protein BACCELL_01899 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521331|gb|EEF90436.1| hypothetical protein BACCELL_01899 [Bacteroides cellulosilyticus
DSM 14838]
Length = 447
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 4/80 (5%)
Query: 83 KRLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIA 139
+R AG+G Q ++G +F P VD +GNIYV + IRKI+ G TIA
Sbjct: 338 ERYAGNGWPNQTNTEGYREDIQFAWPNGVTVDNEGNIYVVNAGGQSIRKISYPDGYVTIA 397
Query: 140 GGGSKK-EGRADGPAQNASF 158
GG+ E DG +F
Sbjct: 398 AGGTMSGETETDGLPLECTF 417
>gi|291232828|ref|XP_002736356.1| PREDICTED: NHL (ring finger b-box coiled coil) domain containing
family member (nhl-1)-like [Saccoglossus kowalevskii]
Length = 871
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 49/121 (40%), Gaps = 25/121 (20%)
Query: 89 GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
G +G DG + + P A D NIYV D+ NH I+ V T++G +K GR
Sbjct: 695 GQEGSEDG-----KLNYPWGVACDRADNIYVCDRDNHRIQ------VFTVSGTFIRKFGR 743
Query: 149 ADGPAQNASFSNDFELTFVPHICAL-----LISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
A PH A+ +SD GN ++ NL + K Q GS
Sbjct: 744 LGHSAGQLE---------SPHYVAIHDDKVFVSDSGNHCVQVFNLDGQFQWKFGQEGSNY 794
Query: 204 G 204
G
Sbjct: 795 G 795
>gi|283779938|ref|YP_003370693.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
gi|283438391|gb|ADB16833.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
Length = 846
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 36/82 (43%), Gaps = 6/82 (7%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
G F P D +GN++ D NHV+R+ + GV T G K G N F
Sbjct: 86 GRGMFKVPHGLRFDKQGNLWTTDNGNHVLRRFNSEGVQTKTIGEENKPGNG-----NLQF 140
Query: 159 SNDFELTFVPHICALLISDHGN 180
+ +L F ++ ++D GN
Sbjct: 141 RSPDDLVFGSD-GSIYVADAGN 161
>gi|428164545|gb|EKX33567.1| hypothetical protein GUITHDRAFT_120259 [Guillardia theta CCMP2712]
Length = 396
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 32/52 (61%), Gaps = 3/52 (5%)
Query: 92 GYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIR--KITNLGVTTIAG 140
GY DG+ SARF++P+ F + G I +AD +N+ +R I+ VTT+ G
Sbjct: 100 GYVDGDGLSARFNQPRDFVMSTDGTKILIADTNNYCLRFFDISTRKVTTLTG 151
>gi|374855656|dbj|BAL58511.1| NHL repeat containing protein [uncultured candidate division OP1
bacterium]
Length = 419
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 99 GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
G +FD+P AVD +G IYVA++ NH I+KI
Sbjct: 385 GDGQFDQPVDLAVDPEGVIYVAEEGNHRIQKI 416
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 39/89 (43%), Gaps = 9/89 (10%)
Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
+F P AVD G +YVAD N I+K ++ G I G G DG F
Sbjct: 340 GQFQAPLGIAVDTYGFVYVADTGNDRIQKFSSDGAFIIKWG---SLGAGDG-----QFDQ 391
Query: 161 DFELTFVPHICALLISDHGNQLIRQINLK 189
+L P + +++ GN I++I K
Sbjct: 392 PVDLAVDPE-GVIYVAEEGNHRIQKIKFK 419
>gi|290979904|ref|XP_002672673.1| predicted protein [Naegleria gruberi]
gi|284086251|gb|EFC39929.1| predicted protein [Naegleria gruberi]
Length = 524
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 55/118 (46%), Gaps = 9/118 (7%)
Query: 76 LSEESVVKRLAGDG---VQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
LSE +V +AG+G + D P SA P S A G + +A++ HVIRK+
Sbjct: 110 LSEMGMVTTIAGNGSTTLTSLIDNVPATSAYLKAPASVAFLSNGEMIIAERQGHVIRKVD 169
Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFS--ND-FELTFVPHICALLISDHGNQLIRQI 186
G ++ G + G +G + NA S ND LT + +L D GN IR +
Sbjct: 170 LTGQISLIAGSFLQLG-LNGDSSNAKISLLNDPLSLTVLKGD-RILFVDRGNNRIRML 225
>gi|187250772|ref|YP_001875254.1| hypothetical protein Emin_0359 [Elusimicrobium minutum Pei191]
gi|186970932|gb|ACC97917.1| hypothetical protein Emin_0359 [Elusimicrobium minutum Pei191]
Length = 448
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 61/119 (51%), Gaps = 5/119 (4%)
Query: 71 TLSFPLSEES-VVKRLAGDGVQGYS-DGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHV 126
TLS L + + +V+ +AG G+ G++ D + G+ + P + AVD +Y+ DK N
Sbjct: 105 TLSMQLIKRTGIVRPVAGKGIAGFNGDDQLGALQSSLSNPCAIAVDNLSALYILDKGNKR 164
Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
IRK+ G +TT+AG G + A++ FS+ ++ P +I +L++
Sbjct: 165 IRKVFADGMITTLAGNGRSGMFQEGLVAEDFRFSDLQDIALSPEGTIYIIDSGFKRLLK 223
>gi|427796943|gb|JAA63923.1| Putative tlpa-like family, partial [Rhipicephalus pulchellus]
Length = 719
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 50/113 (44%), Gaps = 5/113 (4%)
Query: 82 VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
V ++ G GY DG A F +P I VAD NH IR++ N V+T+A
Sbjct: 279 VLQVVGGYEPGYQDGPLNGALFSEPNGVQWRDPHFILVADTGNHAIREVDLENGTVSTLA 338
Query: 140 GGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
G G + E G Q ++ +++ V ++ L I+ G+ I + E
Sbjct: 339 GTGKQGEDTEGGRLGQQQELNSPWDICLVDNV--LFIAMAGSHQIWAYFFRDE 389
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 6/84 (7%)
Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
K FA +M+ + +AD +H I ITN + G + G DGP A FS + +
Sbjct: 252 KVFATEMR--LIIADTGHHRI-VITNHSGEVLQVVGGYEPGYQDGPLNGALFSEPNGVQW 308
Query: 167 V-PHICALLISDHGNQLIRQINLK 189
PH +L++D GN IR+++L+
Sbjct: 309 RDPHF--ILVADTGNHAIREVDLE 330
>gi|428304361|ref|YP_007141186.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
gi|428245896|gb|AFZ11676.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
Length = 491
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 58/134 (43%), Gaps = 8/134 (5%)
Query: 58 DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAVDMK 114
+L + DS S+ ++ L VV+ + G G G DG + R A
Sbjct: 362 ELFIADSETSSIRAVT--LGNFPVVRTICGSGQLFGFGDVDGIGENVRLQHCLGVAY-AS 418
Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
G ++V D NH ++++ GS K G DG ++ FS L++ + L
Sbjct: 419 GYLWVTDTYNHKLKRVNPTTGECQTMSGSGKAGLQDGIGKDVYFSEPSGLSYACNY--LY 476
Query: 175 ISDHGNQLIRQINL 188
I+D N IR+INL
Sbjct: 477 IADSNNHAIRRINL 490
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 60/144 (41%), Gaps = 7/144 (4%)
Query: 63 DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-----AVDMKGNI 117
+ R L L EES K+ DG G EP P +F A ++
Sbjct: 138 EGKREFLDNLVQQLIEESSGKKTV-DGQSGKFTLEPNQTLRLSPLAFPSKVIACQQSNSL 196
Query: 118 YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
++AD +H + I +L T A G+ G DG + A F + F + ++D
Sbjct: 197 FIADTGHHRL-VIASLDGETKAVIGTGSAGWVDGDLEIAQFCEPMGMVFNHEQQVIYVAD 255
Query: 178 HGNQLIRQINLKPEDCSKSSQSGS 201
N L+R+I+LK S + +G+
Sbjct: 256 TVNHLLRKIDLKTRQVSTLAGTGT 279
>gi|322435783|ref|YP_004217995.1| cell surface receptor IPT/TIG [Granulicella tundricola MP5ACTX9]
gi|321163510|gb|ADW69215.1| cell surface receptor IPT/TIG domain protein [Granulicella tundricola
MP5ACTX9]
Length = 1852
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
F++P S AVD GN+YVAD N+ +++I
Sbjct: 1159 FNQPYSMAVDAAGNVYVADFKNNAVKEI 1186
>gi|386773539|ref|ZP_10095917.1| NHL repeat protein [Brachybacterium paraconglomeratum LC44]
Length = 634
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 54/110 (49%), Gaps = 23/110 (20%)
Query: 80 SVVKRLAGDGVQGYSDGEPGSARFDKPK-------SFAVDMKGNIYVADKSNHVIRKI-- 130
+ V + G G +G++DG+ +A+F +P A ++ ++ VAD + H +R +
Sbjct: 213 ATVDAIIGTGERGHADGDEDTAQFAEPNGVLALPAEVAHEVGYDLLVADTAGHRLRGVKI 272
Query: 131 ---------TNLGVTTIAGGGSK-KEG----RADGPAQNASFSNDFELTF 166
T VTT+AG G + +G R +G A+ S S ++LT+
Sbjct: 273 GQDRLLRSRTTTEVTTLAGTGEQWMQGEPLPRGEGDARTYSLSTPWDLTW 322
Score = 37.0 bits (84), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 47/105 (44%), Gaps = 3/105 (2%)
Query: 85 LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
LAG +G DG ++ + +P G I +AD + +R + + V+T+ G G
Sbjct: 352 LAGTTQEGLVDGPAVTSWWAQPSGLDEMPDGRIVIADSESSAVRLLDPQTMQVSTLVGKG 411
Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
G DGP A + +T +P + I+D N IR ++
Sbjct: 412 LFDFGHVDGPLDRARLQHPLGVTALPD-GRIAIADTYNGAIRLLD 455
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.134 0.386
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,191,056,213
Number of Sequences: 23463169
Number of extensions: 167193491
Number of successful extensions: 449925
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 611
Number of HSP's successfully gapped in prelim test: 580
Number of HSP's that attempted gapping in prelim test: 444585
Number of HSP's gapped (non-prelim): 4447
length of query: 269
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 129
effective length of database: 9,074,351,707
effective search space: 1170591370203
effective search space used: 1170591370203
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)